Finding Supporters. Political Predictive Analytics Using Logistic Regression. Multivariate Solutions



Similar documents
Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.)

Binary Logistic Regression

Medicare Advantage National Senior Survey 600 Senior Registered Voters in the Medicare Advantage Program February 24-28, 2015


The relationship between mental wellbeing and financial management among older people

Racial and Ethnic Differences in Health Insurance Coverage Among Adult Workers in Florida. Jacky LaGrace Mentor: Dr. Allyson Hall

Credit Risk Analysis Using Logistic Regression Modeling

Free Trial - BIRT Analytics - IAAs

Enrollment Data Undergraduate Programs by Race/ethnicity and Gender (Fall 2008) Summary Data Undergraduate Programs by Race/ethnicity

If You Think Investing is Gambling, You re Doing it Wrong!

MULTIVARIATE ANALYSIS OF BUYERS AND NON-BUYERS OF THE FEDERAL LONG-TERM CARE INSURANCE PROGRAM

Criminal Justice Professionals Attitudes Towards Offenders: Assessing the Link between Global Orientations and Specific Attributions

HOUSEHOLDS WITH HIGH LEVELS OF NET ASSETS

How to set the main menu of STATA to default factory settings standards

Release #2301 Release Date and Time: 6:00 a.m., Tuesday, March 10, 2009

When to Use a Particular Statistical Test

11. Analysis of Case-control Studies Logistic Regression

Marist College Institute for Public Opinion Poughkeepsie, NY Phone Fax

ABSTRACT INTRODUCTION STUDY DESCRIPTION

America Is Changing. National Conference of State Legislatures. August 15, 2013 Atlanta, GA

SPSS Guide: Regression Analysis

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg

SURVEY OF ILLINOIS VOTERS. Conducted by the UIS Survey Research Office, Center for State Policy & Leadership

ln(p/(1-p)) = α +β*age35plus, where p is the probability or odds of drinking

Row vs. Column Percents. tab PRAYER DEGREE, row col

Americans Current Views on Smoking 2013: An AARP Bulletin Survey

Students' Opinion about Universities: The Faculty of Economics and Political Science (Case Study)

National Survey Results: Federal Vote Intention Tight 3-Way Race June 25, 2015

A Basic Guide to Modeling Techniques for All Direct Marketing Challenges

Simple Predictive Analytics Curtis Seare

USING LOGIT MODEL TO PREDICT CREDIT SCORE

Jon A. Krosnick and LinChiat Chang, Ohio State University. April, Introduction

Western New England University Polling Institute

Florida Poll Results Trump 47%, Clinton 42% (Others 3%, 8% undecided) Rubio re-elect: 38-39% (22% undecided)

THE FIELD POLL. UCB Contact

Internet Gambling in Canada: Prevalence, Patterns and Land-Based Comparisons

Graduating to a Pay Gap

End User Satisfaction With a Food Manufacturing ERP

Data Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing. C. Olivia Rud, VP, Fleet Bank

Role of Customer Response Models in Customer Solicitation Center s Direct Marketing Campaign

An Analysis of the Health Insurance Coverage of Young Adults

Customer Profiling for Marketing Strategies in a Healthcare Environment MaryAnne DePesquo, Phoenix, Arizona

Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén Table Of Contents

Where there is a will

SATURATION LISTS. Reach all households in a geographic area at lowest postage rates

Children in Egypt 2014 A STATISTICAL DIGEST

STATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and

THE FIELD POLL. By Mark DiCamillo, Director, The Field Poll

Predicting Successful Completion of the Nursing Program: An Analysis of Prerequisites and Demographic Variables

San Antonio Voter Support for Protecting Water Supply in the Edwards Aquifer and Linear Parks

Logistic (RLOGIST) Example #7

An Example of SAS Application in Public Health Research --- Predicting Smoking Behavior in Changqiao District, Shanghai, China

Attrition in Online and Campus Degree Programs

Examining Professional and Academic Culture in Chilean Journalism and Mass Communication Education

NEW JERSEY VOTERS DIVIDED OVER SAME-SEX MARRIAGE. A Rutgers-Eagleton Poll on same-sex marriage, conducted in June 2006, found the state s

With Depression Without Depression 8.0% 1.8% Alcohol Disorder Drug Disorder Alcohol or Drug Disorder

Does Financial Sophistication Matter in Retirement Preparedness of U.S Households? Evidence from the 2010 Survey of Consumer Finances

Illinois 8th Congressional District Survey Results

MEDICAL ASSISTANCE (MA)/MCHP APPLICATION FOR FAMILIES, PREGNANT WOMEN, AND CHILDREN

Mind on Statistics. Chapter 15

Introduction to Statistics and Quantitative Research Methods

Stockholm Research Reports in Demography 2013:19

5. Survey Samples, Sample Populations and Response Rates

Prediction of Car Prices of Federal Auctions

Relating the ACT Indicator Understanding Complex Texts to College Course Grades

Quick Start. Creating a Scoring Application. RStat. Based on a Decision Tree Model

Clinton Leads Sanders by 29%

PharmaSUG2011 Paper HS03

Multivariate Models of Student Success

Multivariate Logistic Regression

Education and Wage Differential by Race: Convergence or Divergence? *

Insurance Analytics - analýza dat a prediktivní modelování v pojišťovnictví. Pavel Kříž. Seminář z aktuárských věd MFF 4.

Modeling Lifetime Value in the Insurance Industry

Survey of Nevada Casino Gaming Employees

Better decision making under uncertain conditions using Monte Carlo Simulation

Transcription:

Finding Supporters Political Predictive Analytics Using Logistic Regression Multivariate Solutions

What is Logistic Regression? In a political application, logistic regression can describe the outcome of whether a person will vote or not vote ; or Vote for my candidate, Not Vote for my candidate. Logistic regression should be used in situations when a campaign would like to ascertain which voters are: Likely Voters Inclined to vote for their candidate Undecided Donor possibility Logistic regression in a nutshell: Logistic regression is used for prediction of the probability of occurrence of an event by fitting data to a logistic curve. Logistic regression makes use of several predictor variables that may be either numerical or categorical.

Example: Finding the Liz Anderson Supporter For example, in a political race, logistic regression could be used to analyze the factors that determine whether an individual should be targeted with strategic communications; in the same sense as a swing state in president elections. In our example, our candidate, Liz Anderson, is running for Congress. Let us define a variable Outcome Vote for Liz Anderson Outcome = 1 If 'The Individual Is Likely to vote for Liz = 0 If 'The Individual Is Not Likely To Vote for Liz. The outcome takes only two possible values.

Variables in the Model These are the variables in the Anderson for Congress model Gender-Male=1,Female=2 Married Own home Urban=1/Rural=0 Working women in Household Completed college Credit card debt

Regression Output Vote for 'Liz' Logistic Regression Regression Output Regression Beta Sig. Odds Ratio Gender 1=Male 2=Female Marital Status 1=Married/0=Not Married College Graduate 1=Yes, 0=No Working Woman in House 1=Yes, 0=No Urban 1=Urban, 0=Not urban Housing 1=Own, 0=Rent Credit Card Debt 1=Yes, 0=No 2.670 0.029 14.440 2.095 0.015 8.126 1.800 0.120 6.050 1.762 0.038 5.826 1.649 0.024 5.202 0.800 0.506 2.226-0.860 0.252 0.423 Shading indicated statistical significance This slide is descriptive, and shows which of the variables are most influential in determining whether a voter is a Liz for Congress supporter. The campaign can now determine, by examining this table, that educated married, working woman who live in urban areas are a key demographic to target in the days leading up to the election. These should be targets for both Strategic Communications and Get Out the Vote field operations.

Odds-Ratio When a respondent s choices are set within the regression model, an odds-ratio for each respondent is created using the formula of 1/(1+e -z ). Z is the outcome of the regression equation once all the questions are input. A simulator can be used to classify individuals based on demographic data or a survey screen. Two examples follow:

Example One Urban College-Educated Working Woman Vote for 'Liz' Logistic Regression Regression Output Answer Regression Beta Sum (b*c) Gender 1=Male 2=Female 2 2.670 5.340 Marital Status 1=Married/0=Not Married 1 2.095 2.095 College Graduate 1=Yes, 0=No 1 1.800 1.800 Working Woman in House 1=Yes, 0=No 1 1.762 1.762 Urban 1=Urban, 0=Not urban 1 1.649 1.649 Housing 1=Own, 0=Rent 1 0.800 0.800 Credit Card Debt 1=Yes, 0=No 0-0.860 0.000 Sum 3.446 Odds Ratio (1/(1+e -z ) 0.97 (Chances to 'Vote for Liz') 97% Using the logistic output, the chances of a college educated, urban, working, married woman who owns her own home and has no credit card debt to be a supporter shown above. The logistic regression formula (1/(1=e -z ) gives this woman an odd-ratio (in purple) of.97, or 97% chance that she will vote for Liz.

Example Two Urban College-Educated Working Man Vote for 'Liz' Logistic Regression Regression Output Answer Regression Beta Sum (b*c) Gender 1=Male 2=Female 1 2.670 2.670 Marital Status 1=Married/0=Not Married 1 2.095 2.095 College Graduate 1=Yes, 0=No 1 1.800 1.800 Working Woman in House 1=Yes, 0=No 1 1.762 1.762 Urban 1=Urban, 0=Not urban 1 1.649 1.649 Housing 1=Own, 0=Rent 1 0.800 0.800 Credit Card Debt 1=Yes, 0=No 0-0.860 0.000 Sum 0.776 Odds Ratio (1/(1+e -z ) 0.68 (Chances to 'Vote for Liz') 68% Using the logistic output, the chances of a college educated, urban, working, married MAN who owns his own home and has no credit card debt to be a Liz supporter is shown above. The logistic regression formula (1/(1=e -z ) gives this man an odd-ratio (in purple) of.68, or 68% chance that he will vote for Liz (down from 97% for a women with the same demographic characteristics).