Nominal and ordinal logistic regression



Similar documents
Generalized Linear Models

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Multinomial and Ordinal Logistic Regression

Lecture 3: Linear methods for classification

SAS Software to Fit the Generalized Linear Model

Classification Problems

Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes

Logistic Regression. Jia Li. Department of Statistics The Pennsylvania State University. Logistic Regression

Applied Statistics. J. Blanchet and J. Wadsworth. Institute of Mathematics, Analysis, and Applications EPF Lausanne

Poisson Models for Count Data

Interpretation of Somers D under four simple models

Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression

Multiple Choice Models II

Tests for Two Survival Curves Using Cox s Proportional Hazards Model

Statistics 305: Introduction to Biostatistical Methods for Health Sciences

11. Analysis of Case-control Studies Logistic Regression

Regression Modeling Strategies

LOGISTIC REGRESSION. Nitin R Patel. where the dependent variable, y, is binary (for convenience we often code these values as

Pearson's Correlation Tests

LOGIT AND PROBIT ANALYSIS

Some Essential Statistics The Lure of Statistics

Week TSX Index

Christfried Webers. Canberra February June 2015

Microsoft Azure Machine learning Algorithms

CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS

Statistical Machine Learning

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Regression with a Binary Dependent Variable

STA 4273H: Statistical Machine Learning

Regression III: Advanced Methods

Logistic regression modeling the probability of success

Examples of Using R for Modeling Ordinal Data

Univariate Regression

Using An Ordered Logistic Regression Model with SAS Vartanian: SW 541

Machine Learning Logistic Regression

ANALYSING LIKERT SCALE/TYPE DATA, ORDINAL LOGISTIC REGRESSION EXAMPLE IN R.

Tips for surviving the analysis of survival data. Philip Twumasi-Ankrah, PhD

STATISTICA Formula Guide: Logistic Regression. Table of Contents

Regression 3: Logistic Regression

VI. Introduction to Logistic Regression

Introduction to General and Generalized Linear Models

Students' Opinion about Universities: The Faculty of Economics and Political Science (Case Study)

Statistics in Retail Finance. Chapter 2: Statistical models of default

Logit Models for Binary Data

Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén Table Of Contents

IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results

Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March Due:-March 25, 2015.

Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.)

Lecture 14: GLM Estimation and Logistic Regression

Nonlinear Regression Functions. SW Ch 8 1/54/

Simple Linear Regression Inference

Elements of statistics (MATH0487-1)

It is important to bear in mind that one of the first three subscripts is redundant since k = i -j +3.

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Linear Classification. Volker Tresp Summer 2015

Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus

Logit and Probit. Brad Jones 1. April 21, University of California, Davis. Bradford S. Jones, UC-Davis, Dept. of Political Science

Introduction to mixed model and missing data issues in longitudinal studies

HYPOTHESIS TESTING: POWER OF THE TEST

Overview Classes Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

Least Squares Estimation

Statistics for Sports Medicine

Natural Language Processing. Today. Logistic Regression Models. Lecture 13 10/6/2015. Jim Martin. Multinomial Logistic Regression

DATA INTERPRETATION AND STATISTICS

Re-analysis using Inverse Probability Weighting and Multiple Imputation of Data from the Southampton Women s Survey

Multivariate Logistic Regression

Pr(X = x) = f(x) = λe λx

Introduction to Fixed Effects Methods

Statistical Models in R

New Work Item for ISO Predictive Analytics (Initial Notes and Thoughts) Introduction

Comparing Multiple Proportions, Test of Independence and Goodness of Fit

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

HLM software has been one of the leading statistical packages for hierarchical

Categorical Data Analysis

Chapter 4: Statistical Hypothesis Testing

A survey on click modeling in web search

13. Poisson Regression Analysis

End User Satisfaction With a Food Manufacturing ERP

Multivariate Normal Distribution

Statistics Graduate Courses

HETEROGENEOUS AGENTS AND AGGREGATE UNCERTAINTY. Daniel Harenberg University of Mannheim. Econ 714,

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools

Logistic Regression (1/24/13)

SUMAN DUVVURU STAT 567 PROJECT REPORT

Logistic Regression (a type of Generalized Linear Model)

Example: Boats and Manatees

Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 )

17. SIMPLE LINEAR REGRESSION II

Interaction between quantitative predictors

CS 688 Pattern Recognition Lecture 4. Linear Models for Classification

The zero-adjusted Inverse Gaussian distribution as a model for insurance claims

Non-Inferiority Tests for Two Means using Differences

Multiple Linear Regression

Standard errors of marginal effects in the heteroskedastic probit model

Lesson Outline Outline

Agenda. Mathias Lanner Sas Institute. Predictive Modeling Applications. Predictive Modeling Training Data. Beslutsträd och andra prediktiva modeller

Statistics in Retail Finance. Chapter 6: Behavioural models

Penalized Logistic Regression and Classification of Microarray Data

Transcription:

Nominal and ordinal logistic regression April 26

Nominal and ordinal logistic regression Our goal for today is to briefly go over ways to extend the logistic regression model to the case where the outcome can have multiple categories (i.e., not binary) We will discuss two approaches: Multinomial logistic regression, which makes no assumptions regarding the relationship between the categories, and is most appropriate for nominal outcomes, which assumes an ordering to the categories and is most appropriate for ordinal outcomes

Notation Nominal and ordinal logistic regression We will use the following notation to describe these multi-class models: Let Y be a random variable that can on one of K discrete value (i.e., fall into one of K classes) Number the classes 1,..., K Thus, π i2 = Pr(Y i = 2) denotes the probability that the ith individual s outcome belongs to the second class More generally, π ik = Pr(Y i = k) denotes the probability that the ith individual s outcome belongs to the kth class

The multinomial logistic regression model Multinomial logistic regression is equivalent to the following: Let k = 1 denote the reference category Fit separate logistic regression models for k = 2,..., K, comparing each outcome to the baseline: ( ) πik log = x T i β k π i1 Note that this will result in K 1 vectors of regression coefficients (we don t need to estimate the Kth vector because k π k = 1)

Probabilities and odds ratios The fitted class probabilities for an observation with explanatory variable vector x are therefore 1 ˆπ 1 = 1 + k exp(xt β k ) exp(x T βk ) ˆπ k = 1 + l exp(xt β l )

Probabilities and odds ratios Like logistic regression, odds ratios in the multinomial model are easily estimated as exponential functions of the regression coefficients: OR kl = π k π l = π k/π 1 π l /π 1 = exp ( (x 2 x 1 ) T β k ) exp ((x 2 x 1 ) T β l ) = exp ( (x 2 x 1 ) T (β k β l ) ) In the simple case of changing x j by δ j and comparing k to the reference category, OR kl = exp(δ j β kj )

Flu vaccine data: Results This model estimates the following odds ratios, comparing vaccinated to control: β ÔR Moderate 2.24 9.38 Large 2.22 9.17 A test of the null hypothesis that the odds ratios are all 1 is significant (p = 0.00009) Note: These are the same coefficients, the same ratios (replacing OR with RR), and the same p-value for the hypothesis test as the Poisson regression approach

Proportional odds: requires the estimation of (K 1)p parameters, and assumes nothing about the relationship between the categories to assist in that estimation This is very flexible of course, but has the potential to lead to large variability in the estimates, especially when the number of categories is large A common alternative when the categories are ordered to assume that the log odds of Y k is linearly related to the explanatory variables This is called the proportional odds model, and requires the estimate of only one regression coefficient per explanatory variable

Proportional odds model Specifically, the proportional odds model assumes ( ) πk + + π K log = β 0k + x T β 1 + + π k 1 Thus, we still have to estimate K 1 intercepts, but only p linear effects, where p is the number of explanatory variables (note that K + p 1 < (K 1)(p + 1) if K > 2) Note: Writing down the proportional odds model requires us to modify the notation we ve used all semester so in the above, x and β do not include a term for the intercept

Proportional odds: Results estimates that the odds ratio for Y {Moderate, Large} given vaccination is exp( ˆβ 1 ) = 6.3; furthermore, by assumption of the model, this is also the odds ratio for a large response relative to {Small, Moderate} given vaccination Multinomial Proportional odds Small Moderate Large Placebo Vaccine 0.6 0.5 Prob 0.4 0.3 0.2 0.1 Small Moderate Large Response

Non-linear models Nominal and ordinal logistic regression Linear and generalized linear models are certainly the most important class of models, but they are not the only kind of model For example, we have already alluded to the idea that we sometimes wish to allow the effect of an explanatory variable to be a smooth curve rather than a line: g(µ i ) = f(x i )

Non-linear models: Example 4 6 8 10 12 14 3 2 1 0 1 2 3 4 log(cafo) Log odds (Asthma)

Tree-based models Nominal and ordinal logistic regression There are also tree-based models: PPD < 2.25 >= 2.25 SmokerTime None, Intermittent Daily PctOut >= 95 < 95 Nsmokers < 1.5 >= 1.5 25 20 15 10 5 0 n = 64 25 20 15 10 5 0 n = 7 25 20 15 10 5 0 n = 11 25 20 15 10 5 0 n = 11 25 20 15 10 5 0 n = 9

And much more... There are many additional extensions/modifications/alternatives that have been proposed as well: Robust regression Distribution-free methods for inference Discriminant analysis Principal component analysis Methods for dealing with highly correlated explanatory variables Methods for variable selection that avoid the problems of subset selection These topics form the basis of BST 764: Applied Statistical Modeling, which I will be teaching next fall