The Analysis of Covariance. ERSH 8310 Keppel and Wickens Chapter 15

Similar documents
SIMPLE LINEAR CORRELATION

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting

CHAPTER 14 MORE ABOUT REGRESSION

CHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES

Part 1: quick summary 5. Part 2: understanding the basics of ANOVA 8

THE METHOD OF LEAST SQUARES THE METHOD OF LEAST SQUARES

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12

1. Measuring association using correlation and regression

Economic Interpretation of Regression. Theory and Applications

STATISTICAL DATA ANALYSIS IN EXCEL

7 ANALYSIS OF VARIANCE (ANOVA)

Can Auto Liability Insurance Purchases Signal Risk Attitude?

Analysis of Premium Liabilities for Australian Lines of Business

Statistical Methods to Develop Rating Models

Calculation of Sampling Weights

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol

How To Calculate The Accountng Perod Of Nequalty

Evaluating the generalizability of an RCT using electronic health records data

PRACTICE 1: MUTUAL FUNDS EVALUATION USING MATLAB.

Forecasting the Direction and Strength of Stock Market Movement

Portfolio Loss Distribution

Latent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006

Sketching Sampled Data Streams

Exhaustive Regression. An Exploration of Regression-Based Data Mining Techniques Using Super Computation

NPAR TESTS. One-Sample Chi-Square Test. Cell Specification. Observed Frequencies 1O i 6. Expected Frequencies 1EXP i 6

Credit Limit Optimization (CLO) for Credit Cards

Calibration and Linear Regression Analysis: A Self-Guided Tutorial

The OC Curve of Attribute Acceptance Plans

Regression Models for a Binary Response Using EXCEL and JMP

GRAVITY DATA VALIDATION AND OUTLIER DETECTION USING L 1 -NORM

RECENT DEVELOPMENTS IN QUANTITATIVE COMPARATIVE METHODOLOGY:

Meta-Analysis of Hazard Ratios

Chapter XX More advanced approaches to the analysis of survey data. Gad Nathan Hebrew University Jerusalem, Israel. Abstract

Measuring Ad Effectiveness Using Geo Experiments

An Interest-Oriented Network Evolution Mechanism for Online Communities

5 Multiple regression analysis with qualitative information

An Alternative Way to Measure Private Equity Performance

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements

Two Faces of Intra-Industry Information Transfers: Evidence from Management Earnings and Revenue Forecasts

Design and Analysis of Benchmarking Experiments for Distributed Internet Services

SPEE Recommended Evaluation Practice #6 Definition of Decline Curve Parameters Background:

Single and multiple stage classifiers implementing logistic discrimination

Lecture 14: Implementing CAPM

HARVARD John M. Olin Center for Law, Economics, and Business

A Multi-mode Image Tracking System Based on Distributed Fusion

Is There A Tradeoff between Employer-Provided Health Insurance and Wages?

Stress test for measuring insurance risks in non-life insurance

What is Candidate Sampling

Student Performance in Online Quizzes as a Function of Time in Undergraduate Financial Management Courses

HOUSEHOLDS DEBT BURDEN: AN ANALYSIS BASED ON MICROECONOMIC DATA*

14.74 Lecture 5: Health (2)

Characterization of Assembly. Variation Analysis Methods. A Thesis. Presented to the. Department of Mechanical Engineering. Brigham Young University

the Manual on the global data processing and forecasting system (GDPFS) (WMO-No.485; available at

Evaluating credit risk models: A critique and a new proposal

Calculating the high frequency transmission line parameters of power cables

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).

L10: Linear discriminants analysis

n + d + q = 24 and.05n +.1d +.25q = 2 { n + d + q = 24 (3) n + 2d + 5q = 40 (2)

total A A reag total A A r eag

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek

The Application of Fractional Brownian Motion in Option Pricing

How To Understand The Results Of The German Meris Cloud And Water Vapour Product

This circuit than can be reduced to a planar circuit

Traffic-light a stress test for life insurance provisions

Understanding the Impact of Marketing Actions in Traditional Channels on the Internet: Evidence from a Large Scale Field Experiment

Quantization Effects in Digital Filters

International University of Japan Public Management & Policy Analysis Program

Media Mix Modeling vs. ANCOVA. An Analytical Debate

Survival analysis methods in Insurance Applications in car insurance contracts

Approximating Cross-validatory Predictive Evaluation in Bayesian Latent Variables Models with Integrated IS and WAIC

Diagnostic Tests of Cross Section Independence for Nonlinear Panel Data Models

NBER WORKING PAPER SERIES THE SOCIAL MULTIPLIER. Edward L. Glaeser Bruce I. Sacerdote Jose A. Scheinkman

Answer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy

Statistical algorithms in Review Manager 5

Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008

Meta-analysis in Psychological Research.

Staff Paper. Farm Savings Accounts: Examining Income Variability, Eligibility, and Benefits. Brent Gloy, Eddy LaDue, and Charles Cuykendall

ADVERSE SELECTION IN INSURANCE MARKETS: POLICYHOLDER EVIDENCE FROM THE U.K. ANNUITY MARKET *

The Greedy Method. Introduction. 0/1 Knapsack Problem

Efficient Project Portfolio as a tool for Enterprise Risk Management

Brigid Mullany, Ph.D University of North Carolina, Charlotte

Online Appendix Supplemental Material for Market Microstructure Invariance: Empirical Hypotheses

Management Quality, Financial and Investment Policies, and. Asymmetric Information

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College

AN EMPIRICAL INVESTIGATION OF META-ANALYSIS USING RANDOMIZED CONTROLLED CLINICAL TRIALS IN A PARTICULAR CENTRE

Social Nfluence and Its Models

Returns to Experience in Mozambique: A Nonparametric Regression Approach

A statistical approach to determine Microbiologically Influenced Corrosion (MIC) Rates of underground gas pipelines.

Heterogeneous Paths Through College: Detailed Patterns and Relationships with Graduation and Earnings

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic

BERNSTEIN POLYNOMIALS

Gender differences in revealed risk taking: evidence from mutual fund investors

Learning from Large Distributed Data: A Scaling Down Sampling Scheme for Efficient Data Processing

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Evaluating the Effects of FUNDEF on Wages and Test Scores in Brazil *

Method for assessment of companies' credit rating (AJPES S.BON model) Short description of the methodology

Fuzzy Regression and the Term Structure of Interest Rates Revisited

Vision Mouse. Saurabh Sarkar a* University of Cincinnati, Cincinnati, USA ABSTRACT 1. INTRODUCTION

Transcription:

The Analyss of Covarance ERSH 830 Keppel and Wckens Chapter 5

Today s Class Intal Consderatons Covarance and Lnear Regresson The Lnear Regresson Equaton TheAnalyss of Covarance Assumptons Underlyng the Analyss of Covarance Example ANCOVA Analyss

THE ANALYSIS OF COVARIANCE

The Analyss of Covarance In a completely randomzed desgn subjects are randomly assgned to the expermental treatments The completely randomzed desgn s relatvely defcent n power If there s a varable avalable before the start of experment that s reasonably correlated wth the dependent varable abe(e,co (.e., control o varable, abe,concomtant varable, or covarate; e.g., ntellgence, grade pont average, etc.), we may ether employ blockng or statstcal ttt adjustment t

The Analyss of Covarance The randomzed blocks desgn ncludes groups of homogeneous subjects drawn from respectve blocks Advantages of the randomzed blocks desgn are: Blockng helps to equate the treatment groups before the start of the experment more effectvely than s accomplshed n the completely l randomzed ddesgn The power s ncreased because smallererrorerror term usually assocated wth the blockng desgn Interactons can be assessed

The Analyss of Covarance Dsadvantages may nclude: There wll be cost of ntroducng the blockng factor It may be dffcult to fnd blockng factors that are hghly correlated ltdwth the dependent d varable bl Loss of power may occur f a poorly correlated blockng p y p y g factor s used

The Analyss of Covarance The analyss of covarance reduces expermental error by statstcal, rather than expermental, means Subjects arefrstmeasured on the concomtant varable called the covarate whch conssts of some relevant ablty or characterstcs Subjects are then randomly assgned to the treatment j y g group wthout regard for ther scores on the covarate

The Analyss of Covarance The analyss of covarance refnes estmates of expermental error and uses the adjust treatment effects for any dfferences between the treatment groups that exsted before the expermental treatments were admnstered

Today s Example Data We wll rehash data from Secton.5 A researcher was studyng the effects of nstructonal materal on how well college students learn basc concepts n statstcs Two nstructonal materal groups (a 2) Pretest gven General quanttatve ablty (now called the covarate X) DV Scores on a test on basc statstcs test (Y)

The Data Group a Group a 2 X Y X Y 9 3 2 6 7 3 5 2 7 8 3 9 2 9 5 5 5 4 2 0 9 9 4 7 6 2 6 4 4 Mean: 8 4 7 6

Our Data As a Plot

COVARIANCE AND LINEAR REGRESSION

Covarance and Lnear Regresson The correlaton coeffcent between two varables X and Y s: r XY xy Where the covarance between X and Y s: ( X X )( Y Y ) s s x s y N N The standard devaton of The standard devaton of X s: Y s: s x N ( X X ) 2 N s XY s Y N ( ) 2 Y Y N

Wth Our Data Set The correlaton coeffcent between two varables X and Y s: xy 5.667 2.805*3.055 Where the covarance between X and Y s: ( X X )( Y Y ) s 0.66 r XY s 5. sxs 667 XY y N The standard devaton of The standard devaton of X s: Y s: s x N ( X X ) 2 N 2.805 s Y N N ( ) 2 Y Y N 3.055

Other Useful Terms (Wll Help n ANCOVA) We may defne the sum of products: SP And the sums of squares: 2 XY N ( X X )( Y Y ) 85 N SS ( ) X X X 8 N SS ( ) Y Y Y 40 Consequently: 2 r XY SP SS X XY SS Y 85 8*40 0.66

THE LINEAR REGRESSION EQUATION

The Lnear Regresson Equaton The lnear regresson lne relatng the dependent varable Y to the covarate X s: Y β 0 + β X + And the predcton equaton for s: Where: ˆ + E Y b0 b X b 0 Y b X S SPXY S X SS X Y b rxy

Wth Our Data The lnear regresson lne relatng the dependent varable Y to the covarate X s: Y β 0 + β X + And the predcton equaton for s: Where: Yˆ b. 720X E 0 + b X.403+ SY SPXY b0 Y b X.403 b rxy. 720 S SS X X

Regresson Hypothess Tests In regresson, the key hypothess test we are worred about s that for the slope, p, b H 0 : β 0 H : β 0 Usng these hypotheses, and our method for calculatng sums of squares from Chapter 4 (the GLM chapter), we can then create an ANOVA lke table for the regresson. We need: H0 H SSun exp SS unexp

Regresson Sums of Squares H Recall that SS 0exp un came from calculatng the sums of squares where the predcted value came from the lnear model when H 0 was true It was treatment + error Inregresson the correspondng predcted valuesare: are: Ths comes from: df un Yˆ b 0 Y b Y b X Y 0* X 0 H 0 The exp comes from the number of observatons (here 6) mnus the number of parameters () so H 0 df un exp 5 Y

Regresson Sums of Squares H Recall that SS exp un came from calculatng the sums of squares where the predcted value came from the lnear model when H was true It was error In regresson the correspondng predcted values are: ˆ + Y b 0 b X H The df un exp comes from the number of observatons (6) mnus the number of parameters (2), so H df un exp 4

Resdual Varaton and the Lnear Model Another way of lookng at Sums of Squares: The sum of the squared devaton from the mean s the Sums of Squares Total: T N SSY ( ) 2 H 0 Y Y SS exp SS Thesum of the squared resduals fromthe regresson lne s: N SS Re gresson SSY X un ( ) 2 ˆ H H Y Y SS SS Fnally, the sum of squares for error s: N 2 ˆ H ( ) Y Y SS SS Error un exp 0 unexp unexp

Puttng t Together n an ANOVA Table: See Excel worksheet on ELC for computatons

Resdual Varaton and the Lnear Model The amount of the Y varablty that can be attrbuted to the regresson equaton s: SS regresson SSY SSY X The squared correlaton coeffcent (or what we call the effect sze eta squared) s: r SS 2 regresson XY SSY.437

Now wth SPSS

THE ANALYSIS OF COVARIANCE

The Analyss of Covarance Now that we know a lttle about regresson, we can use t to mprove power to detect group dfferences n an ANOVA lke expermental desgn ANCOVA tests for dfferences between groups by comparng a descrpton of the data based on a sngle regresson lne to one based on lnes wth the same slope and dfferent ntercepts for each group

In the Begnnng There was the ANOVA Model Recall our GLM representaton of ANOVA: The H model: The H 0 model: Y μ + α + j T j E j Y μ + j μ T E j ANCOVA adds n the covarate X to the model

The Analyss of Covarance and the General Lnear Model For the analyss of covarance, the alternatvehypothess model s: Y X + j β0 + α j + β j E j And the null hypothess model (testng the effect of the treatment means) s: Y β 0 + β X + j j E j

Decomposng ANCOVA If the H model holds n ANCOVA, t means that the ntercepts of the regresson lne dffer across groups: For our 2 group example that means, for group : For group 2: ( β ) 0 + α + X E Y β + ( β ) 0 + α 2 + β X 2 E 2 Y β + 2

Does t? Hmm H 0 Model same ntercepts H Model Dff. Intercepts

...Its a Queston of Statstcal Sgnfcance Usng our knowledge of GLM, we could form SS for each of the possble categores or we could use SPSS: Covarate (X the score on the general quanttatve test) Treatment group (A)

Comparng ANOVA and ANCOVA ANOVA ANCOVA

The Dfference? Adjusted Means For ANCOVA, the adjusted mean (or Least Squares Mean) s: ' Y b ( X X ) Y j T

What about the Rest of ANOVA All of the rest of ANOVA happens on the adjusted means lke contrasts or post hoc tests Both of whch are made easy n SPSS The desgn can be extended to nclude more factors and more covarates Of course, thngs get more tedous to understand as you add more varables Th k t l th t f t f tl The key s to realze that f your covarate sgnfcantly predcts Y (above and beyond the group varable), you wll have more power to detect group dfferences

ASSUMPTIONS UNDERLYING THE ANALYSIS OF COVARIANCE

Assumptons Underlyng the Analyss of Covarance Three assumptons n addton to the usual analyss of varance assumptons are:. The assumpton of lnear regresson: The devatons from regresson are normally and ndependently dstrbuted n the populaton, wth means of zero and homogeneous varances Hard to check defntvely 2. The assumpton of homogeneous group regresson coeffcents: The wthn groupsregressoncoeffcent regresson s actually an average of the regresson coeffcents for the respectve treatment groups Can be checked by addng an nteracton term between the covarate and group varable 3. The exact measurement of the covarate: The covarate s measured wthout error Ubqutous term

Checkng the Homogenety of Slopes Assumpton You can easly check the homogenety of slopes assumpton by testng the nteracton between your group IV and the covarate The H model becomes: j β 0 + α j + j ( β ) X j E j Y β X + αβ + If true, t mples, for group : ( β ) [ ( ) ] 0 + α + β + X E Y αβ + And for group 2: ( β ) [ ( ) ] 0 + α 2 + β + X 2 E 2 Y αβ + 2 2 j

In SPSS Does t meet our assumpton of homogenety of slopes?

Wrappng Up Today s class covered a method for controllng for mportant varables n an experment: ANCOVA ANCOVA s a general technque that adds addtonal (contnuous/quanttatve) varables to a model and adjusts for the values of such varables Any ANOVA desgn can nclude such varables

Up Next In lab tonght: How to do ANCOVA n SPSS Homework: Assgned n the mornng, due next week before class Readng for next week: Chapter 6: Wthn Subject Desgns Only two lectures left!