Chapter 11. Chapter 11 Overview. Chapter 11 Objectives 11/24/2015. Other Chi-Square Tests

Similar documents
The Chi-Square Test. STAT E-50 Introduction to Statistics

Elementary Statistics

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Is it statistically significant? The chi-square test

Test Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table

Solutions to Homework 10 Statistics 302 Professor Larget

Comparing Multiple Proportions, Test of Independence and Goodness of Fit

Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails.

Chi-square test Fisher s Exact test

Study Guide for the Final Exam

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Recommend Continued CPS Monitoring. 63 (a) 17 (b) 10 (c) (d) 20 (e) 25 (f) 80. Totals/Marginal

Opgaven Onderzoeksmethoden, Onderdeel Statistiek

Contingency Tables and the Chi Square Statistic. Interpreting Computer Printouts and Constructing Tables

Chapter 23. Two Categorical Variables: The Chi-Square Test

CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS

Odds ratio, Odds ratio test for independence, chi-squared statistic.

Chapter 8: Hypothesis Testing for One Population Mean, Variance, and Proportion

Descriptive Analysis

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Simulating Chi-Square Test Using Excel

CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Nonparametric Tests. Chi-Square Test for Independence

Section 12 Part 2. Chi-square test

Association Between Variables

5/31/2013. Chapter 8 Hypothesis Testing. Hypothesis Testing. Hypothesis Testing. Outline. Objectives. Objectives

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Math 108 Exam 3 Solutions Spring 00

5/31/ Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.

DDBA 8438: The t Test for Independent Samples Video Podcast Transcript

Statistical tests for SPSS

Goodness of Fit. Proportional Model. Probability Models & Frequency Data

8 6 X 2 Test for a Variance or Standard Deviation

Lesson 3: Calculating Conditional Probabilities and Evaluating Independence Using Two-Way Tables

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

Statistics 2014 Scoring Guidelines

Chapter 23 Inferences About Means

Crosstabulation & Chi Square

MATH 140 Lab 4: Probability and the Standard Normal Distribution

Section 13, Part 1 ANOVA. Analysis Of Variance

Table 1: Profile of Consumer Particulars Classification Numbers Percentage Upto Age. 21 to Above

November 08, S8.6_3 Testing a Claim About a Standard Deviation or Variance

CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA

3.4 Statistical inference for 2 populations based on two samples

STAT 145 (Notes) Al Nosedal Department of Mathematics and Statistics University of New Mexico. Fall 2013

UNDERSTANDING THE TWO-WAY ANOVA

Descriptive Statistics

3. Analysis of Qualitative Data

The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Mind on Statistics. Chapter 15

Chapter 13. Chi-Square. Crosstabs and Nonparametric Tests. Specifically, we demonstrate procedures for running two separate

Independent t- Test (Comparing Two Means)

The Analysis of Variance ANOVA

This chapter discusses some of the basic concepts in inferential statistics.

CHAPTER 15 NOMINAL MEASURES OF CORRELATION: PHI, THE CONTINGENCY COEFFICIENT, AND CRAMER'S V

Chapter 7 Probability. Example of a random circumstance. Random Circumstance. What does probability mean?? Goals in this chapter

Topic 8. Chi Square Tests

Chapter 7 Section 7.1: Inference for the Mean of a Population

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct

COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.

12: Analysis of Variance. Introduction

Clocking In Facebook Hours. A Statistics Project on Who Uses Facebook More Middle School or High School?

9 Testing the Difference

Math 58. Rumbos Fall Solutions to Review Problems for Exam 2

Difference of Means and ANOVA Problems

Lesson 17: Margin of Error When Estimating a Population Proportion

Independent samples t-test. Dr. Tom Pierce Radford University

Fairfield Public Schools

An introduction to IBM SPSS Statistics

CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont

Chi Square Distribution

Adverse Impact Ratio for Females (0/ 1) = 0 (5/ 17) = Adverse impact as defined by the 4/5ths rule was not found in the above data.

Chapter 5 Analysis of variance SPSS Analysis of variance

A Study On Customer Purchase Behaviourtowards Mobile Phone With Special Reference To Erode City

Nonparametric Statistics

Linear Models in STATA and ANOVA

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

One-Way Analysis of Variance (ANOVA) Example Problem

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

DATA ANALYSIS AND INTERPRETATION OF EMPLOYEES PERSPECTIVES ON HIGH ATTRITION

Mind on Statistics. Chapter 13

Additional sources Compilation of sources:

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

Blood Type Probability O 0.42 A 0.43 B 0.11 AB 0.04

The Dummy s Guide to Data Analysis Using SPSS

Describing Populations Statistically: The Mean, Variance, and Standard Deviation

CHAPTER 12 TESTING DIFFERENCES WITH ORDINAL DATA: MANN WHITNEY U

Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples

CHAPTER 12. Chi-Square Tests and Nonparametric Tests LEARNING OBJECTIVES. USING T.C. Resort Properties

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

Statistical Impact of Slip Simulator Training at Los Alamos National Laboratory

Unit 26 Estimation with Confidence Intervals

TABLE OF CONTENTS. About Chi Squares What is a CHI SQUARE? Chi Squares Hypothesis Testing with Chi Squares... 2

statistics Chi-square tests and nonparametric Summary sheet from last time: Hypothesis testing Summary sheet from last time: Confidence intervals

Transcription:

11/4/015 Chapter 11 Overview Chapter 11 Introduction 11-1 Test for Goodness of Fit 11- Tests Using Contingency Tables Other Chi-Square Tests McGraw-Hill, Bluman, 7th ed., Chapter 11 1 Bluman, Chapter 11 Chapter 11 Objectives 1. Test a distribution for goodness of fit, using chi-square.. Test two variables for independence, using chi-square. 3. Test proportions for homogeneity, using chi-square. Characteristics of the Chi-Square Distribution 1. It is not symmetric. Bluman, Chapter 11 3 Characteristics of the Chi-Square Distribution 1. It is not symmetric.. The shape of the chi-square distribution depends upon the degrees of freedom, just like Student s t-distribution. Characteristics of the Chi-Square Distribution 1. It is not symmetric.. The shape of the chi-square distribution depends upon the degrees of freedom, just like Student s t-distribution. 3. As the number of degrees of freedom increases, the chi-square distribution becomes more symmetric as is illustrated in Figure 1. 1

11/4/015 Characteristics of the Chi-Square Distribution 1. It is not symmetric.. The shape of the chi-square distribution depends upon the degrees of freedom, just like Student s t-distribution. 3. As the number of degrees of freedom increases, the chi-square distribution becomes more symmetric as is illustrated in Figure 1. 4. The values are non-negative. That is, the values of are greater than or equal to 0. The Chi-Square Distribution 11.1 Test for Goodness of Fit Reading the Table The chi-square statistic can be used to see whether a frequency distribution fits a specific pattern. This is referred to as the chi-square goodness-of-fit test. Find the value of for 7 df and an area of 0.10 in the right tail of the chi-square distribution curve. Bluman, Chapter 11 9 Bluman, Chapter 11 10 Reading the Table Find the value of for 1 df and an area of 0.05 in the left tail of the chi-square distribution curve. Test for Goodness of Fit Formula for the test for goodness of fit: O where d.f. = k 1 ; number of categories minus 1 O = observed frequency = expected frequency Bluman, Chapter 11 11 Bluman, Chapter 11 1

11/4/015 Assumptions for Goodness of Fit 1. The data are obtained from a random sample.. The expected frequency for each category must be 5 or more. Bluman, Chapter 11 13 xample 1: Fruit Soda Flavors A market analyst wished to see whether consumers have any preference among five flavors of a new fruit soda. A sample of 100 people provided the following data. Is there enough evidence to reject the claim that there is no preference in the selection of fruit soda flavors, using the data shown previously? Let α = 0.05. Cherry Strawberry Orange Lime Grape Observed 3 8 16 14 10 xpected 0 0 0 0 0 H 0 : Consumers show no preference (claim). (H 0 : p 1 = p = p 3 = p 4 = p 5 ) H 1 : Consumers show a preference. (H 1 : At least of the 5 proportions are 0.0) Bluman, Chapter 11 14 xample 1: Fruit Soda Flavors Cherry Strawberry Orange Lime Grape Observed 3 8 16 14 10 xpected 0 0 0 0 0 xample 1: Fruit Soda Flavors The decision is to reject the null hypothesis, since 18.0 > 9.488. Step : Find the critical value. D.f. = 5 1 = 4, and α = 0.05. CV = 9.488. O 3 0 8 0 16 0 14 0 0 0 0 0 10 0 18.0 0 Bluman, Chapter 11 15 There is enough evidence to reject the claim that consumers show no preference for the flavors. Bluman, Chapter 11 16 xample 1: Using your graphing calc. Cherry Strawberry Orange Lime Grape Observed 3 8 16 14 10 xpected 0 0 0 0 0 Step : nter the observed values into L 1 nter the expected values into L calculate first and enter or enter as 100*. Step 3: STATS TST- D: GOF Test Observed: L 1 xpected: L df: 4 Calculate This should be displayed on your screen: =18 p =.00134098 df = 4 and CNTRB= (7. 3. Bluman, Chapter 11 17 xample : Retirees The Russel Reynold Association surveyed retired senior executives who had returned to work. They found that after returning to work, 38% were employed by another organization, 3% were self-employed, 3% were either freelancing or consulting, and 7% had formed their own companies. To see if these percentages are consistent with those of Allegheny County residents, a local researcher surveyed 300 retired executives who had returned to work and found that 1 were working for another company, 85 were self-employed, 76 were either freelancing or consulting, and 17 had formed their own companies. At α = 0.10, test the claim that the percentages are the same for those people in Allegheny County. Bluman, Chapter 11 18 3

11/4/015 xample : Retirees New Self- mployed Owns Observed 1 85 76 17 xpected.38(300)= 114.3(300)= 96.3(300)= 69.07(300)= 1 H 0 : The retired executives who returned to work are distributed as follows: 38% are employed by another organization, 3% are self-employed, 3% are either freelancing or consulting, and 7% have formed their own companies (claim). (H 0 : p 1 =.38, p =.3 p 3 =.3 p 4 =.07) H 1 : The distribution is not the same as stated in the null hypothesis. (H 1 : At least one of the percentages is not as stated.) Bluman, Chapter 11 19 xample : Retirees New Freelancing Self- mployed Owns Observed 1 85 76 17 xpected.38(300)= 114.3(300)= 96.3(300)= 69 Step : Find the critical value. D.f. = 4 1 = 3, and α = 0.10. CV = 6.51..07(300)= 1 O 1 114 85 96 76 69 17 1 114 96 69 1 3.939 Bluman, Chapter 11 0 xample : Retirees Since 3.939 < 6.51, the decision is not to reject the null hypothesis. There is not enough evidence to reject the claim. It can be concluded that the percentages are not significantly different from those given in the null hypothesis. Bluman, Chapter 11 1 xample : Graphing Calculator New Freelancing Self- mployed Freelancing Owns Observed 1 85 76 17.38(300)=.3(300)=.3(300)= xpected 114 96 69 Step : nter the observed values into L 1 nter the expected values into L Step 3: STATS TST- D: GOF Test Observed: L 1 xpected: L df: 3 Calculate This should be displayed on your screen: =3.93869865 p =.3484967603 df = 3 and CNTRB= (.561403.07(300)= 1 Bluman, Chapter 11 xample 11-3: ATM Usage (p. 495) A bank has an ATM installed inside the bank, and it is available to its customers only from 7 AM to 6 PM Mon Fri. The manager of the bank wanted to investigate if the percentage of transactions made on the ATM is the same for each of the five days (M F) of the week. She randomly selected one week and counted the number of transactions made on this ATM on each of the 5 days. At α = 0.01, can we reject the null hypothesis that the proportion of people who use this ATM each of the 5 days is the same? (Assume this week is typical of all weeks regarding ATM use) DAY # of users Mon Tues Wed Thurs Fri 53 197 04 79 67 xample 11-3: ATM Usage DAY # of users Mon Tues Wed Thurs Fri 53 197 04 79 67 H 0 : The proportion of people using the ATM is the same for all 5 days of the week. (H 0 : p 1 = p = p 3 = p 4 = p 5 ) H 1 : The distribution is not the same as stated in the null hypothesis. (H 1 : At least of the 5 proportions are 0.0) Bluman, Chapter 11 3 Bluman, Chapter 11 4 4

11/4/015 xample 11-3: ATM Usage DAY Mon Tues Wed Thurs Fri # of users 53 197 04 79 67 xample 11-3: ATM Usage Reject the null hypothesis, since 3.184 > 13.77. Step : Find the critical value. Df = k - 1 D.f. = 5 1 = 4, and α = 0.01. CV = 13.77. O 53 40 197 40 04 40 97 40 67 40 40 40 40 40 40 = 3.183 13.77 3.184 There is enough evidence to reject the claim that the proportion of people using the ATM is the same for all 5 days of the week. Bluman, Chapter 11 5 Bluman, Chapter 11 6 11.3 Tests Using Contingency Tables When data can be tabulated in table form in terms of frequencies, several types of hypotheses can be tested by using the chi-square test. The test of independence of variables is used to determine whether two variables are independent of or related to each other when a single sample is selected. The test of homogeneity of proportions is used to determine whether the proportions for a variable are equal when several samples are selected from different populations. Test for Independence The chi-square goodness-of-fit test can be used to test the independence of two variables. The hypotheses are: H 0 : There is no relationship between two variables. H 1 : There is a relationship between two variables. If the null hypothesis is rejected, there is some relationship between the variables. Bluman, Chapter 11 7 Bluman, Chapter 11 8 Test for Independence In order to test the null hypothesis, one must compute the expected frequencies, assuming the null hypothesis is true. When data are arranged in table form for the independence test, the table is called a contingency table. Contingency Tables The degrees of freedom for any contingency table are d.f. = (rows 1) (columns 1) = (R 1)(C 1). Bluman, Chapter 11 9 Bluman, Chapter 11 30 5

11/4/015 Test for Independence The formula for the test for independence: O where d.f. = (R 1)(C 1) O = observed frequency row sumcolumn sum = expected frequency = grand total xample 5: College ducation and A sociologist wishes to see whether the number of years of college a person has completed is related to her or his place of residence. A sample of 88 people is selected and classified as shown. At α = 0.05, can the sociologist conclude that a person s location is dependent on the number of years of college? Location No College Four-Year Degree Advanced Degree Total Urban 15 1 8 35 Suburban 8 15 9 3 Rural 6 8 7 1 Total 9 35 4 88 Bluman, Chapter 11 31 Bluman, Chapter 11 3 xample 5: College ducation and H 0 : A person s place of residence is independent of the number of years of college completed. H 1 : A person s place of residence is dependent on the number of years of college completed (claim). Step : Find the critical value. The critical value is 9.488, since the degrees of freedom are (3 1)(3 1) = 4. xample 5: College ducation and Compute the expected values. row sumcolumn sum grand total Location Urban Suburba n Rural No College Four-Year Degree Advanced Degree 15 1 8 (11.53) (13.9) (9.55) 8 15 9 (10.55) (1.73) (8.73) 6 8 7 (6.9) (8.35) (5.73) 359 11.53 88 Total Total 9 35 4 88 1,1 35 3 1 Bluman, Chapter 11 33 Bluman, Chapter 11 34 xample 5: College ducation and O 15 11.53 1 13.9 8 9.55 11.53 13.9 9.55 8 10.55 15 1.73 9 8.73 10.55 1.73 8.73 6 6.9 8 8.35 7 5.73 6.9 8.35 5.73 3.01 Bluman, Chapter 11 35 xample 5: College ducation and Do not reject the null hypothesis, since 3.01<9.488. A person s place of residence is independent of the number of years of college completed. Bluman, Chapter 11 36 6

11/4/015 Graphing Calculator Press nd x -1 for MATRIX and move the cursor to edit, then press enter nter the number of rows & columns then enter nter the values in the matrix as they appear in the contingency table Press STAT TST C: Test Make sure the observed matrix is [A] and the expected matrix is [B] Calculate, enter Bluman, Chapter 11 37 You should see Test = 3.00553884 P =.5568989167 df = 4 Bluman, Chapter 11 38 xample6: Alcohol and Gender A researcher wishes to determine whether there is a relationship between the gender of an individual and the amount of alcohol consumed. A sample of 68 people is selected, and the following data are obtained. At α = 0.10, can the researcher conclude that alcohol consumption is related to gender? Gender Alcohol Consumption Low Moderate High Total Male 10 9 8 7 Female 13 16 1 41 Total 3 5 0 68 Bluman, Chapter 11 39 Graphing Calculator Press nd x -1 for MATRIX and move the cursor to edit, then press enter nter the number of rows & columns then enter nter the values in the matrix as they appear in the contingency table Press STAT TST C: Test Make sure the observed matrix is [A] and the expected matrix is [B] Calculate, enter Bluman, Chapter 11 40 xample 6: Alcohol and Gender H 0 : The amount of alcohol that a person consumes is independent of the individual s gender. H 1 : The amount of alcohol that a person consumes is dependent on the individual s gender (claim). Step : Find the critical value. The critical value is 9.488, since the degrees of freedom are (3 1 )(3 1) = ()() = 4. xample 6: Alcohol and Gender Compute the expected values. row sumcolumn sum grand total Gender Male Female Alcohol Consumption Low Moderate High 10 9 8 (9.13) (9.93) (7.94) 13 16 1 (13.87) (15.07) (1.06) 73 9.13 68 Total Total 3 5 0 68 1,1 7 41 Bluman, Chapter 11 41 Bluman, Chapter 11 4 7

11/4/015 xample 6: Alcohol and Gender O 10 9.13 9 9.93 8 7.94 9.13 9.93 7.94 13 13.87 16 15.07 1 1.06 13.87 15.07 1.06 0.83 Bluman, Chapter 11 43 xample 6: Alcohol and Gender Do not reject the null hypothesis, since 0.83 < 4.605.. There is not enough evidence to support the claim that the amount of alcohol a person consumes is dependent on the individual s gender. Bluman, Chapter 11 44 Test for Homogeneity of Proportions Homogeneity of proportions test is used when samples are selected from several different populations and the researcher is interested in determining whether the proportions of elements that have a common characteristic are the same for each population. Test for Homogeneity of Proportions The hypotheses are: H 0 : p 1 = p = p 3 = = p n H 1 : At least one proportion is different from the others. When the null hypothesis is rejected, it can be assumed that the proportions are not all equal. Bluman, Chapter 11 45 Bluman, Chapter 11 46 Assumptions for Homogeneity of Proportions 1. The data are obtained from a random sample.. The expected frequency for each category must be 5 or more. xample 7: Lost Luggage A researcher selected 100 passengers from each of 3 airlines and asked them if the airline had lost their luggage on their last flight. The data are shown in the table. At α = 0.05, test the claim that the proportion of passengers from each airline who lost luggage on the flight is the same for each airline. Airline 1 Airline Airline 3 Total Yes 10 7 4 1 No 90 93 96 79 Total 100 100 100 300 Bluman, Chapter 11 47 Bluman, Chapter 11 48 8

11/4/015 Graphing Calculator Press nd x -1 for MATRIX and move the cursor to edit, then press enter nter the number of rows & columns then enter nter the values in the matrix as they appear in the contingency table Press STAT TST C: Test Make sure the observed matrix is [A] and the expected matrix is [B] Calculate, enter Bluman, Chapter 11 49 xample 7: Lost Luggage Step 1: State the hypotheses. H 0 : p 1 = p = p 3 = = p n H 1 : At least one mean differs from the other. Step : Find the critical value. The critical value is 5.991, since the degrees of freedom are ( 1 )(3 1) = (1)() =. Bluman, Chapter 11 50 xample 7: Lost Luggage Compute the expected values. row sumcolumn sum grand total Yes 1,1 1100 7 300 Airline 1 Airline Airline 3 Total 10 7 4 (7) (7) (7) 90 93 96 No 79 (93) (93) (93) Total 100 100 100 300 1 xample 7: Luggage O 10 7 7 7 4 7 7 7 7 90 93 93 93 96 93 93 93 93.765 Bluman, Chapter 11 51 Bluman, Chapter 11 5 xample 7: Lost Luggage Do not reject the null hypothesis, since.765 < 5.991. There is no difference in the proportions of the luggage lost by each airline. Bluman, Chapter 11 53 9