The Chi-Square Goodness-of-Fit Test, Equal Proportions

Similar documents
Is it statistically significant? The chi-square test

The Chi-Square Test. STAT E-50 Introduction to Statistics

Chapter 5 Analysis of variance SPSS Analysis of variance

8 6 X 2 Test for a Variance or Standard Deviation

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

ABSORBENCY OF PAPER TOWELS

Data Analysis Tools. Tools for Summarizing Data

Chapter 23. Two Categorical Variables: The Chi-Square Test

Stats for Strategy Fall 2012 First-Discussion Handout: Stats Using Calculators and MINITAB

IBM SPSS Statistics for Beginners for Windows

Scatter Plots with Error Bars

IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA

Fixture List 2018 FIFA World Cup Preliminary Competition

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)

MTH 140 Statistics Videos

SPSS Tests for Versions 9 to 13

Testing Hypotheses About Proportions

Cruise Line Agencies of Alaska. Cruise Ship Calendar for 2016 FOR PORT(S) = KTN AND SHIP(S) = ALL AND VOYAGES = ALL

EXCEL Analysis TookPak [Statistical Analysis] 1. First of all, check to make sure that the Analysis ToolPak is installed. Here is how you do it:

Online 12 - Sections 9.1 and 9.2-Doug Ensley

Cruise Line Agencies of Alaska. Cruise Ship Calendar for 2016 FOR PORT(S) = KTN AND SHIP(S) = ALL AND VOYAGES = ALL

One-Way Analysis of Variance (ANOVA) Example Problem

Bill Burton Albert Einstein College of Medicine April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

DIRECTIONS. Exercises (SE) file posted on the Stats website, not the textbook itself. See How To Succeed With Stats Homework on Notebook page 7!

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Mind on Statistics. Chapter 15

Using SPSS, Chapter 2: Descriptive Statistics

Minitab Tutorials for Design and Analysis of Experiments. Table of Contents

Independent t- Test (Comparing Two Means)

STEP TWO: Highlight the data set, then select DATA PIVOT TABLE

CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS

Statistics 2014 Scoring Guidelines

Analysis of categorical data: Course quiz instructions for SPSS

Table of Contents TASK 1: DATA ANALYSIS TOOLPAK... 2 TASK 2: HISTOGRAMS... 5 TASK 3: ENTER MIDPOINT FORMULAS... 11

In this example, Mrs. Smith is looking to create graphs that represent the ethnic diversity of the 24 students in her 4 th grade class.

When to use Excel. When NOT to use Excel 9/24/2014

Analysis of Variance. MINITAB User s Guide 2 3-1

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

One-Way ANOVA using SPSS SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate

Gestation Period as a function of Lifespan

Two Correlated Proportions (McNemar Test)

Federal Employee Viewpoint Survey Online Reporting and Analysis Tool

SPSS 3: COMPARING MEANS

Chapter 13. Chi-Square. Crosstabs and Nonparametric Tests. Specifically, we demonstrate procedures for running two separate

Odds ratio, Odds ratio test for independence, chi-squared statistic.

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Assessing Measurement System Variation

Gage Studies for Continuous Data

TIPS FOR DOING STATISTICS IN EXCEL

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

How To Run Statistical Tests in Excel

How to set the main menu of STATA to default factory settings standards

Your Name: Section: INTRODUCTION TO STATISTICAL REASONING Computer Lab Exercise #5 Analysis of Time of Death Data for Soldiers in Vietnam

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

How To Check For Differences In The One Way Anova

Use of deviance statistics for comparing models

Budgeting in QuickBooks

Regression step-by-step using Microsoft Excel

Main Effects and Interactions

STATISTICA Formula Guide: Logistic Regression. Table of Contents

An introduction to IBM SPSS Statistics

Descriptive Statistics

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS

Cruise Line Agencies of Alaska. Cruise Ship Calendar for 2012 FOR PORT(S) = KTN AND SHIP(S) = ALL AND VOYAGES = ALL

Information Technology

Factors affecting online sales

Cruise Line Agencies of Alaska. Cruise Ship Calendar for 2013 FOR PORT(S) = KTN AND SHIP(S) = ALL AND VOYAGES = ALL

Solutions to Homework 10 Statistics 302 Professor Larget

People like to clump things into categories. Virtually every research

Getting Started with Minitab 17

Elementary Statistics

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.

Modifying Colors and Symbols in ArcMap

Tutorial Segmentation and Classification

Using Excel for Statistics Tips and Warnings

MBA 611 STATISTICS AND QUANTITATIVE METHODS

Using Microsoft Excel to Analyze Data

Fairfield Public Schools

Using Microsoft Excel to Analyze Data from the Disk Diffusion Assay

Module 4 (Effect of Alcohol on Worms): Data Analysis

Binary Diagnostic Tests Two Independent Samples

IBM SPSS Statistics 20 Part 1: Descriptive Statistics

3.4 Statistical inference for 2 populations based on two samples

Charts, Tables, and Graphs

Simulating Chi-Square Test Using Excel

Drawing a histogram using Excel

Recall this chart that showed how most of our course would be organized:

Adverse Impact Ratio for Females (0/ 1) = 0 (5/ 17) = Adverse impact as defined by the 4/5ths rule was not found in the above data.

Introduction Course in SPSS - Evening 1

Table of Contents. Preface

Adverse Impact and Test Validation Book Series: Multiple Regression. Introduction. Comparison of Compensation using

Directions for using SPSS

An SPSS companion book. Basic Practice of Statistics

Section 12 Part 2. Chi-square test

Appendix 2.1 Tabular and Graphical Methods Using Excel

2 Sample t-test (unequal sample sizes and unequal variances)

Transcription:

Chapter 11 Chi-Square Tests 1 Chi-Square Tests Chapter 11 The Chi-Square Goodness-of-Fit Test, Equal Proportions A hospital wants to know if the proportion of births are the same for each day of the week. In a random sample of 100 births, the number on each day of the week is shown below: Day Monday Tuesday Wednesday Thursday Friday Saturday Sunday Number Births 32 16 7 20 11 8 6 To enter the raw data, in column C1 enter 32 Monday rows, 16 Tuesday rows, 7 Wednesday rows, and so on, until you have 100 rows corresponding to each birth.

Chapter 11 Chi-Square Tests 2 To perform the Goodness of Fit Test to determine if the proportion of each day of the week is the same, click on Stat Tables Chi-square Goodness of Fit Test (One Variable). Click on the circle next to Categorical data and select the data in column C1. Select the circle to test Equal proportions. Click OK. Two graphs will be displayed. The first is a Chart of Contribution to the Chi-Square Value by Category. These are the values of ( O E) E graph by clicking on the X in the right corner of the window. 2 for each category. Close this

Chapter 11 Chi-Square Tests 3 The second graph that is displayed is the Chart of Observed and Expected Values for each category. Close this graph by clicking on the X in the right corner of the window. The results of the Chi-Square Goodness of Fit Test will be displayed in the Session 2 window. is 36.5, and the p-value is 0.000. Therefore, for any level of 0.000 the correct decision would be to reject the null hypothesis.,

Chapter 11 Chi-Square Tests 4 The summarized data may also be entered into Minitab. In a Minitab worksheet, enter each Day of the Week in column C1 and the Number of Births for each day in column C2. To perform the Goodness of Fit Test that the proportion of each day of the week is the same, click on Stat Tables Chi-square Goodness of Fit Test (One Variable). Click on the circle next to Observed counts and select the data in column C2. In the Category name field, select the data in column C1. Select the circle to test Equal proportions. Click OK. The two graphs will be displayed, and the results of the Chi-Square Goodness of Fit Test will be displayed in the Session window.

Chapter 11 Chi-Square Tests 5 The Chi-Square Goodness-of-Fit Test, Unequal Proportions Using the above data for the number of births on each day of the week, assume the hospital instead wants to test that the proportion of births on Mondays is twice that of every other day of the week. This means that the proportion of births on Mondays is 0.25 and each of the other days is 0.125. The data may be entered in either raw format or summary format, as described above. For this example, the summary format will be shown. To perform the Goodness of Fit Test for these proportions, click on Stat Tables Chi-square Goodness of Fit Test (One Variable). Click on the circle next to Observed counts and select the data in column C2. In the Category name field, select the data in column C1. Select the circle to test Specific proportions. In the pulldown, select the option to Input Constants. A table will appear on the right side of the window, with each of the Category Names displayed in the first column. In the proportions column, enter the test proportions as described above. Click OK.

Chapter 11 Chi-Square Tests 6 The two graphs will be displayed, and the results of the Chi-Square Goodness of Fit Test 2 will be displayed in the Session window. is 15.04, and the p-value is 0.020. Therefore, for any level of 0.020, the correct decision would be to reject the null hypothesis.

Chapter 11 Chi-Square Tests 7 The Chi-Square Test of Independence or Homogeneity A random sample of 100 adult Americans was selected and they were asked if they favor higher taxes in return for government-funded healthcare. The data is shown below. In Favor Against No Opinion Males 16 30 6 Females 26 18 4 To enter the raw data, label column C1 Gender and label column C2 Opinion. Enter the 100 rows of data. There should be 16 rows of data with a gender of Male and an Opinion of In Favor, 26 rows of data with a Gender of Female and an Opinion of In Favor, and so forth.

Chapter 11 Chi-Square Tests 8 To perform the Chi-Square Independence test, click on Stat Tables Cross Tabulation and Chi-square. In the Categorical Variables fields, select column C1 for rows and column C2 for columns. Click the Chi-Square button. Select the checkbox next to Chi-square analysis. Click OK. Click OK on the Cross Tabulation and Chi-Square window.

Chapter 11 Chi-Square Tests 9 The results of the Chi-Square Test will be displayed in the Session window. 2 is 5.630, and the p-value is 0.060. Therefore, for any level of 0.060, the correct decision would be to reject the null hypothesis.

Chapter 11 Chi-Square Tests 10 The summarized data may also be entered into Minitab. In a Minitab worksheet, enter the values of each cell from the table into columns C1-C3. There is no need to label the columns or the rows. To perform the chi-square independence test, click on Stat Tables Chi-square Test (Two-Way Table in Worksheet). In the Columns containing the table field, select columns C1, C2, and C3. Click OK.

Chapter 11 Chi-Square Tests 11 The results of the Chi-Square Test will be displayed in the Session window. 2 is 5.630, and the p-value is 0.060. Therefore, for any level of 0.060, the correct decision would be to reject the null hypothesis.

Chapter 11 Chi-Square Tests 12 Inferences Concerning the Population Variance A machine that fills packages of pretzels is set up so that the average net weight of a package is 32 ounces with a variance of.02 square ounces. An acceptable variance is.02 sq. ounces or less. A sample of 35 packages from the production line gave a sample variance of.0316. The weight of each package in the sample is shown below: 31.9925 32.1883 32.0588 32.0574 31.9404 32.2078 32.0573 31.7231 31.8077 32.1175 31.7734 31.7593 31.9207 32.0505 32.1513 32.3108 31.7970 31.5387 31.8856 32.0117 31.9628 31.9412 32.0298 32.0125 31.7906 32.1660 32.0081 31.6126 32.1061 31.8992 31.9314 32.2252 32.1494 32.1850 31.9980 Confidence Interval for the Population Variance To analyze the data in raw format, enter the data points into column C1 of a Minitab worksheet.

Chapter 11 Chi-Square Tests 13 To construct a 99% confidence interval for the variance, click on Stat Basic Statistics 1 Variance. In the Data pulldown, select the option Samples in columns. In the Columns field, select the data in column C1. Select the Options button. Enter 99 in the Confidence Level field. Make sure the Alternative field has the not equal option selected. Click OK. Click OK on the 1 Variance window.

Chapter 11 Chi-Square Tests 14 The results will be displayed in the Session window. The 99% confidence interval for the variance (assuming a normal distribution) is (0.0182, 0.0651).

Chapter 11 Chi-Square Tests 15 In the above example, the data was entered in raw format. If instead you have the summary data, click on Stat Basic Statistics 1 Variance. In the pulldown, select Sample Variance. Enter a Sample size of 35 and a Sample variance of 0.0316. Select the Options button. Enter 99 in the Confidence Level field. Make sure the Alternative field has the not equal option selected. Click OK. Click OK on the 1 Variance window. The results will be displayed in the Session window. The 99% confidence interval for the variance (assuming a normal distribution) is (0.0182, 0.0651).

Chapter 11 Chi-Square Tests 16 Hypothesis Test for the Population Variance Using the example from above regarding the weight of the pretzel packages, assume that if the variance is greater than.02, the machine filling the bags needs to be adjusted. Perform the hypothesis test to determine if the machine needs to be adjusted. To analyze the data in raw format, enter the data points into column C1 of a Minitab worksheet.

Chapter 11 Chi-Square Tests 17 To test the hypothesis 2 H O : 0.02 H 2 1 : 0.02 click on Stat Basic Statistics 1 Variance. In the Data pulldown, select Samples in columns. In the Columns field, select the data in column C1. Select the checkbox to Perform hypothesis test. In the pulldown, select the Hypothesized variance and enter a Value of 0.02. Select the Options button. The Confidence Level field will be defaulted to 95.0, for a 95% confidence interval. This will not need to be edited this unless you would also like to perform a different confidence interval at this time. This will not influence the results of the hypothesis test. Since the alternative hypothesis is greater than for this example, make sure the Alternative field has the greater than option selected. Click OK. Click OK on the 1 Variance window.

Chapter 11 Chi-Square Tests 18 The results will be displayed in the Session window. In the test of Sigma-squared = 2 0.02 vs > 0.02, is 53.70, and the p-value is 0.017. Therefore, for any level of 0.017, the correct decision would be to not reject the null hypothesis.

Chapter 11 Chi-Square Tests 19 In the above example, the data was entered in raw format. If instead you have the summary data, click on Stat Basic Statistics 1 Variance. In the pulldown, select Sample Variance. Enter a Sample size of 35 and a Sample variance of 0.0316. Select the checkbox to Perform hypothesis test. In the pulldown, select the Hypothesized variance and enter a Value of 0.02. Select the Options button. The Confidence Level field will be defaulted to 95.0, for a 95% confidence interval. This will not need to be edited this unless you would also like to perform a different confidence interval at this time. This will not influence the results of the hypothesis test. Since the alternative hypothesis is greater than for this example, make sure the Alternative field has the greater than option selected. Click OK. Click OK on the 1 Variance window. The results will be displayed in the Session window. In the test of Sigma-squared = 2 0.02 vs > 0.02, is 53.70, and the p-value is 0.017. Therefore, for any level of 0.017, the correct decision would be to not reject the null hypothesis.

Chapter 11 Chi-Square Tests 20 Suggested Exercises Section 11.2 11.14, 11.15, 11.17, 11.18, 11.19, 11.22 Section 11.3 11.28, 11.29, 11.31, 11.32, 11.37, 11.38 Section 11.4 11.46, 11.47, 11.48, 11.49 Supplementary Exercises 11.50, 11.51, 11.53, 11.54, 11.58, 11.69, 11.72, 11.73, 11.75 Technology Assignments TA 11.1, TA 11.2, TA 11.3