11-2 Goodness of Fit Test

Similar documents
Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Comparing Multiple Proportions, Test of Independence and Goodness of Fit

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Odds ratio, Odds ratio test for independence, chi-squared statistic.

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

Hypothesis Testing --- One Mean

Goodness of Fit. Proportional Model. Probability Models & Frequency Data

Study Guide for the Final Exam

Is it statistically significant? The chi-square test

Chi-square test Fisher s Exact test

Chapter 23. Two Categorical Variables: The Chi-Square Test

Section 13, Part 1 ANOVA. Analysis Of Variance

Section 12 Part 2. Chi-square test

The Chi-Square Test. STAT E-50 Introduction to Statistics

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

November 08, S8.6_3 Testing a Claim About a Standard Deviation or Variance

1.5 Oneway Analysis of Variance

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Hypothesis testing - Steps

Elementary Statistics

Solutions to Homework 10 Statistics 302 Professor Larget

Association Between Variables

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Final Exam Practice Problem Answers

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Statistical Testing of Randomness Masaryk University in Brno Faculty of Informatics

3.4 Statistical inference for 2 populations based on two samples

An Automated Test for Telepathy in Connection with s

Math 108 Exam 3 Solutions Spring 00

Elementary Statistics Sample Exam #3

CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS

Recommend Continued CPS Monitoring. 63 (a) 17 (b) 10 (c) (d) 20 (e) 25 (f) 80. Totals/Marginal

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Regression Analysis: A Complete Example

Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

Simulating Chi-Square Test Using Excel

Hypothesis Testing. Reminder of Inferential Statistics. Hypothesis Testing: Introduction

Stats Review Chapters 9-10

Rank-Based Non-Parametric Tests

HYPOTHESIS TESTING: POWER OF THE TEST

Statistiek I. Proportions aka Sign Tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen.

Test Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

UNDERSTANDING THE TWO-WAY ANOVA

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

Hypothesis Testing: Two Means, Paired Data, Two Proportions

Crosstabulation & Chi Square

Name: (b) Find the minimum sample size you should use in order for your estimate to be within 0.03 of p when the confidence level is 95%.

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Statistical Impact of Slip Simulator Training at Los Alamos National Laboratory

Linear Models in STATA and ANOVA

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

Online 12 - Sections 9.1 and 9.2-Doug Ensley

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

Introduction to Quantitative Methods

Solutions: Problems for Chapter 3. Solutions: Problems for Chapter 3

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

Statistics 2014 Scoring Guidelines

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Mind on Statistics. Chapter 15

Testing Research and Statistical Hypotheses

Contingency Tables and the Chi Square Statistic. Interpreting Computer Printouts and Constructing Tables

Chapter 8: Hypothesis Testing for One Population Mean, Variance, and Proportion

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Hypothesis Test for Mean Using Given Data (Standard Deviation Known-z-test)

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct

TABLE OF CONTENTS. About Chi Squares What is a CHI SQUARE? Chi Squares Hypothesis Testing with Chi Squares... 2

Module 2 Probability and Statistics

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Statistical tests for SPSS

MBA 611 STATISTICS AND QUANTITATIVE METHODS

The Dummy s Guide to Data Analysis Using SPSS

MATH 140 Lab 4: Probability and the Standard Normal Distribution

How To Test For Significance On A Data Set

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

statistics Chi-square tests and nonparametric Summary sheet from last time: Hypothesis testing Summary sheet from last time: Confidence intervals

Independent samples t-test. Dr. Tom Pierce Radford University

MULTIPLE REGRESSION EXAMPLE

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

Introduction. Statistics Toolbox

UWW Wireless: Creating Bulk Wireless Guest Accounts

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA

1 Basic ANOVA concepts

A) B) C) D)

Variables Control Charts

Chapter 3 RANDOM VARIATE GENERATION

Transcription:

11-2 Goodness of Fit Test In This section we consider sample data consisting of observed frequency counts arranged in a single row or column (called a one-way frequency table). We will use a hypothesis test for the claim that the observed frequency counts agree with some claimed distribution, so that there is a good fit of the observed data with the claimed distribution. A goodness-of-fit test is used to test the hypothesis that an observed frequency distribution fits (or conforms to) some claimed distribution. H 0 : The random variable follows a particular distribution. H 1 : The random variable does not follow the distribution specified in H 0. Ex 1) Consider the observed frequencies and relative frequencies of browser preference form a survey of 200 Internet users. Browser Observed frequency Relative frequency Microsoft Internet Explorer 140 0.785 Firefox 40 0.15 Safari/other 20 0.065 The following model shows how the market shares are distributed in the null hypothesis: H 0 : P Ms IE = 0.785, P firefox =0.15, P Safari/other = 0.065 H 1: The random variable does not follow the distribution specified in H 0.

How a Goodness- of -Fit Test Works The goodness -of -fit test is based on a comparison of the observed frequencies (actual data from the field) with the expected frequencies when H 0 is true. That is, we compare what we actually see with what would expect to see if H 0 were true. If the difference between the observed and expected frequencies is large, we reject H 0. As usual, it comes down to how large a difference is large. The hypothesis we conduct to answer this question relies on χ 2 distribution. Performing the χ 2 Goodness of Fit Test The following conditions must be met: Finding Expected Frequencies The data have been randomly selected. The sample data consist of frequency counts for each of the different categories. None of the expected frequencies is less than 1. For each category, the expected frequency is at least 5. The expected frequency for a category is the frequency that would occur if the data actually have the distribution that is being claimed. For the i th category, the expected frequency is E i = n. p i, where n represent the number of trials and p i represents the population proportion for the i th category. If we assume that all expected frequencies are equal, then each expected frequency is E = n/k, where n is the total number of observations and k is the number of categories. The χ 2 goodness of fit test may be performed using (a) the critical value, and (b) the p-value method.

(a) χ 2 goodness of fit test. (Critical value method) Step 1: State the hypotheses and check the conditions. The null hypothesis is states that the qualitative random variable follows a particular distribution. The alternative hypothesis states that the random variable does not follow that distribution. Step 2: Find the χ 2 critical value, χ 2 critical, from table A-4 by using k -1 degrees of freedom, where k is the number of categories. Note, Goodness-of-fit hypothesis are always right tailed. And state the rejection rule. Reject if χ 2 data > χ 2 critical. Step 3: Find the test statistic χ 2 data. χ 2 data= O i E 2 i E i Where O i = observed frequency, and E i = expected frequency. Step 4: State the conclusion and the interpretation.

Ex 2) Perform the hypothesis test shown in example 1, use 0.05 as a significance level. H 0 : P Ms IE = 0.785, P firefox =0.15, P Safari/other = 0.065 H 1: The random variable does not follow the distribution specified in H 0. Browser Observed freq Relative freq Category O i p i MS IE 140 0.785 Firefox 40 0.15 Safari/other 20 0.065 Interpretation: There is evidence that the random variable browser does not follow the distribution in H 0. In other words, there is evidence that the market shares for internet browsers have changed.

Note carefully what this conclusion says and what it doesn t say. The χ 2 goodness of fit test shows that there is evidence that the random variable does not follow the distribution specified in H 0. In particular, the conclusion does not state, for example, that Firefox s proportion is significantly greater. ` (b) χ 2 goodness of fit test: (p-value Method) Step 1: State the hypotheses and check the conditions. Step 2: Find the test statistic χ 2 data. χ 2 data= O i E i 2 E i Where O i = observed frequency, and E i = expected frequency Step 3: Find the p-value. p-value = P (χ 2 > χ 2 data) Step 4: State the conclusion and the interpretation. Ex (3) The following tables show figures on the market share of cable modem, DSL, and wireless broadband from a 2002 survey and a 2006 survey which was based on a random sample of 1000 home broadband users. Test whether the population proportions have changed since 2002, using the p-value method, and level of significance is 0.05. 2002 broadband adoption survey Cable modem DSL Wireless/other 67% 28% 5% 2006 broadband adoption survey Cable modem DSL Wireless/other 410 500 90