MATH 10: Elementary Statistics and Probability Chapter 11: The Chi-Square Distribution

Similar documents
Recommend Continued CPS Monitoring. 63 (a) 17 (b) 10 (c) (d) 20 (e) 25 (f) 80. Totals/Marginal

MATH 10: Elementary Statistics and Probability Chapter 7: The Central Limit Theorem

Topic 8. Chi Square Tests

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Chapter 23. Two Categorical Variables: The Chi-Square Test

Comparing Multiple Proportions, Test of Independence and Goodness of Fit

Is it statistically significant? The chi-square test

The Chi-Square Test. STAT E-50 Introduction to Statistics

MATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables

Chi-square test Fisher s Exact test

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

Section 12 Part 2. Chi-square test

Chi Square Distribution

Elementary Statistics Sample Exam #3

CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails.

Elementary Statistics

Odds ratio, Odds ratio test for independence, chi-squared statistic.

November 08, S8.6_3 Testing a Claim About a Standard Deviation or Variance

TABLE OF CONTENTS. About Chi Squares What is a CHI SQUARE? Chi Squares Hypothesis Testing with Chi Squares... 2

Statistical Impact of Slip Simulator Training at Los Alamos National Laboratory

Simulating Chi-Square Test Using Excel

Contingency Tables and the Chi Square Statistic. Interpreting Computer Printouts and Constructing Tables

Chapter 19 The Chi-Square Test

Mind on Statistics. Chapter 15

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Chi Square Tests. Chapter Introduction

First-year Statistics for Psychology Students Through Worked Examples

Distribution is a χ 2 value on the χ 2 axis that is the vertical boundary separating the area in one tail of the graph from the remaining area.

Goodness of Fit. Proportional Model. Probability Models & Frequency Data

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

People like to clump things into categories. Virtually every research

CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions

Elements of statistics (MATH0487-1)

Math 108 Exam 3 Solutions Spring 00

Study Guide for the Final Exam

Testing Research and Statistical Hypotheses

Testing differences in proportions

IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA

Recall this chart that showed how most of our course would be organized:

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools

1.5 Oneway Analysis of Variance

Lecture Notes Module 1

One-Way Analysis of Variance

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Biostatistics: Types of Data Analysis

CHAPTER 5 COMPARISON OF DIFFERENT TYPE OF ONLINE ADVERTSIEMENTS. Table: 8 Perceived Usefulness of Different Advertisement Types

1 Basic ANOVA concepts

Test Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table

Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples

Solutions to Homework 10 Statistics 302 Professor Larget

Crosstabulation & Chi Square

Chapter 23 Inferences About Means

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

One-Way Analysis of Variance (ANOVA) Example Problem

Math 58. Rumbos Fall Solutions to Review Problems for Exam 2

IB Practice Chi Squared Test of Independence

Introduction to General and Generalized Linear Models

Association Between Variables

Multivariate Analysis of Variance (MANOVA): I. Theory

Chapter 7. One-way ANOVA

Stats Review Chapters 9-10

STAT 145 (Notes) Al Nosedal Department of Mathematics and Statistics University of New Mexico. Fall 2013

Evaluation of Total Quality Management (TQM) Application in the Nigerian Telecommunication Industry: A Case study of Imo NITEL Owerri

Advanced Statistical Analysis of Mortality. Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc. 160 University Avenue. Westwood, MA 02090

Normality Testing in Excel

Additional sources Compilation of sources:

DATA INTERPRETATION AND STATISTICS

3. Analysis of Qualitative Data

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

This chapter discusses some of the basic concepts in inferential statistics.

Using Stata for Categorical Data Analysis

Final Exam Practice Problem Answers

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE

PELLISSIPPI STATE COMMUNITY COLLEGE MASTER SYLLABUS INTRODUCTION TO STATISTICS MATH 2050

Descriptive Statistics

Online 12 - Sections 9.1 and 9.2-Doug Ensley

NEW YORK CITY COLLEGE OF TECHNOLOGY The City University of New York

Chapter 7 Section 7.1: Inference for the Mean of a Population

individualdifferences

Tests of Hypotheses Using Statistics

A and B This represents the probability that both events A and B occur. This can be calculated using the multiplication rules of probability.

5 Cumulative Frequency Distributions, Area under the Curve & Probability Basics 5.1 Relative Frequencies, Cumulative Frequencies, and Ogives

Chapter 3 RANDOM VARIATE GENERATION

CHAPTER 14 NONPARAMETRIC TESTS

Fat Content in Ground Meat: A statistical analysis

Statistical Testing of Randomness Masaryk University in Brno Faculty of Informatics

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

Exact Confidence Intervals

13: Additional ANOVA Topics. Post hoc Comparisons

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Name: Date: Use the following to answer questions 3-4:

Information Technology and the Business Communities: A Case Study of Smallscale Business Enterprises in Nigeria

Transcription:

MATH 10: Elementary Statistics and Probability Chapter 11: The Chi-Square Distribution Tony Pourmohamad Department of Mathematics De Anza College Spring 2015

Objectives By the end of this set of slides, you should be able to: 1 Learn about the chi-square distribution 2 Conduct and interpret Goodness-of-Fit Test 3 Conduct and interpret tests of independence 4 Conduct and interpret test for homogeneity 2 / 15

The Chi-Square Distribution The major characteristics of the chi-square distribution are: It is positively skewed The distribution is not symmetric It contains only positive values It is based on degrees of freedom When the degrees of freedom change a new distribution is created An example of the chi-square distribution Density 0.00 0.05 0.10 0.15 0 2 4 6 8 10 3 / 15

The Chi-Square Distribution Some more chi-square distributions Chi Square Distributions Density 0.0 0.1 0.2 0.3 0.4 0.5 df = 1 df = 2 df = 3 df = 4 0 2 4 6 8 10 x 4 / 15

The Chi-Square Distribution The notation for a chi-square random variable as df is the degrees of freedom X χ 2 df The chi-square table can be found on the course webpage under handouts Let s take look at the table to understand it 5 / 15

Goodness-of-Fit Test In this type of hypothesis test, you determine whether the data "fit" a particular distribution or not You will use a chi-square test to determine if there is a fit or not H 0 : The observed frequency distribution is the same as the hypothesized frequency distribution H a : The observed and hypothesized frequency distributions are different Example: # of Absences Expected # of Absences Actual # of Absences 0-2 50 35 3-5 30 40 6-8 12 20 9-12 6 1 12+ 2 4 6 / 15

Goodness-of-Fit Test The test statistic for a goodness-of-fit test is the following x 2 = k (O i E i ) 2 E i i=1 O is the observed frequency E is the expected frequency k is the number of different categories or outcomes The critical value is a chi-square value with (k 1) degrees of freedom, where k is the number of categories or outcomes 7 / 15

Goodness-of-Fit Test We can calculate the test statistic and then compare it to the critical value If the test statistic is inside the critical region then we reject H 0 If the test statistic is outside the critical region then we fail to reject H 0 Let s take a look at handout 9 for examples 8 / 15

Test of Independence Tests of independence involve using a contingency table of observed data values A test of independence tests the null hypothesis that there is no association between the row variable and the column variable in the contingency table H 0 : The row and column variables are independent H a : The row and column variables are not independent Recall what a contingency table looks like Lunger Cancer No Lung Cancer Row Total Smoker 70 20 90 Non-Smoker 5 5 10 Column Total 75 25 100 9 / 15

Test of Independence The test statistic for a test of independence is the following x 2 = O is the observed frequency E is the expected frequency r c i=1 r is the number of rows of the table (O i E i ) 2 c is the number of columns of the table E i The critical value is a chi-square value with (r 1)(c 1) degrees of freedom 10 / 15

Test of Independence We can calculate the test statistic and then compare it to the critical value If the test statistic is inside the critical region then we reject H 0 If the test statistic is outside the critical region then we fail to reject H 0 Important: For a contingency table E = (row total)(column total) (grand total) Let s take a look at handout 9 for examples 11 / 15

Test for Homogeneity Tests of homogeneity involve using a contingency table of observed data values Used to test whether two populations have the same distribution of some characteristic In a test of homogeneity, we test the claim that different populations have the same proportion characteristics H 0 : The distribution of the two populations are the same H a : The distribution of the two populations are different Brown Eye Blue Eye Green Eye Row Total Smoker 70 20 10 100 Non-Smoker 5 5 5 15 Column Total 75 25 15 115 12 / 15

Test for Homogeneity The test statistic for a test of homogeneity is the following x 2 = O is the observed frequency E is the expected frequency r c i=1 r is the number of rows of the table (O i E i ) 2 c is the number of columns of the table E i The critical value is a chi-square value with c 1 degrees of freedom 13 / 15

Test for Homogeneity We can calculate the test statistic and then compare it to the critical value If the test statistic is inside the critical region then we reject H 0 If the test statistic is outside the critical region then we fail to reject H 0 Important: For a contingency table E = (row total)(column total) (grand total) Let s take a look at handout 9 for examples 14 / 15

Summary of Tests Goodness-of-Fit: Use to decide whether a population with an unknown distribution "fits" a known distribution H 0 : The population fits the given distribution H a : The population does not fit the given distribution Independence: Use to decide whether two variables are independent or dependent H 0 : The two variables are independent H a : The two variables are dependent Homogeneity: Use to decide if two populations with unknown distributions have the same distribution as each other H 0 : The two populations follow the same distribution H a : The two populations have different distributions 15 / 15