Chapter 13. Inferences for Tables

Size: px
Start display at page:

Download "Chapter 13. Inferences for Tables"

Transcription

1 Chapter 13 Inferences for Tables

2 Lesson 13-1 Test for Goodness of Fit

3 Test for Goodness of Fit To analyze categorical data, we construct two-way tables and examine the counts or percents of the explanatory and response variables

4 Two Way Table Proportion of M&M Colors 1.69 ounce bag of plain M&M Color Observed Count, O Expected Count, E O E 2 E Blue (57) = Brown (57) = Green (57) = Orange (57) = Red (57) = Yellow (57) = Total

5 Null and Alternative Hypothesis We want to compare the observed counts to the expected counts The null hypothesis is that there is no difference between the observed and expected counts. The alternative hypothesis is that there is a difference between the observed and expected counts.

6 Chi-Square Statistic The chi-square statistic measures how well the observed counts fit the expected counts, assuming that the null hypothesis is true. In order to determine whether the distribution has changes we need a way to measure the observed counts (O) from the expected counts (E) X 2 O E E 2

7 Chi-Square Distribution, 2 The distribution of the chisquare statistic is called the chi-square distribution, 2. This distribution is a density curve. The total area under the curve is 1. The curve begins at zero on the horizontal axis and is skewed right. As the degree of freedom increase, the sample of the curve becomes more symmetric.

8 Goodness of Fit Test Use the M&M chi-square statistic, find the probability of obtaining a X 2 value at least this extreme assuming that the null hypothesis is true. This is known as the Goodness of Fit Test Degree of freedom (n 1), where is the number of categories.

9 Goodness of Fit Test Using the M&M data chi-square statistic X 2 = Degree of freedom (n 1) = 6 1 = 5. If the M&Ms were distributed as claim by Mars Inc, an observed value of or higher would occur about 0.50% of the time.

10 Assumptions and Conditions Counted Data Condition Check to see if that the data are counts for categories of categorical variables. Randomization Condition Expected Cell Frequency Condition We should expect to see at least 5 individuals in each cell.

11 Example Chi-Square Test for Goodness of Fit Does your zodiac sign determine how successful you will be later in life? Fortune magazine collected the zodiac signs of 256 heads of the largest 400 companies. Here are the number of births for each sign: We can see some variation in the number of births per sign and there are more Pisces, but is it enough to claim that successful people are more likely to be under some signs than others? Births Signs 23 Aries 20 Taurus 18 Gemini 23 Cancer 20 Leo 19 Virgo 18 Libra 21 Scorpio 19 Sagittarius 22 Capricorn 24 Aquarius 29 Pisces

12 Example Chi-Square Test for Goodness of Fit If births were distributed uniformly across the year, we would expect about 1/12 of them to occur under each sign of the zodiac. That suggests 256/12, or about 21.3 births per sign. How closely do the observed numbers of births per sign fit this simple null model?

13 Example Chi-Square Test for Goodness of Fit Step 1 State appropriate null and alternative hypotheses for investigating the genetic model. H o : Births are uniformly distributed over zodiac signs. H a : Births are not uniformly distributed over zodiac signs Formal approach specifies the parameter values H o : P Aries = P Taurus =.=P Pisces H a : P Aries P Taurus. P Pisces

14 Example Chi-Square Test for Goodness of Fit Step 2 Verify conditions for performing inference in this setting. 1. Counted Data: we have counts of the number of executives in categories. 2. Randomization: we have convenience sample of of executives, but no reason to suspect bias 3. Expected cell frequency: The null hypothesis expects that 1/12 or the 256 births, or , should occur in each sign. These expected values are greater than 5

15 Example Chi-Square Test for Goodness of Fit Step 3 Calculate the test statistic and the P-value ( Obs Exp) ( ) ( ) X Exp p value

16 Example Chi-Square Test for Goodness of Fit Step 4 State the conclusion. There is sufficient evidence to fail to reject H o since, p- value = > = 0.05, and conclude that there is no evidence of non-uniform distribution of zodiac signs among executives.

17 Example Chi-Square Test for Goodness of Fit The p-value is the area in the upper tail of the 2 model for 12 1 = 11 degrees of freedom above the computed X 2 value. The p-value of says that if the zodiac signs of executives were fact distributed uniformly, an observed chi-square value of 5.09 or higher would occur about 93% of the time. P P X 2 2 ( χ ) P 2 ( χ 5.094) 0.926

18 Lesson 13-2, Part 1 Inference for Two-Way Tables

19 Two-Way Tables The first step in the overall test for comparing several proportions is to arrange the data in a two-way table. Example Many high school graduating classes determine the plans of the graduates. We might wonder whether the plans of students have stayed roughly the same over past decades or whether they have changed. Here is a summary table from one high school.

20 Two-Way Tables Each cell of the table shows how many students from a particular class (the column) made a certain choice (row). We might wonder about the changes in the choice of military service. The numbers for 1980 and 1990 look similar, until you notice the class sizes are quite different. Table Total College/Post-HS-Education Employment Military Travel Total

21 Two-Way Tables The 18 seniors in 1980 who chose military service were only about 18/453 = 3.97% of the graduating class. In 1990, the 19 seniors making the same choice represented 19/290 = 6.55% of the class. Because the class sizes change so much, we re better off examining the proportions for each class rather than counts Percentages of the class Total College/Post-HS-Education Employment Military Travel Total

22 Chi-Square Test of Homogeneity We already know how to test whether two proportions are the same. For example, we could consider whether the proportion of students choosing military service was the same between 1980 and 1990 with a two proportion z test. But know we have more than two groups. We want to test whether students choices are the same across all three graduating classes. The z-test for two proportions generalizes to a chisquare test of homogeneity.

23 Chi-Square Test of Homogeneity Why chi-square? It turns out that the statistic is identical to the chisquare statistic for goodness-of-fit that we just saw. The goodness-of-fit test compared counts with a model. But here we re asking whether choices have changes, so we have no model. We find the expected counts for each category directly from the data.

24 Chi-Square Test of Homogeneity As a result, we count the degree of freedom slightly different as well. df = (R 1)(C 1) where R is the number of rows and C is the number of columns. The term homogeneity means that things are the same. Here, we ask whether the post-high school choices made by students are the same for these graduating classes.

25 Null and Alternative Hypothesis The null hypothesis is that there is no difference between the row variable and column variable. The alternative hypothesis is that there is some difference between the row variable and column variable.

26 Assumptions and Conditions Counted Data Condition The data must be counts. You can t do a test of homogeneity on proportions, so we have to work with counts of graduates given in the first table. Also, you can t do a chi-square test on measurements. Example If we recorded GPAs for these same groups over the time span, we wouldn t be able to test whether the mean GPAs had changes using this test.

27 Assumptions and Conditions Randomization Condition Often when we test for homogeneity; we aren t interested in some large population, so we don t really need a random sample. We would need a random sample if we wanted to draw more general conclusions. Example Graduating choices made by all high school students in these graduation years.

28 Assumptions and Conditions The null hypothesis can be thought of as a model in which the counts in the table are distributed as if each student chose a plan randomly according to the overall proportions of choices, regardless of the student s class. As long as we don t want to generalize, we don t have to check the randomization condition.

29 Assumptions and Conditions Expected Cell Frequency Condition We must be sure we have enough data for this method to work. The expected count in each cell must be at least 5.

30 Expected Count The expected count in any cell of two-way table when H o is true is Expected count = Row total x Column total Table total To calculate the expected counts, multiply the row total by the column total, and divide by table total.

31 Chi-Square Test Statistic The chi-square statistic is the sum over all r x c cell in the table. X O E 2 2 Remember that the chi-square statistic measures of how far the observed counts in a two-way table are from the expected counts. The degrees of freedom is (R 1)(C 1). The p-value is the area to the right of the X 2 statistic under the chi-square density curve. E

32 Example Chi-Square Test of Homogeneity We have counts of 1058 students observed in three different years a decade apart and categorized according to their post-graduation activities. Table Total College/Post-HS-Education Employment Military Travel Total Is the post-high school choices made by students the same for these graduating classes?

33 Example Chi-Square Test of Homogeneity Step 1 Identify population of interest. State the hypothesis in words and symbols. H o : The post-high school choices made by classes of 1980, 1990, and 2000 have the same distribution H a : The post-high school choices made by classes of 1980, 1990, and 2000 do not have the same distribution

34 Example Chi-Square Test of Homogeneity Step 2 Choose the appropriate inference procedure and verify conditions for it use. Conditions: 1. Counted Data We have counts of the number of students in categories. 2. Randomization Condition We don t want to draw inferences to other high schools or other classes, so there is no need to check for a random sample.

35 Example Chi-Square Test of Homogeneity 3. Expected cell frequency condition The expected value are (shown below) are all at least 5. Table Total College/Post-HS-Education 320/ / / Employment 98/ / / Military 18/ / / Travel 17/ /6.58 5/ Total (853) (24)

36 Example Chi-Square Test of Homogeneity Step 3 Carry out he procedure. 2 ( Obs Exp) x Exp nd Matrix Observed 2 nd Matrix p value Expected

37 Example Chi-Square Test of Homogeneity Step 4 Interpret your results in context of the problem. There is sufficient evidence to reject the H o since p- value = < = 0.05, and conclude that the choices made by high school graduates have indeed changed over the two decades examined.

38 Example Chi-Square Test of Homogeneity The shape of a 2 model depends on the degrees of freedom. A 2 model with 6 df is skewed right to the high end. The p-value is very small, indicating that the pattern we would see would be very unlikely to occur by chance were post high school choices homogeneous. 2 P P( χ 72.77)

39 Lesson 13-2, Part 2 Inference for Two-Way Tables

40 Independence A study from the University of Texas Southwestern Medical Center examined whether the risk of hepatitis C was related to whether people had tattoos and to where they got their tattoos. Hepatitis C causes about 10,000 deaths each year in the United States, but often lies undetected for many years after infection. Hepatitis C No Hepatitis C Total Tattoo, parlor Tattoo, elsewhere None Total

41 Contingency Tables These data differ from the kinds of data we ve considered before in this chapter because they categorize subjects from a single group on two categorical variables rather than only one. The categorical variables are hepatitis C status ( Hepatitis C or No Hepatitis C ) and tattoo status ( Parlor, Elsewhere, or None ). Contingency Tables Are tables that display categorize counts on two (or more variables so that we can see whether the distribution of counts on one variable is contingent on the other.

42 Chi-Square Test for Independence The natural question to ask of these data is whether the chance of having hepatitis C is independent of tattoo status. Here, this means the probability that randomly selected patient has hepatitis C should change conditional on learning the patient s tattoo status. The test to use here is the chi-square test for independence.

43 Chi-Square Test for Independence If hepatitis is independent of tattoos, we d expect the proportion of people testing positive for hepatitis to be the same for the three levels of tattoo status. This sounds a lot like the test for homogeneity. In fact the calculations are the same. The difference is that we have two categorical variables measured on a single population. Homogeneity test, we have a single categorical variable measured independently on two or more populations.

44 Null and Alternative Hypothesis The null hypothesis is that the row variable is independent (not related to) of the column variable. The alternative hypothesis is that the row variable is dependent (related to) of the column variable.

45 Assumptions and Conditions Counted Data Condition Randomization Condition and < 10% All expected counts are at least 5

46 Example Chi-Square Test of Independence We have counts of 626 individuals categorized according to their tattoo status and their hepatitis status. Hepatitis C No Hepatitis C Total Tattoo, parlor Tattoo, elsewhere None Total Is the chance of having hepatitis C related to whether people had tattoos and where they got their tattoos?

47 Example Chi-Square Test of Independence Step 1 Identify population of interest. State the hypothesis in words and symbols. H o : Tattoo status and hepatitis status are independent. H a : Tattoo status and hepatitis status are not independent

48 Example Chi-Square Test of Independence Step 2 Choose the appropriate inference procedure and verify conditions for it use. Conditions: 1. Counted Data We have counts of the number of individuals in categories of two categorical variables. 2. Randomization Condition These data are from a retrospective study of patients being treated for something unrelated to hepatitis. Although they are not SRS, they were selected to avoid biases and should be representative of the general population.

49 Example Chi-Square Test of Independence 3. Expected cell frequency condition The expected value are (shown below) are all at least 5. Hepatitis C No Hepatitis C Total Tattoo, parlor 17/ / Tattoo, elsewhere 8/ / None 22/ / Total (52) (513)

50 Example Chi-Square Test of Independence Step 3 Carry out the procedure. 2 ( Obs Exp) x Exp nd Matrix Observed 2 nd Matrix p value Expected

51 Example Chi-Square Test of Independence Step 4 Interpret your results in context of the problem. There is sufficient evidence to reject the H o since p-value = < = 0.05, and conclude that the hepatitis status is not independent of tattoo status.

52 Chi-Square and Causation Chi-square tests are common. Test for independence are especially widespread. Unfortunately, many people interpret a small p- value as proof of causation. Just as correlation between quantitative variables does not demonstrate causation, failure of independence between two categorical variables does not show a causeand-effect relationship between them.

53 Chi-Square and Causation The chi-square test for independence treats the two variables symmetrically. There is no way to differentiate the direction of any possible causation from variable to the other. In our example it is unlikely that have hepatitis causes one to crave a tattoo. Also there s never any way to eliminate the possibility that a lurking variable is responsible for the observed lack of independence. For example It might be people who have body piercings or those who inject drugs are both more likely to get tattooed and more likely to contract hepatitis C.

54 Types of Chi-Square Distributions Goodness-of-fit A test of whether the distribution counts in one categorical variable matches the distribution predicted by a model. Homogeneity A test comparing the distribution of counts for two or more groups on the same categorical variable Independence A test of whether two categorical variables are independent examines the distribution of counts for one group of individuals classified according to both variables.

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1) Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

More information

Chapter 23. Two Categorical Variables: The Chi-Square Test

Chapter 23. Two Categorical Variables: The Chi-Square Test Chapter 23. Two Categorical Variables: The Chi-Square Test 1 Chapter 23. Two Categorical Variables: The Chi-Square Test Two-Way Tables Note. We quickly review two-way tables with an example. Example. Exercise

More information

Odds ratio, Odds ratio test for independence, chi-squared statistic.

Odds ratio, Odds ratio test for independence, chi-squared statistic. Odds ratio, Odds ratio test for independence, chi-squared statistic. Announcements: Assignment 5 is live on webpage. Due Wed Aug 1 at 4:30pm. (9 days, 1 hour, 58.5 minutes ) Final exam is Aug 9. Review

More information

Recommend Continued CPS Monitoring. 63 (a) 17 (b) 10 (c) 90. 35 (d) 20 (e) 25 (f) 80. Totals/Marginal 98 37 35 170

Recommend Continued CPS Monitoring. 63 (a) 17 (b) 10 (c) 90. 35 (d) 20 (e) 25 (f) 80. Totals/Marginal 98 37 35 170 Work Sheet 2: Calculating a Chi Square Table 1: Substance Abuse Level by ation Total/Marginal 63 (a) 17 (b) 10 (c) 90 35 (d) 20 (e) 25 (f) 80 Totals/Marginal 98 37 35 170 Step 1: Label Your Table. Label

More information

Crosstabulation & Chi Square

Crosstabulation & Chi Square Crosstabulation & Chi Square Robert S Michael Chi-square as an Index of Association After examining the distribution of each of the variables, the researcher s next task is to look for relationships among

More information

Chi Square Distribution

Chi Square Distribution 17. Chi Square A. Chi Square Distribution B. One-Way Tables C. Contingency Tables D. Exercises Chi Square is a distribution that has proven to be particularly useful in statistics. The first section describes

More information

Solutions to Homework 10 Statistics 302 Professor Larget

Solutions to Homework 10 Statistics 302 Professor Larget s to Homework 10 Statistics 302 Professor Larget Textbook Exercises 7.14 Rock-Paper-Scissors (Graded for Accurateness) In Data 6.1 on page 367 we see a table, reproduced in the table below that shows the

More information

Comparing Multiple Proportions, Test of Independence and Goodness of Fit

Comparing Multiple Proportions, Test of Independence and Goodness of Fit Comparing Multiple Proportions, Test of Independence and Goodness of Fit Content Testing the Equality of Population Proportions for Three or More Populations Test of Independence Goodness of Fit Test 2

More information

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics Ms. Foglia Date AP: LAB 8: THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,

More information

Chi-square test Fisher s Exact test

Chi-square test Fisher s Exact test Lesson 1 Chi-square test Fisher s Exact test McNemar s Test Lesson 1 Overview Lesson 11 covered two inference methods for categorical data from groups Confidence Intervals for the difference of two proportions

More information

Association Between Variables

Association Between Variables Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi

More information

Statistical Impact of Slip Simulator Training at Los Alamos National Laboratory

Statistical Impact of Slip Simulator Training at Los Alamos National Laboratory LA-UR-12-24572 Approved for public release; distribution is unlimited Statistical Impact of Slip Simulator Training at Los Alamos National Laboratory Alicia Garcia-Lopez Steven R. Booth September 2012

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information

Is it statistically significant? The chi-square test

Is it statistically significant? The chi-square test UAS Conference Series 2013/14 Is it statistically significant? The chi-square test Dr Gosia Turner Student Data Management and Analysis 14 September 2010 Page 1 Why chi-square? Tests whether two categorical

More information

CHAPTER 14 NONPARAMETRIC TESTS

CHAPTER 14 NONPARAMETRIC TESTS CHAPTER 14 NONPARAMETRIC TESTS Everything that we have done up until now in statistics has relied heavily on one major fact: that our data is normally distributed. We have been able to make inferences

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics Period Date LAB : THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,

More information

Elementary Statistics Sample Exam #3

Elementary Statistics Sample Exam #3 Elementary Statistics Sample Exam #3 Instructions. No books or telephones. Only the supplied calculators are allowed. The exam is worth 100 points. 1. A chi square goodness of fit test is considered to

More information

Statistical tests for SPSS

Statistical tests for SPSS Statistical tests for SPSS Paolo Coletti A.Y. 2010/11 Free University of Bolzano Bozen Premise This book is a very quick, rough and fast description of statistical tests and their usage. It is explicitly

More information

Independent samples t-test. Dr. Tom Pierce Radford University

Independent samples t-test. Dr. Tom Pierce Radford University Independent samples t-test Dr. Tom Pierce Radford University The logic behind drawing causal conclusions from experiments The sampling distribution of the difference between means The standard error of

More information

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This

More information

Chi Square Tests. Chapter 10. 10.1 Introduction

Chi Square Tests. Chapter 10. 10.1 Introduction Contents 10 Chi Square Tests 703 10.1 Introduction............................ 703 10.2 The Chi Square Distribution.................. 704 10.3 Goodness of Fit Test....................... 709 10.4 Chi Square

More information

1.5 Oneway Analysis of Variance

1.5 Oneway Analysis of Variance Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments

More information

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test Bivariate Statistics Session 2: Measuring Associations Chi-Square Test Features Of The Chi-Square Statistic The chi-square test is non-parametric. That is, it makes no assumptions about the distribution

More information

Topic 8. Chi Square Tests

Topic 8. Chi Square Tests BE540W Chi Square Tests Page 1 of 5 Topic 8 Chi Square Tests Topics 1. Introduction to Contingency Tables. Introduction to the Contingency Table Hypothesis Test of No Association.. 3. The Chi Square Test

More information

UNDERSTANDING THE TWO-WAY ANOVA

UNDERSTANDING THE TWO-WAY ANOVA UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

6.4 Normal Distribution

6.4 Normal Distribution Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

HYPOTHESIS TESTING WITH SPSS:

HYPOTHESIS TESTING WITH SPSS: HYPOTHESIS TESTING WITH SPSS: A NON-STATISTICIAN S GUIDE & TUTORIAL by Dr. Jim Mirabella SPSS 14.0 screenshots reprinted with permission from SPSS Inc. Published June 2006 Copyright Dr. Jim Mirabella CHAPTER

More information

POPULATION AND ZODIAC RHYTHMS

POPULATION AND ZODIAC RHYTHMS POPULATION AND ZODIAC RHYTHMS Didier CASTILLE 56 million French people, as estimated in 1990 from the General Population Census, have been analysed in a descriptive way according to the signs transited

More information

statistics Chi-square tests and nonparametric Summary sheet from last time: Hypothesis testing Summary sheet from last time: Confidence intervals

statistics Chi-square tests and nonparametric Summary sheet from last time: Hypothesis testing Summary sheet from last time: Confidence intervals Summary sheet from last time: Confidence intervals Confidence intervals take on the usual form: parameter = statistic ± t crit SE(statistic) parameter SE a s e sqrt(1/n + m x 2 /ss xx ) b s e /sqrt(ss

More information

DATA INTERPRETATION AND STATISTICS

DATA INTERPRETATION AND STATISTICS PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

More information

CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA

CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA Chapter 13 introduced the concept of correlation statistics and explained the use of Pearson's Correlation Coefficient when working

More information

Testing Research and Statistical Hypotheses

Testing Research and Statistical Hypotheses Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you

More information

The Chi-Square Test. STAT E-50 Introduction to Statistics

The Chi-Square Test. STAT E-50 Introduction to Statistics STAT -50 Introduction to Statistics The Chi-Square Test The Chi-square test is a nonparametric test that is used to compare experimental results with theoretical models. That is, we will be comparing observed

More information

Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails.

Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails. Chi-square Goodness of Fit Test The chi-square test is designed to test differences whether one frequency is different from another frequency. The chi-square test is designed for use with data on a nominal

More information

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions BMS 617 Statistical Techniques for the Biomedical Sciences Lecture 11: Chi-Squared and Fisher's Exact Tests Chi Squared and Fisher's Exact Tests This lecture presents two similarly structured tests, Chi-squared

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

Section 12 Part 2. Chi-square test

Section 12 Part 2. Chi-square test Section 12 Part 2 Chi-square test McNemar s Test Section 12 Part 2 Overview Section 12, Part 1 covered two inference methods for categorical data from 2 groups Confidence Intervals for the difference of

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

Chapter 7. One-way ANOVA

Chapter 7. One-way ANOVA Chapter 7 One-way ANOVA One-way ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The t-test of Chapter 6 looks

More information

Statistics 2014 Scoring Guidelines

Statistics 2014 Scoring Guidelines AP Statistics 2014 Scoring Guidelines College Board, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks of the College Board. AP Central is the official online home

More information

Elementary Statistics

Elementary Statistics lementary Statistics Chap10 Dr. Ghamsary Page 1 lementary Statistics M. Ghamsary, Ph.D. Chapter 10 Chi-square Test for Goodness of fit and Contingency tables lementary Statistics Chap10 Dr. Ghamsary Page

More information

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools Occam s razor.......................................................... 2 A look at data I.........................................................

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

Variables Control Charts

Variables Control Charts MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. Variables

More information

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation Parkland College A with Honors Projects Honors Program 2014 Calculating P-Values Isela Guerra Parkland College Recommended Citation Guerra, Isela, "Calculating P-Values" (2014). A with Honors Projects.

More information

Descriptive Analysis

Descriptive Analysis Research Methods William G. Zikmund Basic Data Analysis: Descriptive Statistics Descriptive Analysis The transformation of raw data into a form that will make them easy to understand and interpret; rearranging,

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015 Stat 411/511 THE RANDOMIZATION TEST Oct 16 2015 Charlotte Wickham stat511.cwick.co.nz Today Review randomization model Conduct randomization test What about CIs? Using a t-distribution as an approximation

More information

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of

More information

University of Arkansas Libraries ArcGIS Desktop Tutorial. Section 2: Manipulating Display Parameters in ArcMap. Symbolizing Features and Rasters:

University of Arkansas Libraries ArcGIS Desktop Tutorial. Section 2: Manipulating Display Parameters in ArcMap. Symbolizing Features and Rasters: : Manipulating Display Parameters in ArcMap Symbolizing Features and Rasters: Data sets that are added to ArcMap a default symbology. The user can change the default symbology for their features (point,

More information

SAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population.

SAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population. SAMPLING & INFERENTIAL STATISTICS Sampling is necessary to make inferences about a population. SAMPLING The group that you observe or collect data from is the sample. The group that you make generalizations

More information

CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont

CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont To most people studying statistics a contingency table is a contingency table. We tend to forget, if we ever knew, that contingency

More information

Solutions to Homework 6 Statistics 302 Professor Larget

Solutions to Homework 6 Statistics 302 Professor Larget s to Homework 6 Statistics 302 Professor Larget Textbook Exercises 5.29 (Graded for Completeness) What Proportion Have College Degrees? According to the US Census Bureau, about 27.5% of US adults over

More information

Using Excel for inferential statistics

Using Excel for inferential statistics FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

More information

8 6 X 2 Test for a Variance or Standard Deviation

8 6 X 2 Test for a Variance or Standard Deviation Section 8 6 x 2 Test for a Variance or Standard Deviation 437 This test uses the P-value method. Therefore, it is not necessary to enter a significance level. 1. Select MegaStat>Hypothesis Tests>Proportion

More information

Descriptive Statistics and Measurement Scales

Descriptive Statistics and Measurement Scales Descriptive Statistics 1 Descriptive Statistics and Measurement Scales Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

Normality Testing in Excel

Normality Testing in Excel Normality Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com

More information

Test Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table

Test Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table ANALYSIS OF DISCRT VARIABLS / 5 CHAPTR FIV ANALYSIS OF DISCRT VARIABLS Discrete variables are those which can only assume certain fixed values. xamples include outcome variables with results such as live

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

EMPIRICAL FREQUENCY DISTRIBUTION

EMPIRICAL FREQUENCY DISTRIBUTION INTRODUCTION TO MEDICAL STATISTICS: Mirjana Kujundžić Tiljak EMPIRICAL FREQUENCY DISTRIBUTION observed data DISTRIBUTION - described by mathematical models 2 1 when some empirical distribution approximates

More information

individualdifferences

individualdifferences 1 Simple ANalysis Of Variance (ANOVA) Oftentimes we have more than two groups that we want to compare. The purpose of ANOVA is to allow us to compare group means from several independent samples. In general,

More information

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing

More information

Chapter 26: Tests of Significance

Chapter 26: Tests of Significance Chapter 26: Tests of Significance Procedure: 1. State the null and alternative in words and in terms of a box model. 2. Find the test statistic: z = observed EV. SE 3. Calculate the P-value: The area under

More information

LESSON 9. Vimshottari Dasa and Manual Calculations

LESSON 9. Vimshottari Dasa and Manual Calculations LESSON 9 58 Vimshottari Dasa and Manual Calculations To find the timing event, we must know the period ruled by the significator planet. There are so many types of dasha systems. But I found the Vimshottari

More information

Week 4: Standard Error and Confidence Intervals

Week 4: Standard Error and Confidence Intervals Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.

More information

TABLE OF CONTENTS. About Chi Squares... 1. What is a CHI SQUARE?... 1. Chi Squares... 1. Hypothesis Testing with Chi Squares... 2

TABLE OF CONTENTS. About Chi Squares... 1. What is a CHI SQUARE?... 1. Chi Squares... 1. Hypothesis Testing with Chi Squares... 2 About Chi Squares TABLE OF CONTENTS About Chi Squares... 1 What is a CHI SQUARE?... 1 Chi Squares... 1 Goodness of fit test (One-way χ 2 )... 1 Test of Independence (Two-way χ 2 )... 2 Hypothesis Testing

More information

Session 7 Bivariate Data and Analysis

Session 7 Bivariate Data and Analysis Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table co-variation least squares

More information

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The

More information

Clocking In Facebook Hours. A Statistics Project on Who Uses Facebook More Middle School or High School?

Clocking In Facebook Hours. A Statistics Project on Who Uses Facebook More Middle School or High School? Clocking In Facebook Hours A Statistics Project on Who Uses Facebook More Middle School or High School? Mira Mehta and Joanne Chiao May 28 th, 2010 Introduction With Today s technology, adolescents no

More information

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses Introduction to Hypothesis Testing 1 Hypothesis Testing A hypothesis test is a statistical procedure that uses sample data to evaluate a hypothesis about a population Hypothesis is stated in terms of the

More information

What Does the Normal Distribution Sound Like?

What Does the Normal Distribution Sound Like? What Does the Normal Distribution Sound Like? Ananda Jayawardhana Pittsburg State University ananda@pittstate.edu Published: June 2013 Overview of Lesson In this activity, students conduct an investigation

More information

Final Exam Practice Problem Answers

Final Exam Practice Problem Answers Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal

More information

1051-232 Imaging Systems Laboratory II. Laboratory 4: Basic Lens Design in OSLO April 2 & 4, 2002

1051-232 Imaging Systems Laboratory II. Laboratory 4: Basic Lens Design in OSLO April 2 & 4, 2002 05-232 Imaging Systems Laboratory II Laboratory 4: Basic Lens Design in OSLO April 2 & 4, 2002 Abstract: For designing the optics of an imaging system, one of the main types of tools used today is optical

More information

Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

More information

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing. Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative

More information

Drawing a histogram using Excel

Drawing a histogram using Excel Drawing a histogram using Excel STEP 1: Examine the data to decide how many class intervals you need and what the class boundaries should be. (In an assignment you may be told what class boundaries to

More information

Describing Populations Statistically: The Mean, Variance, and Standard Deviation

Describing Populations Statistically: The Mean, Variance, and Standard Deviation Describing Populations Statistically: The Mean, Variance, and Standard Deviation BIOLOGICAL VARIATION One aspect of biology that holds true for almost all species is that not every individual is exactly

More information

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR

More information

MTH 140 Statistics Videos

MTH 140 Statistics Videos MTH 140 Statistics Videos Chapter 1 Picturing Distributions with Graphs Individuals and Variables Categorical Variables: Pie Charts and Bar Graphs Categorical Variables: Pie Charts and Bar Graphs Quantitative

More information

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MBA 611 STATISTICS AND QUANTITATIVE METHODS MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain

More information

Simulating Chi-Square Test Using Excel

Simulating Chi-Square Test Using Excel Simulating Chi-Square Test Using Excel Leslie Chandrakantha John Jay College of Criminal Justice of CUNY Mathematics and Computer Science Department 524 West 59 th Street, New York, NY 10019 lchandra@jjay.cuny.edu

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

One-Way Analysis of Variance

One-Way Analysis of Variance One-Way Analysis of Variance Note: Much of the math here is tedious but straightforward. We ll skim over it in class but you should be sure to ask questions if you don t understand it. I. Overview A. We

More information

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon t-tests in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com www.excelmasterseries.com

More information

AP STATISTICS REVIEW (YMS Chapters 1-8)

AP STATISTICS REVIEW (YMS Chapters 1-8) AP STATISTICS REVIEW (YMS Chapters 1-8) Exploring Data (Chapter 1) Categorical Data nominal scale, names e.g. male/female or eye color or breeds of dogs Quantitative Data rational scale (can +,,, with

More information

6 3 The Standard Normal Distribution

6 3 The Standard Normal Distribution 290 Chapter 6 The Normal Distribution Figure 6 5 Areas Under a Normal Distribution Curve 34.13% 34.13% 2.28% 13.59% 13.59% 2.28% 3 2 1 + 1 + 2 + 3 About 68% About 95% About 99.7% 6 3 The Distribution Since

More information

Lesson 4 Measures of Central Tendency

Lesson 4 Measures of Central Tendency Outline Measures of a distribution s shape -modality and skewness -the normal distribution Measures of central tendency -mean, median, and mode Skewness and Central Tendency Lesson 4 Measures of Central

More information

Sample Size and Power in Clinical Trials

Sample Size and Power in Clinical Trials Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance

More information

ABSORBENCY OF PAPER TOWELS

ABSORBENCY OF PAPER TOWELS ABSORBENCY OF PAPER TOWELS 15. Brief Version of the Case Study 15.1 Problem Formulation 15.2 Selection of Factors 15.3 Obtaining Random Samples of Paper Towels 15.4 How will the Absorbency be measured?

More information

November 08, 2010. 155S8.6_3 Testing a Claim About a Standard Deviation or Variance

November 08, 2010. 155S8.6_3 Testing a Claim About a Standard Deviation or Variance Chapter 8 Hypothesis Testing 8 1 Review and Preview 8 2 Basics of Hypothesis Testing 8 3 Testing a Claim about a Proportion 8 4 Testing a Claim About a Mean: σ Known 8 5 Testing a Claim About a Mean: σ

More information

Testing differences in proportions

Testing differences in proportions Testing differences in proportions Murray J Fisher RN, ITU Cert., DipAppSc, BHSc, MHPEd, PhD Senior Lecturer and Director Preregistration Programs Sydney Nursing School (MO2) University of Sydney NSW 2006

More information

Week 3&4: Z tables and the Sampling Distribution of X

Week 3&4: Z tables and the Sampling Distribution of X Week 3&4: Z tables and the Sampling Distribution of X 2 / 36 The Standard Normal Distribution, or Z Distribution, is the distribution of a random variable, Z N(0, 1 2 ). The distribution of any other normal

More information