# The Chi-Square Test. STAT E-50 Introduction to Statistics

Size: px
Start display at page:

Transcription

1 STAT -50 Introduction to Statistics The Chi-Square Test The Chi-square test is a nonparametric test that is used to compare experimental results with theoretical models. That is, we will be comparing observed frequencies with expected frequencies. In a hypothesis test, the expected frequencies are those we would expect if the null hypothesis our test is true. O The formula is where O represents the observed frequency and represents the expected frequency. The value df depends on the type test you are performing. The Chi-Square Distribution The χ distribution is nonnegative not symmetrical; it is skewed to the right distributed to form a family distributions, with a separate distribution for each different degrees freedom. The Chi-Square Test for Goodness Fit The goodness--fit test compares the distribution observed outcomes for a single categorical variable to the expected outcomes predicted by a probability model. This test involves one sample, and one variable. Assumptions and Conditions: Be sure that the data is counts, or frequencies Independence assumption Sample size assumption xpected cell frequency condition: each expected frequency is at least The Chi-square test is one-sided 0 (df, α) Automobile insurance is much more expensive for teenage than for older. To justify this cost difference, insurance companies claim that the younger are much more likely to be involved in costly. To test this claim, a researcher obtains information about registered from the Department Motor Vehicles and selects a sample 300 accident reports from the police department. The DMV reports the age registered in each age category as reported below. The accident reports is also shown. Does this data indicate that occur with the same distribution as the ages the? H 0 : H a : 5 6 1

2 Automobile insurance is much more expensive for teenage than for older. To justify this cost difference, insurance companies claim that the younger are much more likely to be involved in costly. To test this claim, a researcher obtains information about registered from the Department Motor Vehicles and selects a sample 300 accident reports from the police department. The DMV reports the age registered in each age category as reported below. The accident reports is also shown. Does this data indicate that occur with the same distribution as the ages the? H 0 : The distribution the ages involved in is the same as the distribution the ages registered. H a : The distribution the ages involved in is not the same as the distribution the ages registered. xpected cell frequency condition Under or over (this is the data) expected O - (O - ) (O - ) 7 8 xpected cell frequency condition xpected cell frequency condition Under or over n = Note: Σ observed = Σ expected expected O - (O - ) (O - ) Under or over n = Note: Σ observed = Σ expected expected O - (O - ) (O - ) 9 10 xpected cell frequency condition xpected cell frequency condition Under or over n = Note: Σ observed = Σ expected expected O - (O - ) (O - ) Under or over n = Note: Σ observed = Σ expected expected O - (O - ) (O - ) 11 1

3 xpected cell frequency condition - each expected frequency 5 Under or over n = Note: Σ observed = Σ expected expected O - (O - ) (O - ) 13 expected O - (O - ) (O - ) Under or over Specify the sampling distribution model and the test you will use. O, with df = k-1 = df = 14 expected O - (O - ) (O - ) Under or over Note: Σ(O - ) = 0 Specify the sampling distribution model and the test you will use. expected O - (O - ) (O - ) Under or over Note: Σ(O - ) = 0 Specify the sampling distribution model and the test you will use. O, with df = k-1 O, with df = k-1 = df = = df = expected O - (O - ) (O - ) Under or over expected O - (O - ) (O - ) Under or over Specify the sampling distribution model and the test you will use. Since the conditions are met, we will use a Chi-square model with degrees freedom, and do a Chi-square goodness--fit test. O, with df = k-1 = df = 17 O, with df = k-1 = df = 3-1 = P-value: 18 3

4 = df = 3-1 = P-value: P <.005 Statistical conclusion: Conclusion in context: expected O - (O - ) (O - ) Under or over expected O - (O - ) (O - ) Under or over = df = 3-1 = Using SPSS for a Goodness Fit Test If you have the expected proportions: 1. Create a numeric variable with a width 1 and no decimal places for the categories. Code the values this variable as follows: In the Values column, click on the box with the three dots: P-value: P <.005 Statistical conclusion: Since the p-value is small, reject the null hypothesis. Conclusion in context: The data indicates that the distribution ages involved in is not the same as the distribution ages the in the population. 1 You will then see the Value Labels dialog box. Since there are three categories ages, enter the values 1,, and 3 as coding variables: Then click on Add and you will see the results: nter the value "1" and code it as "under 0". (You do not have to use quotation marks; they will be added by SPSS.) 3 4 4

5 Continue adding all categories, one at a time, and then click on OK. You will see the results in the Values column in Variable View Create a numeric variable with no decimal places for the observed frequencies. You can then enter the observed frequency for that category. Then, for each category, enter the coded value: Repeat this until all observed frequencies have been entered: As you enter each value you will see a drop-down box. If you click on it, you can choose from the list labels. However, if you just move to the next column, you will see the category name associated with the coded value Weight the cases using the observed frequencies. 4. Now select > Analyze > Nonparametric Tests > Legacy Dialogs > Chi-Square

6 5. Select the variable with the observed frequencies as the Test Variable In the xpected Values box, select Values: 6. nter the expected s (as decimals) one at a time, and click on Add until all have been entered: nter the expected s (as decimals) one at a time, and click on Add until all have been entered: 7. After the last value has been entered, click on OK. You should see a table showing the observed and expected frequencies and a table with the results the Chi-square test: count Observed N xpected N Residual Test Statistics count Chi-Square a df Asymp. Sig..001 a. 0 cells (.0%) have expected frequencies less than 5. The minimum expected cell frequency is These results show that χ = 13.76, and p =.001 (Note that you also have the option to choose All categories equal if that is appropriate.) The Chi-Square Test for Homogeneity In a test for homogeneity, we compare observed distributions for several groups to see if there are differences among the respective populations. The central issue is whether the category proportions are the same for all the populations. The test involves several samples but only one variable. The article Relationship Health Behaviors to Alcohol and Cigarette Use by College Students (J. College Student Development (199)) included data on drinking behavior for independently chosen random samples male and female students similar to the data shown below. Does there appear to be a gender difference with respect to drinking behavior? None 140 ( ) 186 ( ) Low (1-7) 478 ( ) 661 ( ) Moderate (8-4) 300 ( ) 173 ( ) High (5 or more) 63 ( ) 16 ( )

7 The Chi-Square Test for Homogeneity Assumptions and Conditions: Be sure that the data is counts, or frequencies Independence assumption If you want to generalize from the data to a population. Sample size assumption xpected cell frequency condition ach expected frequency is at least 5 The article Relationship Health Behaviors to Alcohol and Cigarette Use by College Students (J. College Student Development (199)) included data on drinking behavior for independently chosen random samples male and female students similar to the data shown below. Does there appear to be a gender difference with respect to drinking behavior? H 0 : H a : xpected cell frequency condition The article Relationship Health Behaviors to Alcohol and Cigarette Use by College Students (J. College Student Development (199)) included data on drinking behavior for independently chosen random samples male and female students similar to the data shown below. Does there appear to be a gender difference with respect to drinking behavior? H 0 : The proportions the four drinking levels are the same for males and for females H a : The proportions the four drinking levels are not the same for males and for females xpected cell frequency condition: (row total)(column total) n 39 Specify the sampling distribution model and the test you will use. df = (R - 1)(C - 1) None 140 ( ) 186 ( ) Low (1-7) 478 ( ) 661 ( ) Moderate (8-4) 300 ( ) 173 ( ) High (5 or more) 63 ( ) 16 ( ) 40 None 140 ( ) 186 ( ) Low (1-7) 478 ( ) 661 ( ) Moderate (8-4) 300 ( ) 173 ( ) High (5 or more) 63 ( ) 16 ( ) None 140 ( ) 186 ( ) Low (1-7) 478 ( ) 661 ( ) Moderate (8-4) 300 ( ) 173 ( ) High (5 or more) 63 ( ) 16 ( ) Specify the sampling distribution model and the test you will use. df = (R - 1)(C - 1) = (4-1)( - 1) = (3)(1) = 3 Fill in the row and column totals. The conditions are met, so we will use a Chi-square model with 3 degrees freedom, and do a Chi-square test homogeneity

8 None 140 ( ) 186 ( ) 36 Low (1-7) 478 ( ) 661 ( ) 1139 Moderate (8-4) 300 ( ) 173 ( ) 473 High (5 or more) 63 ( ) 16 ( ) None 140 ( ) 186 ( ) 36 Low (1-7) 478 ( ) 661 ( ) 1139 Moderate (8-4) 300 ( ) 173 ( ) 473 High (5 or more) 63 ( ) 16 ( ) Calculate the expected frequencies for each cell, using (row total)(column total) = n Calculate the expected frequencies for each cell, using (row total)(column total) = n None 140 ( ) 186 ( ) 36 Low (1-7) 478 ( ) 661 ( ) 1139 Moderate (8-4) 300 ( ) 173 ( ) 473 High (5 or more) 63 ( ) 16 ( ) Calculate the expected frequencies for each cell, using (row total)(column total) = n O None 140 ( ) 186 ( ) 36 Low (1-7) 478 ( ) 661 ( ) 1139 Moderate (8-4) 300 ( ) 173 ( 4.95 ) 473 High (5 or more) 63 ( 38.4 ) 16 ( ) O.17 + None 140 ( ) 186 ( ) 36 Low (1-7) 478 ( ) 661 ( ) 1139 Moderate (8-4) 300 ( ) 173 ( 4.95 ) 473 High (5 or more) 63 ( 38.4 ) 16 ( ) O None 140 ( ) 186 ( ) 36 Low (1-7) 478 ( ) 661 ( ) 1139 Moderate (8-4) 300 ( ) 173 ( 4.95 ) 473 High (5 or more) 63 ( 38.4 ) 16 ( )

9 O None 140 ( ) 186 ( ) 36 Low (1-7) 478 ( ) 661 ( ) 1139 Moderate (8-4) 300 ( ) 173 ( 4.95 ) 473 High (5 or more) 63 ( 38.4 ) 16 ( ) O = None 140 ( ) 186 ( ) 36 Low (1-7) 478 ( ) 661 ( ) 1139 Moderate (8-4) 300 ( ) 173 ( 4.95 ) 473 High (5 or more) 63 ( 38.4 ) 16 ( ) O None 140 ( ) 186 ( ) 36 Low (1-7) 478 ( ) 661 ( ) 1139 Moderate (8-4) 300 ( ) 173 ( 4.95 ) 473 High (5 or more) 63 ( 38.4 ) 16 ( ) = The article Relationship Health Behaviors to Alcohol and Cigarette Use by College Students (J. College Student Development (199)) included data on drinking behavior for independently chosen random samples male and female students similar to the data shown below. Does there appear to be a gender difference with respect to drinking behavior? H 0 : The proportions the four drinking levels are the same for males and females H a : The proportions the four drinking levels are not the same for males and females = df = 3 P-value: p <.005 Statistical conclusion: Conclusion in context: The article Relationship Health Behaviors to Alcohol and Cigarette Use by College Students (J. College Student Development (199)) included data on drinking behavior for independently chosen random samples male and female students similar to the data shown below. Does there appear to be a gender difference with respect to drinking behavior? H 0 : The proportions the four drinking levels are the same for males and females H a : The proportions the four drinking levels are not the same for males and females = df = 3 P-value: p <.005 Statistical conclusion: p is small, so the null hypothesis is rejected Conclusion in context: The data does indicate a gender difference with respect to drinking behavior

10 Using SPSS for a Test for Homogeneity 1. Create a string variable for each the categories, and a numeric variable for the observed frequencies. Be sure to make the columns wide enough ("columns" in Variable View). 3. Select > Analyze > Descriptive Statistics > Crosstabs Select one variable as the row variable and the other as the column variable. Click on Statistics and then on Chi-square. Then enter the values these two variables:. Weight the cases using the observed frequencies. (> Data > Weight Cases ) Click on the Cells button, and select Observed and xpected in the Cell Display window. Then click on Continue. Your output should include a table showing the observed and expected frequencies: Click on Display clustered bar charts to produce the graph shown in the results. Click on Continue and then click on OK. gender * level Crosstabulation level high low moderate none gender female Count xpected Count male Count xpected Count Count xpected Count and a table with the results your Chi-square test: Here is the graph that represents the results: Chi-Square Tests Value df Asymp. Sig. (- sided) Pearson Chi-Square a Likelihood Ratio N Valid Cases 017 a. 0 cells (.0%) have expected count less than 5. The minimum expected count is These results show that χ = 96.56, and p =

11 The Chi-Square Test for Independence In a test for independence, we investigate association between two categorical variables in a single population. There is one sample, but there are two variables. Assumptions and Conditions: If you want to generalize from the data to a population. xpected cell frequency condition 61 The table shown below was constructed using data in the article Television Viewing and Physical Fitness in Adults (Research Quarterly for xercise and Sport (1990)). The author hoped to determine whether time spent watching television is associated with cardiovascular fitness. Subjects were asked about their television viewing time (per day, rounded to the nearest hour) and were classified as physically fit if they scored in the excellent or very good category on a step test. H o : H a : 0 35 ( ) 147 ( ) ( ) 69 ( ) ( ) ( ) 5 or more 4 ( ) 34 ( ) 6 The table shown below was constructed using data in the article Television Viewing and Physical Fitness in Adults (Research Quarterly for xercise and Sport (1990)). The author hoped to determine whether time spent watching television is associated with cardiovascular fitness. Subjects were asked about their television viewing time (per day, rounded to the nearest hour) and were classified as physically fit if they scored in the excellent or very good category on a step test. xpected cell frequency condition 0 35 ( ) 147 ( ) ( ) 69 ( ) ( ) ( ) 5 or more 4 ( ) 34 ( ) H o : Fitness and TV viewing are independent H a : Fitness and TV viewing are not independent ( ) 147 ( ) ( ) 69 ( ) ( ) ( ) 5 or more 4 ( ) 34 ( ) Specify the sampling distribution model and the test you will use ( ) 147 ( ) ( ) 69 ( ) ( ) ( ) 5 or more 4 ( ) 34 ( ) Find the row and column totals. df = (R - 1)(C - 1) = (4-1)( - 1) = (3)(1) = 3 Since the conditions are met, we will use a Chi-square model with 3 degrees freedom, and do a Chi-square test for independence

12 0 35 ( ) 147 ( ) ( ) 69 ( ) ( ) ( ) 50 5 or more 4 ( ) 34 ( ) ( 5.48 ) 147 ( ) ( 10.0 ) 69 ( ) ( ) ( ) 50 5 or more 4 ( 5.3 ) 34 ( ) (row total)(column total) = n (row total)(column total) = n ( 5.48 ) 147 ( ) ( 10.0 ) 69 ( ) ( ) ( ) 50 5 or more 4 ( 5.3 ) 34 ( 3.68 ) ( 5.48 ) 147 ( ) ( 10.0 ) 69 ( ) ( ) ( ) 50 5 or more 4 ( 5.3 ) 34 ( 3.68 ) (row total)(column total) = n O ( 5.48 ) 147 ( ) ( 10.0 ) 69 ( ) ( ) ( ) 50 5 or more 4 ( 5.3 ) 34 ( 3.68 ) ( 5.48 ) 147 ( ) ( 10.0 ) 69 ( ) ( ) ( ) 50 5 or more 4 ( 5.3 ) 34 ( 3.68 ) O O =

13 6.161 df = ( 5.48 ) 147 ( ) ( 10.0 ) 69 ( ) ( ) ( ) 50 5 or more 4 ( 5.3 ) 34 ( 3.68 ) P-value: df = ( 5.48 ) 147 ( ) ( 10.0 ) 69 ( ) ( ) ( ) 50 5 or more 4 ( 5.3 ) 34 ( 3.68 ) df = ( 5.48 ) 147 ( ) ( 10.0 ) 69 ( ) ( ) ( ) 50 5 or more 4 ( 5.3 ) 34 ( 3.68 ) P-value: p >.10 Statistical conclusion: Conclusion in context: 75 P-value: p >.10 Statistical conclusion: Since the p-value is large, we cannot reject the null hypothesis. Conclusion in context: There is not enough evidence to conclude that time spent watching television is associated with cardiovascular fitness. 76 Using SPSS for a Test for Independence Then enter the frequencies as before: Follow the instructions for a Chi-Square test for homogeneity. You may define two string variables for the categories and one numeric variable for the counts, or you may choose to use coding for one or either the variables representing the categories

14 Weight the cases by counts, and then use > Analyze > Descriptive Statistics > Crosstabs SPSS output: Select one variable as the row variable and the other as the column variable. TVGroup * Fitness Crosstabulation Fitness Fit Not Fit Click on Statistics and then on Chi-square. Click on the Cells button, and select Observed and xpected in the Cell Display window. Click on Display clustered bar charts to produce the graph shown in the results. Then click on Continue and on OK. TVGroup 0 Count xpected Count Count xpected Count Count 8 50 xpected Count or more Count xpected Count Count xpected Count SPSS output: Here is the graph that supports these results: Chi-Square Tests Value df Asymp. Sig. (- sided) Pearson Chi-Square a Likelihood Ratio N Valid Cases 100 a. 0 cells (.0%) have expected count less than 5. The minimum expected count is 5.3. These results show that χ = and p = A health pressional selected a random sample 100 patients from each four major hospital emergency rooms to see if the major reasons for emergency room visits (accident, illegal activity, illness, other) are the same in all four hospitals. This is an example a. A goodness--fit test b. A test for homogeneity c. A test for independence 1. A health pressional selected a random sample 100 patients from each four major hospital emergency rooms to see if the major reasons for emergency room visits (accident, illegal activity, illness, other) are the same in all four hospitals. This is an example a. A goodness--fit test b. A test for homogeneity c. A test for independence

15 . An urban economist wants to determine whether the region the United States a resident lives in is related to his level education. He randomly selects 1800 US residents and asks them to report their level education and the region the US in which they live. The economist is using a. A goodness--fit test b. A test for homogeneity c. A test for independence. An urban economist wants to determine whether the region the United States a resident lives in is related to his level education. He randomly selects 1800 US residents and asks them to report their level education and the region the US in which they live. The economist is using a. A goodness--fit test b. A test for homogeneity c. A test for independence As part a class project, a student asked a random sample students about their preferred st drink: Pepsi, Coke, or 7-Up, to determine whether these three drinks were equally preferred by students. 3. As part a class project, a student asked a random sample students about their preferred st drink: Pepsi, Coke, or 7-Up, to determine whether these three drinks were equally preferred by students. The student should use a. A goodness--fit test b. A test for homogeneity c. A test for independence The student should use a. A goodness--fit test b. A test for homogeneity c. A test for independence

### SPSS for Exploratory Data Analysis Data used in this guide: studentp.sav (http://people.ysu.edu/~gchang/stat/studentp.sav)

Data used in this guide: studentp.sav (http://people.ysu.edu/~gchang/stat/studentp.sav) Organize and Display One Quantitative Variable (Descriptive Statistics, Boxplot & Histogram) 1. Move the mouse pointer

### Is it statistically significant? The chi-square test

UAS Conference Series 2013/14 Is it statistically significant? The chi-square test Dr Gosia Turner Student Data Management and Analysis 14 September 2010 Page 1 Why chi-square? Tests whether two categorical

### An introduction to IBM SPSS Statistics

An introduction to IBM SPSS Statistics Contents 1 Introduction... 1 2 Entering your data... 2 3 Preparing your data for analysis... 10 4 Exploring your data: univariate analysis... 14 5 Generating descriptive

### Nonparametric Tests. Chi-Square Test for Independence

DDBA 8438: Nonparametric Statistics: The Chi-Square Test Video Podcast Transcript JENNIFER ANN MORROW: Welcome to "Nonparametric Statistics: The Chi-Square Test." My name is Dr. Jennifer Ann Morrow. In

### Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test Features Of The Chi-Square Statistic The chi-square test is non-parametric. That is, it makes no assumptions about the distribution

### Analysis of categorical data: Course quiz instructions for SPSS

Analysis of categorical data: Course quiz instructions for SPSS The dataset Please download the Online sales dataset from the Download pod in the Course quiz resources screen. The filename is smr_bus_acd_clo_quiz_online_250.xls.

### Main Effects and Interactions

Main Effects & Interactions page 1 Main Effects and Interactions So far, we ve talked about studies in which there is just one independent variable, such as violence of television program. You might randomly

### Odds ratio, Odds ratio test for independence, chi-squared statistic.

Odds ratio, Odds ratio test for independence, chi-squared statistic. Announcements: Assignment 5 is live on webpage. Due Wed Aug 1 at 4:30pm. (9 days, 1 hour, 58.5 minutes ) Final exam is Aug 9. Review

### SPSS TUTORIAL & EXERCISE BOOK

UNIVERSITY OF MISKOLC Faculty of Economics Institute of Business Information and Methods Department of Business Statistics and Economic Forecasting PETRA PETROVICS SPSS TUTORIAL & EXERCISE BOOK FOR BUSINESS

### Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Parkland College A with Honors Projects Honors Program 2014 Calculating P-Values Isela Guerra Parkland College Recommended Citation Guerra, Isela, "Calculating P-Values" (2014). A with Honors Projects.

### This chapter discusses some of the basic concepts in inferential statistics.

Research Skills for Psychology Majors: Everything You Need to Know to Get Started Inferential Statistics: Basic Concepts This chapter discusses some of the basic concepts in inferential statistics. Details

### Chapter 13. Chi-Square. Crosstabs and Nonparametric Tests. Specifically, we demonstrate procedures for running two separate

1 Chapter 13 Chi-Square This section covers the steps for running and interpreting chi-square analyses using the SPSS Crosstabs and Nonparametric Tests. Specifically, we demonstrate procedures for running

### Chapter 23. Inferences for Regression

Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily

### Two Related Samples t Test

Two Related Samples t Test In this example 1 students saw five pictures of attractive people and five pictures of unattractive people. For each picture, the students rated the friendliness of the person

### How to Make APA Format Tables Using Microsoft Word

How to Make APA Format Tables Using Microsoft Word 1 I. Tables vs. Figures - See APA Publication Manual p. 147-175 for additional details - Tables consist of words and numbers where spatial relationships

### IBM SPSS Statistics for Beginners for Windows

ISS, NEWCASTLE UNIVERSITY IBM SPSS Statistics for Beginners for Windows A Training Manual for Beginners Dr. S. T. Kometa A Training Manual for Beginners Contents 1 Aims and Objectives... 3 1.1 Learning

### EPS 625 INTERMEDIATE STATISTICS FRIEDMAN TEST

EPS 625 INTERMEDIATE STATISTICS The Friedman test is an extension of the Wilcoxon test. The Wilcoxon test can be applied to repeated-measures data if participants are assessed on two occasions or conditions

### 3. Analysis of Qualitative Data

3. Analysis of Qualitative Data Inferential Stats, CEC at RUPP Poch Bunnak, Ph.D. Content 1. Hypothesis tests about a population proportion: Binomial test 2. Chi-square testt for goodness offitfit 3. Chi-square

### Simulating Chi-Square Test Using Excel

Simulating Chi-Square Test Using Excel Leslie Chandrakantha John Jay College of Criminal Justice of CUNY Mathematics and Computer Science Department 524 West 59 th Street, New York, NY 10019 lchandra@jjay.cuny.edu

### CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS

CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS CHI-SQUARE TESTS OF INDEPENDENCE (SECTION 11.1 OF UNDERSTANDABLE STATISTICS) In chi-square tests of independence we use the hypotheses. H0: The variables are independent

### People like to clump things into categories. Virtually every research

05-Elliott-4987.qxd 7/18/2006 5:26 PM Page 113 5 Analysis of Categorical Data People like to clump things into categories. Virtually every research project categorizes some of its observations into neat,

### Independent t- Test (Comparing Two Means)

Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent

### Chapter 23. Two Categorical Variables: The Chi-Square Test

Chapter 23. Two Categorical Variables: The Chi-Square Test 1 Chapter 23. Two Categorical Variables: The Chi-Square Test Two-Way Tables Note. We quickly review two-way tables with an example. Example. Exercise

### Introduction to Statistics with SPSS (15.0) Version 2.3 (public)

Babraham Bioinformatics Introduction to Statistics with SPSS (15.0) Version 2.3 (public) Introduction to Statistics with SPSS 2 Table of contents Introduction... 3 Chapter 1: Opening SPSS for the first

### MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Final Exam Review MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) A researcher for an airline interviews all of the passengers on five randomly

### 4. Descriptive Statistics: Measures of Variability and Central Tendency

4. Descriptive Statistics: Measures of Variability and Central Tendency Objectives Calculate descriptive for continuous and categorical data Edit output tables Although measures of central tendency and

### Whitney Colbert Research Methods for the Social Sciences Trinity College Spring 2012

ALCOHOL IN COLLEGE ATHLETICS: THE FIGHT TO RAISE AWARENESS OF BINGE DRINKING ON COLLEGE ATHLETIC TEAMS Whitney Colbert Research Methods for the Social Sciences Trinity College Spring 2012 While there is

### Using SPSS, Chapter 2: Descriptive Statistics

1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,

### Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails.

Chi-square Goodness of Fit Test The chi-square test is designed to test differences whether one frequency is different from another frequency. The chi-square test is designed for use with data on a nominal

### UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

UNDERSTANDING The independent-samples t test evaluates the difference between the means of two independent or unrelated groups. That is, we evaluate whether the means for two independent groups are significantly

### Difference of Means and ANOVA Problems

Difference of Means and Problems Dr. Tom Ilvento FREC 408 Accounting Firm Study An accounting firm specializes in auditing the financial records of large firm It is interested in evaluating its fee structure,particularly

### CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS

CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS Hypothesis 1: People are resistant to the technological change in the security system of the organization. Hypothesis 2: information hacked and misused. Lack

### Chapter 7 Section 7.1: Inference for the Mean of a Population

Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used

### Data Analysis for Marketing Research - Using SPSS

North South University, School of Business MKT 63 Marketing Research Instructor: Mahmood Hussain, PhD Data Analysis for Marketing Research - Using SPSS Introduction In this part of the class, we will learn

### SPSS/Excel Workshop 3 Summer Semester, 2010

SPSS/Excel Workshop 3 Summer Semester, 2010 In Assignment 3 of STATS 10x you may want to use Excel to perform some calculations in Questions 1 and 2 such as: finding P-values finding t-multipliers and/or

### Probability Distributions

CHAPTER 5 Probability Distributions CHAPTER OUTLINE 5.1 Probability Distribution of a Discrete Random Variable 5.2 Mean and Standard Deviation of a Probability Distribution 5.3 The Binomial Distribution

### MBA 611 STATISTICS AND QUANTITATIVE METHODS

MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain

### Two Correlated Proportions (McNemar Test)

Chapter 50 Two Correlated Proportions (Mcemar Test) Introduction This procedure computes confidence intervals and hypothesis tests for the comparison of the marginal frequencies of two factors (each with

### Data analysis process

Data analysis process Data collection and preparation Collect data Prepare codebook Set up structure of data Enter data Screen data for errors Exploration of data Descriptive Statistics Graphs Analysis

### Data Analysis Tools. Tools for Summarizing Data

Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool

### TI-Inspire manual 1. Instructions. Ti-Inspire for statistics. General Introduction

TI-Inspire manual 1 General Introduction Instructions Ti-Inspire for statistics TI-Inspire manual 2 TI-Inspire manual 3 Press the On, Off button to go to Home page TI-Inspire manual 4 Use the to navigate

### Nonparametric Statistics

Nonparametric Statistics J. Lozano University of Goettingen Department of Genetic Epidemiology Interdisciplinary PhD Program in Applied Statistics & Empirical Methods Graduate Seminar in Applied Statistics

### Q1. Where else, other than your home, do you use the internet? (Check all that apply). Library School Workplace Internet on a cell phone Other

Exploring Check-All Questions: Frequencies, Multiple Response, and Aggregation Target Software & Version: SPSS v19 Last Updated on May 4, 2012 Created by Laura Atkins Sometimes several responses or measurements

### An analysis method for a quantitative outcome and two categorical explanatory variables.

Chapter 11 Two-Way ANOVA An analysis method for a quantitative outcome and two categorical explanatory variables. If an experiment has a quantitative outcome and two categorical explanatory variables that

### Mind on Statistics. Chapter 15

Mind on Statistics Chapter 15 Section 15.1 1. A student survey was done to study the relationship between class standing (freshman, sophomore, junior, or senior) and major subject (English, Biology, French,

### TABLE OF CONTENTS. About Chi Squares... 1. What is a CHI SQUARE?... 1. Chi Squares... 1. Hypothesis Testing with Chi Squares... 2

About Chi Squares TABLE OF CONTENTS About Chi Squares... 1 What is a CHI SQUARE?... 1 Chi Squares... 1 Goodness of fit test (One-way χ 2 )... 1 Test of Independence (Two-way χ 2 )... 2 Hypothesis Testing

### Projects Involving Statistics (& SPSS)

Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,

### Comparing Multiple Proportions, Test of Independence and Goodness of Fit

Comparing Multiple Proportions, Test of Independence and Goodness of Fit Content Testing the Equality of Population Proportions for Three or More Populations Test of Independence Goodness of Fit Test 2

### LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

### Crosstabulation & Chi Square

Crosstabulation & Chi Square Robert S Michael Chi-square as an Index of Association After examining the distribution of each of the variables, the researcher s next task is to look for relationships among

### SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR

### Descriptive Statistics

Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

### Opgaven Onderzoeksmethoden, Onderdeel Statistiek

Opgaven Onderzoeksmethoden, Onderdeel Statistiek 1. What is the measurement scale of the following variables? a Shoe size b Religion c Car brand d Score in a tennis game e Number of work hours per week

### Mind on Statistics. Chapter 13

Mind on Statistics Chapter 13 Sections 13.1-13.2 1. Which statement is not true about hypothesis tests? A. Hypothesis tests are only valid when the sample is representative of the population for the question

### Chapter 5 Analysis of variance SPSS Analysis of variance

Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,

### Simple Linear Regression Inference

Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

### Contingency Tables and the Chi Square Statistic. Interpreting Computer Printouts and Constructing Tables

Contingency Tables and the Chi Square Statistic Interpreting Computer Printouts and Constructing Tables Contingency Tables/Chi Square Statistics What are they? A contingency table is a table that shows

### Profiles and Data Analysis. 5.1 Introduction

Profiles and Data Analysis PROFILES AND DATA ANALYSIS 5.1 Introduction The survey of consumers numbering 617, spread across the three geographical areas, of the state of Kerala, who have given information

### Topic 8. Chi Square Tests

BE540W Chi Square Tests Page 1 of 5 Topic 8 Chi Square Tests Topics 1. Introduction to Contingency Tables. Introduction to the Contingency Table Hypothesis Test of No Association.. 3. The Chi Square Test

### Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

### SPSS Resources. 1. See website (readings) for SPSS tutorial & Stats handout

Analyzing Data SPSS Resources 1. See website (readings) for SPSS tutorial & Stats handout Don t have your own copy of SPSS? 1. Use the libraries to analyze your data 2. Download a trial version of SPSS

### Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Calculate counts, means, and standard deviations Produce

### Using Excel for inferential statistics

FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

### Data exploration with Microsoft Excel: analysing more than one variable

Data exploration with Microsoft Excel: analysing more than one variable Contents 1 Introduction... 1 2 Comparing different groups or different variables... 2 3 Exploring the association between categorical

### HYPOTHESIS TESTING WITH SPSS:

HYPOTHESIS TESTING WITH SPSS: A NON-STATISTICIAN S GUIDE & TUTORIAL by Dr. Jim Mirabella SPSS 14.0 screenshots reprinted with permission from SPSS Inc. Published June 2006 Copyright Dr. Jim Mirabella CHAPTER

### 12.5: CHI-SQUARE GOODNESS OF FIT TESTS

125: Chi-Square Goodness of Fit Tests CD12-1 125: CHI-SQUARE GOODNESS OF FIT TESTS In this section, the χ 2 distribution is used for testing the goodness of fit of a set of data to a specific probability

### An SPSS companion book. Basic Practice of Statistics

An SPSS companion book to Basic Practice of Statistics SPSS is owned by IBM. 6 th Edition. Basic Practice of Statistics 6 th Edition by David S. Moore, William I. Notz, Michael A. Flinger. Published by

### Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples

Statistics One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples February 3, 00 Jobayer Hossain, Ph.D. & Tim Bunnell, Ph.D. Nemours

### INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of

### 3.4 Statistical inference for 2 populations based on two samples

3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted

### Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools Occam s razor.......................................................... 2 A look at data I.........................................................

### Statistics 2014 Scoring Guidelines

AP Statistics 2014 Scoring Guidelines College Board, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks of the College Board. AP Central is the official online home

### Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.

1 Commands in JMP and Statcrunch Below are a set of commands in JMP and Statcrunch which facilitate a basic statistical analysis. The first part concerns commands in JMP, the second part is for analysis

### Working with SPSS. A Step-by-Step Guide For Prof PJ s ComS 171 students

Working with SPSS A Step-by-Step Guide For Prof PJ s ComS 171 students Contents Prep the Excel file for SPSS... 2 Prep the Excel file for the online survey:... 2 Make a master file... 2 Clean the data

### SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Ch. 10 Chi SquareTests and the F-Distribution 10.1 Goodness of Fit 1 Find Expected Frequencies Provide an appropriate response. 1) The frequency distribution shows the ages for a sample of 100 employees.

### Students' Opinion about Universities: The Faculty of Economics and Political Science (Case Study)

Cairo University Faculty of Economics and Political Science Statistics Department English Section Students' Opinion about Universities: The Faculty of Economics and Political Science (Case Study) Prepared

### Statistical Impact of Slip Simulator Training at Los Alamos National Laboratory

LA-UR-12-24572 Approved for public release; distribution is unlimited Statistical Impact of Slip Simulator Training at Los Alamos National Laboratory Alicia Garcia-Lopez Steven R. Booth September 2012

### II. DISTRIBUTIONS distribution normal distribution. standard scores

Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

### SPSS Notes (SPSS version 15.0)

SPSS Notes (SPSS version 15.0) Annie Herbert Salford Royal Hospitals NHS Trust July 2008 Contents Page Getting Started 1 1 Opening SPSS 1 2 Layout of SPSS 2 2.1 Windows 2 2.2 Saving Files 3 3 Creating

### Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

### IB Practice Chi Squared Test of Independence

1. A university required all Science students to study one language for one year. A survey was carried out at the university amongst the 150 Science students. These students all studied one of either French,

### Descriptive Analysis

Research Methods William G. Zikmund Basic Data Analysis: Descriptive Statistics Descriptive Analysis The transformation of raw data into a form that will make them easy to understand and interpret; rearranging,

### Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish

Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish Statistics Statistics are quantitative methods of describing, analysing, and drawing inferences (conclusions)

### SPSS (Statistical Package for the Social Sciences)

SPSS (Statistical Package for the Social Sciences) What is SPSS? SPSS stands for Statistical Package for the Social Sciences The SPSS home-page is: www.spss.com 2 What can you do with SPSS? Run Frequencies

### November 08, 2010. 155S8.6_3 Testing a Claim About a Standard Deviation or Variance

Chapter 8 Hypothesis Testing 8 1 Review and Preview 8 2 Basics of Hypothesis Testing 8 3 Testing a Claim about a Proportion 8 4 Testing a Claim About a Mean: σ Known 8 5 Testing a Claim About a Mean: σ

Table of Contents Preface Chapter 1: Introduction 1-1 Opening an SPSS Data File... 2 1-2 Viewing the SPSS Screens... 3 o Data View o Variable View o Output View 1-3 Reading Non-SPSS Files... 6 o Convert

### 1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

### Non-Inferiority Tests for Two Means using Differences

Chapter 450 on-inferiority Tests for Two Means using Differences Introduction This procedure computes power and sample size for non-inferiority tests in two-sample designs in which the outcome is a continuous

### DDBA 8438: The t Test for Independent Samples Video Podcast Transcript

DDBA 8438: The t Test for Independent Samples Video Podcast Transcript JENNIFER ANN MORROW: Welcome to The t Test for Independent Samples. My name is Dr. Jennifer Ann Morrow. In today's demonstration,

### UNDERSTANDING THE TWO-WAY ANOVA

UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables

### Directions for using SPSS

Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...

### Association Between Variables

Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi

### SPSS Explore procedure

SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,

### Questionnaire design and analysing the data using SPSS page 1

Questionnaire design and analysing the data using SPSS page 1 Questionnaire design. For each decision you make when designing a questionnaire there is likely to be a list of points for and against just

### SPSS Tests for Versions 9 to 13

SPSS Tests for Versions 9 to 13 Chapter 2 Descriptive Statistic (including median) Choose Analyze Descriptive statistics Frequencies... Click on variable(s) then press to move to into Variable(s): list