Testing differences in proportions
|
|
|
- Lewis Kelly
- 9 years ago
- Views:
Transcription
1 Testing differences in proportions Murray J Fisher RN, ITU Cert., DipAppSc, BHSc, MHPEd, PhD Senior Lecturer and Director Preregistration Programs Sydney Nursing School (MO2) University of Sydney NSW 2006 Phone: Fax: [email protected] Andrea P Marshall Sesqui Senior Lecturer (Critical Care) and Director Postgraduate Programs Sydney Nursing School (MO2) University of Sydney NSW 2006 Marion Mitchell Senior Research Fellow Critical Care Griffith University & Princess Alexandra Hospital Address: Nurse Practice Development Unit Princess Alexandra Hospital Ipswich Road Woolloongabba, Qld of 17
2 Abstract This paper is the sixth in a series of statistics articles recently published by Australian Critical Care. In this paper we explore the most commonly used statistical tests to compare groups of data at the nominal level of measurement. The chosen statistical tests are the chisquare test, chi-square test for goodness of fit, chi-square test for independence, Fisher's exact test, McNemar's test and the use of confidence intervals for proportions. Examples of how to use and interpret the tests are provided. Introduction This article presents the most commonly used statistical tests to compare groups of data at the nominal level of measurement. Nominal (or categorical) level of measurement is the sorting of cases into one of several categories (for example, types of religion), where the measure of dispersion is based on the count or frequency of cases in each category of measurement. Concepts around levels of measurement are explained in the second article of this series Understanding Descriptive Statistics 1. The number of cases in each category for a given sample is known as the frequency distribution. A common way of presenting frequency distributions for nominal data is in a table, sometimes referred to as a contingency table or cross-tabulation. An example is depicted in Table 1, which shows the frequency distribution of the incidence of diarrhoea in an intensive care unit (ICU) population over a 12 month period during which time an intervention was introduced 2. 2 of 17
3 Specific methods of inferential statistics are required to determine differences between samples in nominal level measurement. In the example depicted in Table 1, we would be determining the difference in frequency of diarrhoea in patients before implementation of a bowel management protocol with those after the protocol was implemented. The tests of significance for nominal data vary depending on the nature of the chosen measurements for the variables. Table 2 presents the most commonly used tests for comparing two groups using nominal measurement level. Chi-square test The chi-square test compares the observed frequency distribution (ƒ o ) for each category of the scale with the expected frequency distribution (ƒ e ) of the null hypothesis. When using a chi-square test it is assumed that there has been random sampling; that 80% of the cells have an expected frequency of greater than five; that no cell has an observed frequency of 0; and, that a large sample is used, as small sample sizes lead to a small expected frequency which causes large chi-square values 3. A limitation of the chi- square test is that it is sensitive to either very small or large samples. Quantifying the minimum sample is difficult as it is dependent on the number of cells in the crosstab. A sample is considered too small when the above assumptions are not met. When these assumptions are not met the chi-square cannot be meaningfully interpreted. 4 The chance of finding a significant difference between samples is greater with larger samples. If you double the sample size, the chi-square statistic will double due to the large sample size rather than a strong pattern of dependence between the variables 5. 3 of 17
4 When these assumptions are violated the results may lead to erroneous interpretation of the data; that is, the results may be considered statistically significant when in reality they may not be statistically significant. When samples are small and the assumptions for the chisquare are violated, the Fisher s exact test could be used 6. The formula for calculating the chi-square statistic is χ 2 = (ƒ o ƒ e ) 2 ƒ e Where χ 2 is equal to the sum of the squared difference between the observed and the expected frequencies divided by the expected frequency for each cell. The concept of degrees of freedom (df) is important and is a mathematical limitation that needs to be factored in when calculating an estimate of one statistic from an estimate of another. The df are used in conjunction with the table of critical values for chi- square. The df for a chi- square test is calculated with the following equation: df = (R-1) x (C 1) Where: R equals the number of rows C equals the number of columns 3 Chi-square test for goodness of fit 4 of 17
5 The chi-square test for goodness of fit is used for a single population and is a test used when you have one categorical variable. This test determines how well the frequency distribution from that sample fits the model distribution. Consider the data provided in the contingency table (Table 3) which reports the frequency of patients who developed diarrhoea for three different wards within a hospital. The chi-square test for goodness of fit determines difference by comparing the observed frequency distribution with the frequency distribution of the null hypothesis. The null hypothesis is the expected frequency distribution of all wards is the same. That is, approximately 33.3 patients in each ward would be expected to have developed diarrhoea. χ 2 = (-3.3) 2 + (-8.3) 2 + (6.7) = = 3.78 The critical value for χ 2 needs to be determined. First determine the df (see textbox) and determine the level of significance (often set at 0.05) (please refer to the fourth article of this series Statistical and clinical significance, and how to use confidence intervals to help interpret both 8 ). Referring to a table listing the critical values of chi-square (available in most statistics texts) and using the calculated degrees of freedom (df = 1) and level of significance 5 of 17
6 of 0.05, the critical value for χ 2 is The computed chi-square value of 3.78 is lower than the critical value of 3.84, therefore the null hypothesis is not accepted and we conclude that there is not a statistical difference in the distribution of the frequency of patients who developed diarrhoea for the three different wards. Chi-square test for independence The chi-square test for independence is also used for a single population but where there are two categorical variables. The test examines if there is a relationship between the two variables for the one sample. 3 Consider the observed frequency distribution on the difference in the incidence of diarrhoea before and after the implementation of a bowel management protocol (Table 1). The contingency table (Table 1) demonstrates that 36.41% of the pre-intervention sample and 22.74% of the post-intervention sample experienced diarrhoea. In order to determine whether there is a statistical difference between the pre-intervention and post-intervention groups, the chi-squared test of independence is used as these are two independent samples. The chisquare statistic compares the observed frequency distribution (ƒ o ), for example the frequencies that are depicted in Table 1, with the expected frequency distribution of the null hypothesis (ƒ e ). The null hypothesis expresses the expected frequency for each category if there is no statistical difference between categories (see previous publication in this series 8 for further information on hypothesis testing). In this case the null hypothesis is that there is no statistical difference between the number of patients with diarrhoea in the pre-intervention sample as compared to those in the post- 6 of 17
7 intervention sample. To calculate the frequency distribution of the null hypothesis (ƒ e ) the following formula is used: ƒ e = ƒ c ƒ r n Where, ƒ c is the frequency total for the column, ƒ r is the frequency total for the row and n is the total sample size. To calculate the expected frequency for each cell you simply substitute the observed frequency with the calculated expected frequency using the formula. Refer to Table 4 and the calculation below to determine the expected number of patients with diarrhoea in the pre-intervention sample for the null hypothesis. In this case the ƒ e would be: ƒ e = 379(201) 656 = The expected frequency distribution for the null hypothesis in this example would be calculated as depicted in Table 5. 7 of 17
8 At a glance it would appear that in this example there is a difference between frequency observed (ƒ o ) and the expected frequency (ƒ e ). Table 6 presents the difference between the observed and expected frequency for each cell. To calculate whether there is a statistical difference the chi-square formula is used. χ 2 = (ƒ o ƒ e ) 2 ƒ e Where χ 2 is equal to the sum of the squared difference between the observed and the expected frequency divided by the expected frequency for each cell. In this case the chi-square statistic is equal to: χ 2 = (21.87) 2 + (-21.87) 2 + (-21.87) 2 + (21.87) = = The critical value for χ 2 needs to be determined; first calculate the df (see textbox) and determine the level of significance. Referring to a table listing the critical values of chisquare and using the calculated df (1) and level of significance of 0.05, the critical value for 8 of 17
9 χ 2 is The chi-square value of calculated above exceeds that of the critical value, therefore the null hypothesis is rejected and we conclude that there is a statistical difference between the number of patients with diarrhoea in the pre-intervention sample as compared to those in the post-intervention sample. The Statistical Package for the Social Sciences (SPSS version 18) was used to examine this sample of patients with or without diarrhea. The reported SPSS output confirms that there was a statistical difference in the incidence of diarrhoea between the pre-intervention and post-intervention samples χ 2 (1, n=656) =14.06, p< In the original study Ferrie and East 2 identified a statistical difference in the incidence of diarrhoea between the two samples (p<0.0001), however this claim could have been strengthened by reporting the χ 2 statistic. Fisher s exact test The Fisher s exact test is used in cases where there are cells with an expected frequency (ƒ e ) less than 5 and/or with small sample sizes, as the Fisher s exact test has no sample size restriction 6. The method of calculation of the Fisher s exact test is different to the chi-square statistic and is calculated by determining the probability of getting the observed frequency distribution by establishing and comparing to all other possible distributions where the column and row totals remain the same as the observed distribution. In this case the null hypothesis indicates that all the cells would be close to equal. The calculation of the Fisher s exact test is complex and is not available in all statistical packages but can be performed using the Statistical Package for Social Sciences. 9 of 17
10 McNemar s test The McNemar test compares dependent (paired or matched) samples in terms of a dichotomous variable. 4 It is the best test for comparing dichotomous variables with two dependent sample studies as opposed to the chi-square test which examines nominal level variables with two samples that are independent of each other. 4 A dichotomous variable has only two possible outcomes, for example yes or no and it results in a binomial distribution. 9 The McNemar test may be used for pretest-posttest design or in time series data where the same sample is tested at least in two points in time. The main assumption of the McNemar text is that the data comes from two samples that are matched. This can either be as a paired sample or a before/after sample. The McNemar test is a non parametric test and thus assumes that the data are not normally distributed. 4 Consider the following fictitious two by two contingency table (Table 7) which shows the incidence of diarrhoea at two time periods in a sample (n=200).the McNemar test is similar to the chi-squared test in that it examines the difference between expected and observed cell frequencies. The following formula is used to calculate the McNemar test. There is one df which is derived from the following equation: (rows 1)x (columns 1) = 1. χ 2 M = (N a - N d -1) 2 N a + N d Where: N a equals the frequency of observed responses see Cell marked A in Table 7 10 of 17
11 N d equals the frequency of observed responses see Cell D marked in Table 7 In the above example the χ 2 M is equal to : χ 2 M = ( ) = = 0.04 With one degree of freedom and level of significance of.05, based on the chi-square distribution the critical value for χ 2 M is The McNemar chi-square value is less than that of the critical value, therefore the Null Hypothesis (H o ) is retained and we conclude that there is not a statistical difference in the incidence of diarrhoea between the two time periods. Confidence intervals for differences in proportion Confidence intervals (CI) are now being reported along with p values in clinical studies and their use has been described in an earlier paper in this series 5. Calculation of CI is based on the assumption that the variable is normally distributed in the population and is dependent on the level of measurement and therefore the statistical test used. The formula to construct the CI for a proportion will be available in most statistics textbooks 10. There are also computer programmes that perform these tasks for researchers. 11 of 17
12 The statistical programme will calculate the CI and the researcher selects the level of confidence (for example, 95%). Below is an example of how CIs for nominal data may be used in determining clinical significance. In this sample the proportional difference of those with diarrhoea before the intervention compared to those with diarrhoea after the intervention is 13% (36% - 23% - Table 1). We will now calculate the CI around this sample result. The equation for an approximate 95% confidence interval for the difference between two population proportions (p 1 p 2 ) based on two independent samples of size n1 and n2 with sample proportions and is given by the following equation: Example Using the data provided in Table 1, we will calculate the 95% CI using the equation above where = 36%; = 23%; n 1 = 379 and n 2 = 277. The figure of 1.96 indicates we are computing a CI of 95%. CI = ( ) ± (1 0.36) (1 0.23) = 0.13 ± = 0.13 ± 1.96 x = 0.13 ± upper limit = and lower limit = of 17
13 These results indicate that the lower limit of a 95% CI is 6% and the upper limit is 20% with the sample proportion difference at 13%. Note that CIs may not be symmetrical around the sample proportion, it just happens to be in this instance. With a 0.05 level of significance, there is a significant result with p< (as reported earlier in the paper) and the CI provides additional information as it gives a range of where the population proportion is likely to lie. Patients with the intervention are somewhere between 6% and 20% more likely to experience no diarrhea than those without the intervention. The clinical significance and research conclusions should be drawn from the individual context for the study 3. Conclusion This paper has provided an introduction to the statistical tests commonly used to test differences in proportions for nominal level data. Chi-square tests are commonly used in health care research and where sample sizes are small, the Fisher s Exact Test may be used. If the data from dependent, paired samples are binomial, then the McNemar s test may be more appropriate. As with many other statistical tests, assessment of the critical values, p values and CI may assist in the reader determining clinical and statistical significance of the results. Table 1: Incidence of diarrhoea in intensive care following a bowel management protocol 13 of 17
14 Pre-intervention n (%) Post-intervention n (%) Total Patients with diarrhoea Patients without diarrhoea 138 (36) 63 (23) (77) 214 (64) 455 Total Table 2 The tests of significance for nominal data Sample types One-sample case Two or more independent samples Two dependent (paired) samples Small samples Test of Significance Chi-square goodness of fit Chi-square test for independence McNemar test for binomial distributions Fisher s exact test Table 3 Frequency of diarrhea in patients admitted to three wards Ward A Ward B Ward C Total Table 4 Calculating the expected number of patients with diarrhoea in the preintervention sample for the null hypothesis Pre-intervention Post-intervention Total Patients with diarrhoea Patients without diarrhoea? (fe) (fr) Total 379 (fc) (n) 14 of 17
15 Table 5 Expected frequency distribution for patients with and without diarrhoea at preintervention and post-intervention time periods Pre-intervention Post-intervention Total Patients with diarrhoea Patients without diarrhoea Total Table 6 Difference between frequency observed and expected frequency Patients with diarrhoea Patients without diarrhoea Pre-intervention Post-intervention Table 7 Contingency table of diarrhoea at two time periods (with cells named) Time 1 No Yes Total Time 2 Yes Cell A = 40 Cell B = No Cell C= 60 Cell D = 33 Total Text box 1 In statistics the degree of freedom is the number of values in the calculation of a statistic that are free to vary. To calculate the degree of freedom for a chi-square test you count the number of rows and subtract 1 and multiply with the number of columns with 1 subtracted. So for Table 2, DF = (number of rows 1) x (number of columns 1) or (2-1) x (2-1) = of 17
16 16 of 17
17 References 1. Fisher M, Marshall AP. Understanding descriptive statistics. Aust CritCare 2009; 22: Ferrie S, East V. Managing diarrhoea in intensive care. Aust Crit Care 2007; 20: Corty EW. Using and interpreting statistics: A Practical text for the Health, Behavioral, and Social Sciences. St. Louis: Mosby Elsevier; Argyrous G. Statistics for Social Research. Melbourne: Macmillan Education Australia Pty. Ltd; Smithson, M.J. Statistics with Confidence: An Introduction for Psychologists Sage, Canberra Altman DG. Practical Statistics for Medical Research. London: Chapman & Hall; Fethney J. Statistical and clinical significance, and how to use confidence intervals to help interpret both. Aust Crit Care 2010; 23: Periera SMC, Leslie G. Hypothesis testing. Aust Crit Care 2009; 22: Polit DF, Beck CT. Nursing Research: Generating and Assessing Evidence for Nursing Practice, 8th ed. St. Louis: Mosby Elsevier; Carlin JB, Doyle LW. Statistics for Clinicians 6: Comparison of means and proportions using confidence intervals. J Paediatr Child Health 2001; 37: of 17
Recommend Continued CPS Monitoring. 63 (a) 17 (b) 10 (c) 90. 35 (d) 20 (e) 25 (f) 80. Totals/Marginal 98 37 35 170
Work Sheet 2: Calculating a Chi Square Table 1: Substance Abuse Level by ation Total/Marginal 63 (a) 17 (b) 10 (c) 90 35 (d) 20 (e) 25 (f) 80 Totals/Marginal 98 37 35 170 Step 1: Label Your Table. Label
Chi-square test Fisher s Exact test
Lesson 1 Chi-square test Fisher s Exact test McNemar s Test Lesson 1 Overview Lesson 11 covered two inference methods for categorical data from groups Confidence Intervals for the difference of two proportions
Test Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table
ANALYSIS OF DISCRT VARIABLS / 5 CHAPTR FIV ANALYSIS OF DISCRT VARIABLS Discrete variables are those which can only assume certain fixed values. xamples include outcome variables with results such as live
Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions
BMS 617 Statistical Techniques for the Biomedical Sciences Lecture 11: Chi-Squared and Fisher's Exact Tests Chi Squared and Fisher's Exact Tests This lecture presents two similarly structured tests, Chi-squared
Bivariate Statistics Session 2: Measuring Associations Chi-Square Test
Bivariate Statistics Session 2: Measuring Associations Chi-Square Test Features Of The Chi-Square Statistic The chi-square test is non-parametric. That is, it makes no assumptions about the distribution
Chapter 13. Chi-Square. Crosstabs and Nonparametric Tests. Specifically, we demonstrate procedures for running two separate
1 Chapter 13 Chi-Square This section covers the steps for running and interpreting chi-square analyses using the SPSS Crosstabs and Nonparametric Tests. Specifically, we demonstrate procedures for running
Crosstabulation & Chi Square
Crosstabulation & Chi Square Robert S Michael Chi-square as an Index of Association After examining the distribution of each of the variables, the researcher s next task is to look for relationships among
Association Between Variables
Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi
Section 12 Part 2. Chi-square test
Section 12 Part 2 Chi-square test McNemar s Test Section 12 Part 2 Overview Section 12, Part 1 covered two inference methods for categorical data from 2 groups Confidence Intervals for the difference of
INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)
INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of
Contingency Tables and the Chi Square Statistic. Interpreting Computer Printouts and Constructing Tables
Contingency Tables and the Chi Square Statistic Interpreting Computer Printouts and Constructing Tables Contingency Tables/Chi Square Statistics What are they? A contingency table is a table that shows
Odds ratio, Odds ratio test for independence, chi-squared statistic.
Odds ratio, Odds ratio test for independence, chi-squared statistic. Announcements: Assignment 5 is live on webpage. Due Wed Aug 1 at 4:30pm. (9 days, 1 hour, 58.5 minutes ) Final exam is Aug 9. Review
Study Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
Is it statistically significant? The chi-square test
UAS Conference Series 2013/14 Is it statistically significant? The chi-square test Dr Gosia Turner Student Data Management and Analysis 14 September 2010 Page 1 Why chi-square? Tests whether two categorical
Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY
Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY 1. Introduction Besides arriving at an appropriate expression of an average or consensus value for observations of a population, it is important to
Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)
Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the
Nonparametric Statistics
Nonparametric Statistics J. Lozano University of Goettingen Department of Genetic Epidemiology Interdisciplinary PhD Program in Applied Statistics & Empirical Methods Graduate Seminar in Applied Statistics
Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples
Statistics One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples February 3, 00 Jobayer Hossain, Ph.D. & Tim Bunnell, Ph.D. Nemours
TABLE OF CONTENTS. About Chi Squares... 1. What is a CHI SQUARE?... 1. Chi Squares... 1. Hypothesis Testing with Chi Squares... 2
About Chi Squares TABLE OF CONTENTS About Chi Squares... 1 What is a CHI SQUARE?... 1 Chi Squares... 1 Goodness of fit test (One-way χ 2 )... 1 Test of Independence (Two-way χ 2 )... 2 Hypothesis Testing
SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES
SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR
Analysis of categorical data: Course quiz instructions for SPSS
Analysis of categorical data: Course quiz instructions for SPSS The dataset Please download the Online sales dataset from the Download pod in the Course quiz resources screen. The filename is smr_bus_acd_clo_quiz_online_250.xls.
Chi Square Tests. Chapter 10. 10.1 Introduction
Contents 10 Chi Square Tests 703 10.1 Introduction............................ 703 10.2 The Chi Square Distribution.................. 704 10.3 Goodness of Fit Test....................... 709 10.4 Chi Square
Projects Involving Statistics (& SPSS)
Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,
Confidence Intervals for Cp
Chapter 296 Confidence Intervals for Cp Introduction This routine calculates the sample size needed to obtain a specified width of a Cp confidence interval at a stated confidence level. Cp is a process
Descriptive Analysis
Research Methods William G. Zikmund Basic Data Analysis: Descriptive Statistics Descriptive Analysis The transformation of raw data into a form that will make them easy to understand and interpret; rearranging,
Testing Research and Statistical Hypotheses
Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you
Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails.
Chi-square Goodness of Fit Test The chi-square test is designed to test differences whether one frequency is different from another frequency. The chi-square test is designed for use with data on a nominal
X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)
CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.
CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont
CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont To most people studying statistics a contingency table is a contingency table. We tend to forget, if we ever knew, that contingency
Simulating Chi-Square Test Using Excel
Simulating Chi-Square Test Using Excel Leslie Chandrakantha John Jay College of Criminal Justice of CUNY Mathematics and Computer Science Department 524 West 59 th Street, New York, NY 10019 [email protected]
Chapter 23. Two Categorical Variables: The Chi-Square Test
Chapter 23. Two Categorical Variables: The Chi-Square Test 1 Chapter 23. Two Categorical Variables: The Chi-Square Test Two-Way Tables Note. We quickly review two-way tables with an example. Example. Exercise
One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate
1 One-Way ANOVA using SPSS 11.0 This section covers steps for testing the difference between three or more group means using the SPSS ANOVA procedures found in the Compare Means analyses. Specifically,
Descriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
Using Excel for inferential statistics
FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied
Chapter 2 Probability Topics SPSS T tests
Chapter 2 Probability Topics SPSS T tests Data file used: gss.sav In the lecture about chapter 2, only the One-Sample T test has been explained. In this handout, we also give the SPSS methods to perform
HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...
HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men
Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship)
1 Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship) I. Authors should report effect sizes in the manuscript and tables when reporting
Comparing Multiple Proportions, Test of Independence and Goodness of Fit
Comparing Multiple Proportions, Test of Independence and Goodness of Fit Content Testing the Equality of Population Proportions for Three or More Populations Test of Independence Goodness of Fit Test 2
3. Analysis of Qualitative Data
3. Analysis of Qualitative Data Inferential Stats, CEC at RUPP Poch Bunnak, Ph.D. Content 1. Hypothesis tests about a population proportion: Binomial test 2. Chi-square testt for goodness offitfit 3. Chi-square
6.4 Normal Distribution
Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under
First-year Statistics for Psychology Students Through Worked Examples
First-year Statistics for Psychology Students Through Worked Examples 1. THE CHI-SQUARE TEST A test of association between categorical variables by Charles McCreery, D.Phil Formerly Lecturer in Experimental
1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
UNDERSTANDING THE TWO-WAY ANOVA
UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables
CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS
CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS CHI-SQUARE TESTS OF INDEPENDENCE (SECTION 11.1 OF UNDERSTANDABLE STATISTICS) In chi-square tests of independence we use the hypotheses. H0: The variables are independent
Come scegliere un test statistico
Come scegliere un test statistico Estratto dal Capitolo 37 of Intuitive Biostatistics (ISBN 0-19-508607-4) by Harvey Motulsky. Copyright 1995 by Oxfd University Press Inc. (disponibile in Iinternet) Table
The Chi-Square Test. STAT E-50 Introduction to Statistics
STAT -50 Introduction to Statistics The Chi-Square Test The Chi-square test is a nonparametric test that is used to compare experimental results with theoretical models. That is, we will be comparing observed
Chapter 7. Comparing Means in SPSS (t-tests) Compare Means analyses. Specifically, we demonstrate procedures for running Dependent-Sample (or
1 Chapter 7 Comparing Means in SPSS (t-tests) This section covers procedures for testing the differences between two means using the SPSS Compare Means analyses. Specifically, we demonstrate procedures
Terminating Sequential Delphi Survey Data Collection
A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to the Practical Assessment, Research & Evaluation. Permission is granted to
statistics Chi-square tests and nonparametric Summary sheet from last time: Hypothesis testing Summary sheet from last time: Confidence intervals
Summary sheet from last time: Confidence intervals Confidence intervals take on the usual form: parameter = statistic ± t crit SE(statistic) parameter SE a s e sqrt(1/n + m x 2 /ss xx ) b s e /sqrt(ss
Lecture Notes Module 1
Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific
UNDERSTANDING THE DEPENDENT-SAMPLES t TEST
UNDERSTANDING THE DEPENDENT-SAMPLES t TEST A dependent-samples t test (a.k.a. matched or paired-samples, matched-pairs, samples, or subjects, simple repeated-measures or within-groups, or correlated groups)
Introduction to Analysis of Variance (ANOVA) Limitations of the t-test
Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only
Nonparametric Tests. Chi-Square Test for Independence
DDBA 8438: Nonparametric Statistics: The Chi-Square Test Video Podcast Transcript JENNIFER ANN MORROW: Welcome to "Nonparametric Statistics: The Chi-Square Test." My name is Dr. Jennifer Ann Morrow. In
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
Multivariate Analysis of Variance. The general purpose of multivariate analysis of variance (MANOVA) is to determine
2 - Manova 4.3.05 25 Multivariate Analysis of Variance What Multivariate Analysis of Variance is The general purpose of multivariate analysis of variance (MANOVA) is to determine whether multiple levels
Statistical tests for SPSS
Statistical tests for SPSS Paolo Coletti A.Y. 2010/11 Free University of Bolzano Bozen Premise This book is a very quick, rough and fast description of statistical tests and their usage. It is explicitly
Types of Data, Descriptive Statistics, and Statistical Tests for Nominal Data. Patrick F. Smith, Pharm.D. University at Buffalo Buffalo, New York
Types of Data, Descriptive Statistics, and Statistical Tests for Nominal Data Patrick F. Smith, Pharm.D. University at Buffalo Buffalo, New York . NONPARAMETRIC STATISTICS I. DEFINITIONS A. Parametric
CHAPTER THREE COMMON DESCRIPTIVE STATISTICS COMMON DESCRIPTIVE STATISTICS / 13
COMMON DESCRIPTIVE STATISTICS / 13 CHAPTER THREE COMMON DESCRIPTIVE STATISTICS The analysis of data begins with descriptive statistics such as the mean, median, mode, range, standard deviation, variance,
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...
HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men
12.5: CHI-SQUARE GOODNESS OF FIT TESTS
125: Chi-Square Goodness of Fit Tests CD12-1 125: CHI-SQUARE GOODNESS OF FIT TESTS In this section, the χ 2 distribution is used for testing the goodness of fit of a set of data to a specific probability
2 Sample t-test (unequal sample sizes and unequal variances)
Variations of the t-test: Sample tail Sample t-test (unequal sample sizes and unequal variances) Like the last example, below we have ceramic sherd thickness measurements (in cm) of two samples representing
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate
Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures
Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone:
Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means
Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis
Use of the Chi-Square Statistic. Marie Diener-West, PhD Johns Hopkins University
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
Fairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
Chi Square Distribution
17. Chi Square A. Chi Square Distribution B. One-Way Tables C. Contingency Tables D. Exercises Chi Square is a distribution that has proven to be particularly useful in statistics. The first section describes
Topic 8. Chi Square Tests
BE540W Chi Square Tests Page 1 of 5 Topic 8 Chi Square Tests Topics 1. Introduction to Contingency Tables. Introduction to the Contingency Table Hypothesis Test of No Association.. 3. The Chi Square Test
Two Related Samples t Test
Two Related Samples t Test In this example 1 students saw five pictures of attractive people and five pictures of unattractive people. For each picture, the students rated the friendliness of the person
People like to clump things into categories. Virtually every research
05-Elliott-4987.qxd 7/18/2006 5:26 PM Page 113 5 Analysis of Categorical Data People like to clump things into categories. Virtually every research project categorizes some of its observations into neat,
Introduction to Statistics with SPSS (15.0) Version 2.3 (public)
Babraham Bioinformatics Introduction to Statistics with SPSS (15.0) Version 2.3 (public) Introduction to Statistics with SPSS 2 Table of contents Introduction... 3 Chapter 1: Opening SPSS for the first
Statistics in Medicine Research Lecture Series CSMC Fall 2014
Catherine Bresee, MS Senior Biostatistician Biostatistics & Bioinformatics Research Institute Statistics in Medicine Research Lecture Series CSMC Fall 2014 Overview Review concept of statistical power
11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS
CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS Hypothesis 1: People are resistant to the technological change in the security system of the organization. Hypothesis 2: information hacked and misused. Lack
2. Simple Linear Regression
Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according
Chapter 7 Section 7.1: Inference for the Mean of a Population
Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used
Introduction to Quantitative Methods
Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................
Two Correlated Proportions (McNemar Test)
Chapter 50 Two Correlated Proportions (Mcemar Test) Introduction This procedure computes confidence intervals and hypothesis tests for the comparison of the marginal frequencies of two factors (each with
Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation
Parkland College A with Honors Projects Honors Program 2014 Calculating P-Values Isela Guerra Parkland College Recommended Citation Guerra, Isela, "Calculating P-Values" (2014). A with Honors Projects.
Final Exam Practice Problem Answers
Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal
Research Methods & Experimental Design
Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and
Statistical estimation using confidence intervals
0894PP_ch06 15/3/02 11:02 am Page 135 6 Statistical estimation using confidence intervals In Chapter 2, the concept of the central nature and variability of data and the methods by which these two phenomena
Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.
Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing
Multiple samples: Pairwise comparisons and categorical outcomes
Multiple samples: Pairwise comparisons and categorical outcomes Patrick Breheny May 1 Patrick Breheny Introduction to Biostatistics (171:161) 1/19 Introduction Pairwise comparisons In the previous lecture,
Common Univariate and Bivariate Applications of the Chi-square Distribution
Common Univariate and Bivariate Applications of the Chi-square Distribution The probability density function defining the chi-square distribution is given in the chapter on Chi-square in Howell's text.
CHAPTER 12. Chi-Square Tests and Nonparametric Tests LEARNING OBJECTIVES. USING STATISTICS @ T.C. Resort Properties
CHAPTER 1 Chi-Square Tests and Nonparametric Tests USING STATISTICS @ T.C. Resort Properties 1.1 CHI-SQUARE TEST FOR THE DIFFERENCE BETWEEN TWO PROPORTIONS (INDEPENDENT SAMPLES) 1. CHI-SQUARE TEST FOR
Recall this chart that showed how most of our course would be organized:
Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical
" Y. Notation and Equations for Regression Lecture 11/4. Notation:
Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through
This chapter discusses some of the basic concepts in inferential statistics.
Research Skills for Psychology Majors: Everything You Need to Know to Get Started Inferential Statistics: Basic Concepts This chapter discusses some of the basic concepts in inferential statistics. Details
5/31/2013. Chapter 8 Hypothesis Testing. Hypothesis Testing. Hypothesis Testing. Outline. Objectives. Objectives
C H 8A P T E R Outline 8 1 Steps in Traditional Method 8 2 z Test for a Mean 8 3 t Test for a Mean 8 4 z Test for a Proportion 8 6 Confidence Intervals and Copyright 2013 The McGraw Hill Companies, Inc.
Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini
NEW YORK UNIVERSITY ROBERT F. WAGNER GRADUATE SCHOOL OF PUBLIC SERVICE Course Syllabus Spring 2016 Statistical Methods for Public, Nonprofit, and Health Management Section Format Day Begin End Building
3.4 Statistical inference for 2 populations based on two samples
3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted
Chapter 5 Analysis of variance SPSS Analysis of variance
Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,
SAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
NCSS Statistical Software
Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the
CHAPTER 12 TESTING DIFFERENCES WITH ORDINAL DATA: MANN WHITNEY U
CHAPTER 12 TESTING DIFFERENCES WITH ORDINAL DATA: MANN WHITNEY U Previous chapters of this text have explained the procedures used to test hypotheses using interval data (t-tests and ANOVA s) and nominal
Math 58. Rumbos Fall 2008 1. Solutions to Review Problems for Exam 2
Math 58. Rumbos Fall 2008 1 Solutions to Review Problems for Exam 2 1. For each of the following scenarios, determine whether the binomial distribution is the appropriate distribution for the random variable
One-Way Analysis of Variance (ANOVA) Example Problem
One-Way Analysis of Variance (ANOVA) Example Problem Introduction Analysis of Variance (ANOVA) is a hypothesis-testing technique used to test the equality of two or more population (or treatment) means
research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other
1 Hypothesis Testing Richard S. Balkin, Ph.D., LPC-S, NCC 2 Overview When we have questions about the effect of a treatment or intervention or wish to compare groups, we use hypothesis testing Parametric
