Testing Research and Statistical Hypotheses
|
|
|
- Joy Barker
- 9 years ago
- Views:
Transcription
1 Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you learned can be described in terms of the central tendency, such as mean length, and the variability about that mean, as expressed by the variance and standard deviation. But many archaeological data come to us as discrete variables, which need to be treated very differently. For example, among an assemblage of artifacts including handaxes, cleavers and scrapers there is no mean artifact type. In faunal assemblages there is no mean animal species. For these kinds of data we need to treat the individual categories, whether they are artifact types, raw material types or categories of ceramic decorations. These are discrete variables, and to test hypotheses about them we need to use appropriate statistical procedures. We will give you the opportunity to do this by applying discrete statistical analysis to the testing of hypotheses about differences in lithic artifact samples. Background Statistical analysis is very common to archaeology and other sciences, because it is the way we qualify our conclusions, rather than just make claims that they are true. It is true that many scientific conclusions can be reached without statistics. Indeed many major contributions from science are observations. Also, we need to understand when we need statistical support for our work and when we don t. For example, we do not need statistics to test the hypothesis that one group used pottery and another did not; we document this by the presence or absence of ceramics in their assemblages. But a hypothesis concerning the cultural relationships between prehistoric groups based on the similarities and differences in their pottery styles may very well require that we use statistical analysis, since this involves degrees of similarity, and a decision about what is significantly similar or different. Statistics are most commonly applied when we are testing hypotheses about archaeological phenomena, such that it is the archaeological hypotheses that drive our work. We perform statistical analyses in order to verify that our observations on archaeological data are valid and not just impressions or erroneous conclusions based on small samples. In most cases there are two kinds of hypotheses that are being tested. One, we ll call the research hypothesis, is based on the actual archaeological data, and our understanding of how those data are related. So if we pose the hypothesis that the cultural relationships between two groups are accurately reflected by their pottery styles, we need to be confidant that pottery indeed can be used to measure cultural similarities. We can support this by analogy to modern potters, and from the archaeological record itself. Once we have defined our research hypothesis, we can decide what kind of statistical hypothesis is appropriate. In the following exercises we will illustrate the sequence of testing a research hypothesis with a statistical hypothesis that is appropriate for the situation. But in the end, it is the archaeological issue that we are concerned with. There are many discrete statistical analytical methods. We will use the chi-square (X 2 ) analysis. The X 2 statistic is commonly used in testing archaeological hypotheses because we deal with so 51
2 many categorical variables as mentioned above. These can involve single classifications or analysis of independence in two-way (two categories compared at the same time) classification. In both, the X 2 statistic allows us to evaluate the observed frequency of certain categories against a theoretical frequency of those categories. In some cases we are simply deciding if variations in our data are random, and therefore of little meaning. For example, I claim that in my site I found 46 handaxes and 54 choppers, and claim that the higher number of choppers is a significant difference, indicative of the cultural tradition that made those artifacts. You are not convinced, and ask me to test whether this difference could be due to chance. Let s use this as an example of hypothesis testing in archaeology, supported by use of the the X 2 statistic. Testing for Randomness of Observations One Variable In this example, we are going to test the following research hypothesis: H o : Any differences between handaxe and chopper frequencies in my assemblage are due to chance. Statistically, we need to compare the observed frequencies of these artifact types against their theoretical frequencies that would be expected if their frequencies were indeed random. If two variables occur with random frequencies, such as tossing of coins, then their theoretical frequencies are Once we have determined the theoretical frequencies, we can proceed with testing the hypothesis using the X 2 statistic that is calculated with the following equation: ( O X 2 i = Ti T ) i 2 where, O T i "the sum of' observed frequency (count) theoretical frequency (count) label for categories (1 for handaxes, 2 for choppers) Let s consider this equation briefly, to make sure you understand what it s about. First, note that the X 2 statistic is going to be calculated by completing the operations shown on the right side of the equation. These can be summarized as follows: 1. For the first category (handaxes) subtract the theoretical frequency from that which we have observed. 2. Then square this difference to make all differences a positive value. We need to do this because some differences are negative and some are positive, but we re only interested in how large the difference is. 3. Next, divide the squared difference by the theoretical frequency. This is actually computing the ratio of the difference to the absolute value. If this ratio is a large number it will 52
3 make the X 2 larger, which will increase the chance that the difference is in fact a significant one. 4. Now perform steps 1-3 for the second variable (choppers). 5. Last, add the results for both variables to calculate the X 2. This will always be a positive number, and as it gets bigger, the chances that the difference between handaxe and chopper frequencies is significant ( that is to say nonrandom) will be increased.after we have done this on our data, we will see how we determine whether or not my claim is supported.the first thing we need to do is determine the theoretical number of handaxes and choppers. We calculate this by dividing the sum of all cases by the number of categories. In this case, the sum of all artifacts is 100 and the number of categories (handaxes and choppers) is two. Thus, the theoretical frequency of handaxes and choppers is 100/2 =50. Handaxes Choppers Sum Next, we ll calculate the differences between observed and theoretical for each category and square those differences: Handaxes Choppers Sum Next we divide each squared difference by the theoretical frequency for each category, which in our situation is 16/50 = 0.32 for each category: Handaxes Choppers Sum
4 Last, we simply add the last column in our table. This sum is the X 2 statistic. Handaxes Choppers Sum We now have to determine if the X 2 we calculated indicates that the difference in frequencies of handaxes and choppers is statistically significant, which is the way that we ll decide to accept or reject our hypothesis. So now we re going to compare our X 2 with the probability distribution of the X 2 statistic as shown in the X 2 Table. The X 2 Table lists values of the X 2 statistic based on two considerations: probability level and degrees of freedom. What do these mean? We determine the level of probability by deciding how often we are willing to make a mistake in our conclusion. If, for example, we are willing to be wrong 10% of the time, we will use the P 90 column in the table. If we are willing to be wrong only 5% of the time, we ll select the P 95 column. Notice that the X 2 cut-off values for P 90 and P 95 increase in the first row from 2.71 to Remember we said earlier that the larger the X 2 value, the greater the chance (probability) that the differences between observed and theoretical frequencies are significant. This should help you to see that the term significance is really a statement of how often we could be wrong in accepting or rejecting a hypothesis. So hypothesis testing in a situation like this can yield an all-or-nothing result only if we define ahead of time how often we are willing to make a mistake. Alternatively, we could just see what the chances are for a mistake by looking at the X 2 probability we came up with. The degrees of freedom (df) which make up the first column in the X 2 Table simply adjusts for the number of categories we are comparing. The degrees of freedom is one less than the number of categories (2-1) As you can see in the table under any probability column, that the cut-off X 2 values increase as the number of categories increase. We have two categories, so our degree of freedom is 1. If we had more categories, the X 2 cut-off would increase simply because we would add together more summed differences between observed and theoretical frequencies. Making a Decision About the Hypothesis We can now see the result of our X 2 analysis. The X 2 for our handaxe-chopper example is First, we have to predetermine the probability level we want to use, let s say P 95, which would mean that we could make a mistake 5% of the time. By looking at the X 2 cut-off for the P 95 probability and 1 df you can see that our X 2 is smaller than the expected X 2 value of If the X 2 value we calculate is greater than the selected X 2 probability, then the null 54
5 hypothesis is rejected. Thus, the likelihood is that the frequencies are not due to chance and the difference between the observed and theoretical frequencies of handaxes and choppers is statistically significant. If the X 2 value we calculate is less than the selected X 2 probability then the null hypothesis cannot be rejected and the likelihood is that the frequencies are due to chance and the difference between the observed and theoretical frequencies of handaxes and choppers is not statistically significant. So now you know that the frequencies of handaxes and choppers in my sample could easily be due to chance, and the difference in their frequencies is not statistically significant. Therefore we reject my hypothesis, and your suspicions are now statistically affirmed. We can also use X 2 analysis to compare the frequencies of categories between two archaeological samples not to see if they are random, but to see if they are different or similar. This requires a slightly different approach, and one that you will use in your lab exercise to compare artifact samples you will analyze. Testing for Randomness of Observations Multiple Variables In this next example, we are going to examine differences across two variables. The mechanics of conducting a X 2 analysis is similar to the previous example. The main difference is in the calculation of the theoretical values and the degrees of freedom. In this example, we will examine whether there are differences in the raw materials used to make handaxes and choppers. Here are the frequencies of raw materials for handaxes and choppers from a site in East Africa: Basalt Rhyolite Total Handaxes Choppers Total Overall, it appears that basalt was somewhat preferred for making these tools than rhyolite, but were these raw materials used the same for the two tool categories or was there some preference by their makers? We can find out by testing the following null hypothesis: H o : Any differences between raw materials for handaxes and chopper are due to chance. To calculate the theoretical values for our X 2 analysis, we use the following formula: (R i x C i )/T where R i is the row total ( 1 for handaxes and 2 for choppers), C i is the column total (1 for basalt 55
6 and 2 for rhyolite), and T is the total of all artifacts (109). Here are the theoretical frequencies: Basalt handaxes (54x48)/109 = 23.8 Basalt choppers (55x48)/109 = 24.2 Rhyolite handaxes (54x61)/109 = 30.2 Rhyolite choppers (55x61)/109 = 30.8 Calculating the X 2 is the same as in the previous example: X 2 ( Oi Ti ) = T i 2 Basalt Rhyolite Total Handaxes ( ) 2 /23.8=1.62 ( ) 2 /30.2= Choppers ( ) 2 /24.2=1.59 ( ) 2 /30.8= Total X 2 =5.73 The degrees of freedom for multiple variables is calculated slightly differently from in the previous example. Because we now have more than one column, we use the following formula: df= (r-1)x(c-1) where r is the number of rows and c is the number of columns. Thus in our example, there are two columns (basalt and rhyolite) and two rows (handaxes and choppers). So the degrees of freedom are calculated as follows: (2-1) x (2-1) = 1. If we use the probability level of 95% with the degrees of freedom of 1, the selected probability cut-off ( P 95 )in the X 2 Table 2 is The X 2 value for our analysis (5.73) is greater than the expected value, thus the null hypothesis that raw material selection for handaxes and choppers is due to chance is rejected if we are willing to make an error in this decision no more than 5% of the time. On the other hand, if we wanted to risk an error in testing our hypothesis no more than 1% of the time ( P 99 ) our X 2 is less than the cut-off of 6.63, and we would conclude that the differences in raw material selection are not significant. Thus, we get to decide the level of significance. 56
7 Percentiles of the X 2 Distribution df P O. 5 P I P 2.5 P 5 P IO P 90 P 95 P P 99 P
Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)
Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the
AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics
Ms. Foglia Date AP: LAB 8: THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,
Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails.
Chi-square Goodness of Fit Test The chi-square test is designed to test differences whether one frequency is different from another frequency. The chi-square test is designed for use with data on a nominal
LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics
Period Date LAB : THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,
Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation
Parkland College A with Honors Projects Honors Program 2014 Calculating P-Values Isela Guerra Parkland College Recommended Citation Guerra, Isela, "Calculating P-Values" (2014). A with Honors Projects.
Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means
Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis
Crosstabulation & Chi Square
Crosstabulation & Chi Square Robert S Michael Chi-square as an Index of Association After examining the distribution of each of the variables, the researcher s next task is to look for relationships among
Descriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
Independent samples t-test. Dr. Tom Pierce Radford University
Independent samples t-test Dr. Tom Pierce Radford University The logic behind drawing causal conclusions from experiments The sampling distribution of the difference between means The standard error of
Friedman's Two-way Analysis of Variance by Ranks -- Analysis of k-within-group Data with a Quantitative Response Variable
Friedman's Two-way Analysis of Variance by Ranks -- Analysis of k-within-group Data with a Quantitative Response Variable Application: This statistic has two applications that can appear very different,
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
1.5 Oneway Analysis of Variance
Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments
One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate
1 One-Way ANOVA using SPSS 11.0 This section covers steps for testing the difference between three or more group means using the SPSS ANOVA procedures found in the Compare Means analyses. Specifically,
2 Sample t-test (unequal sample sizes and unequal variances)
Variations of the t-test: Sample tail Sample t-test (unequal sample sizes and unequal variances) Like the last example, below we have ceramic sherd thickness measurements (in cm) of two samples representing
Two-Sample T-Tests Assuming Equal Variance (Enter Means)
Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of
Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)
Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption
" Y. Notation and Equations for Regression Lecture 11/4. Notation:
Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through
Study Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
Chapter 5 Analysis of variance SPSS Analysis of variance
Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,
Fairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
Introduction to Analysis of Variance (ANOVA) Limitations of the t-test
Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
Chapter 23. Two Categorical Variables: The Chi-Square Test
Chapter 23. Two Categorical Variables: The Chi-Square Test 1 Chapter 23. Two Categorical Variables: The Chi-Square Test Two-Way Tables Note. We quickly review two-way tables with an example. Example. Exercise
Using Excel for inferential statistics
FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied
Association Between Variables
Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi
Recall this chart that showed how most of our course would be organized:
Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical
Bivariate Statistics Session 2: Measuring Associations Chi-Square Test
Bivariate Statistics Session 2: Measuring Associations Chi-Square Test Features Of The Chi-Square Statistic The chi-square test is non-parametric. That is, it makes no assumptions about the distribution
UNDERSTANDING THE TWO-WAY ANOVA
UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables
STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance
Principles of Statistics STA-201-TE This TECEP is an introduction to descriptive and inferential statistics. Topics include: measures of central tendency, variability, correlation, regression, hypothesis
One-Way Analysis of Variance (ANOVA) Example Problem
One-Way Analysis of Variance (ANOVA) Example Problem Introduction Analysis of Variance (ANOVA) is a hypothesis-testing technique used to test the equality of two or more population (or treatment) means
CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA
CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA Chapter 13 introduced the concept of correlation statistics and explained the use of Pearson's Correlation Coefficient when working
Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY
Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY 1. Introduction Besides arriving at an appropriate expression of an average or consensus value for observations of a population, it is important to
Chapter 7. Comparing Means in SPSS (t-tests) Compare Means analyses. Specifically, we demonstrate procedures for running Dependent-Sample (or
1 Chapter 7 Comparing Means in SPSS (t-tests) This section covers procedures for testing the differences between two means using the SPSS Compare Means analyses. Specifically, we demonstrate procedures
Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation
Chapter 9 Two-Sample Tests Paired t Test (Correlated Groups t Test) Effect Sizes and Power Paired t Test Calculation Summary Independent t Test Chapter 9 Homework Power and Two-Sample Tests: Paired Versus
Standard Deviation Estimator
CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of
One-Way Analysis of Variance
One-Way Analysis of Variance Note: Much of the math here is tedious but straightforward. We ll skim over it in class but you should be sure to ask questions if you don t understand it. I. Overview A. We
Chi-square test Fisher s Exact test
Lesson 1 Chi-square test Fisher s Exact test McNemar s Test Lesson 1 Overview Lesson 11 covered two inference methods for categorical data from groups Confidence Intervals for the difference of two proportions
Weight of Evidence Module
Formula Guide The purpose of the Weight of Evidence (WoE) module is to provide flexible tools to recode the values in continuous and categorical predictor variables into discrete categories automatically,
Topic 8. Chi Square Tests
BE540W Chi Square Tests Page 1 of 5 Topic 8 Chi Square Tests Topics 1. Introduction to Contingency Tables. Introduction to the Contingency Table Hypothesis Test of No Association.. 3. The Chi Square Test
CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS
CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS Hypothesis 1: People are resistant to the technological change in the security system of the organization. Hypothesis 2: information hacked and misused. Lack
II. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
Introduction to Hypothesis Testing
I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters - they must be estimated. However, we do have hypotheses about what the true
Statistical Impact of Slip Simulator Training at Los Alamos National Laboratory
LA-UR-12-24572 Approved for public release; distribution is unlimited Statistical Impact of Slip Simulator Training at Los Alamos National Laboratory Alicia Garcia-Lopez Steven R. Booth September 2012
SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES
SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR
Recommend Continued CPS Monitoring. 63 (a) 17 (b) 10 (c) 90. 35 (d) 20 (e) 25 (f) 80. Totals/Marginal 98 37 35 170
Work Sheet 2: Calculating a Chi Square Table 1: Substance Abuse Level by ation Total/Marginal 63 (a) 17 (b) 10 (c) 90 35 (d) 20 (e) 25 (f) 80 Totals/Marginal 98 37 35 170 Step 1: Label Your Table. Label
Hypothesis Testing: Two Means, Paired Data, Two Proportions
Chapter 10 Hypothesis Testing: Two Means, Paired Data, Two Proportions 10.1 Hypothesis Testing: Two Population Means and Two Population Proportions 1 10.1.1 Student Learning Objectives By the end of this
Odds ratio, Odds ratio test for independence, chi-squared statistic.
Odds ratio, Odds ratio test for independence, chi-squared statistic. Announcements: Assignment 5 is live on webpage. Due Wed Aug 1 at 4:30pm. (9 days, 1 hour, 58.5 minutes ) Final exam is Aug 9. Review
MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010
MONT 07N Understanding Randomness Solutions For Final Examination May, 00 Short Answer (a) (0) How are the EV and SE for the sum of n draws with replacement from a box computed? Solution: The EV is n times
CHAPTER 12 TESTING DIFFERENCES WITH ORDINAL DATA: MANN WHITNEY U
CHAPTER 12 TESTING DIFFERENCES WITH ORDINAL DATA: MANN WHITNEY U Previous chapters of this text have explained the procedures used to test hypotheses using interval data (t-tests and ANOVA s) and nominal
Simple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
8 6 X 2 Test for a Variance or Standard Deviation
Section 8 6 x 2 Test for a Variance or Standard Deviation 437 This test uses the P-value method. Therefore, it is not necessary to enter a significance level. 1. Select MegaStat>Hypothesis Tests>Proportion
CHI-SQUARE: TESTING FOR GOODNESS OF FIT
CHI-SQUARE: TESTING FOR GOODNESS OF FIT In the previous chapter we discussed procedures for fitting a hypothesized function to a set of experimental data points. Such procedures involve minimizing a quantity
MULTIPLE REGRESSION WITH CATEGORICAL DATA
DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 86 MULTIPLE REGRESSION WITH CATEGORICAL DATA I. AGENDA: A. Multiple regression with categorical variables. Coding schemes. Interpreting
NCSS Statistical Software
Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the
Hypothesis testing - Steps
Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =
First-year Statistics for Psychology Students Through Worked Examples
First-year Statistics for Psychology Students Through Worked Examples 1. THE CHI-SQUARE TEST A test of association between categorical variables by Charles McCreery, D.Phil Formerly Lecturer in Experimental
Chi Square Tests. Chapter 10. 10.1 Introduction
Contents 10 Chi Square Tests 703 10.1 Introduction............................ 703 10.2 The Chi Square Distribution.................. 704 10.3 Goodness of Fit Test....................... 709 10.4 Chi Square
2 GENETIC DATA ANALYSIS
2.1 Strategies for learning genetics 2 GENETIC DATA ANALYSIS We will begin this lecture by discussing some strategies for learning genetics. Genetics is different from most other biology courses you have
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217
Part 3 Comparing Groups Chapter 7 Comparing Paired Groups 189 Chapter 8 Comparing Two Independent Groups 217 Chapter 9 Comparing More Than Two Groups 257 188 Elementary Statistics Using SAS Chapter 7 Comparing
CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS
CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS CHI-SQUARE TESTS OF INDEPENDENCE (SECTION 11.1 OF UNDERSTANDABLE STATISTICS) In chi-square tests of independence we use the hypotheses. H0: The variables are independent
CHAPTER 15 NOMINAL MEASURES OF CORRELATION: PHI, THE CONTINGENCY COEFFICIENT, AND CRAMER'S V
CHAPTER 15 NOMINAL MEASURES OF CORRELATION: PHI, THE CONTINGENCY COEFFICIENT, AND CRAMER'S V Chapters 13 and 14 introduced and explained the use of a set of statistical tools that researchers use to measure
3.4 Statistical inference for 2 populations based on two samples
3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted
Goodness of Fit. Proportional Model. Probability Models & Frequency Data
Probability Models & Frequency Data Goodness of Fit Proportional Model Chi-square Statistic Example R Distribution Assumptions Example R 1 Goodness of Fit Goodness of fit tests are used to compare any
INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)
INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of
Testing differences in proportions
Testing differences in proportions Murray J Fisher RN, ITU Cert., DipAppSc, BHSc, MHPEd, PhD Senior Lecturer and Director Preregistration Programs Sydney Nursing School (MO2) University of Sydney NSW 2006
Test Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table
ANALYSIS OF DISCRT VARIABLS / 5 CHAPTR FIV ANALYSIS OF DISCRT VARIABLS Discrete variables are those which can only assume certain fixed values. xamples include outcome variables with results such as live
Describing Populations Statistically: The Mean, Variance, and Standard Deviation
Describing Populations Statistically: The Mean, Variance, and Standard Deviation BIOLOGICAL VARIATION One aspect of biology that holds true for almost all species is that not every individual is exactly
Confidence intervals
Confidence intervals Today, we re going to start talking about confidence intervals. We use confidence intervals as a tool in inferential statistics. What this means is that given some sample statistics,
12.5: CHI-SQUARE GOODNESS OF FIT TESTS
125: Chi-Square Goodness of Fit Tests CD12-1 125: CHI-SQUARE GOODNESS OF FIT TESTS In this section, the χ 2 distribution is used for testing the goodness of fit of a set of data to a specific probability
Is it statistically significant? The chi-square test
UAS Conference Series 2013/14 Is it statistically significant? The chi-square test Dr Gosia Turner Student Data Management and Analysis 14 September 2010 Page 1 Why chi-square? Tests whether two categorical
Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.
Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative
Solutions to Homework 10 Statistics 302 Professor Larget
s to Homework 10 Statistics 302 Professor Larget Textbook Exercises 7.14 Rock-Paper-Scissors (Graded for Accurateness) In Data 6.1 on page 367 we see a table, reproduced in the table below that shows the
Chapter 3 RANDOM VARIATE GENERATION
Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.
Session 7 Bivariate Data and Analysis
Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table co-variation least squares
"Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1
BASIC STATISTICAL THEORY / 3 CHAPTER ONE BASIC STATISTICAL THEORY "Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1 Medicine
Analysis of Variance ANOVA
Analysis of Variance ANOVA Overview We ve used the t -test to compare the means from two independent groups. Now we ve come to the final topic of the course: how to compare means from more than two populations.
CALCULATIONS & STATISTICS
CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents
In the past, the increase in the price of gasoline could be attributed to major national or global
Chapter 7 Testing Hypotheses Chapter Learning Objectives Understanding the assumptions of statistical hypothesis testing Defining and applying the components in hypothesis testing: the research and null
CS 147: Computer Systems Performance Analysis
CS 147: Computer Systems Performance Analysis One-Factor Experiments CS 147: Computer Systems Performance Analysis One-Factor Experiments 1 / 42 Overview Introduction Overview Overview Introduction Finding
ABSORBENCY OF PAPER TOWELS
ABSORBENCY OF PAPER TOWELS 15. Brief Version of the Case Study 15.1 Problem Formulation 15.2 Selection of Factors 15.3 Obtaining Random Samples of Paper Towels 15.4 How will the Absorbency be measured?
Comparing Multiple Proportions, Test of Independence and Goodness of Fit
Comparing Multiple Proportions, Test of Independence and Goodness of Fit Content Testing the Equality of Population Proportions for Three or More Populations Test of Independence Goodness of Fit Test 2
Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions
BMS 617 Statistical Techniques for the Biomedical Sciences Lecture 11: Chi-Squared and Fisher's Exact Tests Chi Squared and Fisher's Exact Tests This lecture presents two similarly structured tests, Chi-squared
CHAPTER 14 NONPARAMETRIC TESTS
CHAPTER 14 NONPARAMETRIC TESTS Everything that we have done up until now in statistics has relied heavily on one major fact: that our data is normally distributed. We have been able to make inferences
Rank-Based Non-Parametric Tests
Rank-Based Non-Parametric Tests Reminder: Student Instructional Rating Surveys You have until May 8 th to fill out the student instructional rating surveys at https://sakai.rutgers.edu/portal/site/sirs
Objective. Materials. TI-73 Calculator
0. Objective To explore subtraction of integers using a number line. Activity 2 To develop strategies for subtracting integers. Materials TI-73 Calculator Integer Subtraction What s the Difference? Teacher
Analyzing and interpreting data Evaluation resources from Wilder Research
Wilder Research Analyzing and interpreting data Evaluation resources from Wilder Research Once data are collected, the next step is to analyze the data. A plan for analyzing your data should be developed
SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.
SIMPLE LINEAR CORRELATION Simple linear correlation is a measure of the degree to which two variables vary together, or a measure of the intensity of the association between two variables. Correlation
Content Sheet 7-1: Overview of Quality Control for Quantitative Tests
Content Sheet 7-1: Overview of Quality Control for Quantitative Tests Role in quality management system Quality Control (QC) is a component of process control, and is a major element of the quality management
CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont
CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont To most people studying statistics a contingency table is a contingency table. We tend to forget, if we ever knew, that contingency
research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other
1 Hypothesis Testing Richard S. Balkin, Ph.D., LPC-S, NCC 2 Overview When we have questions about the effect of a treatment or intervention or wish to compare groups, we use hypothesis testing Parametric
AN ANALYSIS OF INSURANCE COMPLAINT RATIOS
AN ANALYSIS OF INSURANCE COMPLAINT RATIOS Richard L. Morris, College of Business, Winthrop University, Rock Hill, SC 29733, (803) 323-2684, [email protected], Glenn L. Wood, College of Business, Winthrop
Chapter 4. Probability and Probability Distributions
Chapter 4. robability and robability Distributions Importance of Knowing robability To know whether a sample is not identical to the population from which it was selected, it is necessary to assess the
Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance
2 Making Connections: The Two-Sample t-test, Regression, and ANOVA In theory, there s no difference between theory and practice. In practice, there is. Yogi Berra 1 Statistics courses often teach the two-sample
Chapter 13. Chi-Square. Crosstabs and Nonparametric Tests. Specifically, we demonstrate procedures for running two separate
1 Chapter 13 Chi-Square This section covers the steps for running and interpreting chi-square analyses using the SPSS Crosstabs and Nonparametric Tests. Specifically, we demonstrate procedures for running
Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing
Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing
This chapter will demonstrate how to perform multiple linear regression with IBM SPSS
CHAPTER 7B Multiple Regression: Statistical Methods Using IBM SPSS This chapter will demonstrate how to perform multiple linear regression with IBM SPSS first using the standard method and then using the
Simple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
The F distribution and the basic principle behind ANOVAs. Situating ANOVAs in the world of statistical tests
Tutorial The F distribution and the basic principle behind ANOVAs Bodo Winter 1 Updates: September 21, 2011; January 23, 2014; April 24, 2014; March 2, 2015 This tutorial focuses on understanding rather
