First-year Statistics for Psychology Students Through Worked Examples. 3. Analysis of Variance
|
|
|
- Arthur Hubert Clark
- 9 years ago
- Views:
Transcription
1 First-year Statistics for Psychology Students Through Worked Examples 3. Analysis of Variance by Charles McCreery, D.Phil Formerly Lecturer in Experimental Psychology Magdalen College Oxford Copyright Charles McCreery, 2007 OXFORD FORUM Psychological Paper No
2 Contents 1. Introduction 1.1 Concerning the Appendices 1.2 Recommended reading: 1.3 Comments 2. The Question 3. The Answer 3.1 What is meant by randomization? 3.2 The plot 3.3 The ANOVA analysis Hypothesis and assumptions Computation of the ANOVA The results 3.4 Design precautions which should have been taken 3.5 Two potential benefits from a repeated measures design Appendix I steps in a two-way ANOVA Appendix II the theoretical distinction between populations and samples Appendix III repeated measures and interaction effects Appendix IV parametric versus non-parametric tests 2
3 1. Introduction This tutorial is the third in a series based on handouts I developed while teaching statistics to first-year psychology students at Magdalen College, Oxford. The main body of the tutorial is a detailed worked answer to a question from the Oxford University Psychology Prelims examination. The rationale for basing teaching of statistics on worked examples is given in the Introduction to the first tutorial Concerning the Appendices In this worked example of a Two-way ANOVA the arithmetical workings are set out in the most explicit way possible for heuristic purposes; i.e. the object is to show as clearly as possible what is going on. In practice, various short-cut methods of calculation are possible, particularly in an examination context. For example, first-year Oxford University psychology students are allowed the use of a hand calculator in the Psychology Prelims exam, and the Department of Statistics recommends the method of calculation summarized in Appendix 1. The remaining three appendices deal with more general theoretical points. The first of these, Appendix II, summarises in schematic form the distinction between populations and samples. This distinction is important to an understanding of the theoretical basis of the ANOVA, as well as to the understanding of other tests in statistics. Appendix III summarises some points concerning repeated measures and interaction effects. These are topics which are not directly involved in the particular worked example in this tutorial, although the concept of repeated measures is alluded to. However, the Appendix may be found useful by people learning about other forms of the ANOVA. ANOVA is an example of a parametric statistical test, and the requirements of such a test are alluded to in the present example. Appendix IV summarises the differences between parametric and non-parametric tests, and lists a number of parametric tests and their non-parametric equivalents. 1 McCreery, C. (2007). First-year statistics for psychology students through worked examples: I The Chi-square test, a test of association between categorical variables. Psychological paper No. 2007, 1. Oxford: Oxford Forum. Online pdf at: 3
4 1.2 Recommended reading: The present tutorial will probably be found more useful if some preliminary reading around the topic has already been done in advance. I suggest that it does not matter if not everything in the reading is clear on the first go through. After the preliminary reading you can work through the example in the present tutorial and then return to the books, when you may have a better feel for the topic. I particularly suggest that it is important not to get bogged down in trying to understand the details of areas of the topic which may not be relevant to one s syllabus. Some of the books mentioned below go into the subject in great depth, and with a degree of detail which is probably unnecessary for the purposes of most examinations. For the purposes of understanding what is going on in a two-way ANOVA I particularly recommend following the worked example given on pp of Murray Spiegel s book listed below. The arithmetical workings are laid out very explicitly, with a minimum of abstract notation. In my view going through an explicit worked example of this kind can give a better understanding of the underlying theoretical basis of ANOVA than a verbal exposition. Hoel, Paul G. (1976). Elementary Statistics (4 th edition). New York: Wiley. Chapter 11. Covers one-way and two-way ANOVAs in a relatively user-friendly manner. Howell, David C. (1997). Statistical Methods for Psychology (4 th London: Duxbury Press. Chapters edition). There are nearly 200 pages on ANOVA in these four chapters, covering all aspects of the subject, including repeated measures designs. Hays, William L. (1994). Statistics (5 th edition). Orlando, Florida: Harcourt Brace. Chapters Rather more than 200 pages in this case. Similar in comprehensiveness to Howell. Spiegel, Murray R. (1992). Schaum s Outline of Theory and Problems of Statistics (2 nd edition). New York: McGraw-Hill. Chapter 16. 4
5 The theoretical introduction to this chapter is compressed, but the worked examples in the remainder may be found helpful. 1.3 Comments If anyone has any comment to make on this tutorial, or notices any typographical or other error which should be corrected, I should be pleased to hear from them, via: [email protected] 5
6 2. The Question 2 (i) What is meant by randomisation? An undesirable effect of some antihistamines is drowsiness, which is a consequence of the effect of the drugs on the central nervous system. These data come from an experiment of Hedges, Hills, Maclay, Newman-Taylor and Turner (1971) to compare the effect on the central nervous system of a placebo and two antihistamines. This was done by measuring the flicker frequency 3 some time after drug administration in four volunteers who have taken the three treatments. The data presented here are scaled measures based on the flicker frequency. Subject Number Meclastine Promethazine Placebo (ii) Plot these data in a meaningful way and comment. (iii) Carry out an appropriate analysis to examine whether there is a difference between the effects of the different drugs, stating clearly your hypotheses, conclusions and any assumptions made. (iv) State two precautions which should have been taken in running this experiment. (v) Give two benefits which would have resulted if more than one measurement for each drug for each subject had been obtained. (3;4;7;3;3) [The numbers in the bottom right hand corner of the question are as they appear in the Oxford Prelims exam paper and indicate to the candidate how many marks each part of the question is potentially worth if answered correctly.] 2 The question is taken from the Prelims Statistics paper for first-year psychology students at Oxford University, Hilary Term, The flicker frequency, often referred to as the critical flicker frequency or CFF, is that frequency at which a flickering light appears to a particular observer to be continuous rather than discontinuous. It is usually estimated by taking a series of readings from both below and above; that is to say, the experimenter starts with a frequency below the fusion point and gradually increases it until the flicker disappears; he/she then starts again from a frequency above the fusion point and lowers the frequency until the flicker appears again. The CFF reading for a given subject at a given time would then be the mean of several observations taken in both directions, to allow for noise (i.e. random moment-to-moment small fluctuations in the CFF). 6
7 CFF 3. The answer 3.1 What is meant by randomization? Random sampling: a sample is said to be taken at random from a population when each member of the population has an equal chance of being chosen. One result is that each member would be selected approximately the same number of times as any other member given a large number of repetitions of the experiment. The concept of randomisation in the design of experiments is a distinct but related concept. It refers to the process of allocating subjects to two or more experimental conditions, or 'treatment groups', by some chance process such as the use of random number tables. Assuming that we wish to have an equal number of subjects in each group, the chances of a given subject being allocated to the each possible condition are thus equal. The purpose of random allocation of this kind is to produce treatment groups that are as nearly similar as possible prior to the experimental procedure whose effect it is desired to measure. Thus any difference between the groups subsequent to the treatment may be attributed to the experimental treatment and not to some systematic, prior difference between them. 3.2 The plot Antihistamines and CFF Subject 1 Subject 2 Subject 3 Subject Meclastine Promethazine Placebo Drug 7
8 Comment on the plot: The most notable effect appears to be large individual differences in baseline critical flicker frequency or CFF. It also appears that both antihistamines have the effect of depressing the CFF, since the highest mean value in all four subjects is for the placebo condition. Promethazine would appear to depress it more than Meclastine, though this difference may not be statistically significant, given the small number of subjects. [The data are consistent with the idea that the CFF is a measure of sensory efficiency and hence cortical arousal.] 3.3 The ANOVA analysis We will perform a two-way ANOVA on the data to determine whether there is any significant difference between the effects of the different drugs. There will be two factors in this analysis: 'Drug', which has three levels 4, Meclastine, Promethazine and Placebo; and 'Subject', which has four. The inter-subject variability referred to above suggests that there may be a significant main effect 5 for Subject. Whether the intra-subject variability is sufficient to give us a main effect for Drug is less clear Hypothesis and assumptions The hypothesis is that the two antihistamines depress cortical arousal, and hence visual efficiency as measured by the visual system's power of temporal resolution. We predict that the CFF will therefore be significantly lower under the influence of the antihistamines than under the placebo condition. Our assumptions are: (1) that the observations under each condition are drawn from a normal distribution, even though there are too few under each condition to constitute a normal sample; and (2) that the variances of each of these underlying normal distributions are approximately equal. [These are general assumptions underlying all applications of the ANOVA test. Note that the question explicitly mentions that the data are from 'scaled measures'; i.e. they are at least interval data, so a parametric test such as ANOVA is appropriate. For an explanation of the distinctions between interval and other types of measurements, and the implications for what type of test to 4 Level in analysis of variance is simply a technical term for the different possible values of a variable or factor. 5 For a definition of main effect and a discussion of the difference between this and an interaction effect see Appendix III. 8
9 use on a given type of data, please refer to my tutorial on the chi-square test, Appendix B. 6 Appendix IV to the present tutorial summarises the differences between parametric tests and non-parametric tests in general.] Computation of the ANOVA Subject Number Meclastine Promethazine Placebo Row Mean Column Mean Calculation of the total sum of squares of the data: Grand mean = ( )/12 = 979/12 = Deviation of observation from Deviation Observation grand mean squared , , , , Grand mean =81.58 Total sum of squares (TSS): 11, McCreery, C. (2007). First-year statistics for psychology students through worked examples: I The Chi-square test, a test of association between categorical variables. Psychological paper No. 2007, 1. Oxford: Oxford Forum. Online pdf at: 9
10 Total number of observations: n = (no. of rows) * (no. of columns) = r * c = 4 * 3 = 12. [The calculation of TSS is laid out in the explicit form above for heuristic purposes. In practice you are likely to be able to use a short-cut method for calculating the TSS. For example, in the Oxford University Psychology Prelims examination the student is allowed the use of a hand calculator, and may therefore calculate the TSS by entering all 12 observations into the calculator, working out their standard deviation (SD) and squaring the SD to get the variance. Having got the variance, you can derive the TSS by multiplying the variance by (n-1), where n is the total number of observations. See Appendix I for a summary of the steps using this method. N.B. There are two separate operational definitions of variance (and hence of SD, its square root). If we are calculating the variance of a population, we divide TSS by N, the total number in the population; but if we are dealing with only a sample of a population, as in the present example, we divide by (n-l), where n is the total number in the sample. This is an adjustment designed to allow for sampling error in our data, or random variation from sample to sample. If using a calculator, therefore, it is advisable to be aware which definition is being used by it, as some calculators are able to calculate both versions. For more on the distinction between populations and samples, see Appendix II.] Row calculations: Deviation of row mean from Deviation Row means grand mean squared Grand mean =81.58 Sum of squares: Row Sum of Squares (RSS) = Sum of squares of row means* No. of columns = * 3 = 10, [Note that the sum of squares of the row means is the variation in just one column of figures, albeit a representative one. To get an estimate of the overall 10
11 variation due to the 'row factor', Subject, we have to extrapolate this variation across the whole two-dimensional array of columns. Hence the need to multiply by the number of columns. As with the computation of the TSS, the calculation of the RSS shown above is laid out in a manner designed to show the underlying rationale of ANOVA, whereas a short-cut method may be used in an examination in which a calculator is permitted. For details of this method, see Appendix I.] Column calculations: Column means Deviation of column mean from grand mean Deviation squared Grand mean =81.58 Sum of squares: [Note that the sum of squares of the column means is much less than that of the row means, reflecting the fact that, simply from visual inspection of the data, there seems to be more variability from one subject to another than withinsubject variability due to the different effect of different drugs.] Column Sum of Squares (CSS) = Sum of squares of column means* No. of rows = * 4 = [This time we are multiplying the sum of squares of the column means by the number of rows, for an analogous reason to that given for multiplying the sum of squares of the row means by the number of columns. As with the RSS, an alternative method of calculating the CSS is given in Appendix I.] Within groups/error sum of squares (ESS): - ESS = TSS - RSS - CSS = 11, , = 83. [In an experimental design like this there are only two sources of variation in the CFF readings which are of interest to us: variation associated with different drugs (i.e. between-groups or 'horizontal' variation), and variation associated 11
12 with different subjects (i.e. within-groups or 'vertical' variation). So if we know the value of the total variation and the value of both these two sorts of variation, then 'error variation' is simply what is left over when we have subtracted both sorts from the total. Note that the term 'error' here does not imply any sort of mistake. In another context the error variation might be of interest to us. Some of it might be due to a time of day effect, for example, if the subjects were not all run at the same time of day. The error term in this case is extremely small relative to the main effects for Subject and Drug. The reason for this can be seen from the plot: the pattern of variation due to drug is very similar in each subject, so that by the time one has taken out the variation in the scores due to drug and that due to subject the effect of any other factor appears to be minimal.] Computation of mean squares: Row Mean Square (RMS) = RSS/(r-1) = 10,778.76/3 = Column Mean Square (CMS) = CSS/(c-1) = /2 = Error Mean Square (EMS ) = ESS/(r-1)(c-1) = 83/6 =13.83 Computation of degrees of freedom Rows: Degrees of freedom (d.f.) = (r-1) = (4-1) = 3 Columns: D.f. = (c-1) = (3-1) = 2 12
13 Error D.f. = (r-1)(c-1) = 3 * 2 = 6 [Note that there are always two separate sets of degrees of freedom to be taken into account when looking up the probability of an F value in a two-way ANOVA, one for the numerator in the F ratio (e.g. the RMS if one is looking up the Row main effect) and one for the numerator (always the EMS).] Results and conclusions ANOVA Table Source of Degrees of Sum of Mean F variation freedom Squares Square Rows 3 10, RMS/EMS= Columns CMS/EMS=34.60 Error Total: 11 11, The critical value for F at the 0.05 level with 2 and 6 degrees of freedom is 5.14 (see, for example, Table IX in Hoel, 1976, pp ). Our F value for the Column Mean Square/Error Mean Square variance ratio (also called the Main Effect for Drug) far exceeds this value; we can therefore reject the null hypothesis that the antihistamines do not significantly depress visual performance. The Main Effect for Subject is even more significant, as suggested by the plot. [It can be seen simply from visual inspection of the data that there is far greater variability from subject to subject than there is from one condition to another. This fact is reflected in the far larger value for the row sum of squares than for the columns sum of squares, which is the one we are interested in. In another context we might of course be interested in the within-samples variability, e.g. in a study of individual differences in response to antihistamines, or individual differences in base-line CFF.] 13
14 3.4 Design precautions which should have been taken: (1) The order in which the drugs/placebo were administered to each subject should have been randomised. This would prevent the possibility of the latent effect 7 of one of the drugs potentiating the effect of another. (2) Sufficient time should have been allowed to elapse between the administration of each drug to avoid any possibility of a carry-over effect 8 (i.e. the effect of the first drug treatment not having worn off before the second was given). 3.5 Two potential benefits from a repeated measures design: (1) Administering each drug more than once to each subject would have increased the reliability of the readings. (2) Repeated administration of each drug to each subject would have meant that it was possible to randomly vary the position of each drug for each subject in successive experiments. E.g. if Meclastine was the first drug to be administered to Subject 1 in the first experiment, then it could have been administered second or third in a second experiment. This randomisation of the order of administration between experiments and within subjects would have provided an even stronger control for any latent effect than simply randomising the order of administration between subjects in one experiment as recommended in section 3.4, (1). 7 8 See Appendix III See Appendix III 14
15 Appendix I Short-cut Method for Computing a Two-Way ANOVA 9 Definitions: Number of rows = r Number of columns = c 1. Calculate the row means and column means. 2. (a) Calculate the total variance of all the observations in the data using a calculator, (b) Calculate the total sum of squares (TSS), by multiplying the total variance by (n - 1), where n is the total number of observations. [Caveat: As mentioned in the text above, there are two separate operational definitions of variance (and hence of SD, its square root). Make sure your calculator is using the version for samples, which uses (n-1), rather than the version for populations, which uses N.] 3. Row calculations: (i) Compute the variance of the row means; i.e. enter your various row means from 1. into the calculator and find their variance, (ii) Compute the row mean square (RMS) by multiplying the variance of the row means by c (the number of columns), (iii) Compute row sum of squares (RSS), by multiplying RMS by (r 1). [The same caveat applies here as in the case of the short-cut computation of TSS given above: be sure you know which definition of variance your calculator is using.] 9 This Appendix summarises the method recommended by the handbook accompanying the Oxford University Psychology Prelims Statistics course: Definitions and Formulae with Statistical Tables for Elementary Statistics and Quantitative Methods Courses, Department of Statistics, University of Oxford,
16 4. Column calculations: (i) Compute the variance of the column means; i.e. enter your various column means from 1. into the calculator and find their variance, (ii) Compute column mean square (CMS) by multiplying the variance of the column means by r (the number of rows). (iii) Compute the column sum of squares (CSS) by multiplying CMS by (c 1). [Observe the usual caveat with regard to the definition of variance.] 5. Compute the error sum of squares (ESS): i.e. take both row sum of squares (RSS) and column sum of squares (CSS) from total sum of squares (TSS). In other words, take results 3 (iii) and 4 (iii) from result 2. The error sum of squares is whatever is left. 6. Compute the error mean square (EMS) from (5). i.e. divide the result of (5), the error sum of squares, by the product of (r 1) and (c 1). 7. You now have all the values you need to complete your ANOVA table (see worked example, above, for a model), work out your F ratios and look up their corresponding p-values. Note that there are always two separate sets of degrees of freedom to be taken into account when looking up the probability of an F value in a two-way ANOVA, one for the numerator in the F ratio (e.g. the RMS if one is looking up the Row main effect) and one for the numerator (always the EMS). 16
17 Appendix II THE THEORETICAL DISTINCTION BETWEEN SAMPLES AND POPULATIONS It's importance: 'Statistical methods may described as methods for drawing conclusions about populations by means of samples.' Hoel, p.2 Samples Populations Nature of measures: Always empirical/a posteriori May be theoretical/a priori (e.g. mean IQ score, or predictions from binomial); or, if unknown, may have to be represented by sample. Represented by: English alphabet Mainly Greek alphabet Examples: Mean μ s 2 Variance σ 2 S Standard σ deviation Proportion p SAMPLING - Key concepts Sampling error 10 or variability from sample to sample due to chance Sampling distribution of a statistic 'the most basic concept underlying all statistical tests' (Howell, p.90) 10 N.B. Does not imply any mistake. 17
18 tells us 'what degree of sample-to-sample variability we can expect by chance as a function of sampling error' (Howell, p.90) or: 'the distribution of values obtained for that statistic over repeated sampling' (Ibid.) derived mathematically rather than empirically Sampling distribution for the mean 'distribution of means of an infinite number of random samples' (Howell, p.90) Standard Error the standard deviation of a sampling distribution of a statistic 18
19 Appendix III REPEATED MEASURES AND INTERACTION EFFECTS Repeated Measures Definition: Any experiment in which the same variable is measured more than once for each subject is a repeated measures design; e.g. a paired t-test in which two measures, 'before' and 'after', are taken for each S. Advantage of repeated measures designs: - they reduce overall variability by using fewer subjects - they control for individual differences Disadvantages: ` - the carry-over effect (e.g. effect of first drug treatment has not worn off before second is given and tested) - the latent effect (second drug potentiates or activates the latent effect of the first) - the learning effect (self-explanatory; most relevant to psychology experiments) Interaction An interaction effect should be thought of in contradistinction to a main effect. 19
20 Main Effects A main effect is what we would measure if we were to collapse all our data into the several levels of a single factor and compare the resulting means. Interaction Effects An interaction effect can be thought of as a correlation or interaction between two or more factors. In other words, the influence of the two or more factors is not independent of each other - they interact. Interaction Effects may be 'second-order' but they are not necessarily secondary in importance An interaction effect may be of greater interest than a main effect. In fact it may be the main object of investigation of an experiment. See Howell (1997) for discussions of the basic ideas of both interaction (pp ) and repeated measures (pp ). Reference: Howell, David C. (1997). Statistical Methods for Psychology (4 th London: Duxbury Press. edition). 20
21 Appendix IV PARAMETRIC VERSUS NON-PARAMETRIC TESTS Parametric Non-parametric Differences: Assume normality in underlying population distribution No assumptions about nature of underlying distribution Deal in means Deal in medians Deal with intervals Deal with ranks Advantages: More power Fewer assumptions (so good for skewed distributions, for example) Less affected by outliers (re mean and variance) Simpler to calculate Disadvantages: Mean and variance disproportionately affected by outliers Affected by skewness Less power (because use less information) CORRESPONDING TESTS Parametric T-test, independent samples T-test, matched pairs ANOVA (one-way) ANOVA (2-factor) Chi-square Product-Moment Correlation Non-parametric Mann-Whitney ( Rank Sum ) Median (less powerful than Mann- Whitney) Wilcoxon for matched pairs Sign (less powerful than Wilcoxon) Kruskal-Wallis (generalization of Wilcoxon) Friedman Kolmogorov-Smirnov Spearman s Rank Correlation 21
22 Some other publications from Oxford Forum are described on the following webpage: They are available from Blackwells Bookshop, Oxford (tel ) and from Amazon UK. 22
First-year Statistics for Psychology Students Through Worked Examples
First-year Statistics for Psychology Students Through Worked Examples 1. THE CHI-SQUARE TEST A test of association between categorical variables by Charles McCreery, D.Phil Formerly Lecturer in Experimental
First-year Statistics for Psychology Students Through Worked Examples. 2. Probability and Bayes Theorem
First-year Statistics for Psychology Students Through Worked Examples 2. Probability and Bayes Theorem by Charles McCreery, D.Phil Formerly Lecturer in Experimental Psychology Magdalen College Oxford Copyright
II. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS
QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.
UNIVERSITY OF NAIROBI
UNIVERSITY OF NAIROBI MASTERS IN PROJECT PLANNING AND MANAGEMENT NAME: SARU CAROLYNN ELIZABETH REGISTRATION NO: L50/61646/2013 COURSE CODE: LDP 603 COURSE TITLE: RESEARCH METHODS LECTURER: GAKUU CHRISTOPHER
UNDERSTANDING THE TWO-WAY ANOVA
UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables
Introduction to Analysis of Variance (ANOVA) Limitations of the t-test
Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only
Study Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
Projects Involving Statistics (& SPSS)
Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,
Rank-Based Non-Parametric Tests
Rank-Based Non-Parametric Tests Reminder: Student Instructional Rating Surveys You have until May 8 th to fill out the student instructional rating surveys at https://sakai.rutgers.edu/portal/site/sirs
CHAPTER 14 NONPARAMETRIC TESTS
CHAPTER 14 NONPARAMETRIC TESTS Everything that we have done up until now in statistics has relied heavily on one major fact: that our data is normally distributed. We have been able to make inferences
INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)
INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of
SPSS Explore procedure
SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,
Descriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
Research Methods & Experimental Design
Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and
Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)
Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the
SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES
SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR
Post-hoc comparisons & two-way analysis of variance. Two-way ANOVA, II. Post-hoc testing for main effects. Post-hoc testing 9.
Two-way ANOVA, II Post-hoc comparisons & two-way analysis of variance 9.7 4/9/4 Post-hoc testing As before, you can perform post-hoc tests whenever there s a significant F But don t bother if it s a main
Statistics 2014 Scoring Guidelines
AP Statistics 2014 Scoring Guidelines College Board, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks of the College Board. AP Central is the official online home
STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance
Principles of Statistics STA-201-TE This TECEP is an introduction to descriptive and inferential statistics. Topics include: measures of central tendency, variability, correlation, regression, hypothesis
1.5 Oneway Analysis of Variance
Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments
Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test
Nonparametric Two-Sample Tests Sign test Mann-Whitney U-test (a.k.a. Wilcoxon two-sample test) Kolmogorov-Smirnov Test Wilcoxon Signed-Rank Test Tukey-Duckworth Test 1 Nonparametric Tests Recall, nonparametric
CALCULATIONS & STATISTICS
CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents
Analysing Questionnaires using Minitab (for SPSS queries contact -) [email protected]
Analysing Questionnaires using Minitab (for SPSS queries contact -) [email protected] Structure As a starting point it is useful to consider a basic questionnaire as containing three main sections:
Statistical tests for SPSS
Statistical tests for SPSS Paolo Coletti A.Y. 2010/11 Free University of Bolzano Bozen Premise This book is a very quick, rough and fast description of statistical tests and their usage. It is explicitly
Simple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
Fairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other
1 Hypothesis Testing Richard S. Balkin, Ph.D., LPC-S, NCC 2 Overview When we have questions about the effect of a treatment or intervention or wish to compare groups, we use hypothesis testing Parametric
Permutation Tests for Comparing Two Populations
Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of
Analysis of Data. Organizing Data Files in SPSS. Descriptive Statistics
Analysis of Data Claudia J. Stanny PSY 67 Research Design Organizing Data Files in SPSS All data for one subject entered on the same line Identification data Between-subjects manipulations: variable to
SPSS Tests for Versions 9 to 13
SPSS Tests for Versions 9 to 13 Chapter 2 Descriptive Statistic (including median) Choose Analyze Descriptive statistics Frequencies... Click on variable(s) then press to move to into Variable(s): list
Using Excel for inferential statistics
FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied
Parametric and Nonparametric: Demystifying the Terms
Parametric and Nonparametric: Demystifying the Terms By Tanya Hoskin, a statistician in the Mayo Clinic Department of Health Sciences Research who provides consultations through the Mayo Clinic CTSA BERD
Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
Comparing Means in Two Populations
Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we
Recall this chart that showed how most of our course would be organized:
Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical
Nonparametric Statistics
Nonparametric Statistics J. Lozano University of Goettingen Department of Genetic Epidemiology Interdisciplinary PhD Program in Applied Statistics & Empirical Methods Graduate Seminar in Applied Statistics
Recommend Continued CPS Monitoring. 63 (a) 17 (b) 10 (c) 90. 35 (d) 20 (e) 25 (f) 80. Totals/Marginal 98 37 35 170
Work Sheet 2: Calculating a Chi Square Table 1: Substance Abuse Level by ation Total/Marginal 63 (a) 17 (b) 10 (c) 90 35 (d) 20 (e) 25 (f) 80 Totals/Marginal 98 37 35 170 Step 1: Label Your Table. Label
business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar
business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel
TABLE OF CONTENTS. About Chi Squares... 1. What is a CHI SQUARE?... 1. Chi Squares... 1. Hypothesis Testing with Chi Squares... 2
About Chi Squares TABLE OF CONTENTS About Chi Squares... 1 What is a CHI SQUARE?... 1 Chi Squares... 1 Goodness of fit test (One-way χ 2 )... 1 Test of Independence (Two-way χ 2 )... 2 Hypothesis Testing
Two-Sample T-Tests Assuming Equal Variance (Enter Means)
Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of
Association Between Variables
Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi
Analysis of Variance ANOVA
Analysis of Variance ANOVA Overview We ve used the t -test to compare the means from two independent groups. Now we ve come to the final topic of the course: how to compare means from more than two populations.
UNDERSTANDING THE DEPENDENT-SAMPLES t TEST
UNDERSTANDING THE DEPENDENT-SAMPLES t TEST A dependent-samples t test (a.k.a. matched or paired-samples, matched-pairs, samples, or subjects, simple repeated-measures or within-groups, or correlated groups)
Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217
Part 3 Comparing Groups Chapter 7 Comparing Paired Groups 189 Chapter 8 Comparing Two Independent Groups 217 Chapter 9 Comparing More Than Two Groups 257 188 Elementary Statistics Using SAS Chapter 7 Comparing
Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures
Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone:
Non-Parametric Tests (I)
Lecture 5: Non-Parametric Tests (I) KimHuat LIM [email protected] http://www.stats.ox.ac.uk/~lim/teaching.html Slide 1 5.1 Outline (i) Overview of Distribution-Free Tests (ii) Median Test for Two Independent
individualdifferences
1 Simple ANalysis Of Variance (ANOVA) Oftentimes we have more than two groups that we want to compare. The purpose of ANOVA is to allow us to compare group means from several independent samples. In general,
This chapter discusses some of the basic concepts in inferential statistics.
Research Skills for Psychology Majors: Everything You Need to Know to Get Started Inferential Statistics: Basic Concepts This chapter discusses some of the basic concepts in inferential statistics. Details
DATA INTERPRETATION AND STATISTICS
PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples
Statistics One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples February 3, 00 Jobayer Hossain, Ph.D. & Tim Bunnell, Ph.D. Nemours
NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)
NONPARAMETRIC STATISTICS 1 PREVIOUSLY parametric statistics in estimation and hypothesis testing... construction of confidence intervals computing of p-values classical significance testing depend on assumptions
Chapter 7. One-way ANOVA
Chapter 7 One-way ANOVA One-way ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The t-test of Chapter 6 looks
6.4 Normal Distribution
Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under
UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)
UNDERSTANDING ANALYSIS OF COVARIANCE () In general, research is conducted for the purpose of explaining the effects of the independent variable on the dependent variable, and the purpose of research design
Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means
Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis
NCSS Statistical Software
Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the
Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY
Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY 1. Introduction Besides arriving at an appropriate expression of an average or consensus value for observations of a population, it is important to
Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:
Glo bal Leadership M BA BUSINESS STATISTICS FINAL EXAM Name: INSTRUCTIONS 1. Do not open this exam until instructed to do so. 2. Be sure to fill in your name before starting the exam. 3. You have two hours
One-Way Analysis of Variance (ANOVA) Example Problem
One-Way Analysis of Variance (ANOVA) Example Problem Introduction Analysis of Variance (ANOVA) is a hypothesis-testing technique used to test the equality of two or more population (or treatment) means
NAG C Library Chapter Introduction. g08 Nonparametric Statistics
g08 Nonparametric Statistics Introduction g08 NAG C Library Chapter Introduction g08 Nonparametric Statistics Contents 1 Scope of the Chapter... 2 2 Background to the Problems... 2 2.1 Parametric and Nonparametric
DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.
DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,
Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)
Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption
Types of Data, Descriptive Statistics, and Statistical Tests for Nominal Data. Patrick F. Smith, Pharm.D. University at Buffalo Buffalo, New York
Types of Data, Descriptive Statistics, and Statistical Tests for Nominal Data Patrick F. Smith, Pharm.D. University at Buffalo Buffalo, New York . NONPARAMETRIC STATISTICS I. DEFINITIONS A. Parametric
5/31/2013. 6.1 Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.
The Normal Distribution C H 6A P T E R The Normal Distribution Outline 6 1 6 2 Applications of the Normal Distribution 6 3 The Central Limit Theorem 6 4 The Normal Approximation to the Binomial Distribution
Come scegliere un test statistico
Come scegliere un test statistico Estratto dal Capitolo 37 of Intuitive Biostatistics (ISBN 0-19-508607-4) by Harvey Motulsky. Copyright 1995 by Oxfd University Press Inc. (disponibile in Iinternet) Table
Normality Testing in Excel
Normality Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. [email protected]
StatCrunch and Nonparametric Statistics
StatCrunch and Nonparametric Statistics You can use StatCrunch to calculate the values of nonparametric statistics. It may not be obvious how to enter the data in StatCrunch for various data sets that
Correlation key concepts:
CORRELATION Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson s coefficient of correlation c) Spearman s Rank correlation coefficient d)
Statistiek II. John Nerbonne. October 1, 2010. Dept of Information Science [email protected]
Dept of Information Science [email protected] October 1, 2010 Course outline 1 One-way ANOVA. 2 Factorial ANOVA. 3 Repeated measures ANOVA. 4 Correlation and regression. 5 Multiple regression. 6 Logistic
Standard Deviation Estimator
CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of
HYPOTHESIS TESTING: POWER OF THE TEST
HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,
DATA ANALYSIS. QEM Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. Howard University
DATA ANALYSIS QEM Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. Howard University Quantitative Research What is Statistics? Statistics (as a subject) is the science
1 Nonparametric Statistics
1 Nonparametric Statistics When finding confidence intervals or conducting tests so far, we always described the population with a model, which includes a set of parameters. Then we could make decisions
CHI-SQUARE: TESTING FOR GOODNESS OF FIT
CHI-SQUARE: TESTING FOR GOODNESS OF FIT In the previous chapter we discussed procedures for fitting a hypothesized function to a set of experimental data points. Such procedures involve minimizing a quantity
THE KRUSKAL WALLLIS TEST
THE KRUSKAL WALLLIS TEST TEODORA H. MEHOTCHEVA Wednesday, 23 rd April 08 THE KRUSKAL-WALLIS TEST: The non-parametric alternative to ANOVA: testing for difference between several independent groups 2 NON
Testing Research and Statistical Hypotheses
Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you
One-Way Analysis of Variance
One-Way Analysis of Variance Note: Much of the math here is tedious but straightforward. We ll skim over it in class but you should be sure to ask questions if you don t understand it. I. Overview A. We
The Kruskal-Wallis test:
Graham Hole Research Skills Kruskal-Wallis handout, version 1.0, page 1 The Kruskal-Wallis test: This test is appropriate for use under the following circumstances: (a) you have three or more conditions
Week 4: Standard Error and Confidence Intervals
Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.
Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS
Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS About Omega Statistics Private practice consultancy based in Southern California, Medical and Clinical
Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.
Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing
LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics
Period Date LAB : THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,
Sample Size and Power in Clinical Trials
Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance
Hypothesis Testing: Two Means, Paired Data, Two Proportions
Chapter 10 Hypothesis Testing: Two Means, Paired Data, Two Proportions 10.1 Hypothesis Testing: Two Population Means and Two Population Proportions 1 10.1.1 Student Learning Objectives By the end of this
Chapter G08 Nonparametric Statistics
G08 Nonparametric Statistics Chapter G08 Nonparametric Statistics Contents 1 Scope of the Chapter 2 2 Background to the Problems 2 2.1 Parametric and Nonparametric Hypothesis Testing......................
SAS/STAT. 9.2 User s Guide. Introduction to. Nonparametric Analysis. (Book Excerpt) SAS Documentation
SAS/STAT Introduction to 9.2 User s Guide Nonparametric Analysis (Book Excerpt) SAS Documentation This document is an individual chapter from SAS/STAT 9.2 User s Guide. The correct bibliographic citation
CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont
CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont To most people studying statistics a contingency table is a contingency table. We tend to forget, if we ever knew, that contingency
Session 7 Bivariate Data and Analysis
Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table co-variation least squares
The Dummy s Guide to Data Analysis Using SPSS
The Dummy s Guide to Data Analysis Using SPSS Mathematics 57 Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved TABLE OF CONTENTS PAGE Helpful Hints for All Tests...1 Tests
MASTER COURSE SYLLABUS-PROTOTYPE PSYCHOLOGY 2317 STATISTICAL METHODS FOR THE BEHAVIORAL SCIENCES
MASTER COURSE SYLLABUS-PROTOTYPE THE PSYCHOLOGY DEPARTMENT VALUES ACADEMIC FREEDOM AND THUS OFFERS THIS MASTER SYLLABUS-PROTOTYPE ONLY AS A GUIDE. THE INSTRUCTORS ARE FREE TO ADAPT THEIR COURSE SYLLABI
Intro to Parametric & Nonparametric Statistics
Intro to Parametric & Nonparametric Statistics Kinds & definitions of nonparametric statistics Where parametric stats come from Consequences of parametric assumptions Organizing the models we will cover
Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance
2 Making Connections: The Two-Sample t-test, Regression, and ANOVA In theory, there s no difference between theory and practice. In practice, there is. Yogi Berra 1 Statistics courses often teach the two-sample
Chapter 12 Nonparametric Tests. Chapter Table of Contents
Chapter 12 Nonparametric Tests Chapter Table of Contents OVERVIEW...171 Testing for Normality...... 171 Comparing Distributions....171 ONE-SAMPLE TESTS...172 TWO-SAMPLE TESTS...172 ComparingTwoIndependentSamples...172
Chi Square Tests. Chapter 10. 10.1 Introduction
Contents 10 Chi Square Tests 703 10.1 Introduction............................ 703 10.2 The Chi Square Distribution.................. 704 10.3 Goodness of Fit Test....................... 709 10.4 Chi Square
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate
