Biodiversity Data Analysis: Testing Statistical Hypotheses By Joanna Weremijewicz, Simeon Yurek, Steven Green, Ph. D. and Dana Krempels, Ph. D.


 Victoria Austin
 1 years ago
 Views:
Transcription
1 Biodiversity Data Analysis: Testing Statistical Hypotheses By Joanna Weremijewicz, Simeon Yurek, Steven Green, Ph. D. and Dana Krempels, Ph. D. In biological science, investigators often collect biological observations that can be tabulated as numerical facts, also known as data (singular = datum). Biological research can yield several different types of data. Important measurements include counts (frequency) and those that describe characteristics (length, mass, etc.). Data from a sample are often used to calculate estimates of the average values of the population of interest (mean, mode, and median) and others describing the dispersion around those values (range, variance, and standard deviation). I. Data, Parameters, and Statistics: A Review Recall that data can be of three basic types: 1. Attribute data. These are descriptive, "eitheror" measurements, and usually describe the presence or absence of a particular attribute. The presence or absence of a genetic trait ("freckles" or "no freckles") or the type of genetic trait (type A, B, AB or o blood) are examples. Because such data have no specific sequence, they are considered unordered. 2. Discrete numerical data. These correspond to biological observations counted as integers (whole numbers). The number of leaves on each member of a group of plants, the number of breaths per minute in a group of newborns or the number of beetles per square meter of forest floor are all examples of discrete numerical data. These data are ordered, but do not describe physical attributes of the things being counted. 3. Continuous numerical data. These are data that fall along a numerical continuum. The limit of resolution of such data is the accuracy of the methods and instruments used to collect them. Examples are tail length, brain volume, percent body fat...anything that varies on a continuous scale. Rates (such as decomposition of hydrogen peroxide per minute or uptake of oxygen during respiration over the course of an hour) are also numerical continuous data. (Figure 1). (Continuous numerical data generally fall along a normal (Gaussian) distribution. This distribution is a function indicating the probability that a data point will fall between any two real numbers.) When an investigator collects numerical data from a group of subjects, s/he must determine how and with what frequency the data vary. For example, if one wished to study the distribution of shoe size in the human population, one might measure the shoe size of a sample of the human population (say, 50 individuals) and graph the numbers with "shoe size" on the xaxis and "number of individuals" on the yaxis. The resulting figure shows the frequency distribution of the data, a representation of how often a particular data point occurs at a given measurement. Biodiversity Data Analysis 1
2 Usually, data measurements are distributed over a range of values. Measures of the tendency of measurements to occur near the center of the range include the population mean (the average measurement), the median (the measurement located at the exact center of the range) and the mode (the most common measurement in the range). It is also important to understand how much variation a group of subjects exhibits around the mean. For example, if the average human shoe size is "9," we must determine whether shoe size forms a very wide distribution (with a relatively small number of individuals wearing all sizes from 115) or one which hovers near the mean (with a relatively large number of individuals wearing sizes 7 through 10, and many fewer wearing sizes 16 and 1115). Measurements of dispersion around the mean include the range, variance and standard deviation. Parameters and Statistics If you were able to measure the height of every adult male Homo sapiens who ever existed, and then calculate a mean, median, mode, range, variance and standard deviation from your measurements, those values would be known as parameters. They represent the actual values as calculated from measuring every member of a population of interest. Obviously, it is very difficult to obtain data from every member of a population of interest, and impossible of that population is theoretically infinite in size. However, one can estimate parameters by randomly sampling members of the population. Such an estimate, calculated from measurements of a subset of the entire population, is known as a statistic. In general, parameters are written as Greek symbols equivalent to the Roman symbols used to represent statistics. For example, the standard deviation for a subset of an entire population is written as "s", whereas the true population parameter is written as σ. II. From Raw Data to Index of Biodiversity Now that you ve had a chance to review a bit of statistical information, it s time to apply it to your own project. In this section, you will be guided through the process of calculating indices from your raw data collected over the past two weeks, and then using those indices to compare the two habitat types you chose. A. Ordinal Data Points: Menhinick s Index (D) When you collected and counted organisms in your samples, you were taking a survey of the number of different species present in each of your two habitat types. You counted the number of individuals of various species in 12 samples collected from each of your two selected habitat types. From these counts, you can calculate a Menhinick s Index (D) for each counted sample. At the end of your preliminary calculations, you should have ten D values for each of the two habitats you are comparing. You will use these D values in the MannWhitney U test to determine whether your two habitats differ significantly in the measure of biodiversity you have chosen (species richness). Recall the formula for Menhinick s index, which represents the number of species in the sample divided by the square root of the number of individuals in the sample. s = the number of different species in your sample N = the total number of individual organisms in the sample. Biodiversity Data Analysis 2
3 Your team should have counted at least 10 samples from each of your two habitats, and can now calculate one Menhinick s index (D value) for each sample. Tabulate your D values here: Sample # D habitat1 D habitat So what do we do with these indices? You may have an intuitive sense that they will allow you to determine whether your two sampled habitats overlap in their degrees of biodiversity. But science isn t about intuition. Statistics and statistical tests are used to test whether the results of an experiment are significantly different from the null hypothesis prediction. What is meant by "significant?" For that matter, what is meant by "expected" results? To answer these questions, we must consider the matter of probability. B. Probability The significance level (also known as alpha (α)) for a given study is set by the investigator before the analysis is begun. Alpha is defined as the probability of mistakenly rejecting a null hypothesis that is true (Type I error). By convention, α is usually set at 0.05 (5%). The probability that an observed result is due to some factor other than chance is known as P. The result of a statistical test is a statistic. For example, the student s t test yields a t statistic, the Chisquare test yields a X2 statistic, and the MannWhitney U test yields a U statistic. Every value of a particular statistic is associated with a particular P value. If the P value associated with a calculated statistic (e.g., the U statistic you will calculate with the Mann Whitney test, to be described below) is 0.05, this means that there is only a 5% chance that the rejection of the null hypothesis will be incorrect. A P value of less than 0.05 means that there is an even lower chance of a Type 1 error. (For example, a P value of 0.01 means that there is only a 1% chance that the results are due to chance, and not to the factor you are examining.) In essence, α is a cut off value that defines the area(s) in a probability distribution where a particular value is unlikely to fall. In some studies, a more rigorous α of 0.01 (1%) is required to reject the null hypothesis, and in some others, a more lenient α of 0.1 (10%) is allowed for rejection of the null hypothesis. For our study of biodiversity, you will use an α level of The term "significant" as used in every day conversation is not the same as the statistical meaning of the word. In scientific endeavors, significance has a highly specific and important definition. Every time you read the word "significant" in this lab manual, know that we refer to the following scientifically accepted standard: Biodiversity Data Analysis 3
4 The difference between an observed and expected result is said to be statistically significant if and only if: Under the assumption that there is no true difference, the probability that the observed difference would be at least as large as that actually seen is less than or equal to α (5%; 0.05). Conversely, under the assumption that there is no true difference, the probability that the observed difference would be smaller than that actually seen is greater than 95% (0.95). Once an investigator has calculated a statistic from collected data, s/he must be able to draw conclusions from it. How does one determine whether deviations from the expected (null hypothesis) are significant? There is a specific probability value linked to every possible value of any statistic. A probability distribution assigns a relative probability of any possible outcome (e.g., Menhinick s Index). The species richness calculations you performed for each sample, while expressed as a number, are not distributed along a normal curve. They are ordinal, rather than continuous, data. For this reason, a nonparametric statistical test, the MannWhitney U test, will be employed for your analysis. C. Statistical Hypotheses A nonparametric test is used to test the significance of qualitative or attribute data such as those you have been collecting for this research project. In the following sections, you will learn how to apply a statistical test to your data. Your team should already have devised two statistical hypotheses stated in terms of opposing statements, the null hypothesis (H o ) and the alternative hypothesis (H a ). The null hypothesis states that there is no significant difference between two populations being compared. The alternative hypothesis may be either directional (onetailed), stating the precise way in which the two populations will differ ( Pond A will have greater species richness than Pond B. ), or nondirectional (twotailed), not specifying the way in which two populations will differ ( Pond A and Pond B will differ in species richness ). Your team should already have devised null and alternative hypotheses for your survey of biodiversity. To determine whether or not there is a difference in biodiversity between your two sample sites, you must now perform statistical tests on your data, the series of Menhinick s Indices (D) that you calculated from your individual survey samples. III. Applying a Statistical Test to Your Menhinick s Indices Once your team has calculated a Menhinick s index (D) for each of your 12 samples from each of the two habitats, you are ready to employ a statistical test to determine whether there is overlap between the range of calculated indices. If there is a great deal of overlap, it means that there is not a significant difference between them, and you will fail to reject your null hypothesis. However, if there is very little overlap (5% or less), you can confidently conclude that two habitats do differ significantly in their species richness, and reject your null hypothesis. A. Nonparametric test for two samples: MannWhitney U The MannWhitney test allows the investigator (you) to compare your two habitat types without assuming that your D values are normally distributed. The MannWhitney U does have its rules. For this test to be appropriate: Biodiversity Data Analysis 4
5 You must be comparing two random, independent samples (your two sites) The measurements (Menhinick s Indices, in our case) should be ordinal No two measurements should have exactly the same value (though we can deal with ties in a way that will be explained shortly). The MannWhitney U test allows the investigator to determine whether there is a significant difference between two sets of ordered/ranked data, such as those your team has collected in its biodiversity study. Here is a stepwise explanation and example of how to apply this test to your data. 1. State your null and alternative hypotheses. (You already have done this, right?) H o : H a : Example: H o : There is no difference in the ranks of species richness between a silted pond and a clear pond. H o : There is a difference in the ranks of species richness between a silted pond and a clear pond. 2. State the significance level (alpha, α) necessary to reject H o. This is typically P < Rank your Menhinick s Indices from smallest to largest in a table, noting which index came from which habitat. Example: Table 1 shows 18 (imaginary) values for Menhinick s Indices from the two ponds mentioned before, silted (S) and clear (C). Table 2 shows the values ranked and labeled by pond type. Table 1. Menhinick s Indices Table 2. Ranked Menhinick s Indices for silted and clear ponds D silted D clear Rank Ranked D values Habitat S S S S S S C S S C C C S C C C C C Notice in the ranked table that if two values are the same, then the rank each one receives is the average of the two ranks. For example, value nine appears twice, at rank 6 and 7. Add the two ranks and divide by two to get their mean: 13/2 = 6.5. Each value is assigned their same, mean rank whenever there is a tie. Biodiversity Data Analysis 5
6 4. Assign points to each ranked value. Each silted rank gets one point for every clear rank that appears below it. Every clear value gets one point for every silted value that appears below it. For example, the first rank, 2(s) has 9 clear values below it, so it gets 9 points. Value 9(c) has 3 silted values below it, so it gets 3 points. Table 3. Points assigned to ranked D values in silted and clear ponds. Rank Ranked D Habitat Points values 1 2 S S S S S S C S S C C C S C C C C C 0 5. Calculate a U statistic for each category by adding the points for each habitat. U silted = = 75 U clear = = 6 Your final U value is the smaller of these two values. In this example our U value is 6. In general, the lower the U value, the greater the difference between the two groups being tested. (For example, if none of the D values overlapped, the U value would be zero. That means there is a large difference between the two groups: they do not overlap at all.) 6. You are now ready to move to the final step, determining whether to reject or fail to reject your null hypothesis. (Proceed to Section IV.) A video explanation of the MannWhitney U test procedure can be viewed here: B. Nonparametric test for multiple samples: Kruskal Wallis test We told you not to. But some teams just have to go that extra mile. Biodiversity Data Analysis 6
7 If your team is comparing more than two nonparametric data sets, a useful test, analogous to the ANOVA (Analysis Of Variance), is the KruskalWallis test. This is well explained here: Kruskal Wallis: But you re on your own. We warned you. IV. Critical values for nonparametric statistics As you already know, a specific probability value linked to every possible value of any statistic, including the MannWhitney U statistic you just calculated. A. Critical values for the MannWhitney U statistic Remember that we have defined our significance level (α) as This implies that a correct null hypothesis will be rejected only 5% of the time, but correctly identified as false 95% of the time. A critical value of a statistic (e.g., your MannWhitney U statistic) is that value associated with a significance level of 0.05 or lower. The critical values for the MannWhitney U statistic are listed in Table 4. Compare your U value to those shown in the Table of Critical Values for the MannWhitney U (Table 4). Find the sample size (i.e., the number of Menhinick s Indices (D) you calculated) for each of your two habitats, and use the matrix to find the critical value for U at those two sample sizes. (For example, if you calculated 19 D values for one habitat and 17 for the other, then the critical value of the U statistic would be 99. This means that a U value of 99 or lower indicates rejection of the null hypothesis. In our example calculation, there were nine samples from two different habitats. In the MannWhitney U table, that corresponds to a critical value of 17. Our U statistic was 6, which is quite a bit lower than 17. This means that, if these were real data, we would reject the null hypothesis and fail to reject the alternative hypothesis. There is a significant difference in species richness between the clear and silted ponds. If your U value is lower than the critical value at the appropriate spot in the table, reject your null hypothesis. If your U value is greater than that in the table, fail to reject. B. Critical values for the Kruskall Wallis statistic If your team went crazy and decided to sample more than two different habitats, then your data analysis will be more complex. You will still use a nonparametric test, but it will be analogous to the parametric ANOVA, not the ttest. In this case, you will use the Kruskal Wallis test, as shown in the video linked above. Kruskal Wallis critical values are more complex, as they involve more than two data sets. Fortunately for us, J. Patrick Meyer (University of Virginia) and Michael A. Seaman (University of South Caroina) have made available a limited portion of a table of critical values they have calculated. These can be found here if your project involves either three or four data sets: The tables are not complete, but they do provide critical values for α levels of 0.1, 0.5, and You are unlikely to need other values; these will tell you whether to reject or fail to reject your null hypothesis. Biodiversity Data Analysis 7
8 Table 5. Critical values for the MannWhitney U statistic. Find the value that corresponds to the sample sizes of your two habitats. If your U value is smaller than that shown in the table, then there is less than 5% chance that the difference between your two habitats is due to chance alone. If your U value is smaller than the one shown in this table for your two sample sizes, reject your null hypothesis. If your U value is larger than that shown in the table, fail to reject your null hypothesis. (From The Open Door Web Site, Biodiversity Data Analysis 8
9 V. Project Completed. Is This the End? The study you are now completing is only the beginning of what could be a longterm research project to discover the various factors that affect biodiversity. The only thing you are determining now is whether or not there is a statistically significant difference between your two sample habitats. In other words, the research project you are now completing is a pilot Biodiversity Data Analysis 9
10 study. It establishes an observable fact (i.e., that there is or is not a difference in biodiversity between your two sample habitats). That fact should be subject to further investigation beyond what you have accomplished here. Although you may have established that there is or is not a difference in biodiversity between your two sample habitats, you still cannot definitively state why or why not there is a difference. To do that, you must move to the next step, which is to list as many competing hypotheses as possible as to why there is a difference (or even if your team has obtained negative results why there is not a difference, despite obvious differences in your two sample habitats). Each of these multiple hypotheses could form the basis for a research project that would take your team one step further towards discovering the reasons for your pilot study s observed result. You should be able to give a brief description of an experiment that could be designed to test each of your competing hypotheses. In your presentation, be sure to include a list of hypotheses that could explain your observed results. What factors differed between the two habitats that might cause differences in biodiversity? Would these factors affect the physiology of any organisms that lived there? Or would they simply be more hospitable to certain species and not others? When you consider your results, consider every aspect of your findings, and report anything you find intriguing enough to warrant further study. Science is not a oneproject endeavor. Every finding of every research project can be seen as opening a new doorway to discovery of the most intimate mechanisms of life. Biodiversity Data Analysis 10
Appendix 2 Statistical Hypothesis Testing 1
BIL 151 Data Analysis, Statistics, and Probability By Dana Krempels, Ph.D. and Steven Green, Ph.D. Most biological measurements vary among members of a study population. These variations may occur for
More informationAppendix I The Scientific Method
Appendix I The Scientific Method The study of science is different from other disciplines in many ways. Perhaps the most important aspect of hard science is its adherence to the principle of the scientific
More informationDescriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
More informationInferential Statistics
Inferential Statistics Sampling and the normal distribution Zscores Confidence levels and intervals Hypothesis testing Commonly used statistical methods Inferential Statistics Descriptive statistics are
More informationII. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
More informationLecture Topic 6: Chapter 9 Hypothesis Testing
Lecture Topic 6: Chapter 9 Hypothesis Testing 9.1 Developing Null and Alternative Hypotheses Hypothesis testing can be used to determine whether a statement about the value of a population parameter should
More informationSupplement on the KruskalWallis test. So what do you do if you don t meet the assumptions of an ANOVA?
Supplement on the KruskalWallis test So what do you do if you don t meet the assumptions of an ANOVA? {There are other ways of dealing with things like unequal variances and nonnormal data, but we won
More informationChapter 21 Section D
Chapter 21 Section D Statistical Tests for Ordinal Data The ranksum test. You can perform the ranksum test in SPSS by selecting 2 Independent Samples from the Analyze/ Nonparametric Tests menu. The first
More informationCHAPTER 11 CHISQUARE: NONPARAMETRIC COMPARISONS OF FREQUENCY
CHAPTER 11 CHISQUARE: NONPARAMETRIC COMPARISONS OF FREQUENCY The hypothesis testing statistics detailed thus far in this text have all been designed to allow comparison of the means of two or more samples
More information3. Nonparametric methods
3. Nonparametric methods If the probability distributions of the statistical variables are unknown or are not as required (e.g. normality assumption violated), then we may still apply nonparametric tests
More informationIntroduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses
Introduction to Hypothesis Testing 1 Hypothesis Testing A hypothesis test is a statistical procedure that uses sample data to evaluate a hypothesis about a population Hypothesis is stated in terms of the
More informationNonparametric Statistics
1 14.1 Using the Binomial Table Nonparametric Statistics In this chapter, we will survey several methods of inference from Nonparametric Statistics. These methods will introduce us to several new tables
More informationNonparametric tests I
Nonparametric tests I Objectives MannWhitney Wilcoxon Signed Rank Relation of Parametric to Nonparametric tests 1 the problem Our testing procedures thus far have relied on assumptions of independence,
More informationCHAPTER 12 TESTING DIFFERENCES WITH ORDINAL DATA: MANN WHITNEY U
CHAPTER 12 TESTING DIFFERENCES WITH ORDINAL DATA: MANN WHITNEY U Previous chapters of this text have explained the procedures used to test hypotheses using interval data (ttests and ANOVA s) and nominal
More informationChapter 3: Nonparametric Tests
B. Weaver (15Feb00) Nonparametric Tests... 1 Chapter 3: Nonparametric Tests 3.1 Introduction Nonparametric, or distribution free tests are socalled because the assumptions underlying their use are fewer
More informationDifference tests (2): nonparametric
NST 1B Experimental Psychology Statistics practical 3 Difference tests (): nonparametric Rudolf Cardinal & Mike Aitken 10 / 11 February 005; Department of Experimental Psychology University of Cambridge
More informationBox plots & ttests. Example
Box plots & ttests Box Plots Box plots are a graphical representation of your sample (easy to visualize descriptive statistics); they are also known as boxandwhisker diagrams. Any data that you can
More informationVariables and Data A variable contains data about anything we measure. For example; age or gender of the participants or their score on a test.
The Analysis of Research Data The design of any project will determine what sort of statistical tests you should perform on your data and how successful the data analysis will be. For example if you decide
More informationComparing two groups (t tests...)
Page 1 of 33 Comparing two groups (t tests...) You've measured a variable in two groups, and the means (and medians) are distinct. Is that due to chance? Or does it tell you the two groups are really different?
More informationTRANSCRIPT: In this lecture, we will talk about both theoretical and applied concepts related to hypothesis testing.
This is Dr. Chumney. The focus of this lecture is hypothesis testing both what it is, how hypothesis tests are used, and how to conduct hypothesis tests. 1 In this lecture, we will talk about both theoretical
More informationLecture 7: Binomial Test, Chisquare
Lecture 7: Binomial Test, Chisquare Test, and ANOVA May, 01 GENOME 560, Spring 01 Goals ANOVA Binomial test Chi square test Fisher s exact test Su In Lee, CSE & GS suinlee@uw.edu 1 Whirlwind Tour of One/Two
More informationQUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NONPARAMETRIC TESTS
QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NONPARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.
More informationSuggested solution for exam in MSA830: Statistical Analysis and Experimental Design October 2009
Petter Mostad Matematisk Statistik Chalmers Suggested solution for exam in MSA830: Statistical Analysis and Experimental Design October 2009 1. (a) To use a ttest, one must assume that both groups of
More information1. Why the hell do we need statistics?
1. Why the hell do we need statistics? There are three kind of lies: lies, damned lies, and statistics, British Prime Minister Benjamin Disraeli (as credited by Mark Twain): It is easy to lie with statistics,
More informationSample Size Determination
Sample Size Determination Population A: 10,000 Population B: 5,000 Sample 10% Sample 15% Sample size 1000 Sample size 750 The process of obtaining information from a subset (sample) of a larger group (population)
More informationNonparametric TwoSample Tests. Nonparametric Tests. Sign Test
Nonparametric TwoSample Tests Sign test MannWhitney Utest (a.k.a. Wilcoxon twosample test) KolmogorovSmirnov Test Wilcoxon SignedRank Test TukeyDuckworth Test 1 Nonparametric Tests Recall, nonparametric
More informationTesting Research and Statistical Hypotheses
Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you
More informationStatistical tests for SPSS
Statistical tests for SPSS Paolo Coletti A.Y. 2010/11 Free University of Bolzano Bozen Premise This book is a very quick, rough and fast description of statistical tests and their usage. It is explicitly
More informationInferential Statistics. Probability. From Samples to Populations. Katie RommelEsham Education 504
Inferential Statistics Katie RommelEsham Education 504 Probability Probability is the scientific way of stating the degree of confidence we have in predicting something Tossing coins and rolling dice
More informationRankBased NonParametric Tests
RankBased NonParametric Tests Reminder: Student Instructional Rating Surveys You have until May 8 th to fill out the student instructional rating surveys at https://sakai.rutgers.edu/portal/site/sirs
More informationData Analysis. Lecture Empirical Model Building and Methods (Empirische Modellbildung und Methoden) SS Analysis of Experiments  Introduction
Data Analysis Lecture Empirical Model Building and Methods (Empirische Modellbildung und Methoden) Prof. Dr. Dr. h.c. Dieter Rombach Dr. Andreas Jedlitschka SS 2014 Analysis of Experiments  Introduction
More informationThe Dummy s Guide to Data Analysis Using SPSS
The Dummy s Guide to Data Analysis Using SPSS Mathematics 57 Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved TABLE OF CONTENTS PAGE Helpful Hints for All Tests...1 Tests
More informationLecture Notes Module 1
Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific
More informationOneSample ttest. Example 1: Mortgage Process Time. Problem. Data set. Data collection. Tools
OneSample ttest Example 1: Mortgage Process Time Problem A faster loan processing time produces higher productivity and greater customer satisfaction. A financial services institution wants to establish
More informationPower & Effect Size power Effect Size
Power & Effect Size Until recently, researchers were primarily concerned with controlling Type I errors (i.e. finding a difference when one does not truly exist). Although it is important to make sure
More information1 Nonparametric Statistics
1 Nonparametric Statistics When finding confidence intervals or conducting tests so far, we always described the population with a model, which includes a set of parameters. Then we could make decisions
More informationThe Wilcoxon RankSum Test
1 The Wilcoxon RankSum Test The Wilcoxon ranksum test is a nonparametric alternative to the twosample ttest which is based solely on the order in which the observations from the two samples fall. We
More informationHypothesis Testing Level I Quantitative Methods. IFT Notes for the CFA exam
Hypothesis Testing 2014 Level I Quantitative Methods IFT Notes for the CFA exam Contents 1. Introduction... 3 2. Hypothesis Testing... 3 3. Hypothesis Tests Concerning the Mean... 10 4. Hypothesis Tests
More informationOutline of Topics. Statistical Methods I. Types of Data. Descriptive Statistics
Statistical Methods I Tamekia L. Jones, Ph.D. (tjones@cog.ufl.edu) Research Assistant Professor Children s Oncology Group Statistics & Data Center Department of Biostatistics Colleges of Medicine and Public
More informationResearch Variables. Measurement. Scales of Measurement. Chapter 4: Data & the Nature of Measurement
Chapter 4: Data & the Nature of Graziano, Raulin. Research Methods, a Process of Inquiry Presented by Dustin Adams Research Variables Variable Any characteristic that can take more than one form or value.
More informationStatistics for Management IISTAT 362Final Review
Statistics for Management IISTAT 362Final Review Multiple Choice Identify the letter of the choice that best completes the statement or answers the question. 1. The ability of an interval estimate to
More informationHypothesis Testing with z Tests
CHAPTER SEVEN Hypothesis Testing with z Tests NOTE TO INSTRUCTOR This chapter is critical to an understanding of hypothesis testing, which students will use frequently in the coming chapters. Some of the
More informationLecture 13: Kolmogorov Smirnov Test & Power of Tests
Lecture 13: Kolmogorov Smirnov Test & Power of Tests S. Massa, Department of Statistics, University of Oxford 2 February 2016 An example Suppose you are given the following 100 observations. 0.160.680.320.85
More informationHypothesis Testing. Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University
Hypothesis Testing Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University 1 AMU / BonTech, LLC, JourniTech Corporation Copyright 2015 Learning Objectives Upon successful
More informationHypothesis Testing. Concept of Hypothesis Testing
Quantitative Methods 2013 Hypothesis Testing with One Sample 1 Concept of Hypothesis Testing Testing Hypotheses is another way to deal with the problem of making a statement about an unknown population
More informationStatistics Review PSY379
Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses
More informationLAB : THE CHISQUARE TEST. Probability, Random Chance, and Genetics
Period Date LAB : THE CHISQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,
More informationWe have already discussed hypothesis testing in study unit 13. In this
14 study unit fourteen hypothesis tests applied to means: two related samples We have already discussed hypothesis testing in study unit 13. In this study unit we shall test a hypothesis empirically in
More informationChapter 16 Multiple Choice Questions (The answers are provided after the last question.)
Chapter 16 Multiple Choice Questions (The answers are provided after the last question.) 1. Which of the following symbols represents a population parameter? a. SD b. σ c. r d. 0 2. If you drew all possible
More informationACTM State ExamStatistics
ACTM State ExamStatistics For the 25 multiplechoice questions, make your answer choice and record it on the answer sheet provided. Once you have completed that section of the test, proceed to the tiebreaker
More informationChapter 8. Professor Tim Busken. April 20, Chapter 8. Tim Busken. 8.2 Basics of. Hypothesis Testing. Works Cited
Chapter 8 Professor April 20, 2014 In Chapter 8, we continue our study of inferential statistics. Concept: Inferential Statistics The two major activities of inferential statistics are 1 to use sample
More informationHypothesis Testing. Bluman Chapter 8
CHAPTER 8 Learning Objectives C H A P T E R E I G H T Hypothesis Testing 1 Outline 81 Steps in Traditional Method 82 z Test for a Mean 83 t Test for a Mean 84 z Test for a Proportion 85 2 Test for
More informationIntroduction to Statistics for Computer Science Projects
Introduction Introduction to Statistics for Computer Science Projects Peter Coxhead Whole modules are devoted to statistics and related topics in many degree programmes, so in this short session all I
More informationSCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES
SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR
More information103 Measures of Central Tendency and Variation
103 Measures of Central Tendency and Variation So far, we have discussed some graphical methods of data description. Now, we will investigate how statements of central tendency and variation can be used.
More information1/22/2016. What are paired data? Tests of Differences: two related samples. What are paired data? Paired Example. Paired Data.
Tests of Differences: two related samples What are paired data? Frequently data from ecological work take the form of paired (matched, related) samples Before and after samples at a specific site (or individual)
More informationAnswer keys for Assignment 10: Measurement of study variables
Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text) 1. In a study, participants are asked to indicate the type of pet they have at home (ex: dog,
More informationHypothesis Testing Summary
Hypothesis Testing Summary Hypothesis testing begins with the drawing of a sample and calculating its characteristics (aka, statistics ). A statistical test (a specific form of a hypothesis test) is an
More informationUsing Excel for inferential statistics
FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied
More informationThe Logic of Statistical Inference Testing Hypotheses
The Logic of Statistical Inference Testing Hypotheses Confirming your research hypothesis (relationship between 2 variables) is dependent on ruling out Rival hypotheses Research design problems (e.g.
More informationAdditional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jintselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jintselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
More informationThe alternative hypothesis,, is the statement that the parameter value somehow differs from that claimed by the null hypothesis. : 0.5 :>0.5 :<0.
Section 8.28.5 Null and Alternative Hypotheses... The null hypothesis,, is a statement that the value of a population parameter is equal to some claimed value. :=0.5 The alternative hypothesis,, is the
More informationStatistics 2014 Scoring Guidelines
AP Statistics 2014 Scoring Guidelines College Board, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks of the College Board. AP Central is the official online home
More informationSPSS Explore procedure
SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stemandleaf plots and extensive descriptive statistics. To run the Explore procedure,
More informationStatistical Inference and ttests
1 Statistical Inference and ttests Objectives Evaluate the difference between a sample mean and a target value using a onesample ttest. Evaluate the difference between a sample mean and a target value
More informationHomework #3 is due Friday by 5pm. Homework #4 will be posted to the class website later this week. It will be due Friday, March 7 th, at 5pm.
Homework #3 is due Friday by 5pm. Homework #4 will be posted to the class website later this week. It will be due Friday, March 7 th, at 5pm. Political Science 15 Lecture 12: Hypothesis Testing Sampling
More informationTutorial 5: Hypothesis Testing
Tutorial 5: Hypothesis Testing Rob Nicholls nicholls@mrclmb.cam.ac.uk MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................
More informationOverview of NonParametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS
Overview of NonParametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS About Omega Statistics Private practice consultancy based in Southern California, Medical and Clinical
More informationData Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools
Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools Occam s razor.......................................................... 2 A look at data I.........................................................
More informationCALCULATIONS & STATISTICS
CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 15 scale to 0100 scores When you look at your report, you will notice that the scores are reported on a 0100 scale, even though respondents
More informationDescribing Populations Statistically: The Mean, Variance, and Standard Deviation
Describing Populations Statistically: The Mean, Variance, and Standard Deviation BIOLOGICAL VARIATION One aspect of biology that holds true for almost all species is that not every individual is exactly
More informationHypothesis tests, confidence intervals, and bootstrapping
Hypothesis tests, confidence intervals, and bootstrapping Business Statistics 41000 Fall 2015 1 Topics 1. Hypothesis tests Testing a mean: H0 : µ = µ 0 Testing a proportion: H0 : p = p 0 Testing a difference
More informationAP: LAB 8: THE CHISQUARE TEST. Probability, Random Chance, and Genetics
Ms. Foglia Date AP: LAB 8: THE CHISQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,
More informationEBM Cheat Sheet Measurements Card
EBM Cheat Sheet Measurements Card Basic terms: Prevalence = Number of existing cases of disease at a point in time / Total population. Notes: Numerator includes old and new cases Prevalence is crosssectional
More informationThere are three kinds of people in the world those who are good at math and those who are not. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 Positive Views The record of a month
More informationSampling Distributions and the Central Limit Theorem
135 Part 2 / Basic Tools of Research: Sampling, Measurement, Distributions, and Descriptive Statistics Chapter 10 Sampling Distributions and the Central Limit Theorem In the previous chapter we explained
More informationCHAPTER 3 COMMONLY USED STATISTICAL TERMS
CHAPTER 3 COMMONLY USED STATISTICAL TERMS There are many statistics used in social science research and evaluation. The two main areas of statistics are descriptive and inferential. The third class of
More information1 SAMPLE SIGN TEST. NonParametric Univariate Tests: 1 Sample Sign Test 1. A nonparametric equivalent of the 1 SAMPLE TTEST.
NonParametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A nonparametric equivalent of the 1 SAMPLE TTEST. ASSUMPTIONS: Data is nonnormally distributed, even after log transforming.
More informationHypothesis Testing 1
Hypothesis Testing 1 Statistical procedures for addressing research questions involves formulating a concise statement of the hypothesis to be tested. The hypothesis to be tested is referred to as the
More informationChapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 81 Overview 82 Basics of Hypothesis Testing
Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 81 Overview 82 Basics of Hypothesis Testing 83 Testing a Claim About a Proportion 85 Testing a Claim About a Mean: s Not Known 86 Testing
More informationTHE KRUSKAL WALLLIS TEST
THE KRUSKAL WALLLIS TEST TEODORA H. MEHOTCHEVA Wednesday, 23 rd April 08 THE KRUSKALWALLIS TEST: The nonparametric alternative to ANOVA: testing for difference between several independent groups 2 NON
More informationIntroduction to Hypothesis Testing
I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters  they must be estimated. However, we do have hypotheses about what the true
More informationChapter 8 Hypothesis Testing
Chapter 8 Hypothesis Testing Chapter problem: Does the MicroSort method of gender selection increase the likelihood that a baby will be girl? MicroSort: a genderselection method developed by Genetics
More informationHomework 6 Solutions
Math 17, Section 2 Spring 2011 Assignment Chapter 20: 12, 14, 20, 24, 34 Chapter 21: 2, 8, 14, 16, 18 Chapter 20 20.12] Got Milk? The student made a number of mistakes here: Homework 6 Solutions 1. Null
More informationChapter 9. TwoSample Tests. Effect Sizes and Power Paired t Test Calculation
Chapter 9 TwoSample Tests Paired t Test (Correlated Groups t Test) Effect Sizes and Power Paired t Test Calculation Summary Independent t Test Chapter 9 Homework Power and TwoSample Tests: Paired Versus
More informationChapter 08. Introduction
Chapter 08 Introduction Hypothesis testing may best be summarized as a decision making process in which one attempts to arrive at a particular conclusion based upon "statistical" evidence. A typical hypothesis
More information2. Describing Data. We consider 1. Graphical methods 2. Numerical methods 1 / 56
2. Describing Data We consider 1. Graphical methods 2. Numerical methods 1 / 56 General Use of Graphical and Numerical Methods Graphical methods can be used to visually and qualitatively present data and
More informationHow to Conduct a Hypothesis Test
How to Conduct a Hypothesis Test The idea of hypothesis testing is relatively straightforward. In various studies we observe certain events. We must ask, is the event due to chance alone, or is there some
More informationIf you need statistics to analyze your experiment, then you've done the wrong experiment.
When do you need statistical calculations? When analyzing data, your goal is simple: You wish to make the strongest possible conclusion from limited amounts of data. To do this, you need to overcome two
More informationProjects Involving Statistics (& SPSS)
Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,
More informationCHAPTER 14 NONPARAMETRIC TESTS
CHAPTER 14 NONPARAMETRIC TESTS Everything that we have done up until now in statistics has relied heavily on one major fact: that our data is normally distributed. We have been able to make inferences
More informationRecall this chart that showed how most of our course would be organized:
Chapter 4 OneWay ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical
More informationDescribe what is meant by a placebo Contrast the doubleblind procedure with the singleblind procedure Review the structure for organizing a memo
Readings: Ha and Ha Textbook  Chapters 1 8 Appendix D & E (online) Plous  Chapters 10, 11, 12 and 14 Chapter 10: The Representativeness Heuristic Chapter 11: The Availability Heuristic Chapter 12: Probability
More informationResearch Methods 1 Handouts, Graham Hole,COGS  version 1.0, September 2000: Page 1:
Research Methods 1 Handouts, Graham Hole,COGS  version 1.0, September 000: Page 1: NONPARAMETRIC TESTS: What are nonparametric tests? Statistical tests fall into two kinds: parametric tests assume that
More informationChoosing the correct statistical test made easy
Classroom Choosing the correct statistical test made easy N Gunawardana Senior Lecturer in Community Medicine, Faculty of Medicine, University of Colombo Gone are the days where researchers had to perform
More informationChapter 7 Part 2. Hypothesis testing Power
Chapter 7 Part 2 Hypothesis testing Power November 6, 2008 All of the normal curves in this handout are sampling distributions Goal: To understand the process of hypothesis testing and the relationship
More informationA Guide for a Selection of SPSS Functions
A Guide for a Selection of SPSS Functions IBM SPSS Statistics 19 Compiled by Beth Gaedy, Math Specialist, Viterbo University  2012 Using documents prepared by Drs. Sheldon Lee, Marcus Saegrove, Jennifer
More informationANOVA MULTIPLE CHOICE QUESTIONS. In the following multiplechoice questions, select the best answer.
ANOVA MULTIPLE CHOICE QUESTIONS In the following multiplechoice questions, select the best answer. 1. Analysis of variance is a statistical method of comparing the of several populations. a. standard
More informationAnalysis of Variance ANOVA
Analysis of Variance ANOVA Overview We ve used the t test to compare the means from two independent groups. Now we ve come to the final topic of the course: how to compare means from more than two populations.
More informationUnit 29 ChiSquare GoodnessofFit Test
Unit 29 ChiSquare GoodnessofFit Test Objectives: To perform the chisquare hypothesis test concerning proportions corresponding to more than two categories of a qualitative variable To perform the Bonferroni
More information