Sample Size. When is a coin biased? What does this mean? Why do we sample? Validity and Precision. Populations

Size: px
Start display at page:

Download "Sample Size. When is a coin biased? What does this mean? Why do we sample? Validity and Precision. Populations"

Transcription

1 Sample Size Dr. Randall Singer Professor of Epidemiology Executive Veterinary Program University of Illinois December 11 12, 2014 Why do we sample? We rarely have the time or the resources to gather data on all individuals (or whatever the sampling unit is) We must select a subset of the population to study (i.e. a sample) The data will consist of estimates instead of population parameters The validity of these estimates will depend on the sampling method The precision of the estimate will depend on the sample size Validity and Precision Validity High High Precision Low Low Populations Target population The entire set of individuals to which the findings of the survey will be extrapolated Study population The collection of individuals (sampling units) that are actually studied and from which the sample is drawn Sampling frame A list of all sampling units in the study population What does this mean? When is a coin biased? Odds Ratio = 2.3 (95% CI ) p = 0.02 You want to test whether a coin is biased Null Hypothesis = coin is fair (unbiased) You toss it 5 times 5 consecutive heads What is your conclusion? How many heads do you need to throw? 1

2 How many heads do you need to throw? Frequency distribution: n heads in 10 tosses of an unbiased coin How biased is the coin? Biased coin will still throw tails Bias can be small but real Most (all?) coins may be biased How far from 0.5 is observed proportion? Probability of 5 heads with a fair coin? How confident do you want to be that any bias is of negligible importance? When is a coin biased? Statistical tests report a P-value Probability of obtaining a result as extreme or more extreme than that observed under the assumption that the null hypothesis is true. Declare groups statistically different if the probability of an observed difference occurring by chance is small Conventionally where P<0.05 (arbitrary) Now (finally!) being questioned seriously When is a coin biased? Coin example: Null hypothesis: P of head = heads: deem the coin unbiased (chance as P =.054) 9 or 10 heads statistically significant (P = 0.01). Observed vs. Expected Statistical inference is based on comparison of observed data with expected data Assuming no difference between treatments Always (almost) have some observed difference How different (magnitude of effect)? How likely (P value) are the data if the null hypothesis is true Biological vs. statistical significance Sample size and significance testing Statistically significant difference unlikely due to chance alone Effect of sample size Small sample size large sampling error non-significant result even when real effect exists Large sample size small differences of no biological importance may be statistically significant 12 2

3 Key points on P-values Indicate probability of observed data under the assumption that the null hypothesis is true Do not indicate probability the alternative hypothesis is true (e.g. the coin is unbiased) Do not indicate the magnitude of the effect of interest Assume no errors or biases in the data Cut-off for statistical significance (P = 0.05) is arbitrary! Use of exact P values (e.g. P = 0.06) preferred to use of cutoff values (e.g. P >.05) Statistical significance biological significance Statistical inference Conclusions re population value drawn from a sample (random) estimates vs. true (unknown) values Inference assumes no biases in data collection or analysis Point estimate Value representing the effect under investigation (e.g. mean) Confidence interval Indicates the precision of the point estimate 14 Statistical significance Hypothesis testing Is the risk of disease different between breed A and breed B? Evaluation of an observed difference between breeds Disease status Breed Positive Negative Prevalence A B By chance? Null hypothesis (H 0 ): breed A = breed B No difference in population parameters Observed difference is simply the result of random variation in the data Relationship between results of a statistical test and the true biological state Significant Study finding Not Significant Biological truth Association No association Power: probability a study Correct will find a statistical difference if it exists: = (1 b) Incorrect Type II or β error α: probability Incorrectof observed Type difference I or due Alpha to chance error alone Correct Key points: Error rates in statistical tests Type 1 ( ) error = false rejection of the null hypothesis. Analogous to false positive rate for a test level: predetermined significance threshold (0.05) Biologists accept an error rate of 5% for falsely concluding a relationship exists Type 2 ( ) error = false acceptance of the null hypothesis. Analogous to false negative rate for a test Conventional level of 0.2 (20%) used in study design Trade off between and errors Like sensitivity and specificity in diagnostic testing 3

4 Sample Size Calculations for Disease sample size to estimate disease occurrence prevalence, incidence usually we measure a proportion (%) simplest approach is to use the binomial distribution we may be interested in the mean herd prevalence or incidence; we assume this is a continuous variable that is normally distributed sample size to detect disease presence prevalence near zero assume a perfect diagnostic test Exercise 1 you plan to conduct a survey to estimate the prevalence of antibodies against BVD in cattle in a specific county. the are 165 herds in the county. you have no idea about the herd-level prevalence of BVD antibodies. How many herds would you sample (assuming you have a perfect diagnostic test)? Exercise 2 Exercise 3 before conducting the survey to estimate the prevalence of antibodies against BVD in cattle in the county, a colleague suggests that the prevalence might be about 20%. how many herds would you now sample? why is your answer different to exercise 1? your colleague also says that everyone knows that the prevalence of antibodies against BVD in cattle in the county is somewhere in the range of 10 to 30%. how many herds would you now sample? Exercise 4 you are interested in the average milk production of cows in your BVD survey. you think the average milk production is approximately 10 liters per day. in a pilot study of one herd, you recorded the following milk production: 7, 11, 12, 9, and 11 If you wanted to estimate milk production in a herd with 70 cows, how many would you Exercise 5 in the same village, you want to know if the amount of milk produced by BVD positive cows, versus BVD negative cows, is the same. you think that BVD may suppress milk production by as much as 40% How many cows do you need to sample, if you want to be at least 80% certain of finding a real difference, if it exists? sample? 4

5 Exercise 6 Sample Size Calculations for Disease if you think that BVD may suppress milk production by only about 10%, how does this affect your sample size estimate? what are some ways of minimizing the sample size needed? sample size to estimate disease occurrence prevalence, incidence usually we measure a proportion (%) simplest approach is to use the binomial distribution we may be interested in the mean herd prevalence or incidence; we assume this is a continuous variable that is normally distributed sample size to detect disease presence prevalence near zero assume a perfect diagnostic test suppose we are interested whether a herd is free of disease X If we assume that if the disease is present, 50% of animals will be infected, then if we sample and test just one animal from the herd, the probability of detecting disease is 50% If we sample 2 animals, the probability of detecting disease is 75% If we sample 3 animals, the probability of detecting disease is 87.5% probability of detecting disease = probability that sample 1 (or 2 or 3 or. ) is positive probability of detecting disease = probability that sample 1 (or 2 or 3 or. ) is positive probability of detecting disease = 1 probability of NOT detecting disease = 1 (1 probability animal 1 is positive) x (1 probability animal 2 is positive) x (1 probability animal 3 is positive) = 1 (1 probability each animal is positive) number sampled e.g. 50% probability of animals being infected (= prevalence) 3 animals are sampled e.g. 10% probability of animals being infected (= prevalence) 5 animals are sampled probability of detecting disease =1 (1 probability any animal is positive) number sampled = 1 (1 0.5) 3 = 87.5% probability of detecting disease =1 (1 probability any animal is positive) number sampled = 1 (1 0.1) 5 = 41% note: by convention, we usually either want 95, 98 or 99% confidence that a herd, region or country is free of disease i.e. we attempt to minimize the chances we are wrong to <5, <2 or <1% 5

6 Probability of Detecting Disease probability of detecting disease 1. probability that any given animal is positive (prevalence) 2. number of animals sampled probability of detecting disease if: 1. prevalence is low 2. sample size low sample size = 20 Probability of Detecting Disease sample size needed to find at least one disease animal 1. number of detectable cases in the population 2. number of animals sampled 3. confidence level 4. population size prevalence = 5% e.g. suppose you want to detect whether or not a flock of N = 1000 animal is positive for pathogen X if X is present, you suspect that 50% of the flock is infected d = 500 you desire 95% confidence P = 0.95 e.g. suppose you want to detect whether or not a flock of N = 1000 animal is positive for pathogen X if X is present, you suspect that 5% of the flock is infected d = 50 you desire 95% confidence P = 0.95 n = 4.48 (need to sample ~ 5 animals) n = 56.7 (need to sample ~ 57 animals) 6

7 Maximum Number of Positives sample size needs to be increased if: 1. number of detectable cases in the population is small 2. desired confidence level is high 3. population size is large What is the maximum number of positives in a population given that all animals are sampled? Maximum Number of Positives Exercise 7 e.g. suppose that 1,000 slaughter cows were tested negative for E. coli H7:O157 (n=1000) and the total number of cows slaughtered was 1 million (N = ). What is the maximal prevalence in the population of slaughtered cows, if you desire 95% confidence (P = 0.95) you plan to conduct a survey to estimate the prevalence of antibodies against BVD in cattle in the county. the are 165 herds in the county. how many samples per herd should you take do determine if a herd is BVD positive? d = (maximal prevalence = 0.30%) Exercise 8 you plan to conduct a survey to estimate the prevalence of antibodies against BVD in cattle in the county. you obtain some more information: in most herds there are 50 cows; there are some herds with 80 cows, and a few with 100 cows Exercise 9 in the previous exercise, what is the effect of dropping your confidence to 90%? increasing it to 97.5%? for a representative sample, how many cows should be sampled within these classes? 7

8 Exercise 10 Exercise 11 out of the 165 herds in the county, how many need to be sampled to detect BVD antibodies, if you think that 10% of the herds are positive? What would be the total sample size for a survey to detect BVD in county herds if there are 165 herds and you think that: 10% of herds are positive; 140 herds have 50 cows; 15 herds have 80 cows; 10 herds have 100 cows; and the prevalence of antibodies within herds is 5% Disease Freedom many countries and regions are free of important trade-limiting diseases such as foot-and-mouth disease, classical swine fever and avian influenza there may be periodic incursions of such diseases in order to demonstrate freedom, need to identify incursions quickly and control disease spread, then eradication at the beginning and end of such epidemics, prevalence of disease is very low close to zero failure to detect initial incursion larger, longer epidemics failure to demonstrate freedom following incursion unnecessary trade restrictions Exercise 12 Exercise 13 you want to determine if the village poultry population in the county is free of avian influenza antibodies in the county, there are 141 villages, all of which have poultry for freedom, OIE suggests that seroprevalence should be <0.1%, with 95% confidence how many villages need to be sampled? suppose in your survey of the village poultry population for avian influenza antibodies, you sample 20 villages and all samples are seronegative. what is the maximum number (%) of villages that could be seropositive with a 95% confidence level? 99% confidence level? 8

Sample Size and Power in Clinical Trials

Sample Size and Power in Clinical Trials Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance

More information

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing

More information

"Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1

Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals. 1 BASIC STATISTICAL THEORY / 3 CHAPTER ONE BASIC STATISTICAL THEORY "Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1 Medicine

More information

p-values and significance levels (false positive or false alarm rates)

p-values and significance levels (false positive or false alarm rates) p-values and significance levels (false positive or false alarm rates) Let's say 123 people in the class toss a coin. Call it "Coin A." There are 65 heads. Then they toss another coin. Call it "Coin B."

More information

SAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population.

SAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population. SAMPLING & INFERENTIAL STATISTICS Sampling is necessary to make inferences about a population. SAMPLING The group that you observe or collect data from is the sample. The group that you make generalizations

More information

Hypothesis testing - Steps

Hypothesis testing - Steps Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =

More information

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment

More information

Odds ratio, Odds ratio test for independence, chi-squared statistic.

Odds ratio, Odds ratio test for independence, chi-squared statistic. Odds ratio, Odds ratio test for independence, chi-squared statistic. Announcements: Assignment 5 is live on webpage. Due Wed Aug 1 at 4:30pm. (9 days, 1 hour, 58.5 minutes ) Final exam is Aug 9. Review

More information

Basic research methods. Basic research methods. Question: BRM.2. Question: BRM.1

Basic research methods. Basic research methods. Question: BRM.2. Question: BRM.1 BRM.1 The proportion of individuals with a particular disease who die from that condition is called... BRM.2 This study design examines factors that may contribute to a condition by comparing subjects

More information

The Danish veterinary preparedness for avian influenza and Newcastle disease

The Danish veterinary preparedness for avian influenza and Newcastle disease The Danish veterinary preparedness for avian influenza and Newcastle disease Sten Mortensen, Veterinary R&D manager, Animal Health Division, Deputy head 19-04-2016 Livestock statistics, Denmark 2015 Species

More information

Chapter 4. Probability and Probability Distributions

Chapter 4. Probability and Probability Distributions Chapter 4. robability and robability Distributions Importance of Knowing robability To know whether a sample is not identical to the population from which it was selected, it is necessary to assess the

More information

Types of Error in Surveys

Types of Error in Surveys 2 Types of Error in Surveys Surveys are designed to produce statistics about a target population. The process by which this is done rests on inferring the characteristics of the target population from

More information

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

SECOND M.B. AND SECOND VETERINARY M.B. EXAMINATIONS INTRODUCTION TO THE SCIENTIFIC BASIS OF MEDICINE EXAMINATION. Friday 14 March 2008 9.00-9.

SECOND M.B. AND SECOND VETERINARY M.B. EXAMINATIONS INTRODUCTION TO THE SCIENTIFIC BASIS OF MEDICINE EXAMINATION. Friday 14 March 2008 9.00-9. SECOND M.B. AND SECOND VETERINARY M.B. EXAMINATIONS INTRODUCTION TO THE SCIENTIFIC BASIS OF MEDICINE EXAMINATION Friday 14 March 2008 9.00-9.45 am Attempt all ten questions. For each question, choose the

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

DEVISING IMPORT HEALTH MEASURES FOR ANIMAL COMMODITIES

DEVISING IMPORT HEALTH MEASURES FOR ANIMAL COMMODITIES DEVISING IMPORT HEALTH MEASURES FOR ANIMAL COMMODITIES This paper provides guidance to OIE Members on the use of the animal health information in the OIE World Animal Health Information Database (WAHID)

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Sample Practice problems - chapter 12-1 and 2 proportions for inference - Z Distributions Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide

More information

Unit 26 Estimation with Confidence Intervals

Unit 26 Estimation with Confidence Intervals Unit 26 Estimation with Confidence Intervals Objectives: To see how confidence intervals are used to estimate a population proportion, a population mean, a difference in population proportions, or a difference

More information

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely

More information

Testing Hypotheses About Proportions

Testing Hypotheses About Proportions Chapter 11 Testing Hypotheses About Proportions Hypothesis testing method: uses data from a sample to judge whether or not a statement about a population may be true. Steps in Any Hypothesis Test 1. Determine

More information

Testing Research and Statistical Hypotheses

Testing Research and Statistical Hypotheses Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you

More information

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics Ms. Foglia Date AP: LAB 8: THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.

More information

Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom 1 Learning Goals 1. Be able to explain the difference between the p-value and a posterior

More information

Principles of Hypothesis Testing for Public Health

Principles of Hypothesis Testing for Public Health Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions

More information

Reflections on Probability vs Nonprobability Sampling

Reflections on Probability vs Nonprobability Sampling Official Statistics in Honour of Daniel Thorburn, pp. 29 35 Reflections on Probability vs Nonprobability Sampling Jan Wretman 1 A few fundamental things are briefly discussed. First: What is called probability

More information

Introduction to Hypothesis Testing OPRE 6301

Introduction to Hypothesis Testing OPRE 6301 Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about

More information

Binomial Sampling and the Binomial Distribution

Binomial Sampling and the Binomial Distribution Binomial Sampling and the Binomial Distribution Characterized by two mutually exclusive events." Examples: GENERAL: {success or failure} {on or off} {head or tail} {zero or one} BIOLOGY: {dead or alive}

More information

Chapter 3. Sampling. Sampling Methods

Chapter 3. Sampling. Sampling Methods Oxford University Press Chapter 3 40 Sampling Resources are always limited. It is usually not possible nor necessary for the researcher to study an entire target population of subjects. Most medical research

More information

Introduction to Hypothesis Testing

Introduction to Hypothesis Testing I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters - they must be estimated. However, we do have hypotheses about what the true

More information

Analysis and Interpretation of Clinical Trials. How to conclude?

Analysis and Interpretation of Clinical Trials. How to conclude? www.eurordis.org Analysis and Interpretation of Clinical Trials How to conclude? Statistical Issues Dr Ferran Torres Unitat de Suport en Estadística i Metodología - USEM Statistics and Methodology Support

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

Survey Research: Choice of Instrument, Sample. Lynda Burton, ScD Johns Hopkins University

Survey Research: Choice of Instrument, Sample. Lynda Burton, ScD Johns Hopkins University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Mind on Statistics. Chapter 12

Mind on Statistics. Chapter 12 Mind on Statistics Chapter 12 Sections 12.1 Questions 1 to 6: For each statement, determine if the statement is a typical null hypothesis (H 0 ) or alternative hypothesis (H a ). 1. There is no difference

More information

Inclusion and Exclusion Criteria

Inclusion and Exclusion Criteria Inclusion and Exclusion Criteria Inclusion criteria = attributes of subjects that are essential for their selection to participate. Inclusion criteria function remove the influence of specific confounding

More information

p ˆ (sample mean and sample

p ˆ (sample mean and sample Chapter 6: Confidence Intervals and Hypothesis Testing When analyzing data, we can t just accept the sample mean or sample proportion as the official mean or proportion. When we estimate the statistics

More information

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing. Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative

More information

Statistics 2014 Scoring Guidelines

Statistics 2014 Scoring Guidelines AP Statistics 2014 Scoring Guidelines College Board, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks of the College Board. AP Central is the official online home

More information

STAT 35A HW2 Solutions

STAT 35A HW2 Solutions STAT 35A HW2 Solutions http://www.stat.ucla.edu/~dinov/courses_students.dir/09/spring/stat35.dir 1. A computer consulting firm presently has bids out on three projects. Let A i = { awarded project i },

More information

Week 4: Standard Error and Confidence Intervals

Week 4: Standard Error and Confidence Intervals Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.

More information

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses Introduction to Hypothesis Testing 1 Hypothesis Testing A hypothesis test is a statistical procedure that uses sample data to evaluate a hypothesis about a population Hypothesis is stated in terms of the

More information

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics Period Date LAB : THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,

More information

What is Statistic? OPRE 6301

What is Statistic? OPRE 6301 What is Statistic? OPRE 6301 In today s world...... we are constantly being bombarded with statistics and statistical information. For example: Customer Surveys Medical News Demographics Political Polls

More information

BVD qpcr Bulk Milk Test

BVD qpcr Bulk Milk Test BVD qpcr Bulk Milk Test NML launches a new method for BVD screening... NML launched their new bulk milk BVD qpcr service at the BCVA in mid November 2012. The service offers a simple and easy method to

More information

People have thought about, and defined, probability in different ways. important to note the consequences of the definition:

People have thought about, and defined, probability in different ways. important to note the consequences of the definition: PROBABILITY AND LIKELIHOOD, A BRIEF INTRODUCTION IN SUPPORT OF A COURSE ON MOLECULAR EVOLUTION (BIOL 3046) Probability The subject of PROBABILITY is a branch of mathematics dedicated to building models

More information

HYPOTHESIS TESTING: POWER OF THE TEST

HYPOTHESIS TESTING: POWER OF THE TEST HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,

More information

Stat 5102 Notes: Nonparametric Tests and. confidence interval

Stat 5102 Notes: Nonparametric Tests and. confidence interval Stat 510 Notes: Nonparametric Tests and Confidence Intervals Charles J. Geyer April 13, 003 This handout gives a brief introduction to nonparametrics, which is what you do when you don t believe the assumptions

More information

Correlational Research

Correlational Research Correlational Research Chapter Fifteen Correlational Research Chapter Fifteen Bring folder of readings The Nature of Correlational Research Correlational Research is also known as Associational Research.

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

Clinical Study Design and Methods Terminology

Clinical Study Design and Methods Terminology Home College of Veterinary Medicine Washington State University WSU Faculty &Staff Page Page 1 of 5 John Gay, DVM PhD DACVPM AAHP FDIU VCS Clinical Epidemiology & Evidence-Based Medicine Glossary: Clinical

More information

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem) NONPARAMETRIC STATISTICS 1 PREVIOUSLY parametric statistics in estimation and hypothesis testing... construction of confidence intervals computing of p-values classical significance testing depend on assumptions

More information

Using Excel for inferential statistics

Using Excel for inferential statistics FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

More information

Week 3&4: Z tables and the Sampling Distribution of X

Week 3&4: Z tables and the Sampling Distribution of X Week 3&4: Z tables and the Sampling Distribution of X 2 / 36 The Standard Normal Distribution, or Z Distribution, is the distribution of a random variable, Z N(0, 1 2 ). The distribution of any other normal

More information

E3: PROBABILITY AND STATISTICS lecture notes

E3: PROBABILITY AND STATISTICS lecture notes E3: PROBABILITY AND STATISTICS lecture notes 2 Contents 1 PROBABILITY THEORY 7 1.1 Experiments and random events............................ 7 1.2 Certain event. Impossible event............................

More information

Statistics in Medicine Research Lecture Series CSMC Fall 2014

Statistics in Medicine Research Lecture Series CSMC Fall 2014 Catherine Bresee, MS Senior Biostatistician Biostatistics & Bioinformatics Research Institute Statistics in Medicine Research Lecture Series CSMC Fall 2014 Overview Review concept of statistical power

More information

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1) CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.

More information

Characteristics of Binomial Distributions

Characteristics of Binomial Distributions Lesson2 Characteristics of Binomial Distributions In the last lesson, you constructed several binomial distributions, observed their shapes, and estimated their means and standard deviations. In Investigation

More information

Binary Diagnostic Tests Two Independent Samples

Binary Diagnostic Tests Two Independent Samples Chapter 537 Binary Diagnostic Tests Two Independent Samples Introduction An important task in diagnostic medicine is to measure the accuracy of two diagnostic tests. This can be done by comparing summary

More information

Independent samples t-test. Dr. Tom Pierce Radford University

Independent samples t-test. Dr. Tom Pierce Radford University Independent samples t-test Dr. Tom Pierce Radford University The logic behind drawing causal conclusions from experiments The sampling distribution of the difference between means The standard error of

More information

What are confidence intervals and p-values?

What are confidence intervals and p-values? What is...? series Second edition Statistics Supported by sanofi-aventis What are confidence intervals and p-values? Huw TO Davies PhD Professor of Health Care Policy and Management, University of St Andrews

More information

Understand the role that hypothesis testing plays in an improvement project. Know how to perform a two sample hypothesis test.

Understand the role that hypothesis testing plays in an improvement project. Know how to perform a two sample hypothesis test. HYPOTHESIS TESTING Learning Objectives Understand the role that hypothesis testing plays in an improvement project. Know how to perform a two sample hypothesis test. Know how to perform a hypothesis test

More information

How To Understand The State Of The Art In Animal Health Risk Assessment

How To Understand The State Of The Art In Animal Health Risk Assessment Survey Questionnaires Surveillance Activities and University Researchers in Animal Health Risk Assessment Science Advice in the Public Interest Survey Questionnaires 1 Survey Questionnaires Surveillance

More information

Point and Interval Estimates

Point and Interval Estimates Point and Interval Estimates Suppose we want to estimate a parameter, such as p or µ, based on a finite sample of data. There are two main methods: 1. Point estimate: Summarize the sample by a single number

More information

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS The Islamic University of Gaza Faculty of Commerce Department of Economics and Political Sciences An Introduction to Statistics Course (ECOE 130) Spring Semester 011 Chapter 10- TWO-SAMPLE TESTS Practice

More information

National FMD Response Planning

National FMD Response Planning National FMD Response Planning Proactive Risk Assessment to Support and Managed Preparedness Movement of Livestock and Poultry Timothy J. Goldsmith DVM, MPH, DACVPM Center for Animal Health and Food Safety

More information

HYPOTHESIS TESTING WITH SPSS:

HYPOTHESIS TESTING WITH SPSS: HYPOTHESIS TESTING WITH SPSS: A NON-STATISTICIAN S GUIDE & TUTORIAL by Dr. Jim Mirabella SPSS 14.0 screenshots reprinted with permission from SPSS Inc. Published June 2006 Copyright Dr. Jim Mirabella CHAPTER

More information

2 Precision-based sample size calculations

2 Precision-based sample size calculations Statistics: An introduction to sample size calculations Rosie Cornish. 2006. 1 Introduction One crucial aspect of study design is deciding how big your sample should be. If you increase your sample size

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

Crosstabulation & Chi Square

Crosstabulation & Chi Square Crosstabulation & Chi Square Robert S Michael Chi-square as an Index of Association After examining the distribution of each of the variables, the researcher s next task is to look for relationships among

More information

Selecting Research Participants

Selecting Research Participants C H A P T E R 6 Selecting Research Participants OBJECTIVES After studying this chapter, students should be able to Define the term sampling frame Describe the difference between random sampling and random

More information

Stat 20: Intro to Probability and Statistics

Stat 20: Intro to Probability and Statistics Stat 20: Intro to Probability and Statistics Lecture 16: More Box Models Tessa L. Childers-Day UC Berkeley 22 July 2014 By the end of this lecture... You will be able to: Determine what we expect the sum

More information

MATH 140 Lab 4: Probability and the Standard Normal Distribution

MATH 140 Lab 4: Probability and the Standard Normal Distribution MATH 140 Lab 4: Probability and the Standard Normal Distribution Problem 1. Flipping a Coin Problem In this problem, we want to simualte the process of flipping a fair coin 1000 times. Note that the outcomes

More information

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1 Hypothesis testing So far, we ve talked about inference from the point of estimation. We ve tried to answer questions like What is a good estimate for a typical value? or How much variability is there

More information

Normal Distribution Lecture Notes

Normal Distribution Lecture Notes Normal Distribution Lecture Notes Professor Richard Blecksmith richard@math.niu.edu Dept. of Mathematical Sciences Northern Illinois University Math 101 Website: http://math.niu.edu/ richard/math101 Section

More information

Variables Control Charts

Variables Control Charts MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. Variables

More information

IS 30 THE MAGIC NUMBER? ISSUES IN SAMPLE SIZE ESTIMATION

IS 30 THE MAGIC NUMBER? ISSUES IN SAMPLE SIZE ESTIMATION Current Topic IS 30 THE MAGIC NUMBER? ISSUES IN SAMPLE SIZE ESTIMATION Sitanshu Sekhar Kar 1, Archana Ramalingam 2 1Assistant Professor; 2 Post- graduate, Department of Preventive and Social Medicine,

More information

Section 6.2 Definition of Probability

Section 6.2 Definition of Probability Section 6.2 Definition of Probability Probability is a measure of the likelihood that an event occurs. For example, if there is a 20% chance of rain tomorrow, that means that the probability that it will

More information

HPAI Response HPAI Response Goals November 18, 2015

HPAI Response HPAI Response Goals November 18, 2015 HPAI Response HPAI Response Goals November 18, 2015 Please note: These may be revised as the situation continues to change. USDA APHIS will work to achieve these goals for critical activities in a highly

More information

Statistical tests for SPSS

Statistical tests for SPSS Statistical tests for SPSS Paolo Coletti A.Y. 2010/11 Free University of Bolzano Bozen Premise This book is a very quick, rough and fast description of statistical tests and their usage. It is explicitly

More information

Simple Random Sampling

Simple Random Sampling Source: Frerichs, R.R. Rapid Surveys (unpublished), 2008. NOT FOR COMMERCIAL DISTRIBUTION 3 Simple Random Sampling 3.1 INTRODUCTION Everyone mentions simple random sampling, but few use this method for

More information

SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one?

SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one? SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one? Simulations for properties of estimators Simulations for properties

More information

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions BMS 617 Statistical Techniques for the Biomedical Sciences Lecture 11: Chi-Squared and Fisher's Exact Tests Chi Squared and Fisher's Exact Tests This lecture presents two similarly structured tests, Chi-squared

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance

More information

Bayesian Tutorial (Sheet Updated 20 March)

Bayesian Tutorial (Sheet Updated 20 March) Bayesian Tutorial (Sheet Updated 20 March) Practice Questions (for discussing in Class) Week starting 21 March 2016 1. What is the probability that the total of two dice will be greater than 8, given that

More information

There are three kinds of people in the world those who are good at math and those who are not. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 Positive Views The record of a month

More information

5.1 Identifying the Target Parameter

5.1 Identifying the Target Parameter University of California, Davis Department of Statistics Summer Session II Statistics 13 August 20, 2012 Date of latest update: August 20 Lecture 5: Estimation with Confidence intervals 5.1 Identifying

More information

6: Introduction to Hypothesis Testing

6: Introduction to Hypothesis Testing 6: Introduction to Hypothesis Testing Significance testing is used to help make a judgment about a claim by addressing the question, Can the observed difference be attributed to chance? We break up significance

More information

Seroprevalence and risk factors of Lassa fever infection in Nasarawa State, Nigeria 2013

Seroprevalence and risk factors of Lassa fever infection in Nasarawa State, Nigeria 2013 Seroprevalence and risk factors of Lassa fever infection in Nasarawa State, Nigeria 2013 Muhammad Shakir Balogun COHORT 3 Nigeria-FELTP Supervisors: Dr. AT Olayinka, Dr. AI Mamman Outline Background Methodology

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Non-Parametric Tests (I)

Non-Parametric Tests (I) Lecture 5: Non-Parametric Tests (I) KimHuat LIM lim@stats.ox.ac.uk http://www.stats.ox.ac.uk/~lim/teaching.html Slide 1 5.1 Outline (i) Overview of Distribution-Free Tests (ii) Median Test for Two Independent

More information

Conditional Probability, Hypothesis Testing, and the Monty Hall Problem

Conditional Probability, Hypothesis Testing, and the Monty Hall Problem Conditional Probability, Hypothesis Testing, and the Monty Hall Problem Ernie Croot September 17, 2008 On more than one occasion I have heard the comment Probability does not exist in the real world, and

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

Sample Size Planning, Calculation, and Justification

Sample Size Planning, Calculation, and Justification Sample Size Planning, Calculation, and Justification Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa

More information

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935) Section 7.1 Introduction to Hypothesis Testing Schrodinger s cat quantum mechanics thought experiment (1935) Statistical Hypotheses A statistical hypothesis is a claim about a population. Null hypothesis

More information

Lesson 9 Hypothesis Testing

Lesson 9 Hypothesis Testing Lesson 9 Hypothesis Testing Outline Logic for Hypothesis Testing Critical Value Alpha (α) -level.05 -level.01 One-Tail versus Two-Tail Tests -critical values for both alpha levels Logic for Hypothesis

More information

Descriptive Methods Ch. 6 and 7

Descriptive Methods Ch. 6 and 7 Descriptive Methods Ch. 6 and 7 Purpose of Descriptive Research Purely descriptive research describes the characteristics or behaviors of a given population in a systematic and accurate fashion. Correlational

More information