Sample Size. When is a coin biased? What does this mean? Why do we sample? Validity and Precision. Populations
|
|
- Noreen James
- 7 years ago
- Views:
Transcription
1 Sample Size Dr. Randall Singer Professor of Epidemiology Executive Veterinary Program University of Illinois December 11 12, 2014 Why do we sample? We rarely have the time or the resources to gather data on all individuals (or whatever the sampling unit is) We must select a subset of the population to study (i.e. a sample) The data will consist of estimates instead of population parameters The validity of these estimates will depend on the sampling method The precision of the estimate will depend on the sample size Validity and Precision Validity High High Precision Low Low Populations Target population The entire set of individuals to which the findings of the survey will be extrapolated Study population The collection of individuals (sampling units) that are actually studied and from which the sample is drawn Sampling frame A list of all sampling units in the study population What does this mean? When is a coin biased? Odds Ratio = 2.3 (95% CI ) p = 0.02 You want to test whether a coin is biased Null Hypothesis = coin is fair (unbiased) You toss it 5 times 5 consecutive heads What is your conclusion? How many heads do you need to throw? 1
2 How many heads do you need to throw? Frequency distribution: n heads in 10 tosses of an unbiased coin How biased is the coin? Biased coin will still throw tails Bias can be small but real Most (all?) coins may be biased How far from 0.5 is observed proportion? Probability of 5 heads with a fair coin? How confident do you want to be that any bias is of negligible importance? When is a coin biased? Statistical tests report a P-value Probability of obtaining a result as extreme or more extreme than that observed under the assumption that the null hypothesis is true. Declare groups statistically different if the probability of an observed difference occurring by chance is small Conventionally where P<0.05 (arbitrary) Now (finally!) being questioned seriously When is a coin biased? Coin example: Null hypothesis: P of head = heads: deem the coin unbiased (chance as P =.054) 9 or 10 heads statistically significant (P = 0.01). Observed vs. Expected Statistical inference is based on comparison of observed data with expected data Assuming no difference between treatments Always (almost) have some observed difference How different (magnitude of effect)? How likely (P value) are the data if the null hypothesis is true Biological vs. statistical significance Sample size and significance testing Statistically significant difference unlikely due to chance alone Effect of sample size Small sample size large sampling error non-significant result even when real effect exists Large sample size small differences of no biological importance may be statistically significant 12 2
3 Key points on P-values Indicate probability of observed data under the assumption that the null hypothesis is true Do not indicate probability the alternative hypothesis is true (e.g. the coin is unbiased) Do not indicate the magnitude of the effect of interest Assume no errors or biases in the data Cut-off for statistical significance (P = 0.05) is arbitrary! Use of exact P values (e.g. P = 0.06) preferred to use of cutoff values (e.g. P >.05) Statistical significance biological significance Statistical inference Conclusions re population value drawn from a sample (random) estimates vs. true (unknown) values Inference assumes no biases in data collection or analysis Point estimate Value representing the effect under investigation (e.g. mean) Confidence interval Indicates the precision of the point estimate 14 Statistical significance Hypothesis testing Is the risk of disease different between breed A and breed B? Evaluation of an observed difference between breeds Disease status Breed Positive Negative Prevalence A B By chance? Null hypothesis (H 0 ): breed A = breed B No difference in population parameters Observed difference is simply the result of random variation in the data Relationship between results of a statistical test and the true biological state Significant Study finding Not Significant Biological truth Association No association Power: probability a study Correct will find a statistical difference if it exists: = (1 b) Incorrect Type II or β error α: probability Incorrectof observed Type difference I or due Alpha to chance error alone Correct Key points: Error rates in statistical tests Type 1 ( ) error = false rejection of the null hypothesis. Analogous to false positive rate for a test level: predetermined significance threshold (0.05) Biologists accept an error rate of 5% for falsely concluding a relationship exists Type 2 ( ) error = false acceptance of the null hypothesis. Analogous to false negative rate for a test Conventional level of 0.2 (20%) used in study design Trade off between and errors Like sensitivity and specificity in diagnostic testing 3
4 Sample Size Calculations for Disease sample size to estimate disease occurrence prevalence, incidence usually we measure a proportion (%) simplest approach is to use the binomial distribution we may be interested in the mean herd prevalence or incidence; we assume this is a continuous variable that is normally distributed sample size to detect disease presence prevalence near zero assume a perfect diagnostic test Exercise 1 you plan to conduct a survey to estimate the prevalence of antibodies against BVD in cattle in a specific county. the are 165 herds in the county. you have no idea about the herd-level prevalence of BVD antibodies. How many herds would you sample (assuming you have a perfect diagnostic test)? Exercise 2 Exercise 3 before conducting the survey to estimate the prevalence of antibodies against BVD in cattle in the county, a colleague suggests that the prevalence might be about 20%. how many herds would you now sample? why is your answer different to exercise 1? your colleague also says that everyone knows that the prevalence of antibodies against BVD in cattle in the county is somewhere in the range of 10 to 30%. how many herds would you now sample? Exercise 4 you are interested in the average milk production of cows in your BVD survey. you think the average milk production is approximately 10 liters per day. in a pilot study of one herd, you recorded the following milk production: 7, 11, 12, 9, and 11 If you wanted to estimate milk production in a herd with 70 cows, how many would you Exercise 5 in the same village, you want to know if the amount of milk produced by BVD positive cows, versus BVD negative cows, is the same. you think that BVD may suppress milk production by as much as 40% How many cows do you need to sample, if you want to be at least 80% certain of finding a real difference, if it exists? sample? 4
5 Exercise 6 Sample Size Calculations for Disease if you think that BVD may suppress milk production by only about 10%, how does this affect your sample size estimate? what are some ways of minimizing the sample size needed? sample size to estimate disease occurrence prevalence, incidence usually we measure a proportion (%) simplest approach is to use the binomial distribution we may be interested in the mean herd prevalence or incidence; we assume this is a continuous variable that is normally distributed sample size to detect disease presence prevalence near zero assume a perfect diagnostic test suppose we are interested whether a herd is free of disease X If we assume that if the disease is present, 50% of animals will be infected, then if we sample and test just one animal from the herd, the probability of detecting disease is 50% If we sample 2 animals, the probability of detecting disease is 75% If we sample 3 animals, the probability of detecting disease is 87.5% probability of detecting disease = probability that sample 1 (or 2 or 3 or. ) is positive probability of detecting disease = probability that sample 1 (or 2 or 3 or. ) is positive probability of detecting disease = 1 probability of NOT detecting disease = 1 (1 probability animal 1 is positive) x (1 probability animal 2 is positive) x (1 probability animal 3 is positive) = 1 (1 probability each animal is positive) number sampled e.g. 50% probability of animals being infected (= prevalence) 3 animals are sampled e.g. 10% probability of animals being infected (= prevalence) 5 animals are sampled probability of detecting disease =1 (1 probability any animal is positive) number sampled = 1 (1 0.5) 3 = 87.5% probability of detecting disease =1 (1 probability any animal is positive) number sampled = 1 (1 0.1) 5 = 41% note: by convention, we usually either want 95, 98 or 99% confidence that a herd, region or country is free of disease i.e. we attempt to minimize the chances we are wrong to <5, <2 or <1% 5
6 Probability of Detecting Disease probability of detecting disease 1. probability that any given animal is positive (prevalence) 2. number of animals sampled probability of detecting disease if: 1. prevalence is low 2. sample size low sample size = 20 Probability of Detecting Disease sample size needed to find at least one disease animal 1. number of detectable cases in the population 2. number of animals sampled 3. confidence level 4. population size prevalence = 5% e.g. suppose you want to detect whether or not a flock of N = 1000 animal is positive for pathogen X if X is present, you suspect that 50% of the flock is infected d = 500 you desire 95% confidence P = 0.95 e.g. suppose you want to detect whether or not a flock of N = 1000 animal is positive for pathogen X if X is present, you suspect that 5% of the flock is infected d = 50 you desire 95% confidence P = 0.95 n = 4.48 (need to sample ~ 5 animals) n = 56.7 (need to sample ~ 57 animals) 6
7 Maximum Number of Positives sample size needs to be increased if: 1. number of detectable cases in the population is small 2. desired confidence level is high 3. population size is large What is the maximum number of positives in a population given that all animals are sampled? Maximum Number of Positives Exercise 7 e.g. suppose that 1,000 slaughter cows were tested negative for E. coli H7:O157 (n=1000) and the total number of cows slaughtered was 1 million (N = ). What is the maximal prevalence in the population of slaughtered cows, if you desire 95% confidence (P = 0.95) you plan to conduct a survey to estimate the prevalence of antibodies against BVD in cattle in the county. the are 165 herds in the county. how many samples per herd should you take do determine if a herd is BVD positive? d = (maximal prevalence = 0.30%) Exercise 8 you plan to conduct a survey to estimate the prevalence of antibodies against BVD in cattle in the county. you obtain some more information: in most herds there are 50 cows; there are some herds with 80 cows, and a few with 100 cows Exercise 9 in the previous exercise, what is the effect of dropping your confidence to 90%? increasing it to 97.5%? for a representative sample, how many cows should be sampled within these classes? 7
8 Exercise 10 Exercise 11 out of the 165 herds in the county, how many need to be sampled to detect BVD antibodies, if you think that 10% of the herds are positive? What would be the total sample size for a survey to detect BVD in county herds if there are 165 herds and you think that: 10% of herds are positive; 140 herds have 50 cows; 15 herds have 80 cows; 10 herds have 100 cows; and the prevalence of antibodies within herds is 5% Disease Freedom many countries and regions are free of important trade-limiting diseases such as foot-and-mouth disease, classical swine fever and avian influenza there may be periodic incursions of such diseases in order to demonstrate freedom, need to identify incursions quickly and control disease spread, then eradication at the beginning and end of such epidemics, prevalence of disease is very low close to zero failure to detect initial incursion larger, longer epidemics failure to demonstrate freedom following incursion unnecessary trade restrictions Exercise 12 Exercise 13 you want to determine if the village poultry population in the county is free of avian influenza antibodies in the county, there are 141 villages, all of which have poultry for freedom, OIE suggests that seroprevalence should be <0.1%, with 95% confidence how many villages need to be sampled? suppose in your survey of the village poultry population for avian influenza antibodies, you sample 20 villages and all samples are seronegative. what is the maximum number (%) of villages that could be seropositive with a 95% confidence level? 99% confidence level? 8
Sample Size and Power in Clinical Trials
Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance
More informationChapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing
Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing
More information"Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1
BASIC STATISTICAL THEORY / 3 CHAPTER ONE BASIC STATISTICAL THEORY "Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1 Medicine
More informationp-values and significance levels (false positive or false alarm rates)
p-values and significance levels (false positive or false alarm rates) Let's say 123 people in the class toss a coin. Call it "Coin A." There are 65 heads. Then they toss another coin. Call it "Coin B."
More informationSAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population.
SAMPLING & INFERENTIAL STATISTICS Sampling is necessary to make inferences about a population. SAMPLING The group that you observe or collect data from is the sample. The group that you make generalizations
More informationHypothesis testing - Steps
Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =
More informationA POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment
More informationOdds ratio, Odds ratio test for independence, chi-squared statistic.
Odds ratio, Odds ratio test for independence, chi-squared statistic. Announcements: Assignment 5 is live on webpage. Due Wed Aug 1 at 4:30pm. (9 days, 1 hour, 58.5 minutes ) Final exam is Aug 9. Review
More informationBasic research methods. Basic research methods. Question: BRM.2. Question: BRM.1
BRM.1 The proportion of individuals with a particular disease who die from that condition is called... BRM.2 This study design examines factors that may contribute to a condition by comparing subjects
More informationThe Danish veterinary preparedness for avian influenza and Newcastle disease
The Danish veterinary preparedness for avian influenza and Newcastle disease Sten Mortensen, Veterinary R&D manager, Animal Health Division, Deputy head 19-04-2016 Livestock statistics, Denmark 2015 Species
More informationChapter 4. Probability and Probability Distributions
Chapter 4. robability and robability Distributions Importance of Knowing robability To know whether a sample is not identical to the population from which it was selected, it is necessary to assess the
More informationTypes of Error in Surveys
2 Types of Error in Surveys Surveys are designed to produce statistics about a target population. The process by which this is done rests on inferring the characteristics of the target population from
More informationIntroduction. Hypothesis Testing. Hypothesis Testing. Significance Testing
Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters
More informationLecture Notes Module 1
Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific
More informationSECOND M.B. AND SECOND VETERINARY M.B. EXAMINATIONS INTRODUCTION TO THE SCIENTIFIC BASIS OF MEDICINE EXAMINATION. Friday 14 March 2008 9.00-9.
SECOND M.B. AND SECOND VETERINARY M.B. EXAMINATIONS INTRODUCTION TO THE SCIENTIFIC BASIS OF MEDICINE EXAMINATION Friday 14 March 2008 9.00-9.45 am Attempt all ten questions. For each question, choose the
More informationHYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...
HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men
More informationDEVISING IMPORT HEALTH MEASURES FOR ANIMAL COMMODITIES
DEVISING IMPORT HEALTH MEASURES FOR ANIMAL COMMODITIES This paper provides guidance to OIE Members on the use of the animal health information in the OIE World Animal Health Information Database (WAHID)
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Sample Practice problems - chapter 12-1 and 2 proportions for inference - Z Distributions Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide
More informationUnit 26 Estimation with Confidence Intervals
Unit 26 Estimation with Confidence Intervals Objectives: To see how confidence intervals are used to estimate a population proportion, a population mean, a difference in population proportions, or a difference
More informationExperimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test
Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely
More informationTesting Hypotheses About Proportions
Chapter 11 Testing Hypotheses About Proportions Hypothesis testing method: uses data from a sample to judge whether or not a statement about a population may be true. Steps in Any Hypothesis Test 1. Determine
More informationTesting Research and Statistical Hypotheses
Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you
More informationAP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics
Ms. Foglia Date AP: LAB 8: THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,
More informationII. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
More informationQUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS
QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.
More informationComparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom
Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom 1 Learning Goals 1. Be able to explain the difference between the p-value and a posterior
More informationPrinciples of Hypothesis Testing for Public Health
Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions
More informationReflections on Probability vs Nonprobability Sampling
Official Statistics in Honour of Daniel Thorburn, pp. 29 35 Reflections on Probability vs Nonprobability Sampling Jan Wretman 1 A few fundamental things are briefly discussed. First: What is called probability
More informationIntroduction to Hypothesis Testing OPRE 6301
Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about
More informationBinomial Sampling and the Binomial Distribution
Binomial Sampling and the Binomial Distribution Characterized by two mutually exclusive events." Examples: GENERAL: {success or failure} {on or off} {head or tail} {zero or one} BIOLOGY: {dead or alive}
More informationChapter 3. Sampling. Sampling Methods
Oxford University Press Chapter 3 40 Sampling Resources are always limited. It is usually not possible nor necessary for the researcher to study an entire target population of subjects. Most medical research
More informationIntroduction to Hypothesis Testing
I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters - they must be estimated. However, we do have hypotheses about what the true
More informationAnalysis and Interpretation of Clinical Trials. How to conclude?
www.eurordis.org Analysis and Interpretation of Clinical Trials How to conclude? Statistical Issues Dr Ferran Torres Unitat de Suport en Estadística i Metodología - USEM Statistics and Methodology Support
More informationFairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
More informationSurvey Research: Choice of Instrument, Sample. Lynda Burton, ScD Johns Hopkins University
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
More informationMind on Statistics. Chapter 12
Mind on Statistics Chapter 12 Sections 12.1 Questions 1 to 6: For each statement, determine if the statement is a typical null hypothesis (H 0 ) or alternative hypothesis (H a ). 1. There is no difference
More informationInclusion and Exclusion Criteria
Inclusion and Exclusion Criteria Inclusion criteria = attributes of subjects that are essential for their selection to participate. Inclusion criteria function remove the influence of specific confounding
More informationp ˆ (sample mean and sample
Chapter 6: Confidence Intervals and Hypothesis Testing When analyzing data, we can t just accept the sample mean or sample proportion as the official mean or proportion. When we estimate the statistics
More informationIntroduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.
Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative
More informationStatistics 2014 Scoring Guidelines
AP Statistics 2014 Scoring Guidelines College Board, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks of the College Board. AP Central is the official online home
More informationSTAT 35A HW2 Solutions
STAT 35A HW2 Solutions http://www.stat.ucla.edu/~dinov/courses_students.dir/09/spring/stat35.dir 1. A computer consulting firm presently has bids out on three projects. Let A i = { awarded project i },
More informationWeek 4: Standard Error and Confidence Intervals
Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.
More informationIntroduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses
Introduction to Hypothesis Testing 1 Hypothesis Testing A hypothesis test is a statistical procedure that uses sample data to evaluate a hypothesis about a population Hypothesis is stated in terms of the
More informationLAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics
Period Date LAB : THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,
More informationWhat is Statistic? OPRE 6301
What is Statistic? OPRE 6301 In today s world...... we are constantly being bombarded with statistics and statistical information. For example: Customer Surveys Medical News Demographics Political Polls
More informationBVD qpcr Bulk Milk Test
BVD qpcr Bulk Milk Test NML launches a new method for BVD screening... NML launched their new bulk milk BVD qpcr service at the BCVA in mid November 2012. The service offers a simple and easy method to
More informationPeople have thought about, and defined, probability in different ways. important to note the consequences of the definition:
PROBABILITY AND LIKELIHOOD, A BRIEF INTRODUCTION IN SUPPORT OF A COURSE ON MOLECULAR EVOLUTION (BIOL 3046) Probability The subject of PROBABILITY is a branch of mathematics dedicated to building models
More informationHYPOTHESIS TESTING: POWER OF THE TEST
HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,
More informationStat 5102 Notes: Nonparametric Tests and. confidence interval
Stat 510 Notes: Nonparametric Tests and Confidence Intervals Charles J. Geyer April 13, 003 This handout gives a brief introduction to nonparametrics, which is what you do when you don t believe the assumptions
More informationCorrelational Research
Correlational Research Chapter Fifteen Correlational Research Chapter Fifteen Bring folder of readings The Nature of Correlational Research Correlational Research is also known as Associational Research.
More informationHYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...
HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men
More informationLesson 1: Comparison of Population Means Part c: Comparison of Two- Means
Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis
More informationClinical Study Design and Methods Terminology
Home College of Veterinary Medicine Washington State University WSU Faculty &Staff Page Page 1 of 5 John Gay, DVM PhD DACVPM AAHP FDIU VCS Clinical Epidemiology & Evidence-Based Medicine Glossary: Clinical
More informationNONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)
NONPARAMETRIC STATISTICS 1 PREVIOUSLY parametric statistics in estimation and hypothesis testing... construction of confidence intervals computing of p-values classical significance testing depend on assumptions
More informationUsing Excel for inferential statistics
FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied
More informationWeek 3&4: Z tables and the Sampling Distribution of X
Week 3&4: Z tables and the Sampling Distribution of X 2 / 36 The Standard Normal Distribution, or Z Distribution, is the distribution of a random variable, Z N(0, 1 2 ). The distribution of any other normal
More informationE3: PROBABILITY AND STATISTICS lecture notes
E3: PROBABILITY AND STATISTICS lecture notes 2 Contents 1 PROBABILITY THEORY 7 1.1 Experiments and random events............................ 7 1.2 Certain event. Impossible event............................
More informationStatistics in Medicine Research Lecture Series CSMC Fall 2014
Catherine Bresee, MS Senior Biostatistician Biostatistics & Bioinformatics Research Institute Statistics in Medicine Research Lecture Series CSMC Fall 2014 Overview Review concept of statistical power
More informationX X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)
CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.
More informationCharacteristics of Binomial Distributions
Lesson2 Characteristics of Binomial Distributions In the last lesson, you constructed several binomial distributions, observed their shapes, and estimated their means and standard deviations. In Investigation
More informationBinary Diagnostic Tests Two Independent Samples
Chapter 537 Binary Diagnostic Tests Two Independent Samples Introduction An important task in diagnostic medicine is to measure the accuracy of two diagnostic tests. This can be done by comparing summary
More informationIndependent samples t-test. Dr. Tom Pierce Radford University
Independent samples t-test Dr. Tom Pierce Radford University The logic behind drawing causal conclusions from experiments The sampling distribution of the difference between means The standard error of
More informationWhat are confidence intervals and p-values?
What is...? series Second edition Statistics Supported by sanofi-aventis What are confidence intervals and p-values? Huw TO Davies PhD Professor of Health Care Policy and Management, University of St Andrews
More informationUnderstand the role that hypothesis testing plays in an improvement project. Know how to perform a two sample hypothesis test.
HYPOTHESIS TESTING Learning Objectives Understand the role that hypothesis testing plays in an improvement project. Know how to perform a two sample hypothesis test. Know how to perform a hypothesis test
More informationHow To Understand The State Of The Art In Animal Health Risk Assessment
Survey Questionnaires Surveillance Activities and University Researchers in Animal Health Risk Assessment Science Advice in the Public Interest Survey Questionnaires 1 Survey Questionnaires Surveillance
More informationPoint and Interval Estimates
Point and Interval Estimates Suppose we want to estimate a parameter, such as p or µ, based on a finite sample of data. There are two main methods: 1. Point estimate: Summarize the sample by a single number
More informationAn Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS
The Islamic University of Gaza Faculty of Commerce Department of Economics and Political Sciences An Introduction to Statistics Course (ECOE 130) Spring Semester 011 Chapter 10- TWO-SAMPLE TESTS Practice
More informationNational FMD Response Planning
National FMD Response Planning Proactive Risk Assessment to Support and Managed Preparedness Movement of Livestock and Poultry Timothy J. Goldsmith DVM, MPH, DACVPM Center for Animal Health and Food Safety
More informationHYPOTHESIS TESTING WITH SPSS:
HYPOTHESIS TESTING WITH SPSS: A NON-STATISTICIAN S GUIDE & TUTORIAL by Dr. Jim Mirabella SPSS 14.0 screenshots reprinted with permission from SPSS Inc. Published June 2006 Copyright Dr. Jim Mirabella CHAPTER
More information2 Precision-based sample size calculations
Statistics: An introduction to sample size calculations Rosie Cornish. 2006. 1 Introduction One crucial aspect of study design is deciding how big your sample should be. If you increase your sample size
More informationSimple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
More informationCrosstabulation & Chi Square
Crosstabulation & Chi Square Robert S Michael Chi-square as an Index of Association After examining the distribution of each of the variables, the researcher s next task is to look for relationships among
More informationSelecting Research Participants
C H A P T E R 6 Selecting Research Participants OBJECTIVES After studying this chapter, students should be able to Define the term sampling frame Describe the difference between random sampling and random
More informationStat 20: Intro to Probability and Statistics
Stat 20: Intro to Probability and Statistics Lecture 16: More Box Models Tessa L. Childers-Day UC Berkeley 22 July 2014 By the end of this lecture... You will be able to: Determine what we expect the sum
More informationMATH 140 Lab 4: Probability and the Standard Normal Distribution
MATH 140 Lab 4: Probability and the Standard Normal Distribution Problem 1. Flipping a Coin Problem In this problem, we want to simualte the process of flipping a fair coin 1000 times. Note that the outcomes
More informationHypothesis testing. c 2014, Jeffrey S. Simonoff 1
Hypothesis testing So far, we ve talked about inference from the point of estimation. We ve tried to answer questions like What is a good estimate for a typical value? or How much variability is there
More informationNormal Distribution Lecture Notes
Normal Distribution Lecture Notes Professor Richard Blecksmith richard@math.niu.edu Dept. of Mathematical Sciences Northern Illinois University Math 101 Website: http://math.niu.edu/ richard/math101 Section
More informationVariables Control Charts
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. Variables
More informationIS 30 THE MAGIC NUMBER? ISSUES IN SAMPLE SIZE ESTIMATION
Current Topic IS 30 THE MAGIC NUMBER? ISSUES IN SAMPLE SIZE ESTIMATION Sitanshu Sekhar Kar 1, Archana Ramalingam 2 1Assistant Professor; 2 Post- graduate, Department of Preventive and Social Medicine,
More informationSection 6.2 Definition of Probability
Section 6.2 Definition of Probability Probability is a measure of the likelihood that an event occurs. For example, if there is a 20% chance of rain tomorrow, that means that the probability that it will
More informationHPAI Response HPAI Response Goals November 18, 2015
HPAI Response HPAI Response Goals November 18, 2015 Please note: These may be revised as the situation continues to change. USDA APHIS will work to achieve these goals for critical activities in a highly
More informationStatistical tests for SPSS
Statistical tests for SPSS Paolo Coletti A.Y. 2010/11 Free University of Bolzano Bozen Premise This book is a very quick, rough and fast description of statistical tests and their usage. It is explicitly
More informationSimple Random Sampling
Source: Frerichs, R.R. Rapid Surveys (unpublished), 2008. NOT FOR COMMERCIAL DISTRIBUTION 3 Simple Random Sampling 3.1 INTRODUCTION Everyone mentions simple random sampling, but few use this method for
More informationSIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one?
SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one? Simulations for properties of estimators Simulations for properties
More informationChi Squared and Fisher's Exact Tests. Observed vs Expected Distributions
BMS 617 Statistical Techniques for the Biomedical Sciences Lecture 11: Chi-Squared and Fisher's Exact Tests Chi Squared and Fisher's Exact Tests This lecture presents two similarly structured tests, Chi-squared
More informationDescriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
More informationMULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
More informationBayesian Tutorial (Sheet Updated 20 March)
Bayesian Tutorial (Sheet Updated 20 March) Practice Questions (for discussing in Class) Week starting 21 March 2016 1. What is the probability that the total of two dice will be greater than 8, given that
More informationThere are three kinds of people in the world those who are good at math and those who are not. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 Positive Views The record of a month
More information5.1 Identifying the Target Parameter
University of California, Davis Department of Statistics Summer Session II Statistics 13 August 20, 2012 Date of latest update: August 20 Lecture 5: Estimation with Confidence intervals 5.1 Identifying
More information6: Introduction to Hypothesis Testing
6: Introduction to Hypothesis Testing Significance testing is used to help make a judgment about a claim by addressing the question, Can the observed difference be attributed to chance? We break up significance
More informationSeroprevalence and risk factors of Lassa fever infection in Nasarawa State, Nigeria 2013
Seroprevalence and risk factors of Lassa fever infection in Nasarawa State, Nigeria 2013 Muhammad Shakir Balogun COHORT 3 Nigeria-FELTP Supervisors: Dr. AT Olayinka, Dr. AI Mamman Outline Background Methodology
More informationSummary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)
Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume
More informationNon-Parametric Tests (I)
Lecture 5: Non-Parametric Tests (I) KimHuat LIM lim@stats.ox.ac.uk http://www.stats.ox.ac.uk/~lim/teaching.html Slide 1 5.1 Outline (i) Overview of Distribution-Free Tests (ii) Median Test for Two Independent
More informationConditional Probability, Hypothesis Testing, and the Monty Hall Problem
Conditional Probability, Hypothesis Testing, and the Monty Hall Problem Ernie Croot September 17, 2008 On more than one occasion I have heard the comment Probability does not exist in the real world, and
More informationStudy Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
More informationSample Size Planning, Calculation, and Justification
Sample Size Planning, Calculation, and Justification Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa
More informationSection 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)
Section 7.1 Introduction to Hypothesis Testing Schrodinger s cat quantum mechanics thought experiment (1935) Statistical Hypotheses A statistical hypothesis is a claim about a population. Null hypothesis
More informationLesson 9 Hypothesis Testing
Lesson 9 Hypothesis Testing Outline Logic for Hypothesis Testing Critical Value Alpha (α) -level.05 -level.01 One-Tail versus Two-Tail Tests -critical values for both alpha levels Logic for Hypothesis
More informationDescriptive Methods Ch. 6 and 7
Descriptive Methods Ch. 6 and 7 Purpose of Descriptive Research Purely descriptive research describes the characteristics or behaviors of a given population in a systematic and accurate fashion. Correlational
More information