Intro to Statistics for Infection Preventionists Presented By: Kelley M. Boston, MPH, CIC
|
|
- Octavia Parsons
- 7 years ago
- Views:
Transcription
1 Intro to Statistics for Infection Preventionists Presented By: Kelley M. Boston, MPH, CIC Infection Prevention & Management Associates at Methodist Healthcare System
2 Role of Statistics in Hospital Epidemiology Aid in organizing and summarizing data Population characteristics Frequency distributions Calculation of infection rates Make inferences about data Suggest association Infer causality Communicate findings Prepare reports for committees Monitor the impact of interventions
3 Descriptive Epidemiology Descriptive Statistics: techniques concerned with the organization, presentation, and summarization of data. Measures of central tendency Measures of dispersion Use of proportions, rates, ratios Who, What, When, Where
4 Variables Anything that is measured or manipulated in a study Types of variables: Qualitative Nominal, Ordinal Quantitative Interval, Ratio Independent vs. Dependant Variables Continuous vs. Discrete variables
5 Variables
6 Measures of Central Tendency
7 Measures of Central Tendency Mean: mathematical average of the values in a data set. Used for describing numeric data Numeric Data = Actual measurements of individuals (temperature, height, pulse) Interval Variables Ratio Variables Continuous Variables Discrete Variables
8 Measures of Central Tendency
9 Measures of Central Tendency Median: the value falling in the middle of the data set. Used for ordinal data Ordinal Data = ordered categories ASA Scores, Wound Class Median also good for numeric data when the distribution is skewed
10 Measures of Central Tendency
11 Measures of Central Tendency Mode: the most frequently occurring value in a data set. Used for nominal data Nominal Data = named category Gender, Smoking Status
12 Measures of Central Tendency
13 Calculating Central Tendency Patient Lengths of Stay in Days: 12, 9, 3, 5, 7, 6, 13, 8, 4, 15, 6 Mean = The sum of each patient s length of stay x The number of patients = = 88 = 8 days 11 11
14 Calculating Central Tendency Patient Lengths of Stay in Days: 12, 9, 3, 5, 7, 6, 13, 8, 4, 15, 6 Mean = The sum of each patient s length of stay x The number of patients = = 88 = 8 days Median = 3, 4, 5, 6, 6, 7, 8, 9, 12, 13, 15 = 7 days
15 Calculating Central Tendency Patient Lengths of Stay in Days: 12, 9, 3, 5, 7, 6, 13, 8, 4, 15, 6 Mean = The sum of each patient s length of stay x The number of patients = = 88 = 8 days Median = 3, 4, 5, 6, 6, 7, 8, 9, 12, 13, 15 = 7 days Mode = 3, 4, 5, 6, 6, 7, 8, 9, 12, 13, 15 = 6 days
16 Excel Hacks: Central Tendency A 1 LOS Mean =AVERAGE(A2:A12) Median =MEDIAN(A2:A12) Mode =MODE(A2:A12) 13 = Formula
17 Measures of Dispersion
18 Measures of Dispersion Range: the difference between the smallest and largest values in a data set. Patient Lengths of Stay in Days: 12, 9, 3, 5, 7, 6, 13, 8, 4, 15, 6
19 Measures of Dispersion Range: the difference between the smallest and largest values in a data set. Patient Lengths of Stay in Days: 3, 4, 5, 6, 6, 7, 8, 9, 12, 13, 15 Range = 15 3 = 12
20 Measures of Dispersion Deviation: The difference between an individual data point and the mean value for the data set Positive, Negative or No Deviation Patient Lengths of Stay in Days: 3, 4, 5, 6, 6, 7, 8, 9, 12, 13, 15 Mean = x = 8 Calculating Deviation
21 Measures of Dispersion Standard Deviation: measure of dispersion that reflects the variability in values around the mean SD = (Xi-X) 2 / n-1 Take all the deviations from the mean, square then, then divide their sum by the total number of observations minus one and take the square root of the resulting number Variance: a measure of variability that is equal to the square of the standard deviation. Patient Lengths of Stay in Days: 12, 9, 3, 5, 7, 6, 13, 8, 4, 15, 6
22 Excel Hacks: Dispersion A 1 LOS Mean =AVERAGE(A2:A2) 8 SD =STDEV(A2:12) 3.87 Variance =VAR(A2:A12) =AVERAGE(A2:A12) 14 =STDEV(A2:A12) 115 =VAR(A2:A12)
23
24
25 Normal Distribution
26 Normal Distribution Properties Continuous distribution Bell shaped curve Symmetric around the mean
27 Non-Normal Distribution: Skew Non-symmetric distribution Positive or Negative Refers to the direction of the long tail (NOT the bulk of the data) Positive skew has the long tail to the right, Negative to the left Mean near the long tail Mode near the short tail Median somewhere in the middle
28 Non-Normal Distribution: Bimodal Two peaks May have 2 populations each with its own central tendency Mean Mode
29 Other Non-Normal Distributions Multi-Modal No central tendency Refer to graph Scattered No central tendency
30 Use of Proportions, Rates & Ratios Proportions: A fraction in which the numerator is part of the denominator. Rates: A fraction in which the denominator involves a measure of time. Ratios: A fraction in which there is not necessarily a relationship between the numerator and the denominator.
31 Proportions
32 Proportions Prevalence: proportion of persons with a particular disease within a given population at a given time.
33 Percent Resistance Proportions Prevalence: proportion of persons with a particular disease within a given population at a given time. Proportion of S. aureus Nosocomial Infections Resistant to Oxacillin (MRSA) Among Intensive Care Unit Patients, * Year *Source: NNIS System, data for 2003 are incomplete
34 Rates Calculation of a Device-associated Infection Rate Step 1: Decide upon the time period for your analysis. Step 2: Select the patient population for analysis. Step 3: Select the infections to be used in the numerator. Step 4: Determine the number of device-days which is used as the denominator of the rate. Device days: total number of days of exposure to the device by all patients in the selected population during the time period. Step 5: Calculate the device-associated infection rate (per 1000 device-days) using the following formula: Number of device-associated infections x 1000 Number of device-days
35 Device-Associated Infection Rate Example: Foley-Associated UTIs in the ICU Step 1: Time period April 2010
36 Device-Associated Infection Rate Example: Foley-Associated UTIs in the ICU Step 1: Time period April 2010 Step 2: Patient population Patients in the ICU of Hospital X who have Foley catheters
37 Device-Associated Infection Rate Example: Foley-Associated UTIs in the ICU Step 1: Time period April 2010 Step 2: Patient population Patients in the ICU of Hospital X who have Foley catheters Step 3: Infections (numerator) April CAUTI infections in the ICU = 2
38 Device-Associated Infection Rate Example: Foley-Associated UTIs in the ICU Step 1: Time period April 2010 Step 2: Patient population Patients in the Medical / Surgical ICU of Hospital X who have Foley catheters Step 3: Infections (numerator) April CAUTI infections in the ICU = 2 Step 4: Device-days (denominator) Total number of days that patients in the ICU had Foley catheters in place = 920
39 Device-Associated Infection Rate Example: Foley-Associated UTIs in the ICU Step 5: Device-associated infection rate Number of device-associated infections x 1000 Number of device-days # of Infections: 2 Foley-days in ICU: 920 Rate = 2 x 1000 = 2.17 per 1000 Foley-days 920
40 NHSN Comparison
41 Common Rates Attack Rate Number of people who develop a certain illness Total number of people at risk Usually measured over an entire period of exposure Often multiplied by 100
42 Common Rates Incidence Rate (Incidence Density) Number of new cases of disease occurring in the population during a specified period of time Number of persons at risk of developing the disease during that period of time Important features are TIME and NEW cases Often multiplied by 1000 in the hospital setting
43 Ratios Calculation of Device Utilization Ratio Step 1: Decide upon the time period for your analysis. Step 2: Select the patient population for analysis. Step 3: Determine the number of device-days. Step 4: Determine the number of patient-days. Patient-days are the total number of days that patients are in the selected population (have the device) during the time period.
44 Device Utilization Ratio Step 5: Calculate the device-utilization ratio using the following formula: Number of device-days Number of patient-days Example: Foley Utilization Ratio in the ICU Foley-days in ICU: 920 Patient-days in ICU: 1176 Ratio = 920 =
45 NHSN Comparison
46 What does this tell you? When examined together, the deviceassociated infection rate and device utilization ratio can be used to appropriately target preventative measures. Consistently high rates and ratios may signify a problem and further investigation is suggested. Potential overuse/improper use of device Consistently low rates and ratios may suggest underreporting of infection or the infrequent use or short duration of use of devices.
47 Analytic Epidemiology Inferential Statistics: procedures used to make inferences about a population based on information from a sample of measurements from that population. Why, How
48 Observational Studies Descriptive Studies Case Series Analytic Studies Cross Sectional: exposure and outcome measured at same time Cohort: exposure defined at start of study, group followed to see if outcome arises Case Control: outcome defined at start of study, look at past exposure status
49 Experimental Studies Exposure status assigned by the researcher Clinical: investigator assigns intervention to individuals in the study population Community: interventions applied to groups
50 2x2: Exposures and Outcomes Patients Exposed Patients With Disease a Patients With No Disease b Patients Not Exposed c d
51 Relative Risk Comparing the risk of disease in exposed individuals to individuals who were not exposed Patients Exposed Patients Not Exposed Patients With Disease a c Patients With No Disease b d RR = Disease incidence in exposed = _a / (a + b)_ Disease incidence in non-exposed c / (c + d) RR = ( a a + b ( c c + d ) )
52 Relative Risk RR = 1 Risk in exposed equal to risk in non-exposed No association RR > 1 Risk in exposed greater than risk in non-exposed Positive association, possibly causal RR < 1 Risk in exposed less than risk in non-exposed Negative association, possibly protective
53 What are the Odds? The ratio of the number of ways an event can occur to the number of ways the event cannot occur Odds are based on Probability Probability of me getting dessert = 60% = P Probability of me not getting dessert = 40% = 1 P Odds = P = _60%_ = 1.5 : 1 = P 40% Probability of dessert = 60% Odds of dessert = 60% / 40% = 1.5
54 Odds Ratio Comparing the odds that a disease will develop Patients With History of Exposure Patients Without History of Exposure Patients With Disease (Cases) a c Patients With No Disease (Controls) b d OR = Odds that a case was exposed_ = _a / c_ = _ad_ Odds that a control was exposed b / d bc
55 Odds Ratio OR = 1 Exposure not related to the disease OR > 1 Exposure positively related to disease OR < 1 Exposure negatively related to the disease
56 Calculating Risk / Odds Patients With Disease Patients With No Disease Patients Exposed Patients Not Exposed RR = a / (a + b) c / (c + d) P / (TP + F N) = 40 / ( ) = / ( ) OR = ad / bc = (40 x 600) / (250 x 110) = 0.87
57 2x2 Table: Test Validity Patients With Disease Patients With No Disease Test is Positive a b Test Is Negative c d
58 Sensitivity / Specificity Sensitivity: the ability of a test to identify correctly those who have a disease = TP / (TP + F N) Specificity: the ability of a test to correctly rule out those who do not have a disease = TN / (TN + F P) Patients With Disease Patients With No Disease Test Positive True Positive (TP) = Have disease and have a positive test False Positive (FP) = No disease, but have positive test Test Negative False Negative (FN) = Have disease but have negative test True Negative (TN) = No disease and have negative test
59 Positive / Negative Predictive Value Positive Predictive Value (PPV): the proportion of patients who test positive that actually have the disease = TP / (TP + F P) Negative Predictive Value (NPV): the proportion of patients who test negative who do not have the disease = TN / (TN + F N) Patients With Disease Patients With No Disease Test Positive True Positive (TP) = Have disease and have a positive test False Positive (FP) = No disease, but have positive test Test Negative False Negative (FN) = Have disease but have negative test True Negative (TN) = No disease and have negative test
60 Calculating Test Validity Patients With Disease Patients With No Disease Test Positive Test Negative Sensitivity = TP / (TP + F N) = 80 / ( ) = 80 / 100 = 80% Specificity = TN / (TN + F P) PPV = TP / (TP + F P) = 800 / ( ) = 800 / 900 = 89% = 80 / ( ) = 80 / 180 = 44% NPV = TN / (TN + F N) = 800 / ( ) = 800 / 820 = 98%
61 Statistical Inference Inferences about populations based on your sample Sampling Allocation Error Systematic Random
62 Statistical Inference Standard Error of the Mean (SE or SEM) SE = SD Sample Size
63 Hypothesis Testing Studies Null Hypothesis (H o ): a hypothesis of no association between two variables. The hypothesis to be tested Alternate Hypothesis (H a ): a hypothesis of association between two variables.
64 Hypothesis Testing Four possibilities when testing whether treatments differ The treatments do not differ, and we correctly conclude that they do not differ The treatments do not differ, but we conclude that they do differ The treatments differ, but we conclude that they do not differ The treatments differ, and we correctly conclude that they do differ
65 Hypothesis Testing: Error Type I Error: Probability of rejecting the null hypothesis when the null hypothesis is true. = probability of making a type I error Type II Error: Probability of accepting the null hypothesis when the alternate hypothesis is true. = probability of making a type II error Power: Probability of correctly concluding that the outcomes differ 1 - = power
66 Hypothesis Testing: Error Reality Treatments are not different Treatments are different Decision Conclude that treatments are not different Conclude that treatments are different Correct Decision Type I Error (Probability = ) Type II Error (Probability = ) Correct Decision (Probability = 1- = Power)
67 Significance Testing Confidence Interval: a computed interval of values that, with a given probability, contains the true value of the population parameter. 95% CI: 95% of the time the true value falls within the interval given. RR 2.5 ( ) p-value ( ): probability that the findings observed could have occurred due to chance alone. p-value usually set at 0.05
68 Interpreting Confidence Intervals
69 Interpreting Confidence Intervals RR = 2 People who were exposed were twice as likely to have the outcome than people who were not exposed RR = 1 People who were exposed were no more or less likely to have the outcome than people who were not exposed RR = 0.5 People who were exposed were half as likely to have the outcome than people who were not exposed Potential Risk Factor RR 95% CI A B C D E
70 Interpreting Confidence Intervals RR = 2 People who were exposed were twice as likely to have the outcome than people who were not exposed RR = 1 People who were exposed were no more or less likely to have the outcome than people who were not exposed RR = 0.5 People who were exposed were half as likely to have the outcome than people who were not exposed Potential Risk Factor RR 95% CI A B C D E
71 Interpreting p-values What the p-values does NOT mean Not the probability of the null hypothesis being true And 1 (p-value) is not the probability of the alternative hypothesis being true Not the probability that your finding is due to random chance Not the probability of a repeat study finding the same result Does not indicate the importance of the observed effect
72 Interpreting p-values The probability of obtaining a test statistic at least as extreme as the one that was actually observed, assuming that the null hypothesis is true = probability of making a type I error P-value for significance is set when study is designed If p >.05, then the results are considered not statistically significant. If.01 p <.05, then the results are significant. If.001 p <.01, then the results are highly significant. If p <.001, then the results are very highly significant.
73 Interpreting p-values Study A found that the patient s average length of stay was associated with C. difficile colitis (p = 0.002). Highly Significant Study B found that men were more likely to develop a BSI than women (p = 0.09) Not Significant Study C found that redheads were 2% more likely to decline influenza vaccination than blondes (p = ) Statistically Significant, Probably Not Important
74 Inferential Statistics Parametric Tests Normal distribution of the sample population Usually continuous-interval variables z Test Student s t Test
75 z Tests Test the difference in means / proportions Calculates the ratio of the difference between means to the SE Use when: Sample size is greater than 30 Standard deviation (SD) is known Example: Comparing your mean infection rate to NHSN mean rates
76 t Tests
77 t Tests Test the difference in means (one or two tailed) Use when: Sample size is less than 30 Assumes Independence of populations & values Variance is equal for both sets of data No confounding variables Types of t Tests: Independent sample (experiment vs. control) Paired sample (before and after)
78 t Tests
79 t Tests
80 Inferential Statistics Non-Parametric Tests Do not assume normal distribution Used with more types of data: Nominal, Ordinal, Interval, Discrete Chi Square (X 2 ) Compares observed values against expected values Example: Comparing SSI rates for Dr. X and Dr. Y quare/chiexcel.htm
81 Chi square
82 Role of Statistics in Infection Prevention Aid in organizing and summarizing data Can tell you what is happening in your facility Make inferences about data Possible causes of disease or effects of IP interventions Communicate findings Monitor the impact of interventions Prepare reports to share the information Make changes to improve outcomes
83 Useful Resources: APIC EpiGraphics: Statistics and Surveillance Tools for IPs APIC Manual, Chapter 5 Use of Statistics. (2009) PDQ Statistics. GR Norman & DL Streiner (2003). BC Decker. Fundamentals of Biostatistics. B Rosner (2000). Brooks/Cole. Epidemiology for Public Health Practice. RH Friis & TA Sellers (2004). Jones and Bartlett Publishers, Inc. Excel Hacks. DE Hawley (2007). O Reilly Free statistics calculators Free online epi training from North Carolina Center of Public Health Preparedness
84 Useful Resources:
85 Questions? Questions?
86 Images From: National prevalence of methicillin-resistant Staphylococcus aureus in inpatients at US health care facilities, W Jarvis et al. AJIC December 2007 National Healthcare Safety Network (NHSN) report: Data summary for 2006 through 2008, issued December Edwards et al. AJIC December PDQ Statistics. GR Norman & DL Streiner (2003). BC Decker. Pertussis: A Disease Affecting All Ages. DS Gregory. American Family Physician. August Summarizing Your Data. Science Buddies. marizing_data.shtml
II. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
More informationStatistics Review PSY379
Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses
More informationDESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.
DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,
More informationDescriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
More informationDescriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion
Descriptive Statistics Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Statistics as a Tool for LIS Research Importance of statistics in research
More informationCHAPTER THREE COMMON DESCRIPTIVE STATISTICS COMMON DESCRIPTIVE STATISTICS / 13
COMMON DESCRIPTIVE STATISTICS / 13 CHAPTER THREE COMMON DESCRIPTIVE STATISTICS The analysis of data begins with descriptive statistics such as the mean, median, mode, range, standard deviation, variance,
More informationBasic research methods. Basic research methods. Question: BRM.2. Question: BRM.1
BRM.1 The proportion of individuals with a particular disease who die from that condition is called... BRM.2 This study design examines factors that may contribute to a condition by comparing subjects
More informationIntroduction to Statistics and Quantitative Research Methods
Introduction to Statistics and Quantitative Research Methods Purpose of Presentation To aid in the understanding of basic statistics, including terminology, common terms, and common statistical methods.
More informationStatistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013
Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives
More informationSTA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance
Principles of Statistics STA-201-TE This TECEP is an introduction to descriptive and inferential statistics. Topics include: measures of central tendency, variability, correlation, regression, hypothesis
More informationMeasures of Central Tendency and Variability: Summarizing your Data for Others
Measures of Central Tendency and Variability: Summarizing your Data for Others 1 I. Measures of Central Tendency: -Allow us to summarize an entire data set with a single value (the midpoint). 1. Mode :
More informationDescriptive Statistics and Measurement Scales
Descriptive Statistics 1 Descriptive Statistics and Measurement Scales Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample
More informationIntroduction to Quantitative Methods
Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................
More informationNorthumberland Knowledge
Northumberland Knowledge Know Guide How to Analyse Data - November 2012 - This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about
More informationDATA INTERPRETATION AND STATISTICS
PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE
More informationLecture 1: Review and Exploratory Data Analysis (EDA)
Lecture 1: Review and Exploratory Data Analysis (EDA) Sandy Eckel seckel@jhsph.edu Department of Biostatistics, The Johns Hopkins University, Baltimore USA 21 April 2008 1 / 40 Course Information I Course
More informationSCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES
SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR
More informationCA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction
CA200 Quantitative Analysis for Business Decisions File name: CA200_Section_04A_StatisticsIntroduction Table of Contents 4. Introduction to Statistics... 1 4.1 Overview... 3 4.2 Discrete or continuous
More informationStatistics. Measurement. Scales of Measurement 7/18/2012
Statistics Measurement Measurement is defined as a set of rules for assigning numbers to represent objects, traits, attributes, or behaviors A variableis something that varies (eye color), a constant does
More informationMBA 611 STATISTICS AND QUANTITATIVE METHODS
MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain
More informationPrinciples of Hypothesis Testing for Public Health
Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions
More informationMEASURES OF VARIATION
NORMAL DISTRIBTIONS MEASURES OF VARIATION In statistics, it is important to measure the spread of data. A simple way to measure spread is to find the range. But statisticians want to know if the data are
More informationPie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.
Graphical Representations of Data, Mean, Median and Standard Deviation In this class we will consider graphical representations of the distribution of a set of data. The goal is to identify the range of
More information1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression
More informationIntroduction to Statistics for Psychology. Quantitative Methods for Human Sciences
Introduction to Statistics for Psychology and Quantitative Methods for Human Sciences Jonathan Marchini Course Information There is website devoted to the course at http://www.stats.ox.ac.uk/ marchini/phs.html
More informationCHAPTER THREE. Key Concepts
CHAPTER THREE Key Concepts interval, ordinal, and nominal scale quantitative, qualitative continuous data, categorical or discrete data table, frequency distribution histogram, bar graph, frequency polygon,
More informationWHAT IS A JOURNAL CLUB?
WHAT IS A JOURNAL CLUB? With its September 2002 issue, the American Journal of Critical Care debuts a new feature, the AJCC Journal Club. Each issue of the journal will now feature an AJCC Journal Club
More informationBiostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY
Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY 1. Introduction Besides arriving at an appropriate expression of an average or consensus value for observations of a population, it is important to
More informationHow To Write A Data Analysis
Mathematics Probability and Statistics Curriculum Guide Revised 2010 This page is intentionally left blank. Introduction The Mathematics Curriculum Guide serves as a guide for teachers when planning instruction
More informationBiostatistics: Types of Data Analysis
Biostatistics: Types of Data Analysis Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa A Scott, MS
More informationSTATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI
STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members
More informationWhy Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
More informationWeek 4: Standard Error and Confidence Intervals
Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.
More informationLesson 4 Measures of Central Tendency
Outline Measures of a distribution s shape -modality and skewness -the normal distribution Measures of central tendency -mean, median, and mode Skewness and Central Tendency Lesson 4 Measures of Central
More informationDESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1
DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1 OVERVIEW STATISTICS PANIK...THE THEORY AND METHODS OF COLLECTING, ORGANIZING, PRESENTING, ANALYZING, AND INTERPRETING DATA SETS SO AS TO DETERMINE THEIR ESSENTIAL
More informationCALCULATIONS & STATISTICS
CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents
More information6.4 Normal Distribution
Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under
More informationStudy Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
More informationMeasurement & Data Analysis. On the importance of math & measurement. Steps Involved in Doing Scientific Research. Measurement
Measurement & Data Analysis Overview of Measurement. Variability & Measurement Error.. Descriptive vs. Inferential Statistics. Descriptive Statistics. Distributions. Standardized Scores. Graphing Data.
More informationGuide to Biostatistics
MedPage Tools Guide to Biostatistics Study Designs Here is a compilation of important epidemiologic and common biostatistical terms used in medical research. You can use it as a reference guide when reading
More informationThis chapter discusses some of the basic concepts in inferential statistics.
Research Skills for Psychology Majors: Everything You Need to Know to Get Started Inferential Statistics: Basic Concepts This chapter discusses some of the basic concepts in inferential statistics. Details
More informationStat 20: Intro to Probability and Statistics
Stat 20: Intro to Probability and Statistics Lecture 16: More Box Models Tessa L. Childers-Day UC Berkeley 22 July 2014 By the end of this lecture... You will be able to: Determine what we expect the sum
More informationThere are three kinds of people in the world those who are good at math and those who are not. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 Positive Views The record of a month
More informationFoundation of Quantitative Data Analysis
Foundation of Quantitative Data Analysis Part 1: Data manipulation and descriptive statistics with SPSS/Excel HSRS #10 - October 17, 2013 Reference : A. Aczel, Complete Business Statistics. Chapters 1
More informationCertified in Public Health (CPH) Exam CONTENT OUTLINE
NATIONAL BOARD OF PUBLIC HEALTH EXAMINERS Certified in Public Health (CPH) Exam CONTENT OUTLINE April 2014 INTRODUCTION This document was prepared by the National Board of Public Health Examiners for the
More informationChapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs
Types of Variables Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Quantitative (numerical)variables: take numerical values for which arithmetic operations make sense (addition/averaging)
More informationChapter 7 Section 7.1: Inference for the Mean of a Population
Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used
More informationThe Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)
Describing Data: Categorical and Quantitative Variables Population The Big Picture Sampling Statistical Inference Sample Exploratory Data Analysis Descriptive Statistics In order to make sense of data,
More informationChi-square test Fisher s Exact test
Lesson 1 Chi-square test Fisher s Exact test McNemar s Test Lesson 1 Overview Lesson 11 covered two inference methods for categorical data from groups Confidence Intervals for the difference of two proportions
More informationSECOND M.B. AND SECOND VETERINARY M.B. EXAMINATIONS INTRODUCTION TO THE SCIENTIFIC BASIS OF MEDICINE EXAMINATION. Friday 14 March 2008 9.00-9.
SECOND M.B. AND SECOND VETERINARY M.B. EXAMINATIONS INTRODUCTION TO THE SCIENTIFIC BASIS OF MEDICINE EXAMINATION Friday 14 March 2008 9.00-9.45 am Attempt all ten questions. For each question, choose the
More informationFairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
More informationDensity Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:
Density Curve A density curve is the graph of a continuous probability distribution. It must satisfy the following properties: 1. The total area under the curve must equal 1. 2. Every point on the curve
More informationMidterm Review Problems
Midterm Review Problems October 19, 2013 1. Consider the following research title: Cooperation among nursery school children under two types of instruction. In this study, what is the independent variable?
More informationSample Size and Power in Clinical Trials
Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance
More informationSimple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
More informationCompetency 1 Describe the role of epidemiology in public health
The Northwest Center for Public Health Practice (NWCPHP) has developed competency-based epidemiology training materials for public health professionals in practice. Epidemiology is broadly accepted as
More informationLecture Notes Module 1
Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific
More informationDATA COLLECTION AND ANALYSIS
DATA COLLECTION AND ANALYSIS Quality Education for Minorities (QEM) Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. August 23, 2013 Objectives of the Discussion 2 Discuss
More informationWeek 3&4: Z tables and the Sampling Distribution of X
Week 3&4: Z tables and the Sampling Distribution of X 2 / 36 The Standard Normal Distribution, or Z Distribution, is the distribution of a random variable, Z N(0, 1 2 ). The distribution of any other normal
More information4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: "What do the data look like?"
Data Analysis Plan The appropriate methods of data analysis are determined by your data types and variables of interest, the actual distribution of the variables, and the number of cases. Different analyses
More informationMeans, standard deviations and. and standard errors
CHAPTER 4 Means, standard deviations and standard errors 4.1 Introduction Change of units 4.2 Mean, median and mode Coefficient of variation 4.3 Measures of variation 4.4 Calculating the mean and standard
More informationAssociation Between Variables
Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi
More informationX X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)
CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.
More informationGuided Reading 9 th Edition. informed consent, protection from harm, deception, confidentiality, and anonymity.
Guided Reading Educational Research: Competencies for Analysis and Applications 9th Edition EDFS 635: Educational Research Chapter 1: Introduction to Educational Research 1. List and briefly describe the
More informationThe correlation coefficient
The correlation coefficient Clinical Biostatistics The correlation coefficient Martin Bland Correlation coefficients are used to measure the of the relationship or association between two quantitative
More informationDescriptive statistics parameters: Measures of centrality
Descriptive statistics parameters: Measures of centrality Contents Definitions... 3 Classification of descriptive statistics parameters... 4 More about central tendency estimators... 5 Relationship between
More informationTest Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table
ANALYSIS OF DISCRT VARIABLS / 5 CHAPTR FIV ANALYSIS OF DISCRT VARIABLS Discrete variables are those which can only assume certain fixed values. xamples include outcome variables with results such as live
More informationChapter 1: The Nature of Probability and Statistics
Chapter 1: The Nature of Probability and Statistics Learning Objectives Upon successful completion of Chapter 1, you will have applicable knowledge of the following concepts: Statistics: An Overview and
More informationMind on Statistics. Chapter 2
Mind on Statistics Chapter 2 Sections 2.1 2.3 1. Tallies and cross-tabulations are used to summarize which of these variable types? A. Quantitative B. Mathematical C. Continuous D. Categorical 2. The table
More informationDescriptive Statistics
Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web
More informationIntroduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.
Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative
More informationNormal Range. Reference Values. Types of Reference Ranges. Reference Range Study. Reference Values
Advanced Laboratory Medicine Post-Analytic Issues: Reference Ranges and Interpretation Daniel J. Fink, MD, MPH Director, Core Laboratory New York Presbyterian Hospital Columbia University Medical Center
More informationDESCRIPTIVE STATISTICS & DATA PRESENTATION*
Level 1 Level 2 Level 3 Level 4 0 0 0 0 evel 1 evel 2 evel 3 Level 4 DESCRIPTIVE STATISTICS & DATA PRESENTATION* Created for Psychology 41, Research Methods by Barbara Sommer, PhD Psychology Department
More informationQuantitative Methods for Finance
Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain
More informationDATA ANALYSIS. QEM Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. Howard University
DATA ANALYSIS QEM Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. Howard University Quantitative Research What is Statistics? Statistics (as a subject) is the science
More informationIS 30 THE MAGIC NUMBER? ISSUES IN SAMPLE SIZE ESTIMATION
Current Topic IS 30 THE MAGIC NUMBER? ISSUES IN SAMPLE SIZE ESTIMATION Sitanshu Sekhar Kar 1, Archana Ramalingam 2 1Assistant Professor; 2 Post- graduate, Department of Preventive and Social Medicine,
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Final Exam Review MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) A researcher for an airline interviews all of the passengers on five randomly
More informationInclusion and Exclusion Criteria
Inclusion and Exclusion Criteria Inclusion criteria = attributes of subjects that are essential for their selection to participate. Inclusion criteria function remove the influence of specific confounding
More informationOutline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test
The t-test Outline Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test - Dependent (related) groups t-test - Independent (unrelated) groups t-test Comparing means Correlation
More informationStatistical tests for SPSS
Statistical tests for SPSS Paolo Coletti A.Y. 2010/11 Free University of Bolzano Bozen Premise This book is a very quick, rough and fast description of statistical tests and their usage. It is explicitly
More informationUnit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
More informationUNIVERSITY OF NAIROBI
UNIVERSITY OF NAIROBI MASTERS IN PROJECT PLANNING AND MANAGEMENT NAME: SARU CAROLYNN ELIZABETH REGISTRATION NO: L50/61646/2013 COURSE CODE: LDP 603 COURSE TITLE: RESEARCH METHODS LECTURER: GAKUU CHRISTOPHER
More information1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
More informationMULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
More informationCase-control studies. Alfredo Morabia
Case-control studies Alfredo Morabia Division d épidémiologie Clinique, Département de médecine communautaire, HUG Alfredo.Morabia@hcuge.ch www.epidemiologie.ch Outline Case-control study Relation to cohort
More informationSOLUTIONS TO BIOSTATISTICS PRACTICE PROBLEMS
SOLUTIONS TO BIOSTATISTICS PRACTICE PROBLEMS BIOSTATISTICS DESCRIBING DATA, THE NORMAL DISTRIBUTION SOLUTIONS 1. a. To calculate the mean, we just add up all 7 values, and divide by 7. In Xi i= 1 fancy
More informationDescriptive Statistics and Exploratory Data Analysis
Descriptive Statistics and Exploratory Data Analysis Dean s s Faculty and Resident Development Series UT College of Medicine Chattanooga Probasco Auditorium at Erlanger January 14, 2008 Marc Loizeaux,
More informationHYPOTHESIS TESTING: POWER OF THE TEST
HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,
More informationLAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
More informationSummary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)
Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume
More informationModule 3: Correlation and Covariance
Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis
More informationProbability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur
Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Module No. #01 Lecture No. #15 Special Distributions-VI Today, I am going to introduce
More informationIntroduction; Descriptive & Univariate Statistics
Introduction; Descriptive & Univariate Statistics I. KEY COCEPTS A. Population. Definitions:. The entire set of members in a group. EXAMPLES: All U.S. citizens; all otre Dame Students. 2. All values of
More informationComparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples
Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The
More informationAnalyzing Data in NHSN
Analyzing Data in NHSN Lindsey Weiner, MPH Epidemiologist Alabama Hospital Quality Initiative Webinar September 16, 2015 National Center for Emerging and Zoonotic Infectious Diseases Division of Healthcare
More information11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
More informationSimple Random Sampling
Source: Frerichs, R.R. Rapid Surveys (unpublished), 2008. NOT FOR COMMERCIAL DISTRIBUTION 3 Simple Random Sampling 3.1 INTRODUCTION Everyone mentions simple random sampling, but few use this method for
More informationAdditional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
More informationSAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population.
SAMPLING & INFERENTIAL STATISTICS Sampling is necessary to make inferences about a population. SAMPLING The group that you observe or collect data from is the sample. The group that you make generalizations
More informationUsing Excel for inferential statistics
FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied
More information