Rearranging Stata s output for the analysis of epidemiological tables
|
|
|
- Amberlynn Skinner
- 8 years ago
- Views:
Transcription
1 Rearranging Stata s output for the analysis of epidemiological tables 5ª Reunión de Usuarios Españoles de Stata Barcelona, 12/09/2012 Aurelio Tobías IDAEA-CSIC, Barcelona [email protected]
2 Background In epidemiological research we usually investigate the relationship between a binary response (C/nC) and a binary exposure (E/nE) Risk Exposure Outcome
3 Background A major capability in Stata is the analysis of epidemiological tables by using any of the epitab commands These report measures of frequency (proportion or odds), association (risk difference, relative risk, or odds ratio) and impact on public health (attributable risks)
4 . use example 1, clear Contains data from example1.dta obs: 8 vars: storage display value variable name type format label variable label y byte %8.0g cc Response x byte %8.0g yn Exposure z byte %8.0g yn Other N int %8.0g Sample size list, noobs y x z N control no no 131 control no yes 234 control yes no 650 control yes yes 295 case no no 122 case no yes 78 case yes no 660 case yes yes expand N (2612 observations created)
5 . cs y x Exposure Exposed Unexposed Total Cases Noncases Total Risk Point estimate [95% Conf. Interval] Risk difference Risk ratio Attr. frac. ex Attr. frac. pop chi2(1) = Pr>chi2 =
6 . cc y x Proportion Exposed Unexposed Total Exposed Cases Controls Total Point estimate [95% Conf. Interval] Odds ratio Attr. frac. ex Attr. frac. pop chi2(1) = Pr>chi2 =
7 . cc y x Exposed Unexposed Total Cases Controls Total Odds Point estimate [95% Conf. Interval] Odds ratio Attr. frac. ex Attr. frac. pop chi2(1) = Pr>chi2 =
8 . tabodds y x x cases controls odds [95% Conf. Interval] no yes tabodds y x, or x Odds Ratio chi2 P>chi2 [95% Conf. Interval] no yes mhodds y x Maximum likelihood estimate of the odds ratio Comparing x==1 vs. x== Odds Ratio chi2(1) P>chi2 [95% Conf. Interval]
9 Stratified analysis But... there could be more variables involved in this relationship Risk? Exposure Outcome Third variable
10 Stratified analysis Analyse the relationship between the binary response (C/nC) and the binary exposure (E/nE) for each stratum taken by a third variable (M/F) Third variable Exposure M Exposure F Risk M Risk F Outcome M Outcome F
11 Stratified analysis Analyse the relationship between the binary response (C/nC) and the binary exposure (E/nE) for each stratum taken by a third variable (M/F) Third variable Exposure M Exposure F Risk M Risk F Outcome M Outcome F Do specific-stratum estimates look similar?
12 Effect modification Variation in the magnitude of measure of effect across stratums of a third variable (subgroups of population) Can be tested for by using an statistical test for homogeneity
13 How to conduct an stratified analysis? Crude analysis Stratified analysis Do stratum-specific estimates look similar? YES NO EFFECT MODIFICATION (Report estimates by stratum)
14 How to conduct an stratified analysis? Crude analysis Stratified analysis Do stratum-specific estimates look similar? YES NO EFFECT MODIFICATION (Report estimates by stratum)
15 Confounding Distortion in the of measure of effect because of a third variable Exposure Outcome Third variable Be associated with outcome, independently of exposure Be associated with exposure, without being the consequence of exposure
16 Confounding A confounder influences (bias) the effect of a exposure in the same way for everyone Needs to be adjusted for There is not statistical test to check for confounding, it is usually assessed by comparing crude and adjusted estimates
17 How to conduct an stratified analysis? Crude analysis Stratified analysis Do stratum-specific estimates look similar? YES Adjusted analysis Do adjusted estimates look similar to crude estimates? NO EFFECT MODIFICATION (Report estimates by stratum)
18 How to conduct an stratified analysis? Crude analysis Stratified analysis Do stratum-specific estimates look similar? YES Adjusted analysis Do adjusted estimates look similar to crude estimates? NO EFFECT MODIFICATION (Report estimates by stratum) YES NO CONFOUNDING (Report adjusted estimates)
19 Stratified analysis Any of the epitab commands jointly with the by() option allow to run stratified analyses reporting; specific stratum measures of association, a test of homogeneity, as well as the crude and adjusted estimates These allow to epidemiologists to check for effect modification or to assess for confounding However users are still being confused about how Stata reports stratified analyses
20 . cc y x, by(z) Other OR [95% Conf. Interval] M-H Weight no yes Crude M-H combined Test of homogeneity (M-H) chi2(1) = Pr>chi2 = Test that combined OR = 1: Mantel-Haenszel chi2(1) = Pr>chi2 =
21 . cc y x, by(z) Other OR [95% Conf. Interval] M-H Weight no yes Crude M-H combined Test of homogeneity (M-H) chi2(1) = Pr>chi2 = Test that combined OR = 1: Mantel-Haenszel chi2(1) = Pr>chi2 = Do stratum-specific estimates look similar? Misleading p-value
22 . cc y x, by(z) Other OR [95% Conf. Interval] M-H Weight no yes Crude M-H combined Test of homogeneity (M-H) chi2(1) = Pr>chi2 = Test that combined OR = 1: H 0 : OR=1 Mantel-Haenszel chi2(1) = Pr>chi2 = Misleading p-value
23 . cc y x, by(z) Other OR [95% Conf. Interval] M-H Weight no yes Crude M-H combined Test of homogeneity (M-H) chi2(1) = Pr>chi2 = Test that combined OR = 1: Mantel-Haenszel chi2(1) = Pr>chi2 = Misleading p-values No p-values for specific-stratum and crude estimates Missing the relative change between the crude and adjusted estimates
24 . mhodds y x, by(z) Maximum likelihood estimate of the odds ratio Comparing x==1 vs. x==0 by z z Odds Ratio chi2(1) P>chi2 [95% Conf. Interval] no yes Mantel-Haenszel estimate controlling for z Odds Ratio chi2(1) P>chi2 [95% Conf. Interval] Test of homogeneity of ORs (approx): chi2(1) = Pr>chi2 =
25 . mhodds y x, by(z) Maximum likelihood estimate of the odds ratio Comparing x==1 vs. x==0 by z z Odds Ratio chi2(1) P>chi2 [95% Conf. Interval] no yes Mantel-Haenszel estimate controlling for z Odds Ratio chi2(1) P>chi2 [95% Conf. Interval] Test of homogeneity of ORs (approx): chi2(1) = Pr>chi2 = Do stratum-specific estimates look similar? Another misleading p-value
26 . mhodds y x, by(z) Maximum likelihood estimate of the odds ratio Comparing x==1 vs. x==0 by z z Odds Ratio chi2(1) P>chi2 [95% Conf. Interval] no yes Mantel-Haenszel estimate controlling for z Odds Ratio chi2(1) P>chi2 [95% Conf. Interval] Test of homogeneity of ORs (approx): chi2(1) = Pr>chi2 = Another misleading p-value Missing the crude estimate, and the relative change between the crude and adjusted estimates
27 Rearranging the output Rearrange the output of the epitab commands from the scheme of a classic epidemiological analysis Specific stratum estimates, with 95%CI and p-values Test of homogeneity (to check for effect modification) Crude and adjusted estimates, with 95%CI and p-values Relative change between them (to assess for confounding)
28 . mymhodds y x, by(z) Stratified analysis with test for effect modification z Odds Ratio [95% Conf. Interval] chi2(1) P>chi no yes Test of homogeneity of ORs (approx): chi2(1) = 48.86, P>chi2 = Adjusted analysis with assessment for confounding Odds Ratio [95% Conf. Interval] chi2(1) P>chi Crude Adjusted Adjusted/crude relative change = -0.31%
29 . use example2, clear. expand N (2612 observations created). mymhodds y x, by(z) Stratified analysis with test for effect modification z Odds Ratio [95% Conf. Interval] chi2(1) P>chi no yes Test of homogeneity of ORs (approx): chi2(1) = 0.00, P>chi2 = Adjusted analysis with assessment for confounding Odds Ratio [95% Conf. Interval] chi2(1) P>chi Crude Adjusted Adjusted/crude relative change = %
30 Rearranging the output Rearrange the output of the epitab commands from the scheme of a classic epidemiological analysis Specific stratum estimates, with 95%CI and p-values Test of homogeneity (to check for effect modification) Crude and adjusted estimates, with 95%CI and p-values Relative change between them (to assess for confounding) Furthermore, graphs are always helpful and nice!
31 . use example1, clear. expand N. mymhodds y x, by(z) graphs
32 . use example1, clear. expand N. mymhodds y x, by(z) graphs. use example2, clear. expand N. mymhodds y x, by(z) graphs
33 Conclusions Stata is mainly used in epidemiological research thanks to the epitab commands The cc command must also display the odds epitab commands with capabilities to run stratified analysis with the by() option (cs, cc, ir, and mhodds) must rearrange their output to address effect modification and confounding in a easiest way diagt (SJ4-4) command should also be included in epitab to cover all types of study designs in epidemiological research
34 Rearranging Stata s output for the analysis of epidemiological tables 5ª Reunión de Usuarios Españoles de Stata Barcelona, 12/09/2012 Aurelio Tobías IDAEA-CSIC, Barcelona [email protected]
How to set the main menu of STATA to default factory settings standards
University of Pretoria Data analysis for evaluation studies Examples in STATA version 11 List of data sets b1.dta (To be created by students in class) fp1.xls (To be provided to students) fp1.txt (To be
Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing
Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters
Lecture 1: Introduction to Epidemiology
Lecture 1: Introduction to Epidemiology Lecture 1: Introduction to Epidemiology Dankmar Böhning Department of Mathematics and Statistics University of Reading, UK Summer School in Cesme, May/June 2011
Organizing Your Approach to a Data Analysis
Biost/Stat 578 B: Data Analysis Emerson, September 29, 2003 Handout #1 Organizing Your Approach to a Data Analysis The general theme should be to maximize thinking about the data analysis and to minimize
III. INTRODUCTION TO LOGISTIC REGRESSION. a) Example: APACHE II Score and Mortality in Sepsis
III. INTRODUCTION TO LOGISTIC REGRESSION 1. Simple Logistic Regression a) Example: APACHE II Score and Mortality in Sepsis The following figure shows 30 day mortality in a sample of septic patients as
Competing-risks regression
Competing-risks regression Roberto G. Gutierrez Director of Statistics StataCorp LP Stata Conference Boston 2010 R. Gutierrez (StataCorp) Competing-risks regression July 15-16, 2010 1 / 26 Outline 1. Overview
RATIOS, PROPORTIONS, PERCENTAGES, AND RATES
RATIOS, PROPORTIOS, PERCETAGES, AD RATES 1. Ratios: ratios are one number expressed in relation to another by dividing the one number by the other. For example, the sex ratio of Delaware in 1990 was: 343,200
Case-control studies. Alfredo Morabia
Case-control studies Alfredo Morabia Division d épidémiologie Clinique, Département de médecine communautaire, HUG [email protected] www.epidemiologie.ch Outline Case-control study Relation to cohort
Statistics 305: Introduction to Biostatistical Methods for Health Sciences
Statistics 305: Introduction to Biostatistical Methods for Health Sciences Modelling the Log Odds Logistic Regression (Chap 20) Instructor: Liangliang Wang Statistics and Actuarial Science, Simon Fraser
Quick Stata Guide by Liz Foster
by Liz Foster Table of Contents Part 1: 1 describe 1 generate 1 regress 3 scatter 4 sort 5 summarize 5 table 6 tabulate 8 test 10 ttest 11 Part 2: Prefixes and Notes 14 by var: 14 capture 14 use of the
Module 14: Missing Data Stata Practical
Module 14: Missing Data Stata Practical Jonathan Bartlett & James Carpenter London School of Hygiene & Tropical Medicine www.missingdata.org.uk Supported by ESRC grant RES 189-25-0103 and MRC grant G0900724
Lecture 25. December 19, 2007. Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
Confounding in health research
Confounding in health research Part 1: Definition and conceptual issues Madhukar Pai, MD, PhD Assistant Professor of Epidemiology McGill University [email protected] 1 Why is confounding so important
Introduction. Survival Analysis. Censoring. Plan of Talk
Survival Analysis Mark Lunt Arthritis Research UK Centre for Excellence in Epidemiology University of Manchester 01/12/2015 Survival Analysis is concerned with the length of time before an event occurs.
Directions for Frequency Tables, Histograms, and Frequency Bar Charts
Directions for Frequency Tables, Histograms, and Frequency Bar Charts Frequency Distribution Quantitative Ungrouped Data Dataset: Frequency_Distributions_Graphs-Quantitative.sav 1. Open the dataset containing
Comparison of Survival Curves
Comparison of Survival Curves We spent the last class looking at some nonparametric approaches for estimating the survival function, Ŝ(t), over time for a single sample of individuals. Now we want to compare
Section 12 Part 2. Chi-square test
Section 12 Part 2 Chi-square test McNemar s Test Section 12 Part 2 Overview Section 12, Part 1 covered two inference methods for categorical data from 2 groups Confidence Intervals for the difference of
Chapter 7: Effect Modification
A short introduction to epidemiology Chapter 7: Effect Modification Neil Pearce Centre for Public Health Research Massey University Wellington, New Zealand Chapter 8 Effect modification Concepts of interaction
P (B) In statistics, the Bayes theorem is often used in the following way: P (Data Unknown)P (Unknown) P (Data)
22S:101 Biostatistics: J. Huang 1 Bayes Theorem For two events A and B, if we know the conditional probability P (B A) and the probability P (A), then the Bayes theorem tells that we can compute the conditional
Lesson 14 14 Outline Outline
Lesson 14 Confidence Intervals of Odds Ratio and Relative Risk Lesson 14 Outline Lesson 14 covers Confidence Interval of an Odds Ratio Review of Odds Ratio Sampling distribution of OR on natural log scale
Fairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.)
Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.) Logistic regression generalizes methods for 2-way tables Adds capability studying several predictors, but Limited to
Clinical Study Design and Methods Terminology
Home College of Veterinary Medicine Washington State University WSU Faculty &Staff Page Page 1 of 5 John Gay, DVM PhD DACVPM AAHP FDIU VCS Clinical Epidemiology & Evidence-Based Medicine Glossary: Clinical
March 28 2011 ABSTRACT
March 28 2011 A registry based comparative cohort study in four Swedish counties of the risk for narcolepsy after vaccination with Pandemrix - A first and preliminary report, by the Medical Products Agency.
The first three steps in a logistic regression analysis with examples in IBM SPSS. Steve Simon P.Mean Consulting www.pmean.com
The first three steps in a logistic regression analysis with examples in IBM SPSS. Steve Simon P.Mean Consulting www.pmean.com 2. Why do I offer this webinar for free? I offer free statistics webinars
A Simple Method for Estimating Relative Risk using Logistic Regression. Fredi Alexander Diaz-Quijano
1 A Simple Method for Estimating Relative Risk using Logistic Regression. Fredi Alexander Diaz-Quijano Grupo Latinoamericano de Investigaciones Epidemiológicas, Organización Latinoamericana para el Fomento
Statistics 104 Final Project A Culture of Debt: A Study of Credit Card Spending in America TF: Kevin Rader Anonymous Students: LD, MH, IW, MY
Statistics 104 Final Project A Culture of Debt: A Study of Credit Card Spending in America TF: Kevin Rader Anonymous Students: LD, MH, IW, MY ABSTRACT: This project attempted to determine the relationship
Northumberland Knowledge
Northumberland Knowledge Know Guide How to Analyse Data - November 2012 - This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about
Chi-square test Fisher s Exact test
Lesson 1 Chi-square test Fisher s Exact test McNemar s Test Lesson 1 Overview Lesson 11 covered two inference methods for categorical data from groups Confidence Intervals for the difference of two proportions
From this it is not clear what sort of variable that insure is so list the first 10 observations.
MNL in Stata We have data on the type of health insurance available to 616 psychologically depressed subjects in the United States (Tarlov et al. 1989, JAMA; Wells et al. 1989, JAMA). The insurance is
Scatter Plots with Error Bars
Chapter 165 Scatter Plots with Error Bars Introduction The procedure extends the capability of the basic scatter plot by allowing you to plot the variability in Y and X corresponding to each point. Each
Odds ratio, Odds ratio test for independence, chi-squared statistic.
Odds ratio, Odds ratio test for independence, chi-squared statistic. Announcements: Assignment 5 is live on webpage. Due Wed Aug 1 at 4:30pm. (9 days, 1 hour, 58.5 minutes ) Final exam is Aug 9. Review
Handling missing data in Stata a whirlwind tour
Handling missing data in Stata a whirlwind tour 2012 Italian Stata Users Group Meeting Jonathan Bartlett www.missingdata.org.uk 20th September 2012 1/55 Outline The problem of missing data and a principled
Lecture 19: Conditional Logistic Regression
Lecture 19: Conditional Logistic Regression Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University of South Carolina
Introduction to Quantitative Methods
Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................
Paper PO06. Randomization in Clinical Trial Studies
Paper PO06 Randomization in Clinical Trial Studies David Shen, WCI, Inc. Zaizai Lu, AstraZeneca Pharmaceuticals ABSTRACT Randomization is of central importance in clinical trials. It prevents selection
Confounding in Epidemiology
The Young Epidemiology Scholars Program (YES) is supported by The Robert Wood Johnson Foundation and administered by the College Board. Confounding in Epidemiology Mona Baumgarten Department of Epidemiology
Two Correlated Proportions (McNemar Test)
Chapter 50 Two Correlated Proportions (Mcemar Test) Introduction This procedure computes confidence intervals and hypothesis tests for the comparison of the marginal frequencies of two factors (each with
Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised February 21, 2015
Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised February 21, 2015 References: Long 1997, Long and Freese 2003 & 2006 & 2014,
Dealing with Missing Data
Dealing with Missing Data Roch Giorgi email: [email protected] UMR 912 SESSTIM, Aix Marseille Université / INSERM / IRD, Marseille, France BioSTIC, APHM, Hôpital Timone, Marseille, France January
11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
TESTS OF 1 MHZ SIGNAL SOURCE FOR SPECTRUM ANALYZER CALIBRATION 7/8/08 Sam Wetterlin
TESTS OF 1 MHZ SIGNAL SOURCE FOR SPECTRUM ANALYZER CALIBRATION 7/8/08 Sam Wetterlin (Updated 7/19/08 to delete sine wave output) I constructed the 1 MHz square wave generator shown in the Appendix. This
Introduction to StatsDirect, 11/05/2012 1
INTRODUCTION TO STATSDIRECT PART 1... 2 INTRODUCTION... 2 Why Use StatsDirect... 2 ACCESSING STATSDIRECT FOR WINDOWS XP... 4 DATA ENTRY... 5 Missing Data... 6 Opening an Excel Workbook... 6 Moving around
Chapter 3. Sampling. Sampling Methods
Oxford University Press Chapter 3 40 Sampling Resources are always limited. It is usually not possible nor necessary for the researcher to study an entire target population of subjects. Most medical research
Prevalence odds ratio or prevalence ratio in the analysis of cross sectional data: what is to be done?
272 Occup Environ Med 1998;55:272 277 Prevalence odds ratio or prevalence ratio in the analysis of cross sectional data: what is to be done? Mary Lou Thompson, J E Myers, D Kriebel Department of Biostatistics,
Use of the Chi-Square Statistic. Marie Diener-West, PhD Johns Hopkins University
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
1 Intel Smart Connect Technology Installation Guide:
1 Intel Smart Connect Technology Installation Guide: 1.1 System Requirements The following are required on a system: System BIOS supporting and enabled for Intel Smart Connect Technology Microsoft* Windows*
Complex Survey Design Using Stata
Complex Survey Design Using Stata 2010 This document provides a basic overview of how to handle complex survey design using Stata. Probability weighting and compensating for clustered and stratified samples
Guide to Biostatistics
MedPage Tools Guide to Biostatistics Study Designs Here is a compilation of important epidemiologic and common biostatistical terms used in medical research. You can use it as a reference guide when reading
Audit Sampling 101. BY: Christopher L. Mitchell, MBA, CIA, CISA, CCSA [email protected]
Audit Sampling 101 BY: Christopher L. Mitchell, MBA, CIA, CISA, CCSA [email protected] BIO Principal KBA s Risk Advisory Services Team 15 years of internal controls experience within the following
Sampling and Sampling Distributions
Sampling and Sampling Distributions Random Sampling A sample is a group of objects or readings taken from a population for counting or measurement. We shall distinguish between two kinds of populations
Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics
Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),
Tests for Two Proportions
Chapter 200 Tests for Two Proportions Introduction This module computes power and sample size for hypothesis tests of the difference, ratio, or odds ratio of two independent proportions. The test statistics
CHAPTER TWELVE TABLES, CHARTS, AND GRAPHS
TABLES, CHARTS, AND GRAPHS / 75 CHAPTER TWELVE TABLES, CHARTS, AND GRAPHS Tables, charts, and graphs are frequently used in statistics to visually communicate data. Such illustrations are also a frequent
Summary Measures (Ratio, Proportion, Rate) Marie Diener-West, PhD Johns Hopkins University
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
FACILITATOR/MENTOR GUIDE
FACILITATOR/MENTOR GUIDE Descriptive analysis variables table shells hypotheses Measures of association methods design justify analytic assess calculate analysis problem stratify confounding statistical
Prospective, retrospective, and cross-sectional studies
Prospective, retrospective, and cross-sectional studies Patrick Breheny April 3 Patrick Breheny Introduction to Biostatistics (171:161) 1/17 Study designs that can be analyzed with χ 2 -tests One reason
Statistics for Sports Medicine
Statistics for Sports Medicine Suzanne Hecht, MD University of Minnesota ([email protected]) Fellow s Research Conference July 2012: Philadelphia GOALS Try not to bore you to death!! Try to teach
Childhood leukemia and EMF
Workshop on Sensitivity of Children to EMF Istanbul, Turkey June 2004 Childhood leukemia and EMF Leeka Kheifets Professor Incidence rate per 100,000 per year 9 8 7 6 5 4 3 2 1 0 Age-specific childhood
Module 223 Major A: Concepts, methods and design in Epidemiology
Module 223 Major A: Concepts, methods and design in Epidemiology Module : 223 UE coordinator Concepts, methods and design in Epidemiology Dates December 15 th to 19 th, 2014 Credits/ECTS UE description
Laboratory 3 Type I, II Error, Sample Size, Statistical Power
Laboratory 3 Type I, II Error, Sample Size, Statistical Power Calculating the Probability of a Type I Error Get two samples (n1=10, and n2=10) from a normal distribution population, N (5,1), with population
From the help desk: hurdle models
The Stata Journal (2003) 3, Number 2, pp. 178 184 From the help desk: hurdle models Allen McDowell Stata Corporation Abstract. This article demonstrates that, although there is no command in Stata for
I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
Beckman HLM Reading Group: Questions, Answers and Examples Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN Linear Algebra Slide 1 of
Examples of Data Representation using Tables, Graphs and Charts
Examples of Data Representation using Tables, Graphs and Charts This document discusses how to properly display numerical data. It discusses the differences between tables and graphs and it discusses various
Introduction; Descriptive & Univariate Statistics
Introduction; Descriptive & Univariate Statistics I. KEY COCEPTS A. Population. Definitions:. The entire set of members in a group. EXAMPLES: All U.S. citizens; all otre Dame Students. 2. All values of
Chapter 18. Effect modification and interactions. 18.1 Modeling effect modification
Chapter 18 Effect modification and interactions 18.1 Modeling effect modification weight 40 50 60 70 80 90 100 male female 40 50 60 70 80 90 100 male female 30 40 50 70 dose 30 40 50 70 dose Figure 18.1:
Basic of Epidemiology in Ophthalmology Rajiv Khandekar. Presented in the 2nd Series of the MEACO Live Scientific Lectures 11 August 2014 Riyadh, KSA
Basic of Epidemiology in Ophthalmology Rajiv Khandekar Presented in the 2nd Series of the MEACO Live Scientific Lectures 11 August 2014 Riyadh, KSA Basics of Epidemiology in Ophthalmology Dr Rajiv Khandekar
How To Become A Clinical Epidemiologist
CHS Graduate Student & in Description Additional Admission MSc PhD MSc Healthcare MSc Clinical is the study of the distribution of diseases in populations and of factors that influence the occurrence of
Logit and Probit. Brad Jones 1. April 21, 2009. University of California, Davis. Bradford S. Jones, UC-Davis, Dept. of Political Science
Logit and Probit Brad 1 1 Department of Political Science University of California, Davis April 21, 2009 Logit, redux Logit resolves the functional form problem (in terms of the response function in the
Basic Statistical and Modeling Procedures Using SAS
Basic Statistical and Modeling Procedures Using SAS One-Sample Tests The statistical procedures illustrated in this handout use two datasets. The first, Pulse, has information collected in a classroom
Tutorial for Tracker and Supporting Software By David Chandler
Tutorial for Tracker and Supporting Software By David Chandler I use a number of free, open source programs to do video analysis. 1. Avidemux, to exerpt the video clip, read the video properties, and save
Topic 3 - Survival Analysis
Topic 3 - Survival Analysis 1. Topics...2 2. Learning objectives...3 3. Grouped survival data - leukemia example...4 3.1 Cohort survival data schematic...5 3.2 Tabulation of events and time at risk...6
HITACHI INVERTER SJ/L100/300 SERIES PID CONTROL USERS GUIDE
HITACHI INVERTER SJ/L1/3 SERIES PID CONTROL USERS GUIDE After reading this manual, keep it for future reference Hitachi America, Ltd. HAL1PID CONTENTS 1. OVERVIEW 3 2. PID CONTROL ON SJ1/L1 INVERTERS 3
Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods
Lecture 2 ESTIMATING THE SURVIVAL FUNCTION One-sample nonparametric methods There are commonly three methods for estimating a survivorship function S(t) = P (T > t) without resorting to parametric models:
Research Methods & Experimental Design
Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and
Using Big Datasets in Stata
Using Big Datasets in Stata Josh Klugman and Min Gong Department of Sociology Indiana University On the sociology LAN, Stata is configured to use 12 megabytes (MB) of memory, which means that the user
Algebra 2 C Chapter 12 Probability and Statistics
Algebra 2 C Chapter 12 Probability and Statistics Section 3 Probability fraction Probability is the ratio that measures the chances of the event occurring For example a coin toss only has 2 equally likely
Generalized Linear Models
Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the
Gerry Hobbs, Department of Statistics, West Virginia University
Decision Trees as a Predictive Modeling Method Gerry Hobbs, Department of Statistics, West Virginia University Abstract Predictive modeling has become an important area of interest in tasks such as credit
Current Yield Calculation
Current Yield Calculation Current yield is the annual rate of return that an investor purchasing a security at its market price would realize. Generally speaking, it is the annual income from a security
Methods for Meta-analysis in Medical Research
Methods for Meta-analysis in Medical Research Alex J. Sutton University of Leicester, UK Keith R. Abrams University of Leicester, UK David R. Jones University of Leicester, UK Trevor A. Sheldon University
This chapter discusses some of the basic concepts in inferential statistics.
Research Skills for Psychology Majors: Everything You Need to Know to Get Started Inferential Statistics: Basic Concepts This chapter discusses some of the basic concepts in inferential statistics. Details
SUMAN DUVVURU STAT 567 PROJECT REPORT
SUMAN DUVVURU STAT 567 PROJECT REPORT SURVIVAL ANALYSIS OF HEROIN ADDICTS Background and introduction: Current illicit drug use among teens is continuing to increase in many countries around the world.
Cool Tools for PROC LOGISTIC
Cool Tools for PROC LOGISTIC Paul D. Allison Statistical Horizons LLC and the University of Pennsylvania March 2013 www.statisticalhorizons.com 1 New Features in LOGISTIC ODDSRATIO statement EFFECTPLOT
DESCRIPTIVE RESEARCH DESIGNS
DESCRIPTIVE RESEARCH DESIGNS Sole Purpose: to describe a behavior or type of subject not to look for any specific relationships, nor to correlate 2 or more variables Disadvantages since setting is completely
Using SPSS, Chapter 2: Descriptive Statistics
1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,
TU e. Advanced Algorithms: experimentation project. The problem: load balancing with bounded look-ahead. Input: integer m 2: number of machines
The problem: load balancing with bounded look-ahead Input: integer m 2: number of machines integer k 0: the look-ahead numbers t 1,..., t n : the job sizes Problem: assign jobs to machines machine to which
Cohort Analysis for Genetic Epidemiology (C. A.G. E.) User Reference Manual
Cohort Analysis for Genetic Epidemiology (C. A.G. E.) User Reference Manual CAGE is a UNIX based program, which calculates the standardized cancer incidence ratios (Observed / Expected) with 95% confidence
ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2
University of California, Berkeley Prof. Ken Chay Department of Economics Fall Semester, 005 ECON 14 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE # Question 1: a. Below are the scatter plots of hourly wages
Chapter 6. Inequality Measures
Chapter 6. Inequality Measures Summary Inequality is a broader concept than poverty in that it is defined over the entire population, and does not only focus on the poor. The simplest measurement of inequality
Basics of STATA. 1 Data les. 2 Loading data into STATA
Basics of STATA This handout is intended as an introduction to STATA. STATA is available on the PCs in the computer lab as well as on the Unix system. Throughout, bold type will refer to STATA commands,
Analysis and Interpretation of Clinical Trials. How to conclude?
www.eurordis.org Analysis and Interpretation of Clinical Trials How to conclude? Statistical Issues Dr Ferran Torres Unitat de Suport en Estadística i Metodología - USEM Statistics and Methodology Support
Statistics in Retail Finance. Chapter 2: Statistical models of default
Statistics in Retail Finance 1 Overview > We consider how to build statistical models of default, or delinquency, and how such models are traditionally used for credit application scoring and decision
