PREDICTION OF INDIVIDUAL CELL FREQUENCIES IN THE COMBINED 2 2 TABLE UNDER NO CONFOUNDING IN STRATIFIED CASE-CONTROL STUDIES

Size: px
Start display at page:

Download "PREDICTION OF INDIVIDUAL CELL FREQUENCIES IN THE COMBINED 2 2 TABLE UNDER NO CONFOUNDING IN STRATIFIED CASE-CONTROL STUDIES"

Transcription

1 International Journal of Mathematical Sciences Vol. 10, No. 3-4, July-December 2011, pp Serials Publications PREDICTION OF INDIVIDUAL CELL FREQUENCIES IN THE COMBINED 2 2 TABLE UNDER NO CONFOUNDING IN STRATIFIED CASE-CONTROL STUDIES A. K. Dixit & F. Ansari Abstract: In stratified case-control studies, the individual cell frequencies in the combined table are so predicted that the simple Odd Ratio (OR) computed from this table approximates Mantel-Haenszel odd ratio (ORMH). This provides useful information about the distribution of cell frequencies in the combined table under no-confounding, which is of much use in health promotion, planning and management. The procedure first nullifies the effective strength of association of the two components of the confounder and then uses linear regression between the cell counts of matched pathways drawn to connect attributes providing each cell frequency in combined table, through presence and absence of confounder. The procedure when applied in considered 30 case-control situations produced OR close to ORMH in terms of the ratio of ORMH to it. On an average, this ratio was (SD = 0.10). Underestimation was however noted (in cases, ORMH > 1); but with average of the considered ratio (SD = 0.08). Keywords: Case-control studies, Cell frequencies, Distribution, Confounding, Pathways, Prediction, Regression. 1. INTRODUCTION In a stratified case-control study, the distribution of cell frequencies in the combined 2 2 table (combined table), which is the 2 2 table with its cell frequencies as the sum of respective individual cell frequencies in the 2 2 tables formed across the strata, is their distribution under confounding. Though, the measure, such as Mantel-Haenszel Odd Ratio (ORMH), provides reasonable overall estimate of the Odd Ratio ( OR), addressing confounding through stratification [1] but such measures do not provide the distribution of cell frequencies in the combined table under no confounding. Today, when we talk more of health than disease, it becomes of much interest to know the distribution of cell frequencies in the combined table in absence of confounding. The health promotion targets, contrary to disease prevention targets not only aim at reducing the risk factor prevalence among the diseased but simultaneously to enhance absence of risk factor among the non diseased, e.g. in an example of health promotion [2], the targets set were to reduce tobacco consumption by 50% among the diseased as well as to increase nonsmoking to 80% among non diseased.

2 412 A. K. Dixit & F. Ansari Here for proper targets setting, it becomes important to assess beforehand, the distribution of cell frequencies in the combined table under no confounding. The paper describes a procedure to predict individual cell frequencies in the combined table such that the simply computed OR from this table approximates well the ORMH. As such, then it provides an idea of the distribution pattern of cell frequencies in the combined table, had there been no-confounding. The procedure derives first the effective strength of association of the two components of the confounder and nullifies it. The prediction of individual cell frequencies in the combined table then is done using linear regression between the cell counts of matched pathways drawn to connect attributes providing each cell frequency in the combined table, through presence/absence of the confounder. 2. MATERIAL AND METHOD 30 situations of case-control studies were considered, in which only two categories of a confounder was considered, for the purpose of demonstration. Thus, each study considered was described by the two given 2 2 tables pertaining to the two categories of a confounder. The cell counts in these tables, usually denoted by letters, a, b, c and d; read (R +, D + ), (R +, D ), (R, D + ) and (R, D ) combinations for the presence/absence of disease (D + /D ) and the risk factor (R + /R ). The thirty situations considered included examples of stratified case-control studies from the literature and also the simulated situations, so as to cover wide variation in ORMH values ( ), which Figure in all most all practical situations. First, the strength of association of the two components of the confounder in terms of OR was deduced from the given two 2 2 tables in each study, by constructing two more 2 2 tables; one to provide OR 2, the strength of association of the confounder with the disease and the other to provide OR 3, its strength of association with the risk factor. For this, one of the two given tables pertaining to two categories of the confounder was designated with presence of confounder (C + ), for which number of diseased (D + ) was more and then the other with absence of confounder (C ). The a, b, c, d counts of the first 2 2 table constructed, read respectively (C +, D + ), (C +, D ), (C, D + ), and (C, D ) combinations. Naturally, the first row of this constructed table is column totals of the given 2 2 table, designated with presence of confounder. The second row of this table then is the column totals of the given 2 2 table designated with absence of confounder. The OR of this constructed table provided OR 2. Similarly, for the other 2 2 table constructed, the usual a, b, c, d counts read respectively ( R +, C + ), ( R +, C ), ( R, C + ), and ( R, C ). The first column of the second constructed table read the row totals of the given table, designated with presence of confounder and the second column of it read the row totals of the given table, designated with absence of confounder. The OR of second constructed table provided OR 3.

3 Prediction of Individual Cell Frequencies in the Combined 2 2 Table Under In state of no-confounding, the two components of the confounder should be independent of their associations. Accordingly for no-confounding state, individual cell frequencies in the tables providing OR 2 and OR 3 must replace themselves with their expected frequencies under independence. With the replacement of cell frequencies, providing OR 2 and OR 3 by their expected number under independence; two more 2 2 tables were obtained as to provide respectively OR 2 and OR 3, both equal to unity. It was noted from the 2 2 tables, constructed to provide OR 2 and OR 3 that to reach from R + to D + (R + D + ) in the combined table, we have two pathways through the presence (C + ) and absence (C ) of the confounder, viz. R + C + D + and R + C D + and so is the case under the state of no-confounding. To get the predicted frequency of ( R +, D + ) count in the combined table, the similar pathways counts under confounding and noconfounding states; viz the count (R +, C + ) of the first constructed table (confounding) and then this count from the second constructed table (no confounding) were paired as ( x, y) pair and so also the counts (C +, D + ), (R +, C ) and (C, D + ) were paired. These pairs were then regressed under linear regression. To this regression, when the observed ( R +, D + ) value of the combined table was put as x value, the y value, it yielded was put as predicted (R +, D + ) frequency in the combined table. Likewise, the predicted frequencies of (R +, D ), (R, D + ), (R, D ) counts in the combined table were obtained. The simple OR of these predicted frequencies in the combined table approximated ORMH well. The procedure is illustrated with an example at the end. 3. RESULTS The simple OR computed from the predicted cell frequencies in the combined table closely estimated in each case the ORMH, in terms of the ratio of ORMH to it. The average of this ratio was found (SD = 0.10). This means, the relative odds provided by ORMH and its obtained estimate from the predicted cell frequencies have been close enough. In 57% of cases however, underestimation was noted but with average of the considered ratio (SD = 0.08). The underestimation was found more in case, ORMH > 1with average ratio of (SD = 0.09) compared to (SD = 0.03), in case, ORMH < DISCUSSIONS The procedure reported here predicts individual cell frequencies in the combined table, had there been no confounding and as such, the simple OR computed from this table approximates ORMH. Earlier, this table used to provide crude rate ratio (CRR) only. This amounts to say that the distribution of cell frequencies in the original combined table (under the state of confounding) is so changed through the predicted cell frequencies that

4 414 A. K. Dixit & F. Ansari it becomes the distribution under the state of no confounding. The approximation is looked in terms of the ratio of ORMH to its obtained estimate, as the concern has been the assessment of the strength of association of the risk factor with the disease by the two. The average value of the ratio of ORMH to its estimate is found to be (SD = 0.10), which shows the closeness between the two as far as the assessment of the strength of association of risk factor with the disease by the two is concerned. It is observed that there is a bit underestimation, in particular in case, when ORMH > 1 but the magnitude of underestimation is just 0.08 times only. The procedure thus reasonably approximates ORMH, in the described sense. Though, in the studied situations, there was no such situation in which the ratio of ORMH to its estimate lied beyond the interval ( ) but as the procedure uses regression, outliers may occur at times. The procedure described has various applications. An immediate is to comprehend the distribution pattern of the cell frequencies in the combined table, had there been noconfounding, which is of much use in health promotion, where the targets are simultaneously set to reduce the prevalence of the risk factor among the diseased and to enhance the absence of it among the non diseased. The assessment of the proportion of diseased with the risk factor (exposure), when there is no confounding can also be done from here, which is used in public health planning [3]. Like wise, the proportion of non- exposed with the disease could be assessed (in absence of confounding), which has its uses in risk management [4]. In obtaining the predicted cell frequencies in the combined table, the described procedure first derives the strength of association of the two components of the confounder, which give rise to confounding and then nullifies it. The linear regression is now used between the (x, y) values; where x relates to a path value with presence of operative strength of the confounder and y, the corresponding value, with its absence. As linear regression, in a way between state of confounding and no-confounding yields the results; it exhibits such a relationship between the two states. In literature also, regression is used in addressing confounding [5], but it does not address the operative strength of the two components of confounder, rather it directly regresses the disease-exposure responses along with values of confounder. As such, with an obtained OR, adjusting for confounding; we are unable to predict back the individual cell frequencies in the combined table, which may produce this OR. 5. CONCLUSION The procedure described predicts well the distribution pattern of the cell frequencies in the combined table in absence of confounding, as the simple computed OR approximates well the ORMH. Knowing this distribution is of much use to health planners.

5 Prediction of Individual Cell Frequencies in the Combined 2 2 Table Under ILLUSTRATION OF THE PROCEDURE An example of a stratified case-control study cited here [6] presents the two 2 2 tables of disease (D) and risk factor (R) association, pertaining to two strata as per two categories of a considered confounder. The (R +, D + ), (R +, D ), (R, D + ) and (R, D ) counts for the presence (+) and absence ( ) of D and R combinations read respectively 300,,50,25 for stratum I and 50, 25,, for stratum II as depicted in left of Diagram 1. The respective counts in the combined table (given beneath) then are 350, 100, 125, 100. The row totals in stratum I are 3, and in stratum II are, 150. The respective column totals are 350, 100 and 125, 100. The value of CRR computed is ( / ) = 2.8 and that of ORMH(1) is (300 25/ /225)/( 50/ /225) = 2.0, where 450 and 225 are the totals of all the cell counts in the respective 2 2 tables. Diagram 1 Schematic Presentation of Given and Constructed 2 2 Tables Stratum I (C + ) Constructed 2 2 tables Constructed 2 2 tables Given 2 2 tables (Confounding state) (No confounding state) D + D Total D + D Total D + D Total R C C R C C Total Total Total Stratum II. (C ) OR 2 = 2.8 OR 2 = 1 D + D Total D + D Total D + D Total R R R R 150 R R Total Total Total Combined table OR 3 = 10.0 OR 3 = 1 Combined table with predicted frequencies D + D Total D + D Total R R R R Total Total CRR = 2.8 Estimated ORMH = 1.95 ORMH = 2.0

6 416 A. K. Dixit & F. Ansari Two more 2 2 tables to get OR 2 and OR 3 are now constructed. As D + count is more in stratum I (350), compared to stratum II (125), the 2 2 table in stratum I, is designated with presence of confounder (C + ) and then the table in the stratum II, with the absence of the confounder ( C ). These tables are then accordingly marked as C + and C with in parentheses. The two constructed 2x2 tables are presented in the middle of Diagram 1. Among these; table which provides OR 2 reads the counts (C +, D + ), (C +, D ), (C, D + ), and (C, D ) which are obtained from the original two 2 2 tables (given in the left ofdiagram 1). The first row of it is 350, 100 which are column totals in given 2 2 table in stratum 1 and its second row is 125, 100 which are column totals in stratum 2 of this table. The other table to provide OR 3 reads the counts (R +, C + ), (R +, C - ), (R, C + ), and (R, C ) which are also obtained from the original 2 2 tables. The first column of it is 3, which are row totals in stratum 1 of given table and the second column is, 150 which are row totals in stratum 2 of given table. The OR of the newly constructed tables respectively provided OR 2 = 2.8 and OR 3 =10. The cell frequencies in constructed tables are then replaced by their expected frequencies under independence of association (no confounding). This replacement is shown in extreme right of diagram 1. The OR computed from these tables, OR 2 = OR 3 = 1. Diagram 2 Pathways Through Presence and Absence of Confounder Confounding state No confounding state C + C R + 3 D + R + D + C C 158 C C R + D R + D C C C + C R D + D + R C C C + C R R D D C C 67

7 Prediction of Individual Cell Frequencies in the Combined 2 2 Table Under In diagram 2, the pathways to reach from, R + to D +, R + to D, R to D + and R to D ; through the presence (C + ) and absence (C ) of the confounder, in case with confounding (in left) and in case with no confounding (right) are presented with respective cell counts marked. To reach from R + to D +, we have two pathways through the presence and absence of the confounder, viz. (a) R + C + D + (b) R + C D +. The count (R +, C + ) is 300, as read from table providing OR 3, which is marked on the path R + C +, and likewise we had all respective markings. To get the predicted (R +, D + ) frequency in the combined original 2 2 table, the similar pathways counts marked under confounding and no confounding i.e. the marked counts on similar pathways in left and right of diagram 2 are paired, viz. we paired (3, 300), (350, 317), (, 150) and (125, 158). These ( x, y) pairs were regressed using linear regression y = a + bx. The regression constants estimated were a = 98.20, b = 0.5. To the estimated regression, when (R +, D + ) count of the original combined table (350) was put as x, it yielded y = (approx 300) which is predicted (R +, D + ) cell count in the original combined table to be used to get ORMH. Following the procedure, the combined table with predicted cell frequencies is given in right of the combined table used to get CRR in diagram 1. The OR computed from the original combined table with predicted cell frequencies is 1.95, which is the estimated ORMH value obtained from the combined table. The ratio of ORMH to its estimated value obtained is 1.02; which is quite close to ORMH in the sense discussed in the text. REFERENCES [1] Mantel N., and Haenszel W., (1959), Statistical Aspects of the Analysis of Data from Retrospective Studies of Diseases. J Natl. Cancer Inst., 22: [2] Abelin T., Brzezinski Z. J., and Carstairs Vera D. L., (1987), Measurement in Health Promotion and Protection, WHO Regional Pub., European series No.22. [3] Saracci R., and Vineis P., (2007), Disease Proportions Attributable to Environment, Environmental Health., 6: 38. [4] Coggon D., Rose G., and Barker D., (1997), Epidemiology for the Uninitiated, (Fourth Ed.), BMJ Publishing Group. [5] McNamee R., (2005), Regression, Modelling and Other Methods to Control Confounding, Occup Environ Med., 62: [6] Ahlbom A., and Norell S., (1990), Introduction to Modern Epidemiology, (Second Ed.), Epidemiology Resources Inc. A. K. Dixit F. Ansari Desert Medicine Research Centre, Banasthali University, Jodhpur, Rajasthan, India. Rajasthan, India. dixitstat@ rediffmail.com firdoushansari@gmail.com

Chapter 7: Effect Modification

Chapter 7: Effect Modification A short introduction to epidemiology Chapter 7: Effect Modification Neil Pearce Centre for Public Health Research Massey University Wellington, New Zealand Chapter 8 Effect modification Concepts of interaction

More information

Use of the Chi-Square Statistic. Marie Diener-West, PhD Johns Hopkins University

Use of the Chi-Square Statistic. Marie Diener-West, PhD Johns Hopkins University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Module 223 Major A: Concepts, methods and design in Epidemiology

Module 223 Major A: Concepts, methods and design in Epidemiology Module 223 Major A: Concepts, methods and design in Epidemiology Module : 223 UE coordinator Concepts, methods and design in Epidemiology Dates December 15 th to 19 th, 2014 Credits/ECTS UE description

More information

2. Filling Data Gaps, Data validation & Descriptive Statistics

2. Filling Data Gaps, Data validation & Descriptive Statistics 2. Filling Data Gaps, Data validation & Descriptive Statistics Dr. Prasad Modak Background Data collected from field may suffer from these problems Data may contain gaps ( = no readings during this period)

More information

Master of Public Health Program Competencies. Implemented Fall 2015

Master of Public Health Program Competencies. Implemented Fall 2015 Master of Public Program Competencies Implemented Fall 2015 Master of Public Core Competencies SPH Q501 Biostatistics 1. Describe the roles biostatistics serve in the discipline of public health. 2. Apply

More information

Laboratory 3 Type I, II Error, Sample Size, Statistical Power

Laboratory 3 Type I, II Error, Sample Size, Statistical Power Laboratory 3 Type I, II Error, Sample Size, Statistical Power Calculating the Probability of a Type I Error Get two samples (n1=10, and n2=10) from a normal distribution population, N (5,1), with population

More information

2. Simple Linear Regression

2. Simple Linear Regression Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according

More information

4. CLASSICAL METHODS OF ANALYSIS OF GROUPED DATA

4. CLASSICAL METHODS OF ANALYSIS OF GROUPED DATA 4. CLASSICAL METHODS OF ANALYSIS OF GROUPED DATA 4.1 The Ille-et-Vilaine study of oesophageal cancer 4.2 Exact statistical inference for a single 2 x 2 table 4.3 Approximate statistical inference for a

More information

Prevalence odds ratio or prevalence ratio in the analysis of cross sectional data: what is to be done?

Prevalence odds ratio or prevalence ratio in the analysis of cross sectional data: what is to be done? 272 Occup Environ Med 1998;55:272 277 Prevalence odds ratio or prevalence ratio in the analysis of cross sectional data: what is to be done? Mary Lou Thompson, J E Myers, D Kriebel Department of Biostatistics,

More information

Guide to Biostatistics

Guide to Biostatistics MedPage Tools Guide to Biostatistics Study Designs Here is a compilation of important epidemiologic and common biostatistical terms used in medical research. You can use it as a reference guide when reading

More information

Analysis of Regression Based on Sampling Weights in Complex Sample Survey: Data from the Korea Youth Risk Behavior Web- Based Survey

Analysis of Regression Based on Sampling Weights in Complex Sample Survey: Data from the Korea Youth Risk Behavior Web- Based Survey , pp.65-74 http://dx.doi.org/10.14257/ijunesst.2015.8.10.07 Analysis of Regression Based on Sampling Weights in Complex Sample Survey: Data from the Korea Youth Risk Behavior Web- Based Survey Haewon Byeon

More information

Does smoking impact your mortality?

Does smoking impact your mortality? An estimated 25% of the medically underwritten, assured population can be classified as smokers Does smoking impact your mortality? Introduction Your smoking habits influence the premium that you pay for

More information

Case-control studies. Alfredo Morabia

Case-control studies. Alfredo Morabia Case-control studies Alfredo Morabia Division d épidémiologie Clinique, Département de médecine communautaire, HUG Alfredo.Morabia@hcuge.ch www.epidemiologie.ch Outline Case-control study Relation to cohort

More information

Basic Study Designs in Analytical Epidemiology For Observational Studies

Basic Study Designs in Analytical Epidemiology For Observational Studies Basic Study Designs in Analytical Epidemiology For Observational Studies Cohort Case Control Hybrid design (case-cohort, nested case control) Cross-Sectional Ecologic OBSERVATIONAL STUDIES (Non-Experimental)

More information

IS 30 THE MAGIC NUMBER? ISSUES IN SAMPLE SIZE ESTIMATION

IS 30 THE MAGIC NUMBER? ISSUES IN SAMPLE SIZE ESTIMATION Current Topic IS 30 THE MAGIC NUMBER? ISSUES IN SAMPLE SIZE ESTIMATION Sitanshu Sekhar Kar 1, Archana Ramalingam 2 1Assistant Professor; 2 Post- graduate, Department of Preventive and Social Medicine,

More information

Lecture 1: Introduction to Epidemiology

Lecture 1: Introduction to Epidemiology Lecture 1: Introduction to Epidemiology Lecture 1: Introduction to Epidemiology Dankmar Böhning Department of Mathematics and Statistics University of Reading, UK Summer School in Cesme, May/June 2011

More information

Working Capital Management of Small Scale Industries in Rajasthan

Working Capital Management of Small Scale Industries in Rajasthan Working Capital Management of Small Scale Industries in Rajasthan Bashar Matarneh Department of Business Administration, Delmon University Exhibition Avenue Manama, Kingdom of Bahrain, P.O. Box 2469 Tell:

More information

The Young Epidemiology Scholars Program (YES) is supported by The Robert Wood Johnson Foundation and administered by the College Board.

The Young Epidemiology Scholars Program (YES) is supported by The Robert Wood Johnson Foundation and administered by the College Board. The Young Epidemiology Scholars Program (YES) is supported by The Robert Wood Johnson Foundation and administered by the College Board. Case Control Study Mark A. Kaelin Department of Health Professions

More information

The American Cancer Society Cancer Prevention Study I: 12-Year Followup

The American Cancer Society Cancer Prevention Study I: 12-Year Followup Chapter 3 The American Cancer Society Cancer Prevention Study I: 12-Year Followup of 1 Million Men and Women David M. Burns, Thomas G. Shanks, Won Choi, Michael J. Thun, Clark W. Heath, Jr., and Lawrence

More information

Lecture 19: Conditional Logistic Regression

Lecture 19: Conditional Logistic Regression Lecture 19: Conditional Logistic Regression Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University of South Carolina

More information

Appendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP. Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study.

Appendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP. Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study. Appendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study Prepared by: Centers for Disease Control and Prevention National

More information

5. CLASSICAL METHODS OF ANALYSIS OF MATCHED DATA

5. CLASSICAL METHODS OF ANALYSIS OF MATCHED DATA 5. CLASSICAL METHODS OF ANALYSIS OF MATCHED DATA 5.1 Los Angeles retirement community study of endometrial cancer 5.2 Matched pairs: dichotomous exposures 5.3 1 : M matching: dichotomous exposures 5.4

More information

University of Colorado Campus Box 470 Boulder, CO 80309-0470 (303) 492-8230 Fax (303) 492-4916 http://www.colorado.edu/research/hughes

University of Colorado Campus Box 470 Boulder, CO 80309-0470 (303) 492-8230 Fax (303) 492-4916 http://www.colorado.edu/research/hughes Hughes Undergraduate Biological Science Education Initiative HHMI Tracking the Source of Disease: Koch s Postulates, Causality, and Contemporary Epidemiology Koch s Postulates In the late 1800 s, the German

More information

10 th POLISH SUDOKU CHAMPIONSHIP INSTRUCTION BOOKLET. February 22, 2015 IMPORTANT INFORMATION:

10 th POLISH SUDOKU CHAMPIONSHIP INSTRUCTION BOOKLET. February 22, 2015 IMPORTANT INFORMATION: 10 th POLISH SUDOKU CHAMPIONSHIP February 22, 2015 INSTRUCTION BOOKLET IMPORTANT INFORMATION: 1. Answer form can be sent many times, but only the last version will be considered. 2. In case of a tie, the

More information

ANNEX 2: Assessment of the 7 points agreed by WATCH as meriting attention (cover paper, paragraph 9, bullet points) by Andy Darnton, HSE

ANNEX 2: Assessment of the 7 points agreed by WATCH as meriting attention (cover paper, paragraph 9, bullet points) by Andy Darnton, HSE ANNEX 2: Assessment of the 7 points agreed by WATCH as meriting attention (cover paper, paragraph 9, bullet points) by Andy Darnton, HSE The 7 issues to be addressed outlined in paragraph 9 of the cover

More information

03 The full syllabus. 03 The full syllabus continued. For more information visit www.cimaglobal.com PAPER C03 FUNDAMENTALS OF BUSINESS MATHEMATICS

03 The full syllabus. 03 The full syllabus continued. For more information visit www.cimaglobal.com PAPER C03 FUNDAMENTALS OF BUSINESS MATHEMATICS 0 The full syllabus 0 The full syllabus continued PAPER C0 FUNDAMENTALS OF BUSINESS MATHEMATICS Syllabus overview This paper primarily deals with the tools and techniques to understand the mathematics

More information

Confounding in health research

Confounding in health research Confounding in health research Part 1: Definition and conceptual issues Madhukar Pai, MD, PhD Assistant Professor of Epidemiology McGill University madhukar.pai@mcgill.ca 1 Why is confounding so important

More information

Data Analysis, Research Study Design and the IRB

Data Analysis, Research Study Design and the IRB Minding the p-values p and Quartiles: Data Analysis, Research Study Design and the IRB Don Allensworth-Davies, MSc Research Manager, Data Coordinating Center Boston University School of Public Health IRB

More information

9 Hedging the Risk of an Energy Futures Portfolio UNCORRECTED PROOFS. Carol Alexander 9.1 MAPPING PORTFOLIOS TO CONSTANT MATURITY FUTURES 12 T 1)

9 Hedging the Risk of an Energy Futures Portfolio UNCORRECTED PROOFS. Carol Alexander 9.1 MAPPING PORTFOLIOS TO CONSTANT MATURITY FUTURES 12 T 1) Helyette Geman c0.tex V - 0//0 :00 P.M. Page Hedging the Risk of an Energy Futures Portfolio Carol Alexander This chapter considers a hedging problem for a trader in futures on crude oil, heating oil and

More information

Medicine and Big Data

Medicine and Big Data Medicine and Big Data A happy marriage? David Madigan Department of Sta8s8cs Columbia University & OMOP Evidence- Based Medicine as against??? How does the combina8on of Clinical Judgment and Evidence-

More information

Study Plan Master in Public Health ( Non-Thesis Track)

Study Plan Master in Public Health ( Non-Thesis Track) Study Plan Master in Public Health ( Non-Thesis Track) I. General Rules and conditions : 1. This plan conforms to the regulations of the general frame of the Graduate Studies. 2. Specialties allowed to

More information

University of Maryland School of Medicine Master of Public Health Program. Evaluation of Public Health Competencies

University of Maryland School of Medicine Master of Public Health Program. Evaluation of Public Health Competencies Semester/Year of Graduation University of Maryland School of Medicine Master of Public Health Program Evaluation of Public Health Competencies Students graduating with an MPH degree, and planning to work

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Elizabeth Comino Centre fo Primary Health Care and Equity 12-Aug-2015

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Elizabeth Comino Centre fo Primary Health Care and Equity 12-Aug-2015 PEER REVIEW HISTORY BMJ Open publishes all reviews undertaken for accepted manuscripts. Reviewers are asked to complete a checklist review form (http://bmjopen.bmj.com/site/about/resources/checklist.pdf)

More information

training programme in pharmaceutical medicine Clinical Data Management and Analysis

training programme in pharmaceutical medicine Clinical Data Management and Analysis training programme in pharmaceutical medicine Clinical Data Management and Analysis 19-21 may 2011 Clinical Data Management and Analysis 19 21 MAY 2011 LocaL: University of Aveiro, Campus Universitário

More information

Advanced Statistical Analysis of Mortality. Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc. 160 University Avenue. Westwood, MA 02090

Advanced Statistical Analysis of Mortality. Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc. 160 University Avenue. Westwood, MA 02090 Advanced Statistical Analysis of Mortality Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc 160 University Avenue Westwood, MA 02090 001-(781)-751-6356 fax 001-(781)-329-3379 trhodes@mib.com Abstract

More information

Description of the Sample and Limitations of the Data

Description of the Sample and Limitations of the Data Section 3 Description of the Sample and Limitations of the Data T his section describes the 2007 Corporate sample design, sample selection, data capture, data cleaning, and data completion. The techniques

More information

Standard Deviation Estimator

Standard Deviation Estimator CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of

More information

GENETICS AND HEALTH INSURANCE

GENETICS AND HEALTH INSURANCE GENETICS AND HEALTH INSURANCE Angus Macdonald Department of Actuarial Mathematics and Statistics and the Maxwell Institute for Mathematical Sciences Heriot-Watt University, Edinburgh www.maxwell.ac.uk

More information

CHAPTER VI ON PRIORITY SECTOR LENDING

CHAPTER VI ON PRIORITY SECTOR LENDING CHAPTER VI IMPACT OF PRIORITY SECTOR LENDING 6.1 PRINCIPAL FACTORS THAT HAVE DIRECT IMPACT ON PRIORITY SECTOR LENDING 6.2 ASSOCIATION BETWEEN THE PROFILE VARIABLES AND IMPACT OF PRIORITY SECTOR CREDIT

More information

Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

More information

Correlation key concepts:

Correlation key concepts: CORRELATION Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson s coefficient of correlation c) Spearman s Rank correlation coefficient d)

More information

Introduction to Statistics and Quantitative Research Methods

Introduction to Statistics and Quantitative Research Methods Introduction to Statistics and Quantitative Research Methods Purpose of Presentation To aid in the understanding of basic statistics, including terminology, common terms, and common statistical methods.

More information

Solving Simultaneous Equations and Matrices

Solving Simultaneous Equations and Matrices Solving Simultaneous Equations and Matrices The following represents a systematic investigation for the steps used to solve two simultaneous linear equations in two unknowns. The motivation for considering

More information

http://www.jstor.org This content downloaded on Tue, 19 Feb 2013 17:28:43 PM All use subject to JSTOR Terms and Conditions

http://www.jstor.org This content downloaded on Tue, 19 Feb 2013 17:28:43 PM All use subject to JSTOR Terms and Conditions A Significance Test for Time Series Analysis Author(s): W. Allen Wallis and Geoffrey H. Moore Reviewed work(s): Source: Journal of the American Statistical Association, Vol. 36, No. 215 (Sep., 1941), pp.

More information

11. Analysis of Case-control Studies Logistic Regression

11. Analysis of Case-control Studies Logistic Regression Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

More information

P (B) In statistics, the Bayes theorem is often used in the following way: P (Data Unknown)P (Unknown) P (Data)

P (B) In statistics, the Bayes theorem is often used in the following way: P (Data Unknown)P (Unknown) P (Data) 22S:101 Biostatistics: J. Huang 1 Bayes Theorem For two events A and B, if we know the conditional probability P (B A) and the probability P (A), then the Bayes theorem tells that we can compute the conditional

More information

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the

More information

Biostatistics: Types of Data Analysis

Biostatistics: Types of Data Analysis Biostatistics: Types of Data Analysis Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa A Scott, MS

More information

Quantitative importance of asbestos as a cause of lung cancer in a Swedish industrial city: a case-referent study

Quantitative importance of asbestos as a cause of lung cancer in a Swedish industrial city: a case-referent study Eur Aesplr J, 1993, 6, 1271-1275 Printed in UK - all rights reserved Copyright EAS Joumals Ltd 1993 European Respiratory Joumat tssn 0903 1936 Quantitative importance of asbestos as a cause of lung cancer

More information

S weden as well as most other rich countries has a highly

S weden as well as most other rich countries has a highly 188 RESEARCH REPORT Class differences in the social consequences of illness? C Lindholm, B Burström, F Diderichsen... J Epidemiol Community Health 2002;56:188 192 See end of article for authors affiliations...

More information

Preliminary Report on. Hong Kong Assured Lives Mortality and Critical Illness. Experience Study 2000-2003

Preliminary Report on. Hong Kong Assured Lives Mortality and Critical Illness. Experience Study 2000-2003 Preliminary Report on Hong Kong Assured Lives Mortality and Critical Illness Experience Study 2000-2003 Actuarial Society of Hong Kong Experience Committee ASHK - Hong Kong Assured Lives Mortality and

More information

A Simple Method for Estimating Relative Risk using Logistic Regression. Fredi Alexander Diaz-Quijano

A Simple Method for Estimating Relative Risk using Logistic Regression. Fredi Alexander Diaz-Quijano 1 A Simple Method for Estimating Relative Risk using Logistic Regression. Fredi Alexander Diaz-Quijano Grupo Latinoamericano de Investigaciones Epidemiológicas, Organización Latinoamericana para el Fomento

More information

Utilizing Microsoft Excel for Human Resources Management By Palani Murugappan

Utilizing Microsoft Excel for Human Resources Management By Palani Murugappan Utilizing Microsoft Excel for Human Resources Management By Palani Murugappan Employee driven organization Far sighted employers will definitely agree that employees are the greatest asset of an organization.

More information

FOR MATCHED SETS 7. CONDITIONAL LOGISTIC REGRESSION

FOR MATCHED SETS 7. CONDITIONAL LOGISTIC REGRESSION 7. CONDITIONAL LOGISTIC REGRESSION FOR MATCHED SETS 7.1 Bias arising from the unconditional analysis of matched data 7.2 Multivariate analysis for matched 1 : M designs: general methodology 7.3 Matched

More information

Business Analytics using Data Mining

Business Analytics using Data Mining Business Analytics using Data Mining Project Report Indian School of Business Group A6 Bhushan Khandelwal 61410182 61410806 - Mahabaleshwar Bhat Mayank Gupta 61410659 61410697 - Shikhar Angra Sujay Koparde

More information

Mortality Assessment Technology: A New Tool for Life Insurance Underwriting

Mortality Assessment Technology: A New Tool for Life Insurance Underwriting Mortality Assessment Technology: A New Tool for Life Insurance Underwriting Guizhou Hu, MD, PhD BioSignia, Inc, Durham, North Carolina Abstract The ability to more accurately predict chronic disease morbidity

More information

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions BMS 617 Statistical Techniques for the Biomedical Sciences Lecture 11: Chi-Squared and Fisher's Exact Tests Chi Squared and Fisher's Exact Tests This lecture presents two similarly structured tests, Chi-squared

More information

Statistics 305: Introduction to Biostatistical Methods for Health Sciences

Statistics 305: Introduction to Biostatistical Methods for Health Sciences Statistics 305: Introduction to Biostatistical Methods for Health Sciences Modelling the Log Odds Logistic Regression (Chap 20) Instructor: Liangliang Wang Statistics and Actuarial Science, Simon Fraser

More information

Inequality, Mobility and Income Distribution Comparisons

Inequality, Mobility and Income Distribution Comparisons Fiscal Studies (1997) vol. 18, no. 3, pp. 93 30 Inequality, Mobility and Income Distribution Comparisons JOHN CREEDY * Abstract his paper examines the relationship between the cross-sectional and lifetime

More information

Case-Control Studies. Sukon Kanchanaraksa, PhD Johns Hopkins University

Case-Control Studies. Sukon Kanchanaraksa, PhD Johns Hopkins University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Sample size estimation is an important concern

Sample size estimation is an important concern Sample size and power calculations made simple Evie McCrum-Gardner Background: Sample size estimation is an important concern for researchers as guidelines must be adhered to for ethics committees, grant

More information

Estimating the effect of projected changes in the driving population on collision claim frequency

Estimating the effect of projected changes in the driving population on collision claim frequency Bulletin Vol. 29, No. 8 : September 2012 Estimating the effect of projected changes in the driving population on collision claim frequency Despite having higher claim frequencies than prime age drivers,

More information

GENETICS AND INSURANCE: QUANTIFYING THE IMPACT OF GENETIC INFORMATION

GENETICS AND INSURANCE: QUANTIFYING THE IMPACT OF GENETIC INFORMATION GENETICS AND INSURANCE: QUANTIFYING THE IMPACT OF GENETIC INFORMATION Angus Macdonald Department of Actuarial Mathematics and Statistics and the Maxwell Institute for Mathematical Sciences Heriot-Watt

More information

Categorical Data Analysis

Categorical Data Analysis Richard L. Scheaffer University of Florida The reference material and many examples for this section are based on Chapter 8, Analyzing Association Between Categorical Variables, from Statistical Methods

More information

How to set the main menu of STATA to default factory settings standards

How to set the main menu of STATA to default factory settings standards University of Pretoria Data analysis for evaluation studies Examples in STATA version 11 List of data sets b1.dta (To be created by students in class) fp1.xls (To be provided to students) fp1.txt (To be

More information

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE MAT 119 STATISTICS AND ELEMENTARY ALGEBRA 5 Lecture Hours, 2 Lab Hours, 3 Credits Pre-

More information

Competency 1 Describe the role of epidemiology in public health

Competency 1 Describe the role of epidemiology in public health The Northwest Center for Public Health Practice (NWCPHP) has developed competency-based epidemiology training materials for public health professionals in practice. Epidemiology is broadly accepted as

More information

STATISTICAL METHODS IN CANCER RESEARCH

STATISTICAL METHODS IN CANCER RESEARCH WORLD HEALTH ORGANIZATION INTERNATIONAL AGENCY FOR RESEARCH ON CANCER STATISTICAL METHODS IN CANCER RESEARCH VOLUME 1 - The analysis of case-control studies BY N. E. BRESLOW & N. E. DAY TECHNICAL EDITOR

More information

Summary Measures (Ratio, Proportion, Rate) Marie Diener-West, PhD Johns Hopkins University

Summary Measures (Ratio, Proportion, Rate) Marie Diener-West, PhD Johns Hopkins University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Exercise Answers. Exercise 3.1 1. B 2. C 3. A 4. B 5. A

Exercise Answers. Exercise 3.1 1. B 2. C 3. A 4. B 5. A Exercise Answers Exercise 3.1 1. B 2. C 3. A 4. B 5. A Exercise 3.2 1. A; denominator is size of population at start of study, numerator is number of deaths among that population. 2. B; denominator is

More information

Propensity score based data analysis

Propensity score based data analysis Propensity score based data analysis Susanne Stampf March 28, 2014 Abstract For some time, propensity score (PS) based methods have been frequently applied in the analysis of observational and registry

More information

Confounding in Epidemiology

Confounding in Epidemiology The Young Epidemiology Scholars Program (YES) is supported by The Robert Wood Johnson Foundation and administered by the College Board. Confounding in Epidemiology Mona Baumgarten Department of Epidemiology

More information

This accuracy is open to question for the following reasons:

This accuracy is open to question for the following reasons: AN EXPERIMENT TO COMPARE ALTERNATIVE VERIFICATION PROCEDURES FOR MORTALITY MEDICAL CODING Kenneth W. Harris and Dwight K. French National Center for Health Statistics I. Background One of the most difficult

More information

Vertical Alignment Colorado Academic Standards 6 th - 7 th - 8 th

Vertical Alignment Colorado Academic Standards 6 th - 7 th - 8 th Vertical Alignment Colorado Academic Standards 6 th - 7 th - 8 th Standard 3: Data Analysis, Statistics, and Probability 6 th Prepared Graduates: 1. Solve problems and make decisions that depend on un

More information

DATA INTERPRETATION AND STATISTICS

DATA INTERPRETATION AND STATISTICS PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

More information

Lecture 25. December 19, 2007. Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

Lecture 25. December 19, 2007. Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Time series analysis as a framework for the characterization of waterborne disease outbreaks

Time series analysis as a framework for the characterization of waterborne disease outbreaks Interdisciplinary Perspectives on Drinking Water Risk Assessment and Management (Proceedings of the Santiago (Chile) Symposium, September 1998). IAHS Publ. no. 260, 2000. 127 Time series analysis as a

More information

SYSTEMIC COMPLICATIONS AND THEIR RISK FACTORS AMONG TEHRANIAN BLOOD DONOR, 2005

SYSTEMIC COMPLICATIONS AND THEIR RISK FACTORS AMONG TEHRANIAN BLOOD DONOR, 2005 ORIGINAL REPORT SYSTEMIC COMPLICATIONS AND THEIR RISK FACTORS AMONG TEHRANIAN BLOOD DONOR, 2005 F. Majlessi *, S. Ghafari 2, A. RahimiForoushani 3 and M. Maghsoodlou 2 ) Department of Public Health Services,

More information

Survey Inference for Subpopulations

Survey Inference for Subpopulations American Journal of Epidemiology Vol. 144, No. 1 Printed In U.S.A Survey Inference for Subpopulations Barry I. Graubard 1 and Edward. Korn 2 One frequently analyzes a subset of the data collected in a

More information

the Sample 1. Nontaxable (including no alternative minimum tax) with adjusted gross income or expanded income of $200,000 or more.

the Sample 1. Nontaxable (including no alternative minimum tax) with adjusted gross income or expanded income of $200,000 or more. Section 2 Description of the Sample This section describes the sample design and selection, the method of estimation, the sampling variability of the estimates, and the methodology of computing confidence

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS

USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS Natarajan Meghanathan Jackson State University, 1400 Lynch St, Jackson, MS, USA natarajan.meghanathan@jsums.edu

More information

Probability, statistics and football Franka Miriam Bru ckler Paris, 2015.

Probability, statistics and football Franka Miriam Bru ckler Paris, 2015. Probability, statistics and football Franka Miriam Bru ckler Paris, 2015 Please read this before starting! Although each activity can be performed by one person only, it is suggested that you work in groups

More information

The correlation coefficient

The correlation coefficient The correlation coefficient Clinical Biostatistics The correlation coefficient Martin Bland Correlation coefficients are used to measure the of the relationship or association between two quantitative

More information

Master of Public Health (MPH) SC 542

Master of Public Health (MPH) SC 542 Master of Public Health (MPH) SC 542 1. Objectives This proposed Master of Public Health (MPH) programme aims to provide an in depth knowledge of public health. It is designed for students who intend to

More information

Renal cell carcinoma and body composition:

Renal cell carcinoma and body composition: Renal cell carcinoma and body composition: Results from a case-control control study Ryan P. Theis, MPH Department of Epidemiology and Biostatistics College of Public Health and Health Professions University

More information

Basic of Epidemiology in Ophthalmology Rajiv Khandekar. Presented in the 2nd Series of the MEACO Live Scientific Lectures 11 August 2014 Riyadh, KSA

Basic of Epidemiology in Ophthalmology Rajiv Khandekar. Presented in the 2nd Series of the MEACO Live Scientific Lectures 11 August 2014 Riyadh, KSA Basic of Epidemiology in Ophthalmology Rajiv Khandekar Presented in the 2nd Series of the MEACO Live Scientific Lectures 11 August 2014 Riyadh, KSA Basics of Epidemiology in Ophthalmology Dr Rajiv Khandekar

More information

An Article Critique - Helmet Use and Associated Spinal Fractures in Motorcycle Crash Victims. Ashley Roberts. University of Cincinnati

An Article Critique - Helmet Use and Associated Spinal Fractures in Motorcycle Crash Victims. Ashley Roberts. University of Cincinnati Epidemiology Article Critique 1 Running head: Epidemiology Article Critique An Article Critique - Helmet Use and Associated Spinal Fractures in Motorcycle Crash Victims Ashley Roberts University of Cincinnati

More information

Analyzing Research Data Using Excel

Analyzing Research Data Using Excel Analyzing Research Data Using Excel Fraser Health Authority, 2012 The Fraser Health Authority ( FH ) authorizes the use, reproduction and/or modification of this publication for purposes other than commercial

More information

PROC LOGISTIC: Traps for the unwary Peter L. Flom, Independent statistical consultant, New York, NY

PROC LOGISTIC: Traps for the unwary Peter L. Flom, Independent statistical consultant, New York, NY PROC LOGISTIC: Traps for the unwary Peter L. Flom, Independent statistical consultant, New York, NY ABSTRACT Keywords: Logistic. INTRODUCTION This paper covers some gotchas in SAS R PROC LOGISTIC. A gotcha

More information

Completeness of Physician Billing Claims for Diabetes Prevalence Estimation

Completeness of Physician Billing Claims for Diabetes Prevalence Estimation Completeness of Physician Billing Claims for Diabetes Prevalence Estimation Lisa M. Lix 1, John Paul Kuwornu 1, George Kephart 2, Khokan Sikdar 3, Hude Quan 4 1 University of Manitoba; 2 Dalhousie University;

More information

Quantitative relationship of sick building syndrome symptoms with ventilation rates

Quantitative relationship of sick building syndrome symptoms with ventilation rates Published in Indoor Air Journal, volume 19, 2009 Quantitative relationship of sick building syndrome symptoms with ventilation rates William J. Fisk *, Anna G. Mirer, and Mark J. Mendell Indoor Environment

More information

STATISTICAL DATA ANALYSIS

STATISTICAL DATA ANALYSIS STATISTICAL DATA ANALYSIS INTRODUCTION Fethullah Karabiber YTU, Fall of 2012 The role of statistical analysis in science This course discusses some statistical methods, which involve applying statistical

More information

Simple Random Sampling

Simple Random Sampling Source: Frerichs, R.R. Rapid Surveys (unpublished), 2008. NOT FOR COMMERCIAL DISTRIBUTION 3 Simple Random Sampling 3.1 INTRODUCTION Everyone mentions simple random sampling, but few use this method for

More information

H. The Study Design. William S. Cash and Abigail J. Moss National Center for Health Statistics

H. The Study Design. William S. Cash and Abigail J. Moss National Center for Health Statistics METHODOLOGY STUDY FOR DETERMINING THE OPTIMUM RECALL PERIOD FOR THE REPORTING OF MOTOR VEHICLE ACCIDENTAL INJURIES William S. Cash and Abigail J. Moss National Center for Health Statistics I. Introduction

More information

SUGI 29 Statistics and Data Analysis

SUGI 29 Statistics and Data Analysis Paper 194-29 Head of the CLASS: Impress your colleagues with a superior understanding of the CLASS statement in PROC LOGISTIC Michelle L. Pritchard and David J. Pasta Ovation Research Group, San Francisco,

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance

More information

the Sample 1. Nontaxable (including no alternative minimum tax) with adjusted gross income or expanded income of $200,000 or more.

the Sample 1. Nontaxable (including no alternative minimum tax) with adjusted gross income or expanded income of $200,000 or more. Section 2 Description of the Sample This section describes the sample design and selection, the method of estimation, the sampling variability of the estimates, and the methodology of computing confidence

More information

Time series forecasting

Time series forecasting Time series forecasting 1 The latest version of this document and related examples are found in http://myy.haaga-helia.fi/~taaak/q Time series forecasting The objective of time series methods is to discover

More information