Prospective, retrospective, and cross-sectional studies



Similar documents
Which Design Is Best?

Chapter 6. Examples (details given in class) Who is Measured: Units, Subjects, Participants. Research Studies to Detect Relationships

Mind on Statistics. Chapter 4

Two-sample inference: Continuous data

Basic Study Designs in Analytical Epidemiology For Observational Studies

Chapter 3. Sampling. Sampling Methods

TITLE AUTHOR. ADDRESS FOR CORRESPONDENCE (incl. fax and ) KEYWORDS. LEARNING OBJECTIVES (expected outcomes) SYNOPSIS

Stat 1040, Spring 2009 Homework 1 Solutions (with two extra problems at the end)

Clinical Study Design and Methods Terminology

Case-control studies. Alfredo Morabia

Introduction to study design

Chi-square test Fisher s Exact test

Types of Studies. Systematic Reviews and Meta-Analyses

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions

University of Colorado Campus Box 470 Boulder, CO (303) Fax (303)

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

The Young Epidemiology Scholars Program (YES) is supported by The Robert Wood Johnson Foundation and administered by the College Board.

What are observational studies and how do they differ from clinical trials?

Pregnancy Intendedness

Association Between Variables

Case-Control Studies. Sukon Kanchanaraksa, PhD Johns Hopkins University

RATIOS, PROPORTIONS, PERCENTAGES, AND RATES

Introduction to Observational studies Dr. Javaria Gulzar Clinical Research Associate SCRC.

Unit 14: The Question of Causation

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Is a monetary incentive a feasible solution to some of the UK s most pressing health concerns?

Q&A on Monographs Volume 116: Coffee, maté, and very hot beverages

Understanding Retrospective vs. Prospective Study designs

Use of the Chi-Square Statistic. Marie Diener-West, PhD Johns Hopkins University

Easy Read. How can we make sure everyone gets the right health care? How can we make NHS care better?

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Test Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table

Appendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP. Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study.

Circle Of Life SM : Cancer Education and Wellness for American Indian and Alaska Native Communities

Study Design and Statistical Analysis

Section 12 Part 2. Chi-square test

Frequently asked questions about whooping cough (pertussis)

This chapter discusses some of the basic concepts in inferential statistics.

2. METHODS OF DATA COLLECTION. Types of Data. Some examples from Wainer, Palmer and Bradlow (Chance):

Appendix 1. CAHPS Health Plan Survey 4.0H Adult Questionnaire (Commercial)

Q&A on the carcinogenicity of the consumption of red meat and processed meat

IPDET Module 6: Descriptive, Normative, and Impact Evaluation Designs

Module Three. Risk Assessment

Biostatistics and Epidemiology within the Paradigm of Public Health. Sukon Kanchanaraksa, PhD Marie Diener-West, PhD Johns Hopkins University

What is whooping cough. (pertussis)? Information and Prevention. Ocument dn

CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont

Descriptive statistics; Correlation and regression

Introduction to nonparametric regression: Least squares vs. Nearest neighbors

Week 3&4: Z tables and the Sampling Distribution of X

2. Incidence, prevalence and duration of breastfeeding

What is a P-value? Ronald A. Thisted, PhD Departments of Statistics and Health Studies The University of Chicago

MRC Autism Research Forum Interventions in Autism

AP Stats- Mrs. Daniel Chapter 4 MC Practice

Topic 8. Chi Square Tests

Basic research methods. Basic research methods. Question: BRM.2. Question: BRM.1

Thinking of getting pregnant?

p ˆ (sample mean and sample

Rubella. Questions and answers

Data Analysis, Research Study Design and the IRB

The International EMF Collaborative s Counter-View of the Interphone Study

Competency 1 Describe the role of epidemiology in public health

P (B) In statistics, the Bayes theorem is often used in the following way: P (Data Unknown)P (Unknown) P (Data)

Cohort Studies. Sukon Kanchanaraksa, PhD Johns Hopkins University

Basic of Epidemiology in Ophthalmology Rajiv Khandekar. Presented in the 2nd Series of the MEACO Live Scientific Lectures 11 August 2014 Riyadh, KSA

BREAST CANCER. How to spot the signs and symptoms and reduce your risk. cruk.org

The Cross-Sectional Study:

Mind on Statistics. Chapter 12

Inclusion and Exclusion Criteria

Known Donor Questionnaire

Long-term impact of childhood bereavement

Why you and your Family Should Get the Flu Shot

How to choose an analysis to handle missing data in longitudinal observational studies

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MINISTRY OF HEALTH PANDEMIC INFLUENZA A / H1N VACCINE FREQUENTLY ASKED QUESTIONS

Survey on the Mortality from Malignant Tumors of Female Asbestos Spinning Workers

Certified in Public Health (CPH) Exam CONTENT OUTLINE

Oncology Nursing Society Annual Progress Report: 2008 Formula Grant

Body Mass Index as a measure of obesity

Test Your Breast Cancer Knowledge

Glossary of Methodologic Terms

APPENDIX I-A: INFORMED CONSENT BB IND Protocol CDC IRB #4167

Reduce Your Risk of Breast Cancer

SOLUTIONS TO BIOSTATISTICS PRACTICE PROBLEMS

Psychological Impacts of Oil Spills: The Exxon Valdez Disaster. Lawrence A. Palinkas, Ph.D. School of Social Work University of Southern California

Master of Public Health (MPH) SC 542

Childhood leukemia and EMF

COMMITTEE FOR VETERINARY MEDICINAL PRODUCTS GUIDELINE FOR THE CONDUCT OF POST-MARKETING SURVEILLANCE STUDIES OF VETERINARY MEDICINAL PRODUCTS

The Curriculum of Health and Nutrition Education in Czech Republic Jana Koptíková, Visiting Scholar

Odds ratio, Odds ratio test for independence, chi-squared statistic.

IS 30 THE MAGIC NUMBER? ISSUES IN SAMPLE SIZE ESTIMATION

Transcription:

Prospective, retrospective, and cross-sectional studies Patrick Breheny April 3 Patrick Breheny Introduction to Biostatistics (171:161) 1/17

Study designs that can be analyzed with χ 2 -tests One reason that χ 2 -tests are so popular is that they can be used to analyze a wide variety of study designs In addition to controlled experiments, they are widely used in epidemiology, where investigators must conduct observational studies Broadly speaking, observational studies in epidemiology fall into three categories: prospective studies, retrospective studies, and cross-sectional studies Patrick Breheny Introduction to Biostatistics (171:161) 2/17

Study designs (cont d) χ 2 -tests and Fisher s exact test can be used to analyze all of these studies Best of all (some might say), you don t even need to think about your study design or where your data came from; if it can be expressed as a 2x2 contingency table, the χ 2 -test is always appropriate One caveat: there are a lot of tables in this world that have two rows and two columns; that does not necessarily make them contingency tables, so please don t try to perform χ 2 -tests on them! Patrick Breheny Introduction to Biostatistics (171:161) 3/17

Prospective studies We have said that the double-blind, randomized controlled trial is the gold standard of biomedical research When this is not possible (or ethical), the prospective study (also called a cohort study) is the next best thing In a prospective study, investigators collect a sample, classify individuals in some way, and then wait to see if the individuals develop a condition The classification is usually based on exposure to a risk factor such as smoking or obesity Patrick Breheny Introduction to Biostatistics (171:161) 4/17

Risk factors for breast cancer For example, the CDC tracked 6,168 women in the hopes of finding risk factors that led to breast cancer One risk factor they looked at was the age at which the woman gave birth to her first child: Cancer No Yes Before age 25 4475 65 25 or older 1597 31 Patrick Breheny Introduction to Biostatistics (171:161) 5/17

Risk factors for breast cancer (cont d) Performing a χ 2 -test on the data, we obtain p =.19 Thus, the evidence from this study is rather unconvincing as far as whether the risk of developing breast cancer depends on the age at which a woman gives birth to her first child In other words, the data is consistent with the null hypothesis that the two groups have the same risk of developing breast cancer Patrick Breheny Introduction to Biostatistics (171:161) 6/17

Not all researchers have the resources to follow thousands of people for decades to see if they develop a rare disease Instead, they often try the more feasible approach of collecting a sample of people with the condition of interest, a second sample of people without the condition of interest, and then ask them if they were exposed to a risk factor in the past For example, a much cheaper way to conduct the study of breast cancer risk factors would be to find 50 women with breast cancer, 50 women without breast cancer, and ask them when they had their first child This approach is called a retrospective, or case-control study Patrick Breheny Introduction to Biostatistics (171:161) 7/17

Fluoride poisoning in Alaska In 1992, an outbreak of illness occurred in an Alaskan community The CDC suspected fluoride poisoning from one of the town s water supplies Case Control Drank from supply 33 4 Didn t drink from supply 5 46 Patrick Breheny Introduction to Biostatistics (171:161) 8/17

Fluoride poisoning in Alaska Testing whether this could be due to chance, the χ 2 -test gives us p = 6 10 13 The observed association was certainly not due to chance But the association still may be due to factors besides fluoride poisoning Patrick Breheny Introduction to Biostatistics (171:161) 9/17

Recall bias Prospective studies For example, people who got sick may think much harder about what they ate and drank than people who didn t This is called recall bias, and it is an important source of bias in retrospective studies The extent to which recall bias is a concern certainly depends on the study: In the breast cancer example, it would not be much of a concern, since giving birth to a child is a major life event and a woman would know how old she was when it happened On the other hand, if the risk factor was something like diet or exercise, recall bias would be a huge concern, as people are notoriously unreliable at recalling these things Furthermore, because researchers must gather separate samples of cases and controls, these studies are more prone to sampling biases than prospective studies Patrick Breheny Introduction to Biostatistics (171:161) 10/17

Electromagnetic field example For example, retrospective studies have been performed investigating links between childhood leukemia and exposure to electromagnetic fields (EMF) Families with low socioeconomic status are more likely to live near electromagnetic fields Families with low socioeconomic status are also less likely to participate in studies as controls Socioeconomic status does not affect the participation of cases, however (cases are usually eager to participate) This results in an observed association between EMF and leukemia potentially arising entirely due to bias Patrick Breheny Introduction to Biostatistics (171:161) 11/17

The weakest type of observational study is the cross-sectional study In a cross-sectional study, the investigator simply gathers a single sample and cross-classifies them depending on whether they have the risk factor or not and whether they have the disease or not are the easiest to carry out, but are subject to all sorts of hidden biases Patrick Breheny Introduction to Biostatistics (171:161) 12/17

Circulatory disease and respiratory disease For example, one study surveyed 257 hospitalized individuals and determined whether each individual suffered from a disease of the respiratory system, a disease of the circulatory system, or both Their results: Respiratory Disease Yes No Circulatory Yes 7 29 Disease No 13 208 Could this association be due to chance? Not likely; χ 2 = 7.9, so p =.005 Patrick Breheny Introduction to Biostatistics (171:161) 13/17

Circulatory disease and respiratory disease (cont d) Okay, so it s probably not due to chance But does that mean that you are more likely to get a respiratory disease if you have a circulatory disease? The same study surveyed nonhospitalized individuals as well: Respiratory Disease Yes No Circulatory Yes 15 142 Disease No 189 2181 Patrick Breheny Introduction to Biostatistics (171:161) 14/17

Circulatory disease and respiratory disease (cont d) The evidence in favor of an association is now nonexistent: p =.48 What s going on? The issue isn t a poor sampling design: both samples were gathered carefully and are representative of their respective populations Instead, the issue is that cross-sectional studies are very susceptible to selection bias Patrick Breheny Introduction to Biostatistics (171:161) 15/17

Selection bias in cross-sectional studies In this example, the bias was that if a patient has both a circulatory disease and a respiratory disease, then he or she is much more likely to be hospitalized and to be included in the cross-sectional study There are many other examples: Suppose we obtained a cross-sectional sample of factory workers to see if they had developed asthma at a higher rate than non-factory workers Workers who developed asthma from working in the factory may be more likely to quit their job, and less likely to be included in our sample Suppose we notice an association between milk drinking and peptic ulcers Is it because milk drinking causes ulcers, or because ulcer sufferers like to drink milk in order to relieve their symptoms? Patrick Breheny Introduction to Biostatistics (171:161) 16/17

Prospective studies In conclusion, prospective studies are the most trustworthy observational study, but like any observational study, they are subject to confounding are often much more feasible, but potentially subject to recall bias and unrepresentative sampling Cross sectional studies provide a quick snapshot of an association, but need to be interpreted with care Patrick Breheny Introduction to Biostatistics (171:161) 17/17