Study Design and Statistical Analysis

Similar documents

Data Analysis, Research Study Design and the IRB

Sample Size Planning, Calculation, and Justification

Use advanced techniques for summary and visualization of complex data for exploratory analysis and presentation.

Tips for surviving the analysis of survival data. Philip Twumasi-Ankrah, PhD

Organizing Your Approach to a Data Analysis

Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program

Design and Analysis of Phase III Clinical Trials

Biostatistics: Types of Data Analysis

Descriptive Statistics

Course Course Name # Summer Courses DCS Clinical Research 5103 Questions & Methods CORE. Credit Hours. Course Description

Guide to Biostatistics

Statistics in Medicine Research Lecture Series CSMC Fall 2014

Biostatistics and Epidemiology within the Paradigm of Public Health. Sukon Kanchanaraksa, PhD Marie Diener-West, PhD Johns Hopkins University

School of Public Health and Health Services Department of Epidemiology and Biostatistics

Statistics Graduate Courses

Basic research methods. Basic research methods. Question: BRM.2. Question: BRM.1

A Population Based Risk Algorithm for the Development of Type 2 Diabetes: in the United States

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Research Methods & Experimental Design

Study Design. Date: March 11, 2003 Reviewer: Jawahar Tiwari, Ph.D. Ellis Unger, M.D. Ghanshyam Gupta, Ph.D. Chief, Therapeutics Evaluation Branch

Scholars in HeAlth Research Program (SHARP)

BOSTON UNIVERSITY SCHOOL OF PUBLIC HEALTH PUBLIC HEALTH COMPETENCIES

If several different trials are mentioned in one publication, the data of each should be extracted in a separate data extraction form.

Department of Behavioral Sciences and Health Education

University of Hawai i Human Studies Program. Guidelines for Developing a Clinical Research Protocol

Master of Public Health Program Competencies. Implemented Fall 2015

Introduction to Statistics and Quantitative Research Methods

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions

Clinical Study Design and Methods Terminology

Mortality Assessment Technology: A New Tool for Life Insurance Underwriting

1.0 Abstract. Title: Real Life Evaluation of Rheumatoid Arthritis in Canadians taking HUMIRA. Keywords. Rationale and Background:

University of Michigan Dearborn Graduate Psychology Assessment Program

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Mary B Codd. MD, MPH, PhD, FFPHMI UCD School of Public Health, Physiotherapy & Pop. Sciences

GRADUATE RESEARCH PROGRAMMES (MSc and PhD) AY2015/2016 MODULE DESCRIPTION

Form B-1. Inclusion form for the effectiveness of different methods of toilet training for bowel and bladder control

American Academy of Neurology Section on Neuroepidemiology Resident Core Curriculum

Methods for Meta-analysis in Medical Research

Statistics for Sports Medicine

Course Requirements for the Ph.D., M.S. and Certificate Programs

Additional sources Compilation of sources:

Glossary of Clinical Trial Terms

REGULATIONS FOR THE POSTGRADUATE DIPLOMA IN CLINICAL RESEARCH METHODOLOGY (PDipClinResMethodology)

Health Policy and Administration PhD Track in Health Services and Policy Research

Antipsychotic drugs are the cornerstone of treatment

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

DATA ANALYSIS. QEM Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. Howard University

Guidance for Industry

U.S. Food and Drug Administration

How to Develop a Research Protocol

Advanced Quantitative Methods for Health Care Professionals PUBH 742 Spring 2015

MEU. INSTITUTE OF HEALTH SCIENCES COURSE SYLLABUS. Biostatistics

Cancer Biostatistics Workshop Science of Doing Science - Biostatistics

Analysis of Data. Organizing Data Files in SPSS. Descriptive Statistics

Chapter 1. Longitudinal Data Analysis. 1.1 Introduction

Masters of Science in Clinical Research (MSCR) Curriculum. Goal/Objective of the MSCR

BIOM611 Biological Data Analysis

Diabetes Prevention in Latinos

Scholars in HeAlth Research Program (SHARP)

Introduction to study design

Competency 1 Describe the role of epidemiology in public health

ANOVA ANOVA. Two-Way ANOVA. One-Way ANOVA. When to use ANOVA ANOVA. Analysis of Variance. Chapter 16. A procedure for comparing more than two groups

Come scegliere un test statistico

The Harvard MPH in Epidemiology Online/On-Campus/In the Field. Two-Year, Part-Time Program. Sample Curriculum Guide

Survey Research: Choice of Instrument, Sample. Lynda Burton, ScD Johns Hopkins University

Analyzing Research Data Using Excel

The MSCR Curriculum and Its Advantages

SPSS Explore procedure

Priority Program Translational Oncology Applicants' Guidelines

Traditional View of Diabetes. Are children with type 1 diabetes obese: What can we do? 8/9/2012. Change in Traditional View of Diabetes

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE

Supplementary webappendix

Basic of Epidemiology in Ophthalmology Rajiv Khandekar. Presented in the 2nd Series of the MEACO Live Scientific Lectures 11 August 2014 Riyadh, KSA

II. DISTRIBUTIONS distribution normal distribution. standard scores

Following are detailed competencies which are addressed to various extents in coursework, field training and the integrative project.

Treatment of Low Risk MDS. Overview. Myelodysplastic Syndromes (MDS)

The Harvard MPH in Epidemiology Online/On-Campus/In the Field. Two-Year, Part-Time Program. Sample Curriculum Guide

LEVEL ONE MODULE EXAM PART ONE [Clinical Questions Literature Searching Types of Research Levels of Evidence Appraisal Scales Statistic Terminology]

Analysis Issues II. Mary Foulkes, PhD Johns Hopkins University

Module 223 Major A: Concepts, methods and design in Epidemiology

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

How To Write A Systematic Review

Statistical Rules of Thumb

A Guide To Producing an Evidence-based Report

The Product Review Life Cycle A Brief Overview

Study Design Sample Size Calculation & Power Analysis. RCMAR/CHIME April 21, 2014 Honghu Liu, PhD Professor University of California Los Angeles

Use of the Chi-Square Statistic. Marie Diener-West, PhD Johns Hopkins University

Evaluation of Treatment Pathways in Oncology: Modeling Approaches. Feng Pan, PhD United BioSource Corporation Bethesda, MD

Curriculum - Doctor of Philosophy

Summary and general discussion

BIO 226: APPLIED LONGITUDINAL ANALYSIS COURSE SYLLABUS. Spring 2015

Statistics in Applications III. Distribution Theory and Inference

Transcription:

Study Design and Statistical Analysis Anny H Xiang, PhD Department of Preventive Medicine University of Southern California Outline Designing Clinical Research Studies Statistical Data Analysis Designing Clinical Research Studies Study objective Target population Evidence to support the study Study design How big the study should be? What data to collect Follow-up? Data management, monitoring, quality control Data analysis plan 1

What is the Study Objective? Comparing two treatments in disease control? Identifying prognostic factors for disease development? Appropriate research questions that can be answered Translated into hypotheses (primary and secondary hypotheses) Primary hypothesis : The most interested question Capable of being adequately answered What is the Target Population? Defined by the research questions (e.g., diabetes prevention) Ideal to study entire target population Valid if randomly select samples from the target Practically restricted to what s available Impact generalization of the study results Minimize possible bias associated with sample selection Example 1: TRIPOD Study (Diabetes, 2002) Objective: To test whether treatment of insulin resistance using TZD can delay or prevent type 2 diabetes Population: High-risk for diabetes (not diabetes yet) Hispanic women with prior GDM, at least 18 years old 2

Example 2: GDM cohort Study (Diabetes, 1999) Objective: To identify antepartum characteristics that predict the development of diabetes 11-26 months after index pregnancy in women diagnosed with GDM Population: Hispanic women with GDM who were not diabetic at postpartum visit What Evidence to Support the Study PubMed, MEDLINE, etc., literature search and review Pilot study Clinical evidence/experience Meta analysis results What Type of Study Design is Appropriate? Clinical trials? Epidemiological study? A clinical trial is an experimental study, require special consideration of ethics (DSMB) Epidemiological study is an observational study 3

Types of Clinical Study Design Clinical trials Randomized controlled trial Non-randomized controlled trial Historical controls/database Cross-over design Factorial design Single arm study: Phase I & II Clinical Trial Phases Phase I: Establish maximum dosage Phase II: Look for preliminary evidence of efficacy and side effect Phase III: Treatment is compared to control to demonstrate efficacy Phase IV: Post marketing surveillance Types of Clinical Study Design (continued) Observational studies Case-control study Cross-sectional study Longitudinal cohort study Case series Case report 4

Types of Clinical Study Design (continued) Meta Analysis A formal process with statistical methods to combine all the data from studies involving similar participants and similar objectives in a single analysis to reach a conclusion on the overall results Clinical Trials Demonstrating Equivalency Designed to demonstrate equivalence of two treatments In difference trials, failure to reject the null (e.g., p>0.05) Does not mean the two are equal Can only conclude that the evidence is inadequate to conclude they are unequal Need an equivalency trial to prove equivalency Clinical Trials Demonstrating Equivalency A good standard treatment must exist A new one can be as effective as the standard but with Less side effects Less cost Greater convenience Higher quality of life Sample size is much larger than difference trial because a very small difference is compared 5

Randomization for Clinical Trials Reduce bias Achieve balance Quantify error attributable to chance (random) Randomization Type Fixed allocation randomization Simple randomization Blocked randomization Stratified randomization Adaptive randomization Baseline adaptive randomization Response adaptive randomization Examples re-visit TRIPOD study: Randomized controlled clinical trial comparing two interventions (troglitazone vs. placebo) GDM cohort study: Epidemiological observational prospective cohort study to identify antepartum predictors for diabetes 11-26 months after pregnancy 6

How big the study should be? Key to the study power to detect the effect Small effect requires large sample size Power to detect the primary hypothesis should be > 80% Pilot study does not require 80% power, mainly for preliminary data and feasibility How big the study should be? Estimated based on three parameters: Type I error (false positive, 0.05) Type II error (false negative, 0.20) Effect size Effect size defined differently for different statistical methods Sample size is generally estimated by the primary hypothesis Design a Confirmative Phase III Trial Try to limit to one primary response variable Type I error increase if multiple responses, need to consider it in sample size estimation Hard to make conclusion if inconsistent results for multiple response variables 7

Examples re-visit TRIPOD study: The sample size was projected to provide power > 80% to detect a > 20% difference in cumulative diabetes incidence rates between treatment groups at a median follow-up of 42 month with a type I error of 0.05 (n=266) GDM cohort study: The sample size was estimated based on the relative risk 2-4 in developing diabetes comparing the highest to the lowest quartile of the baseline predictor variable What Data to Collect? Defined by research questions Should collect at least all the measurements needed to test each of the hypotheses, including prognostic and confounder measures Can include other measures if minimal impact to the study (for future hypothesis exploration) All measurements should be assessed unbiased (blinding data assessor and subjects if needed) Surrogate Measures What is a surrogate measure? Substitute measure to the clinical outcome When do we use it? For rare event (sample size) For difficult to measure clinical outcome A good surrogate exist Prefer a continuous measure 8

Surrogate Measures What is a good surrogate measure? Pathway to clinical event Changes is highly correlated with real clinical outcome of interest Can be assessed with little error Acceptable by scientific and medical communities Acceptable by subjects Cost-effective Example: IMT, CD4, PSA Follow-up? Needed for clinical trials Needed for observational prospective longitudinal studies May schedule intermediate visits to capture changes (more information) Need to monitor Attrition rate Missing visits Protocol compliance (clinical trials) Adverse event (clinical trials) Data Analysis Plan Needs to be considered during planning stage Defines the sample size estimation Collaborate with biostatistician to define the correct approach 9

Outline Designing Clinical Research Studies Statistical Data Analysis Data Analysis Questions for data analysis: I. Study design & analysis variables II. Analysis questions (hypotheses) III. What statistical test to use? I. Study Design & Analysis Variables Study design Know the study design and how data are collected Are observations independent, paired or repeated? For non-independent observations, data correlation needs to be adjusted (e.g., multiple observation per subject, family members) Analysis Variables Are the variables continuous, categorical or discrete? 10

Variables Types Continuous variables: can be measured as precisely as instrumentation permits (e.g., age, BMI, glucose) Discrete variables: countable (e.g., treatment group, diabetes, contraceptive methods) Continuous variables can be discretized (e.g., age < 50 vs > 50, for clinical interpretation) Discrete measures can be made continuous (e.g., scores ranked from 1 to 10) II. Analysis Questions Translate the research questions into statistical hypothesis (null vs. alternative) What are the dependent variables, independent variables, adjustment variables? 1 or 2-sided test? What type I error? Null vs. Alternative Null hypothesis: hypothesis that not of interest For difference trial: no difference, no effect, no change, etc For equivalency trial: has difference Alternative hypothesis: hypothesis of interest For difference trial: has effect, change, etc For equivalency trial: no difference 11

Dependent vs. Independent Variables Dependent variables: outcomes, endpoints, response (e.g., diabetes development) Independent variables: treatment, groups, risk factor, casual variables Adjusted variables: confounders, variables which could bias the estimate of the relationship between the dependent and independent variables 1 or 2-sided Test & Type I error 1-sided test: tested parameter under alternative hypothesis is either greater than or less than what under the null, but not both (1 direction only) 2-sided test: tested parameter under alternative hypothesis can be greater than or less than what under the null (both directions) Type I error: probability of rejecting the null when null is true (false positive rate), defines the significance cut point (commonly used 0.05) III. What statistical tests to use? Dependent variable is dichotomous Dependent variable is continuous Dependent variable is time to event 12

III. What statistical tests to use? Dependent variable is dichotomous (e.g., diabetes status) Independent variable is dichotomous (e.g. tx group): Chi-square, Fisher s exact, McNemar s test Independent variable is discrete > 2 categories: Chi-square, Fisher s exact, McNemar s test All nonparametric III. What statistical tests to use? Dependent variable is dichotomous (e.g., diabetes status) Independent variable is continuous (e.g., age): Logistic regression Covariate adjustment: Logistic regression III. What statistical tests to use? Dependent variable is continuous (e.g., glucose, HbA1c) Independent variable is dichotomous (e.g., tx group): T-test, Wilcoxon test Independent variable is discrete with > 2 categories: ANOVA, Kruskal-Wallis Assumption for t-test, ANOVA: normal distribution 13

III. What statistical tests to use? Dependent variable is continuous (e.g., glucose, HbA1c) Independent variable is continuous (e.g., age): Correlation, linear regression Covariate adjustment: General linear regression Assumption: normal distribution III. What statistical tests to use? Dependent variable is time to event (e.g., time to development of diabetes) Independent variable is dichotomous (e.g., group): Logrank test Independent variable is categorical (>2): Logrank test Log-rank test is nonparametric III. What statistical tests to use? Dependent variable is time to event (e.g., time to development of diabetes) Independent variable is continuous (e.g., age): Cox regression Covariate adjustment: Cox regression Cox regression is semi-parametric, assumption is proportional hazard 14

III. What statistical tests to use? Analysis approach for repeated measures (e.g., glucose change over time) Fixed effect (no variation in parameters between individuals): Generalized estimating equation (GEE) Random effect (has variation in parameters between individuals): Mixed-effect model Assumption: normal distribution for continuous Examples Re-visit TRIPOD study: Dependent variable: time to development of diabetes Independent variable: treatment group Adjusted covariates: baseline unbalanced variables and on-trial characteristics Analysis approach: logrank test & Cox regression Cumulative Probability of Diabetes 60% 40% 20% TRIPOD: Diabetes Rate Placebo Troglitazone RR=0.45 (p=0.009) 0% 0 10 20 30 40 50 60 Buchanan et al: Diabetes, 2002 Months on Study 15

Examples Re-visit GDM cohort study: Dependent variable: Diabetes status 11-26 months after index pregnancy Independent variable: Antepartum characteristics Adjusted covariates: Variables in the final model were adjusted for each other to assess independent prediction Analysis approach: logistic regression GDM cohort result: Final Model P value Adjusted OR* 1-h glucose, diagnostic OGTT 0.003 15.2 ß-cell compensation index 0.009 0.09 Basal glucose production rate 0.04 5.3 *Highest vs. lowest tertile Issues in Data Analysis Dropouts, non-compliance Intent-to-treat vs. response-to-treat analysis Missing data MCAR, MAR, non-ignorable Hard to test missing data mechanism Perform sensitivity analyses Interaction - Subgroup analysis Multiple comparison issue 16

Multiple Comparison Repeatedly testing using the same data will increase chance finding increase type I error In decreasing order of conservativeness: Bonferroni multiple comparison procedure Scheffe S test Tukey HSD (honestly significant difference) test Fisher s LSD test (least significant difference): Did not provide appropriate type I error protection Final Recommendation for Conducting Clinical Research Inclusion of the biostatistician from study design through manuscript preparation Good communication & collaboration between clinical investigators and biostatistician is essential Biostatistician should make effort to understand the disease under investigation Clinical investigator should also make effort to understand the biostatistics Course Offerings in Preventive Medicine PM510: Introduction to Biostatistics (Fall, Spring and Summer) PM512: Introduction to Epidemiology (Fall and Spring) PM523: Clinical Trials (Spring) PM599: Biomedical Informatics (Summer) Information: Mary Trujillo: 323 442-1810 17

MS in Clinical and Biomedical Investigations Provide training to medical students, fellows, faculty and health professionals to appropriately conduct medical and translational research One-year of didactic course work One year of mentor-supervised research leading to grant proposals or peer-review paper(s) MS in Clinical and Biomedical Investigations Research tracks depend on research interests of applicant Patient-oriented translational research Community-based intervention trials Design, conduct and analysis of clinical trials Epidemiology and disease etiology Molecular biology Cell biology Others 18