Paper PO06. Randomization in Clinical Trial Studies

Size: px
Start display at page:

Download "Paper PO06. Randomization in Clinical Trial Studies"

Transcription

1 Paper PO06 Randomization in Clinical Trial Studies David Shen, WCI, Inc. Zaizai Lu, AstraZeneca Pharmaceuticals ABSTRACT Randomization is of central importance in clinical trials. It prevents selection bias and insures against accidental bias. It produces comparable groups, and eliminates the source of bias in treatment assignments. Finally, it permits the use of probability theory to express the likelihood of chance as a source for the difference between outcomes. This paper discusses four common randomization methods. SAS implementation of randomization is provided with RANUNI and RANOR functions, PROC SURVEYSELECT and PROC PLAN. INTRODUCTION A good clinical trial minimizes variability of the evaluation and provides an unbiased evaluation of the intervention by avoiding confounding from other factors. Randomization insures that each patient have an equal chance of receiving any of the treatments under study, generate comparable intervention groups which are alike in all important aspects except for the intervention each group receives. It also provides a basis for the statistical methods used in analyzing data. WHY RANDOMIZATION The basic benefits of randomization include 1. Eliminates selection bias. 2. Balances arms with respect to prognostic variables (known and unknown). 3. Forms basis for statistical tests, a basis for an assumption-free statistical test of the equality of treatments. In general, a randomized trial is an essential tool for testing the efficacy of the treatment. CRITERIA FOR RANDOMIZATION 1. Unpredictability Each participant has the same chance of receiving any of the interventions. Allocation is carried out using a chance mechanism so that neither the participant nor the investigator will know in advance which will be assigned. 2. Balance

2 Treatment groups are of a similar size & constitution, groups are alike in all important aspects and only differ in the intervention each group receives 3. Simplicity Easy for investigator/staff to implement METHODS OF RANDOMIZATION The common types of randomization include (1) simple, (2) block, (3) stratified and (4) unequal randomization. Some other methods such as biased coin, minimization and response-adaptive methods may be applied for specific purposes. 1. Simple Randomization This method is equivalent to tossing a coin for each subject that enters a trial, such as Heads = Active, Tails = Placebo. The random number generator is generally used. It is simple and easy to implement and treatment assignment is completely unpredictable. However, it can get imbalanced in treatment assignment, especially in smaller trials. Imbalanced randomization reduces statistical power. In trial of 10 participants, treatment effect variance for 5-5 split relative to 7-3 split is (1/5+1/5)/(1/7+1/3)=.84, so 7-3 split is only 84% as efficient as 5-5 split. Even if treatment is balanced at the end of a trial, it may not be balanced at some time during the trial. For example, the trial may be balanced at end with 100 participants, but the first 10 might be AAAATATATA. If the trial is monitored during the process, we d like to have balance in the number of subjects on each treatment over time. 2. Block Randomization Simple randomization does not guarantee balance in numbers during trial. Especially, if patient characteristics change with time, (e.g. early patients sicker than later), early imbalances can't be corrected. Block randomization is often used to fix this issue. The basic idea of block randomization is to divide potential patients into m blocks of size 2n, randomize each block such that n patients are allocated to A and n to B. then choose the blocks randomly. This method ensures equal treatment allocation within each block if the complete block is used. Example: Two treatments of A, B and Block size of 2 x 2= 4 Possible treatment allocations within each block are (1) AABB, (2) BBAA, (3) ABAB, (4) BABA, (5) ABBA, (6) BAAB Block size depends on the number of treatments, it should be short enough to prevent imbalance, and long enough to prevent guessing allocation in trials. The block size should be at least 2x number of treatments (ref ICH E9). The block size is not stated in the protocol so the clinical and investigators are blind to the block size. If blocking is not masked in open-label trials, the sequence becomes somewhat predictable (e.g. 2n= 4): B A B? Must be A. A A?? Must be B B. This could lead to selection bias. The solution to avoid selection bias is (1).Do not reveal blocking mechanism. (2). Use random block sizes.

3 If treatment is double blinded, selection bias is not likely. Note if only one block is requested, then it produces a single sequence of random assignment, i.e. simple randomization. 3. Stratified Randomization Imbalance randomization in numbers of subjects reduces statistical power, but imbalance in prognostic factors is also more likely inefficient for estimating treatment effect. Trial may not be valid if it is not well balanced across prognostic factors. For example, with 6 diabetics, there is 22% chance of 5-1 or 6-0 split by block randomization only. Stratified randomization is the solution to achieve balance within subgroups: use block randomization separately for diabetics and non-diabetics. For example, Age Group: < 40, 41-60, >60; Sex: M, F Total number of strata = 3 x 2 = 6 Stratification can balance subjects on baseline covariates, tend to produce comparable groups with regard to certain characteristics (e.g., gender, age, race, disease severity), thus produces valid statistical tests. The block size should be relative small to maintain balance in small strata. Increased number of stratification variables or increased number of levels within strata leads to fewer patients per stratum. Subjects should have baseline measurements taken before randomization. Large clinical trials don t use stratification. It is unlikely to get imbalance in subject characteristics in a large randomized trial. 4. Unequal Randomization Most randomized trials allocate equal numbers of patients to experimental and control groups. This is the most statistically efficient randomization ratio as it maximizes statistical power for a given total sample size. However, this may not be the most economically efficient or ethically/practically feasible. When two or more treatments under evaluation have a cost difference it may be more economically efficient to randomize fewer patients to the expensive treatment and more to the cheaper one. The substantial cost savings can be achieved by adopting a smaller randomization ratio such as a ratio of 2:1, with only a modest loss in statistical power. When one arm of the treatment saves lives and the other such as placebo/medical care only does not much to save them in the oncology trials. The subject survival time depends on which treatment they receive. More extreme allocation may be used in these trials to allocate fewer patients into the placebo group. Generally, randomization ratio of 3:1 will lose considerable statistical power, more extreme than 3:1 is not very useful, which leads to much larger sample size. SAS IMPLEMENTATION 1. SAS Random Number Generators SAS provides several functions to work as random number generators: RANUNI: generates random numbers between 0 and 1 which have a uniform distribution.

4 RANNOR: generates random numbers with a standard normal ~N(0, 1) distribution RANBIN: generates random numbers with a binomial distribution Random number generators are used in producing randomization schedules for clinical trials or carrying out simulation studies. Subjects are supposed to get either a drug or a placebo with equal probability. Assume the variable GROUP represents assignment: Group = 'A' or Group = 'P'. RANUNI generates random number R between 0 and 1. If R is less than.5, then it is assigned to Group = 'P'. If R is greater than or equal to.5, then is assigned to Group = 'A'. The code that does this is the following: data ONE; seed=123; do i=1 to 100; r = ranuni(seed); if r<.5 then group = 'A'; else group = 'P'; output; end; proc freq data=one; tables group; The SEED for the random number generator determines the starting value. The same positive SEED in the program always generates the same results. However, if SEED is 0 or negative number, the result will be different each time. When 0 or negative number as the seed, SAS chooses the current computer clock time value as the seed. The result is completely impossible to predict, but it is not generally recommended. You need to select a beginning seed value so that you could reproduce the results by the same seed value at a later date. Otherwise you may have to wait for thousand of years to get the same result. Note that in this example, the treatment assignments are unbalanced from the result of PROC FREQ: there are 56 assignments to placebo P and only 44 assignments to active treatment. This is not an unusual imbalance. The following code can put same number of subjects into each group by sorting the random number, then assigning drug and placebo to the random sequence. data ONE; seed=123; do i=1 to 100; r = ranuni(seed); output; end; proc sort data=one; by r; data TWO; set ONE; if _n_ <=50 then group='a'; else group='p';

5 How if we want to split 100 subjects into more than 2 treatment groups? PROC RANK can easily accomplish this. proc rank data=one groups = 5 out=three; var r; ranks group; PROC RANK collapses or categorizes the values of numeric variable R in data set ONE and creates new data set THREE. The new variable GROUP created by PROC RANK indicates observation membership in the ranking or grouping variable. Option GROUPS= N, N is the number of groups to create. RANNOR is another SAS random number generator. It produces random numbers which have a normal distribution with mean 0 and standard deviation 1. RANNOR is used in much the same way as RANUNI. 2. PROC SURVEYSELECT. This procedure is originally designed to analyze very large data but to work with a relatively small random sample. The SURVEYSELECT procedure provides a variety of methods for selecting probability-based random samples. It can select a simple random sample or can sample according to a complex multistage sample design that includes stratification and unequal randomization. The following is the simple randomization. data ONE; do i=1 to 100; output; end; proc surveyselect data=one method=srs n= 50 out=two; The method=srs specifies simple randomization sampling, in which each subject has an equal probability of selection and sampling is without replacement. N=50 option specifies a samples size. OUT= option stores the sample data. If we define subjects in TWO as active treatment, then the rest of subjects in ONE will be treated with placebo. The following information is displayed in OUTPUT, which summarizes the sample selection. The SURVEYSELECT Procedure Selection Method Simple Random Sampling Input Data Set ONE Random Number Seed Sample Size 50 Selection Probability 0.5 Sampling Weight 2 Output Data Set TWO

6 The random number seed is Since the seed= option is not specified in the proc statement, the seed values in obtained using the time from computer s clock. You can specify SEED=56895 to reproduce this sample. It is recommended that a random seed should be specified, so that the sample can be replicated. In the next example. dataset ONE has 100 subjects, of which 20 are male. We d like to randomly split them into two treatment groups and also ensure each group has equal number of males, i.e. 10 males in each group. data ONE; do n=1 to 100; if n<=20 then sex='m'; else sex='f'; output; end; proc sort data=one; by sex; proc surveyselect data=one method=srs n=(40 10) out=two; strata sex; Stratification is added to the sampling. Random samples are selected independently within the strata. N=(40 10) requests that 40 subjects from Female and 10 subjects from Male. PROC SURVEYSELECT requires that the input dataset sorted by the STRATA variables. The PROC FREQ with TABLES SEX displays the sampling result as we expected. sex Frequency Percent F M The N= option can be replaced by rate=(0.5, 0.5) alternatively. RATE is the percentage of observations to select from each strata, 50% from Female and 50% from Male in this example. The rate can be adjusted for unequal randomization. The following randomization selects 25 subjects. Suppose that they are put into placebo group, the rest of subjects will be in the active treatment group. The randomization ratio is 1:3, which is also stratified by SEX. proc surveyselect data=one method=srs rate=(0.25, 0.25) out=three; strata sex; 3. PROC PLAN. The PLAN procedure is designed specifically for more complex designs and randomization plans such as factorial, nested and crossed experiments, and Latin square designs. It can also be used in many basic randomization designs. The syntax is somewhat tricky, so care should be taken when using the procedure. The first example is the simple randomization to divide 12 subjects into 3 treatments. proc plan;

7 factors Subject=12 ; treatments Group=12 cyclic ( ); output out= ONE; quit; Simple Randomization with 3 Levels of Treatments Subject Group Once again, a SEED should be applied, otherwise SAS generates its own seed, and this seed will be displayed in LOG: At the start of processing, random number seed= Our next example is about the block randomization design for 12 subjects: 2 treatments of A & B, block size 2x2=4 and 12/4 =3 blocks. PROC PLAN SEED= ; FACTORS Block=3 random Size=4 random; OUTPUT out =C Size cvals = ('A' 'A' 'B' 'B' ); RUN; It can bee seen that two treatments are always balanced in each block. Block Randomization Design With 3 Blocks of Size 4, Treatments of A & B Obs Block Size 1 1 B 2 1 A 3 1 B 4 1 A 5 2 A 6 2 B 7 2 B 8 2 A 9 3 B 10 3 B 11 3 A 12 3 A CONCLUSION Randomization in clinical trial is convenient with the power of SAS. The randomization numbers generated will be stored in the central computer center (CORE) or put into sealed envelopes (opaque, not resealable). Each subject must have a unique identification

8 number and keep that number throughout the study. Subject should be determined to be eligible by uniform and clear eligibility criteria and have signed the ICF before randomization. The subject s randomization number can be obtained by calling randomization center through IVRS or accessing the web-based central randomization system. CONTACT INFORMATION Zaizai Lu zz_lu@hotmail.com AstraZeneca Pharmaceuticals Wilmington, Delaware SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are registered trademarks or trademarks of their respective companies.

Study Designs. Simon Day, PhD Johns Hopkins University

Study Designs. Simon Day, PhD Johns Hopkins University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Generating Randomization Schedules Using SAS Programming Chunqin Deng and Julia Graz, PPD, Inc., Research Triangle Park, North Carolina

Generating Randomization Schedules Using SAS Programming Chunqin Deng and Julia Graz, PPD, Inc., Research Triangle Park, North Carolina Paper 267-27 Generating Randomization Schedules Using SAS Programming Chunqin Deng and Julia Graz, PPD, Inc., Research Triangle Park, North Carolina ABSTRACT Randomization as a method of experimental control

More information

Randomization in Clinical Trials

Randomization in Clinical Trials in Clinical Trials Versio.0 May 2011 1. Simple 2. Block randomization 3. Minimization method Stratification RELATED ISSUES 1. Accidental Bias 2. Selection Bias 3. Prognostic Factors 4. Random selection

More information

Permuted-block randomization with varying block sizes using SAS Proc Plan Lei Li, RTI International, RTP, North Carolina

Permuted-block randomization with varying block sizes using SAS Proc Plan Lei Li, RTI International, RTP, North Carolina Paper PO-21 Permuted-block randomization with varying block sizes using SAS Proc Plan Lei Li, RTI International, RTP, North Carolina ABSTRACT Permuted-block randomization with varying block sizes using

More information

Design and Analysis of Phase III Clinical Trials

Design and Analysis of Phase III Clinical Trials Cancer Biostatistics Center, Biostatistics Shared Resource, Vanderbilt University School of Medicine June 19, 2008 Outline 1 Phases of Clinical Trials 2 3 4 5 6 Phase I Trials: Safety, Dosage Range, and

More information

What is a P-value? Ronald A. Thisted, PhD Departments of Statistics and Health Studies The University of Chicago

What is a P-value? Ronald A. Thisted, PhD Departments of Statistics and Health Studies The University of Chicago What is a P-value? Ronald A. Thisted, PhD Departments of Statistics and Health Studies The University of Chicago 8 June 1998, Corrections 14 February 2010 Abstract Results favoring one treatment over another

More information

Inclusion and Exclusion Criteria

Inclusion and Exclusion Criteria Inclusion and Exclusion Criteria Inclusion criteria = attributes of subjects that are essential for their selection to participate. Inclusion criteria function remove the influence of specific confounding

More information

New SAS Procedures for Analysis of Sample Survey Data

New SAS Procedures for Analysis of Sample Survey Data New SAS Procedures for Analysis of Sample Survey Data Anthony An and Donna Watts, SAS Institute Inc, Cary, NC Abstract Researchers use sample surveys to obtain information on a wide variety of issues Many

More information

Chapter 11 Introduction to Survey Sampling and Analysis Procedures

Chapter 11 Introduction to Survey Sampling and Analysis Procedures Chapter 11 Introduction to Survey Sampling and Analysis Procedures Chapter Table of Contents OVERVIEW...149 SurveySampling...150 SurveyDataAnalysis...151 DESIGN INFORMATION FOR SURVEY PROCEDURES...152

More information

The SURVEYFREQ Procedure in SAS 9.2: Avoiding FREQuent Mistakes When Analyzing Survey Data ABSTRACT INTRODUCTION SURVEY DESIGN 101 WHY STRATIFY?

The SURVEYFREQ Procedure in SAS 9.2: Avoiding FREQuent Mistakes When Analyzing Survey Data ABSTRACT INTRODUCTION SURVEY DESIGN 101 WHY STRATIFY? The SURVEYFREQ Procedure in SAS 9.2: Avoiding FREQuent Mistakes When Analyzing Survey Data Kathryn Martin, Maternal, Child and Adolescent Health Program, California Department of Public Health, ABSTRACT

More information

Guido s Guide to PROC FREQ A Tutorial for Beginners Using the SAS System Joseph J. Guido, University of Rochester Medical Center, Rochester, NY

Guido s Guide to PROC FREQ A Tutorial for Beginners Using the SAS System Joseph J. Guido, University of Rochester Medical Center, Rochester, NY Guido s Guide to PROC FREQ A Tutorial for Beginners Using the SAS System Joseph J. Guido, University of Rochester Medical Center, Rochester, NY ABSTRACT PROC FREQ is an essential procedure within BASE

More information

Selecting a Stratified Sample with PROC SURVEYSELECT Diana Suhr, University of Northern Colorado

Selecting a Stratified Sample with PROC SURVEYSELECT Diana Suhr, University of Northern Colorado Selecting a Stratified Sample with PROC SURVEYSELECT Diana Suhr, University of Northern Colorado Abstract Stratified random sampling is simple and efficient using PROC FREQ and PROC SURVEYSELECT. A routine

More information

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics Ms. Foglia Date AP: LAB 8: THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,

More information

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics Period Date LAB : THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,

More information

Education in Medicine Journal 2012, VOL 4 ISSUE 1 DOI:10.5959/eimj.v4i1.4

Education in Medicine Journal 2012, VOL 4 ISSUE 1 DOI:10.5959/eimj.v4i1.4 EDUCATIONAL RESOURCE Education in Medicine Journal Random sampling and allocation using SPSS Wan Nor Arifin Trainee Lecturer, Unit of Biostatistics and Research Methodology, School of Medical Sciences,

More information

Survey Analysis: Options for Missing Data

Survey Analysis: Options for Missing Data Survey Analysis: Options for Missing Data Paul Gorrell, Social & Scientific Systems, Inc., Silver Spring, MD Abstract A common situation researchers working with survey data face is the analysis of missing

More information

SUGI 29 Posters. Mazen Abdellatif, M.S., Hines VA CSPCC, Hines IL, 60141, USA

SUGI 29 Posters. Mazen Abdellatif, M.S., Hines VA CSPCC, Hines IL, 60141, USA A SAS Macro for Generating Randomization Lists in Clinical Trials Using Permuted Blocks Randomization Mazen Abdellatif, M.S., Hines VA CSPCC, Hines IL, 60141, USA ABSTRACT We developed a SAS [1] macro

More information

Clinical Study Design and Methods Terminology

Clinical Study Design and Methods Terminology Home College of Veterinary Medicine Washington State University WSU Faculty &Staff Page Page 1 of 5 John Gay, DVM PhD DACVPM AAHP FDIU VCS Clinical Epidemiology & Evidence-Based Medicine Glossary: Clinical

More information

SAS and Clinical IVRS: Beyond Schedule Creation Gayle Flynn, Cenduit, Durham, NC

SAS and Clinical IVRS: Beyond Schedule Creation Gayle Flynn, Cenduit, Durham, NC Paper SD-001 SAS and Clinical IVRS: Beyond Schedule Creation Gayle Flynn, Cenduit, Durham, NC ABSTRACT SAS is the preferred method for generating randomization and kit schedules used in clinical trials.

More information

Imputing Missing Data using SAS

Imputing Missing Data using SAS ABSTRACT Paper 3295-2015 Imputing Missing Data using SAS Christopher Yim, California Polytechnic State University, San Luis Obispo Missing data is an unfortunate reality of statistics. However, there are

More information

If several different trials are mentioned in one publication, the data of each should be extracted in a separate data extraction form.

If several different trials are mentioned in one publication, the data of each should be extracted in a separate data extraction form. General Remarks This template of a data extraction form is intended to help you to start developing your own data extraction form, it certainly has to be adapted to your specific question. Delete unnecessary

More information

Section 6.2 Definition of Probability

Section 6.2 Definition of Probability Section 6.2 Definition of Probability Probability is a measure of the likelihood that an event occurs. For example, if there is a 20% chance of rain tomorrow, that means that the probability that it will

More information

Chapter 4. Probability and Probability Distributions

Chapter 4. Probability and Probability Distributions Chapter 4. robability and robability Distributions Importance of Knowing robability To know whether a sample is not identical to the population from which it was selected, it is necessary to assess the

More information

Critical Appraisal of Article on Therapy

Critical Appraisal of Article on Therapy Critical Appraisal of Article on Therapy What question did the study ask? Guide Are the results Valid 1. Was the assignment of patients to treatments randomized? And was the randomization list concealed?

More information

Sampling. COUN 695 Experimental Design

Sampling. COUN 695 Experimental Design Sampling COUN 695 Experimental Design Principles of Sampling Procedures are different for quantitative and qualitative research Sampling in quantitative research focuses on representativeness Sampling

More information

Biostat Methods STAT 5820/6910 Handout #6: Intro. to Clinical Trials (Matthews text)

Biostat Methods STAT 5820/6910 Handout #6: Intro. to Clinical Trials (Matthews text) Biostat Methods STAT 5820/6910 Handout #6: Intro. to Clinical Trials (Matthews text) Key features of RCT (randomized controlled trial) One group (treatment) receives its treatment at the same time another

More information

Chapter 63 The SURVEYSELECT Procedure

Chapter 63 The SURVEYSELECT Procedure Chapter 63 The SURVEYSELECT Procedure Chapter Table of Contents OVERVIEW...3275 GETTING STARTED...3276 Simple Random Sampling...3277 StratifiedSampling...3279 Stratified Sampling with Control Sorting...3282

More information

Descriptive Methods Ch. 6 and 7

Descriptive Methods Ch. 6 and 7 Descriptive Methods Ch. 6 and 7 Purpose of Descriptive Research Purely descriptive research describes the characteristics or behaviors of a given population in a systematic and accurate fashion. Correlational

More information

AP * Statistics Review. Designing a Study

AP * Statistics Review. Designing a Study AP * Statistics Review Designing a Study Teacher Packet Advanced Placement and AP are registered trademark of the College Entrance Examination Board. The College Board was not involved in the production

More information

Paper PO12 Pharmaceutical Programming: From CRFs to Tables, Listings and Graphs, a process overview with real world examples ABSTRACT INTRODUCTION

Paper PO12 Pharmaceutical Programming: From CRFs to Tables, Listings and Graphs, a process overview with real world examples ABSTRACT INTRODUCTION Paper PO12 Pharmaceutical Programming: From CRFs to Tables, Listings and Graphs, a process overview with real world examples Mark Penniston, Omnicare Clinical Research, King of Prussia, PA Shia Thomas,

More information

SAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population.

SAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population. SAMPLING & INFERENTIAL STATISTICS Sampling is necessary to make inferences about a population. SAMPLING The group that you observe or collect data from is the sample. The group that you make generalizations

More information

Study Design and Statistical Analysis

Study Design and Statistical Analysis Study Design and Statistical Analysis Anny H Xiang, PhD Department of Preventive Medicine University of Southern California Outline Designing Clinical Research Studies Statistical Data Analysis Designing

More information

AP STATISTICS 2010 SCORING GUIDELINES

AP STATISTICS 2010 SCORING GUIDELINES 2010 SCORING GUIDELINES Question 4 Intent of Question The primary goals of this question were to (1) assess students ability to calculate an expected value and a standard deviation; (2) recognize the applicability

More information

Health Services Research Utilizing Electronic Health Record Data: A Grad Student How-To Paper

Health Services Research Utilizing Electronic Health Record Data: A Grad Student How-To Paper Paper 3485-2015 Health Services Research Utilizing Electronic Health Record Data: A Grad Student How-To Paper Ashley W. Collinsworth, ScD, MPH, Baylor Scott & White Health and Tulane University School

More information

2. Making example missing-value datasets: MCAR, MAR, and MNAR

2. Making example missing-value datasets: MCAR, MAR, and MNAR Lecture 20 1. Types of missing values 2. Making example missing-value datasets: MCAR, MAR, and MNAR 3. Common methods for missing data 4. Compare results on example MCAR, MAR, MNAR data 1 Missing Data

More information

Week 3&4: Z tables and the Sampling Distribution of X

Week 3&4: Z tables and the Sampling Distribution of X Week 3&4: Z tables and the Sampling Distribution of X 2 / 36 The Standard Normal Distribution, or Z Distribution, is the distribution of a random variable, Z N(0, 1 2 ). The distribution of any other normal

More information

1. Overview of Clinical Trials

1. Overview of Clinical Trials 1. Overview of Clinical Trials 1.1. What are clinical trials? Definition A clinical trial is a planned experiment which involves patients and is designed to elucidate the most appropriate treatment of

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

RATIOS, PROPORTIONS, PERCENTAGES, AND RATES

RATIOS, PROPORTIONS, PERCENTAGES, AND RATES RATIOS, PROPORTIOS, PERCETAGES, AD RATES 1. Ratios: ratios are one number expressed in relation to another by dividing the one number by the other. For example, the sex ratio of Delaware in 1990 was: 343,200

More information

Methods for Interaction Detection in Predictive Modeling Using SAS Doug Thompson, PhD, Blue Cross Blue Shield of IL, NM, OK & TX, Chicago, IL

Methods for Interaction Detection in Predictive Modeling Using SAS Doug Thompson, PhD, Blue Cross Blue Shield of IL, NM, OK & TX, Chicago, IL Paper SA01-2012 Methods for Interaction Detection in Predictive Modeling Using SAS Doug Thompson, PhD, Blue Cross Blue Shield of IL, NM, OK & TX, Chicago, IL ABSTRACT Analysts typically consider combinations

More information

We begin by presenting the current situation of women s representation in physics departments. Next, we present the results of simulations that

We begin by presenting the current situation of women s representation in physics departments. Next, we present the results of simulations that Report A publication of the AIP Statistical Research Center One Physics Ellipse College Park, MD 20740 301.209.3070 stats@aip.org July 2013 Number of Women in Physics Departments: A Simulation Analysis

More information

NON-PROBABILITY SAMPLING TECHNIQUES

NON-PROBABILITY SAMPLING TECHNIQUES NON-PROBABILITY SAMPLING TECHNIQUES PRESENTED BY Name: WINNIE MUGERA Reg No: L50/62004/2013 RESEARCH METHODS LDP 603 UNIVERSITY OF NAIROBI Date: APRIL 2013 SAMPLING Sampling is the use of a subset of the

More information

Research Methods & Experimental Design

Research Methods & Experimental Design Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and

More information

Guideline for Developing Randomization Procedures RPG-03

Guideline for Developing Randomization Procedures RPG-03 The Clinical Research Center Research Practice Manual Guideline for Developing Randomization Procedures RPG-03 Purpose Guideline The purpose of this Guideline is to outline the Clinical Research Center

More information

Critical Appraisal of the Medical Literature

Critical Appraisal of the Medical Literature Critical Appraisal of the Medical Literature James A. Hokanson, Ph.D. Department of Preventive Medicine and Community Health The University of Texas Medical Branch 2001 United States Crude Death Rates

More information

Louise Hadden, Abt Associates Inc., Cambridge, MA

Louise Hadden, Abt Associates Inc., Cambridge, MA PROC SURVEYSELECT: A Simply Serpentine Solution for Complex Sample Designs Louise Hadden, Abt Associates Inc., Cambridge, MA ABSTRACT SAS programmers are frequently called upon to draw a statistically

More information

Supporting Statement Part B. Collections of Information Employing Statistical Methods

Supporting Statement Part B. Collections of Information Employing Statistical Methods Supporting Statement Part B Collections of Information Employing Statistical Methods Overview This field test will use a probability sample of each Program s eligible participants. Because the purpose

More information

Introduction to Fixed Effects Methods

Introduction to Fixed Effects Methods Introduction to Fixed Effects Methods 1 1.1 The Promise of Fixed Effects for Nonexperimental Research... 1 1.2 The Paired-Comparisons t-test as a Fixed Effects Method... 2 1.3 Costs and Benefits of Fixed

More information

Guideline for Developing Randomization Procedures RPG-03

Guideline for Developing Randomization Procedures RPG-03 The Clinical Research Center Research Practice Manual Guideline for Developing Randomization Procedures RPG-03 Purpose Guideline The purpose of this Guideline is to outline the Clinical Research Center

More information

Counting the Ways to Count in SAS. Imelda C. Go, South Carolina Department of Education, Columbia, SC

Counting the Ways to Count in SAS. Imelda C. Go, South Carolina Department of Education, Columbia, SC Paper CC 14 Counting the Ways to Count in SAS Imelda C. Go, South Carolina Department of Education, Columbia, SC ABSTRACT This paper first takes the reader through a progression of ways to count in SAS.

More information

TUTORIAL on ICH E9 and Other Statistical Regulatory Guidance. Session 1: ICH E9 and E10. PSI Conference, May 2011

TUTORIAL on ICH E9 and Other Statistical Regulatory Guidance. Session 1: ICH E9 and E10. PSI Conference, May 2011 TUTORIAL on ICH E9 and Other Statistical Regulatory Guidance Session 1: PSI Conference, May 2011 Kerry Gordon, Quintiles 1 E9, and how to locate it 2 ICH E9 Statistical Principles for Clinical Trials (Issued

More information

Lecture 14: GLM Estimation and Logistic Regression

Lecture 14: GLM Estimation and Logistic Regression Lecture 14: GLM Estimation and Logistic Regression Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University of South

More information

AVOIDING BIAS AND RANDOM ERROR IN DATA ANALYSIS

AVOIDING BIAS AND RANDOM ERROR IN DATA ANALYSIS AVOIDING BIAS AND RANDOM ERROR IN DATA ANALYSIS Susan Ellenberg, Ph.D. Perelman School of Medicine University of Pennsylvania School of Medicine FDA Clinical Investigator Course White Oak, MD November

More information

Statistics 2014 Scoring Guidelines

Statistics 2014 Scoring Guidelines AP Statistics 2014 Scoring Guidelines College Board, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks of the College Board. AP Central is the official online home

More information

IPDET Module 6: Descriptive, Normative, and Impact Evaluation Designs

IPDET Module 6: Descriptive, Normative, and Impact Evaluation Designs IPDET Module 6: Descriptive, Normative, and Impact Evaluation Designs Intervention or Policy Evaluation Questions Design Questions Elements Types Key Points Introduction What Is Evaluation Design? Connecting

More information

An Automated Test for Telepathy in Connection with Emails

An Automated Test for Telepathy in Connection with Emails Journal of Scientifi c Exploration, Vol. 23, No. 1, pp. 29 36, 2009 0892-3310/09 RESEARCH An Automated Test for Telepathy in Connection with Emails RUPERT SHELDRAKE AND LEONIDAS AVRAAMIDES Perrott-Warrick

More information

Methods for Meta-analysis in Medical Research

Methods for Meta-analysis in Medical Research Methods for Meta-analysis in Medical Research Alex J. Sutton University of Leicester, UK Keith R. Abrams University of Leicester, UK David R. Jones University of Leicester, UK Trevor A. Sheldon University

More information

Organizing Your Approach to a Data Analysis

Organizing Your Approach to a Data Analysis Biost/Stat 578 B: Data Analysis Emerson, September 29, 2003 Handout #1 Organizing Your Approach to a Data Analysis The general theme should be to maximize thinking about the data analysis and to minimize

More information

The Sample Overlap Problem for Systematic Sampling

The Sample Overlap Problem for Systematic Sampling The Sample Overlap Problem for Systematic Sampling Robert E. Fay 1 1 Westat, Inc., 1600 Research Blvd., Rockville, MD 20850 Abstract Within the context of probability-based sampling from a finite population,

More information

10. Analysis of Longitudinal Studies Repeat-measures analysis

10. Analysis of Longitudinal Studies Repeat-measures analysis Research Methods II 99 10. Analysis of Longitudinal Studies Repeat-measures analysis This chapter builds on the concepts and methods described in Chapters 7 and 8 of Mother and Child Health: Research methods.

More information

The ADaM Solutions to Non-endpoints Analyses

The ADaM Solutions to Non-endpoints Analyses The ADaM Solutions to Non-endpoints Analyses ABSTRACT Chengxin Li, Boehringer Ingelheim Pharmaceuticals Inc., Ridgefield, CT, USA There always exist some analyses for non endpoints in the study. These

More information

Guidance for Industry

Guidance for Industry Guidance for Industry E9 Statistical Principles for Clinical Trials U.S. Department of Health and Human Services Food and Drug Administration Center for Drug Evaluation and Research (CDER) Center for Biologics

More information

Why Sample? Why not study everyone? Debate about Census vs. sampling

Why Sample? Why not study everyone? Debate about Census vs. sampling Sampling Why Sample? Why not study everyone? Debate about Census vs. sampling Problems in Sampling? What problems do you know about? What issues are you aware of? What questions do you have? Key Sampling

More information

ANALYTIC AND REPORTING GUIDELINES

ANALYTIC AND REPORTING GUIDELINES ANALYTIC AND REPORTING GUIDELINES The National Health and Nutrition Examination Survey (NHANES) Last Update: December, 2005 Last Correction, September, 2006 National Center for Health Statistics Centers

More information

Chapter 1 Introduction. 1.1 Introduction

Chapter 1 Introduction. 1.1 Introduction Chapter 1 Introduction 1.1 Introduction 1 1.2 What Is a Monte Carlo Study? 2 1.2.1 Simulating the Rolling of Two Dice 2 1.3 Why Is Monte Carlo Simulation Often Necessary? 4 1.4 What Are Some Typical Situations

More information

The Cross-Sectional Study:

The Cross-Sectional Study: The Cross-Sectional Study: Investigating Prevalence and Association Ronald A. Thisted Departments of Health Studies and Statistics The University of Chicago CRTP Track I Seminar, Autumn, 2006 Lecture Objectives

More information

Probability Distributions

Probability Distributions Learning Objectives Probability Distributions Section 1: How Can We Summarize Possible Outcomes and Their Probabilities? 1. Random variable 2. Probability distributions for discrete random variables 3.

More information

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction CA200 Quantitative Analysis for Business Decisions File name: CA200_Section_04A_StatisticsIntroduction Table of Contents 4. Introduction to Statistics... 1 4.1 Overview... 3 4.2 Discrete or continuous

More information

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

More information

Introduction to Hypothesis Testing

Introduction to Hypothesis Testing I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters - they must be estimated. However, we do have hypotheses about what the true

More information

A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic

A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic Report prepared for Brandon Slama Department of Health Management and Informatics University of Missouri, Columbia

More information

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other 1 Hypothesis Testing Richard S. Balkin, Ph.D., LPC-S, NCC 2 Overview When we have questions about the effect of a treatment or intervention or wish to compare groups, we use hypothesis testing Parametric

More information

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE ICH HARMONISED TRIPARTITE GUIDELINE STATISTICAL PRINCIPLES FOR CLINICAL TRIALS E9 Current

More information

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Chapter Seven Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Section : An introduction to multiple regression WHAT IS MULTIPLE REGRESSION? Multiple

More information

Introduction to study design

Introduction to study design Introduction to study design Doug Altman EQUATOR Network, Centre for Statistics in Medicine, NDORMS, University of Oxford EQUATOR OUCAGS training course 4 October 2014 Objectives of the day To understand

More information

SAMPLING METHODS IN SOCIAL RESEARCH

SAMPLING METHODS IN SOCIAL RESEARCH SAMPLING METHODS IN SOCIAL RESEARCH Muzammil Haque Ph.D Scholar Visva Bharati, Santiniketan,West Bangal Sampling may be defined as the selection of some part of an aggregate or totality on the basis of

More information

Practice problems for Homework 11 - Point Estimation

Practice problems for Homework 11 - Point Estimation Practice problems for Homework 11 - Point Estimation 1. (10 marks) Suppose we want to select a random sample of size 5 from the current CS 3341 students. Which of the following strategies is the best:

More information

1.5 Oneway Analysis of Variance

1.5 Oneway Analysis of Variance Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments

More information

Binomial Sampling and the Binomial Distribution

Binomial Sampling and the Binomial Distribution Binomial Sampling and the Binomial Distribution Characterized by two mutually exclusive events." Examples: GENERAL: {success or failure} {on or off} {head or tail} {zero or one} BIOLOGY: {dead or alive}

More information

SAS Code to Select the Best Multiple Linear Regression Model for Multivariate Data Using Information Criteria

SAS Code to Select the Best Multiple Linear Regression Model for Multivariate Data Using Information Criteria Paper SA01_05 SAS Code to Select the Best Multiple Linear Regression Model for Multivariate Data Using Information Criteria Dennis J. Beal, Science Applications International Corporation, Oak Ridge, TN

More information

Section 5 Part 2. Probability Distributions for Discrete Random Variables

Section 5 Part 2. Probability Distributions for Discrete Random Variables Section 5 Part 2 Probability Distributions for Discrete Random Variables Review and Overview So far we ve covered the following probability and probability distribution topics Probability rules Probability

More information

As we saw in the previous chapter, statistical generalization requires a representative sample. Chapter 6. Sampling. Population or Universe

As we saw in the previous chapter, statistical generalization requires a representative sample. Chapter 6. Sampling. Population or Universe 62 Part 2 / Basic Tools of Research: Sampling, Measurement, Distributions, and Descriptive Statistics Chapter 6 Sampling As we saw in the previous chapter, statistical generalization requires a representative

More information

The Binomial Probability Distribution

The Binomial Probability Distribution The Binomial Probability Distribution MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2015 Objectives After this lesson we will be able to: determine whether a probability

More information

Chapter 8: Quantitative Sampling

Chapter 8: Quantitative Sampling Chapter 8: Quantitative Sampling I. Introduction to Sampling a. The primary goal of sampling is to get a representative sample, or a small collection of units or cases from a much larger collection or

More information

Cancer Clinical Trials: In-Depth Information

Cancer Clinical Trials: In-Depth Information Cancer Clinical Trials: In-Depth Information The Drug Development and Approval Process 1. Early research and preclinical testing 2. IND application filed with FDA 3. Clinical trials (phases 1, 2, and 3)

More information

Understanding Clinical Trial Design: A Tutorial for Research Advocates

Understanding Clinical Trial Design: A Tutorial for Research Advocates Understanding Clinical Trial Design: A Tutorial for Research Advocates Understanding Clinical Trial Design: A Tutorial for Research Advocates Authored by Jane Perlmutter, PhD for Research Advocacy Network

More information

An Introduction to Basic Statistics and Probability

An Introduction to Basic Statistics and Probability An Introduction to Basic Statistics and Probability Shenek Heyward NCSU An Introduction to Basic Statistics and Probability p. 1/4 Outline Basic probability concepts Conditional probability Discrete Random

More information

Sample Size Issues for Conjoint Analysis

Sample Size Issues for Conjoint Analysis Chapter 7 Sample Size Issues for Conjoint Analysis I m about to conduct a conjoint analysis study. How large a sample size do I need? What will be the margin of error of my estimates if I use a sample

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Statistiek I. Proportions aka Sign Tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen. http://www.let.rug.nl/nerbonne/teach/statistiek-i/

Statistiek I. Proportions aka Sign Tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen. http://www.let.rug.nl/nerbonne/teach/statistiek-i/ Statistiek I Proportions aka Sign Tests John Nerbonne CLCG, Rijksuniversiteit Groningen http://www.let.rug.nl/nerbonne/teach/statistiek-i/ John Nerbonne 1/34 Proportions aka Sign Test The relative frequency

More information

MAT 155. Chapter 1 Introduction to Statistics. Key Concept. Basics of Collecting Data. 155S1.5_3 Collecting Sample Data.

MAT 155. Chapter 1 Introduction to Statistics. Key Concept. Basics of Collecting Data. 155S1.5_3 Collecting Sample Data. MAT 155 Dr. Claude Moore Cape Fear Community College Chapter 1 Introduction to Statistics 1 1 Review and Preview 1 2 Statistical Thinking 1 3 Types of Data 1 4 Critical Thinking 1 5 Collecting Sample Data

More information

AP Statistics 7!3! 6!

AP Statistics 7!3! 6! Lesson 6-4 Introduction to Binomial Distributions Factorials 3!= Definition: n! = n( n 1)( n 2)...(3)(2)(1), n 0 Note: 0! = 1 (by definition) Ex. #1 Evaluate: a) 5! b) 3!(4!) c) 7!3! 6! d) 22! 21! 20!

More information

The HPSUMMARY Procedure: An Old Friend s Younger (and Brawnier) Cousin Anh P. Kellermann, Jeffrey D. Kromrey University of South Florida, Tampa, FL

The HPSUMMARY Procedure: An Old Friend s Younger (and Brawnier) Cousin Anh P. Kellermann, Jeffrey D. Kromrey University of South Florida, Tampa, FL Paper 88-216 The HPSUMMARY Procedure: An Old Friend s Younger (and Brawnier) Cousin Anh P. Kellermann, Jeffrey D. Kromrey University of South Florida, Tampa, FL ABSTRACT The HPSUMMARY procedure provides

More information

MATH 140 Lab 4: Probability and the Standard Normal Distribution

MATH 140 Lab 4: Probability and the Standard Normal Distribution MATH 140 Lab 4: Probability and the Standard Normal Distribution Problem 1. Flipping a Coin Problem In this problem, we want to simualte the process of flipping a fair coin 1000 times. Note that the outcomes

More information

UNDERSTANDING THE TWO-WAY ANOVA

UNDERSTANDING THE TWO-WAY ANOVA UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables

More information

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR

More information