S SG. Design of Experiment in Metabolomics. Hemant K. Tiwari, Ph.D. Professor and Head. Metabolomics: Bench to Bedside. ection ON tatistical.

Similar documents
The Facts on Equine Drug Testing

Signal, Noise, and Detection Limits in Mass Spectrometry

Optimizing the Post-implementation Monitoring of a LC-MS/MS Method

3.4 Statistical inference for 2 populations based on two samples

HYPOTHESIS TESTING: POWER OF THE TEST

Electrospray Ion Trap Mass Spectrometry. Introduction

Analysis of the Vitamin B Complex in Infant Formula Samples by LC-MS/MS

Study Design Sample Size Calculation & Power Analysis. RCMAR/CHIME April 21, 2014 Honghu Liu, PhD Professor University of California Los Angeles

Statistics Review PSY379

Performance and advantages of qnmr measurements

Functional Data Analysis of MALDI TOF Protein Spectra

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Tests for Two Proportions

NANOCOMPOSIX'S GUIDE TO ICP-MS MEASUREMENT

Identification of Arson Accelerants by Gas Chromatography

Analyzing Small Molecules by EI and GC-MS. July 2014

STATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section:

Using Excel for inferential statistics

Integrated Data Mining Strategy for Effective Metabolomic Data Analysis

Chemistry Instrumental Analysis Lecture 1. Chem 4631

Name: Date: Use the following to answer questions 3-4:

CONFIRMATION OF ZOLPIDEM BY LIQUID CHROMATOGRAPHY MASS SPECTROMETRY

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Non-Inferiority Tests for Two Proportions

Pesticide Analysis by Mass Spectrometry

Hypothesis testing - Steps

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

LC-MS/MS Method for the Determination of Docetaxel in Human Serum for Clinical Research

Chemistry 321, Experiment 8: Quantitation of caffeine from a beverage using gas chromatography

Sample Size Planning, Calculation, and Justification

Principles of Hypothesis Testing for Public Health

SECOND M.B. AND SECOND VETERINARY M.B. EXAMINATIONS INTRODUCTION TO THE SCIENTIFIC BASIS OF MEDICINE EXAMINATION. Friday 14 March

NUSAGE - PAREXEL Postgraduate Certificate in Clinical Trial Management

Tests for One Proportion

Statistical Analysis. NBAF-B Metabolomics Masterclass. Mark Viant

Point Biserial Correlation Tests

VALIDATION OF ANALYTICAL PROCEDURES: TEXT AND METHODOLOGY Q2(R1)

QUANTITATIVE INFRARED SPECTROSCOPY. Willard et. al. Instrumental Methods of Analysis, 7th edition, Wadsworth Publishing Co., Belmont, CA 1988, Ch 11.

Guidance for Industry

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Paper Chromatography: Separation and Identification of Five Metal Cations

Introduction to mass spectrometry (MS) based proteomics and metabolomics

MarkerView Software for Metabolomic and Biomarker Profiling Analysis

ANALYSIS OF VITAMIN C

Validation and Calibration. Definitions and Terminology

Analytical Test Method Validation Report Template

Inclusion and Exclusion Criteria

SAMPLE. Gas Chromatography/Mass Spectrometry Confirmation of Drugs; Approved Guideline Second Edition

Two-sample hypothesis testing, II /16/2004

Chapter 14. Modeling Experimental Design for Proteomics. Jan Eriksson and David Fenyö. Abstract. 1. Introduction

METHODS OF VITAMIN ANALYSIS

How To Test For Contamination In Large Volume Water

Background Information

LC-MS/MS for Chromatographers

Step-by-Step Analytical Methods Validation and Protocol in the Quality System Compliance Industry

Analysis of Vegetable Oils by High Performance Liquid Chromatography

Descriptive Statistics

AMD Analysis & Technology AG

Hypothesis Testing --- One Mean

Online 12 - Sections 9.1 and 9.2-Doug Ensley

Hypothesis Testing: Two Means, Paired Data, Two Proportions

Mind on Statistics. Chapter 13

Alignment and Preprocessing for Data Analysis

1. Overview of Clinical Trials

Supplementary Materials and Methods (Metabolomics analysis)

Correlations Between Citrus Fruit Properties and Ascorbic Acid Content

Reversed Phase High Presssure Liquid Chromatograhphic Technique for Determination of Sodium Alginate from Oral Suspension

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

2 Precision-based sample size calculations

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

Section 13, Part 1 ANOVA. Analysis Of Variance

ICH Topic Q 2 (R1) Validation of Analytical Procedures: Text and Methodology. Step 5

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

Design of Experiments for Analytical Method Development and Validation

Selective Testosterone Analysis in Human Serum by LC-FAIMS-MS/MS

Pep-Miner: A Novel Technology for Mass Spectrometry-Based Proteomics

A feasibility study of the determination of B-group vitamins in food by HPLC with Mass Spectrometric detection. LGC/R/2011/181 LGC Limited 2011

Study Design and Statistical Analysis

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Waters Core Chromatography Training (2 Days)

Fractional Distillation and Gas Chromatography

EXCEL Analysis TookPak [Statistical Analysis] 1. First of all, check to make sure that the Analysis ToolPak is installed. Here is how you do it:

Statistics in Medicine Research Lecture Series CSMC Fall 2014

In the United States and in most European countries, an

Fast, Reproducible LC-MS/MS Analysis of Dextromethorphan and Dextrorphan

GC METHODS FOR QUANTITATIVE DETERMINATION OF BENZENE IN GASOLINE

6: Introduction to Hypothesis Testing

DDBA 8438: Introduction to Hypothesis Testing Video Podcast Transcript

Tests for Two Survival Curves Using Cox s Proportional Hazards Model

HRMS in Clinical Research: from Targeted Quantification to Metabolomics

Appendix 2 Statistical Hypothesis Testing 1

Air Sampling for Gases and Vapors. Patrick N. Breysse, PhD, CIH Peter S.J. Lees, PhD, CIH Johns Hopkins University

AbsoluteIDQ p180 Kit. Targeted Metabolite Identifi cation and Quantifi cation. Bringing our targeted metabolomics expertise to your lab.

Introduction to Statistics and Quantitative Research Methods

Chapter 2 Probability Topics SPSS T tests

Analyzing Research Data Using Excel

Jennifer L. Simeone and Paul D. Rainville Waters Corporation, Milford, MA, USA A P P L I C AT ION B E N E F I T S INT RO DU C T ION

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Advantages of Polar, Reversed- Phase HPLC Columns for the Analysis of Drug Metabolites

Transcription:

S SG ection ON tatistical enetics Design of Experiment in Metabolomics Hemant K. Tiwari, Ph.D. Professor and Head Section on Statistical Genetics Department of Biostatistics School of Public Health Metabolomics: Bench to Bedside Suhre and Gieger (Nature Review Genetics, Vol 13, Nov 2012) 1

Design of Experiments/ Experimental Design A controlled experiment to either test a hypothesis or generate hypotheses In the design of experiments, the experimenter is usually interested in the effect of some process or intervention on some subjects History of Experimental Design In 1747, James Lind (a Scottish Physician) developed the theory that citrus fruits cured scurvy, while serving as surgeon on HMS Salisbury Ship of the Royal Navy. This was the first ever clinical trial conducted. Scurvy is disease resulting from a deficiency of Vitamin C. 2

Lind s Experiment Lind selected 12 men from the ship suffering from scurvy. He divided them into six pairs, giving each pair different supplements to their basic diet for two weeks. The treatments were all remedies that had been proposed: A quart of cider every day Twenty five drops of elixir vitriol (sulphuric acid) three times a day upon an empty stomach One half-pint of seawater every day A mixture of garlic, mustard, and horseradish in a lump the size of a nutmeg Two spoonful of vinegar three times a day Two oranges and one lemon every day Result: The men given citrus fruits recovered dramatically within a week. Source: Wikipedia Aims of experimental design To provide answers to research hypothesis or generate hypothesis Minimize the biological variance and technical or experimental variance since metabolome can change very rapidly in response to subtle changes in environment 3

How to avoid technical variance Randomization Normalization Selection of covariates in the analyses Selection of statistical method most appropriate for the research question Basic Steps in Experimental Design Formalize the question and study design The number of replicates to be used for an experiment Randomization Blocking (stratification) 4

Study Design for human study Pedigree data Twins Population data Case-control Case only GC/MS LC/MS NMR Choice of Platform 5

Choosing which metabolites to study Targeted vs. non-targeted Tissue Type Blood Urine Buckle Swab (?) Steps in metabolomics experiment Sample prep Separation method Detection Method Analysis At each step, the errors can occur Aim of the experiment is to minimize the errors 6

Aims of experimental design Minimize the variance of extraneous variable variance that may have impact on outcome variables Examples: Metabolites are sensitive to environmental influences such as sample storage Temperature Aliquoting Biobanking Measurement Issues in metabolomics To get the sample ready The sample is prepped and put onto wells on a silicon plate Each well s aliquot is subjected to gas and/or liquid chromatography After separation, the sample goes to a mass spectrometry 7

Measurement Issues Sources of errors at the prep stage Within subject variation Within tissue variation Contamination by cleaning solvents Calibrate uncertainty Evaporation of volatiles Errors at separation method Gas chromatography (GC) (GC creates ionized aerosol, each droplet evaporates to a single ion and this is separated by mass in the column, then ejected to the spectrometer): Imperfect evaporation Adhesion in the column Ion fragmentation 8

Measurement Issues at Detection Mass spectrometry (MS) is used to identify and quantify metabolites after separation by GC, HPLC (LC-MS) Baseline removal denoising Peak detection (the process of distinguishing peptide and noise peaks) Normalization Purpose Sample Size and Power Planning a study: number of individuals to recruit to test a research hypothesis Understand sample size implications of alternative study designs Sample was already collected and wants study using new technology GWAS was done, but wants to do metabolomics GWAS 9

Sample Size and Power Calculation Often the number of samples to be used for the experiments dictated by the reality of resources available, not science. How much money is available for the experiment What is the cost per sample Thus, sample size = $ available through NIH/ cost per sample Hypothesis Testing Power calculations are based on the principles of hypothesis testing A hypothesis is a statement about population parameter The two complemetary hypotheses in a hypothesis testing problem are called the null hypothesis (H 0 ) and alternate hypothesis (H 1 ) 10

Two types of errors in hypothesis testing H 0 : θ=0 versus H 1 : θ 0 Decision Accept H 0 Reject H 0 Truth H 0 Correct Decision Type I error (α) H 1 Type II error (β) Correct decision Reject H 0 when we should not Don t reject H 0 when we should Type I Error: Probability of finding a statistically significant effect when the truth is that there is no effect Type II Error: Probability of not finding a statistically significant effect when there is none. Power = 1- β, Significance level = α Goal is to minimize both types of errors Power depends on Design The method of analyzing the data The effect size Standard deviation of the effect of interest Measurement variability The chosen significance level (α) The sample size We usually use significance level of 5% and 80% power to estimate the sample size 11

Factors Affecting the sample size Effect size Variation of data Type I error rate Power Required sample size Required sample size Required sample size Required sample size Type I error rate (α) is kept fixed and becomes smaller as number of tests increase Effect size and variation of the data (σ 2 ) is either obtained through pilot study or vary to calculate different sample sizes. To calculate Sample Size Need to know level of significance (α) Statistical power (1- β) Effect size (expected difference) Standard deviation What statistical test we are going to use 12

Sample Size Formula for difference in means A sample size formula to test difference of means between two groups 1 1 σ 2 (Z β + Z α/2 ) 2 )/ (r Δ 2 ) where, 1 = size of the smaller group r = ratio of larger group to smaller group Z β = standard normal deviate corresponds to β Z α/2 = standard normal deviate corresponds to two-tailed significance level Δ= clinically meaningful difference in means of the outcome σ 2 =Variance of the trait or characteristic Common standard normal deviates Z α and Z β α or β One-sided test Z α or Z β Two-sided test Z α 0.001 3.09 3.29 0.005 2.58 2.81 0.01 2.33 2.58 0.025 1.96 2.24 0.05 1.64 1.96 0.10 1.28 1.64 0.20 0.84 1.28 13

Simple Example How many people would you need to sample in each group (assuming both groups of equal size) to achieve power of 80% if SD =σ=10, difference in mean is 5 with fixed α=0.05. So Z α/2 = Z 0.025 =1.96, Z β =0.84, and 1 1 σ 2 (Z β + Z α/2 ) 2 )/ (r Δ 2 ) = 2 (100) (.84+1.96) 2 / (5 2 ) = 62.72 ~63 63 per group implies 126 altogether. Sample Size Two-tailed test: When the investigator is interested in determining whether a treatment A is different from a treatment B Usually a 2 tailed test is performed with the risk of making a Type-1 error set at α / 2 in each tail. For a 2 tailed test at α =.05 and equal allocation of type-1 error to each tail Z α = 1.96 14

Sample Size Sometimes an investigator is only interested in a difference between treatments in one direction. This is appropriate when The hypothesis is to test whether a treatment A is better than treatment B For a 1 tailed test at α =.05 Z α = 1.65 Conclusions Experiment should be designed with consultation with the statistician and metabolomics assays provider Good design and good analytic methods can lead to reduced sample size and also lead to valid meaningful results 15