Applying Statistics Recommended by Regulatory Documents



Similar documents
Step-by-Step Analytical Methods Validation and Protocol in the Quality System Compliance Industry

Analytical Methods: A Statistical Perspective on the ICH Q2A and Q2B Guidelines for Validation of Analytical Methods

Guidance for Industry

VALIDATION OF ANALYTICAL PROCEDURES: TEXT AND METHODOLOGY Q2(R1)

Validation and Calibration. Definitions and Terminology

ANALYTICAL METHODS INTERNATIONAL QUALITY SYSTEMS

Assay Qualification Template for Host Cell Protein ELISA

American Association for Laboratory Accreditation

Assay Development and Method Validation Essentials

GUIDELINES FOR THE VALIDATION OF ANALYTICAL METHODS FOR ACTIVE CONSTITUENT, AGRICULTURAL AND VETERINARY CHEMICAL PRODUCTS.

Validating Methods using Waters Empower TM 2 Method. Validation. Manager

Analytical Test Method Validation Report Template

QbD Approach to Assay Development and Method Validation

ICH Topic Q 2 (R1) Validation of Analytical Procedures: Text and Methodology. Step 5

To avoid potential inspection

ASSURING THE QUALITY OF TEST RESULTS

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

Design of Experiments for Analytical Method Development and Validation

Assumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model

Descriptive Statistics

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

International GMP Requirements for Quality Control Laboratories and Recomendations for Implementation

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Method Validation/Verification. CAP/CLIA regulated methods at Texas Department of State Health Services Laboratory

Regression Analysis: A Complete Example

DATA INTERPRETATION AND STATISTICS

Simple linear regression

Introduction to Regression and Data Analysis

Simple Predictive Analytics Curtis Seare

Implementing New USP Chapters for Analytical Method Validation

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

II. DISTRIBUTIONS distribution normal distribution. standard scores

Analytical Methods for Cleaning Validation

Using R for Linear Regression

IN VITRO BINDING BIOEQUIVALENCE STUDY SUMMARY TABLES AND SAS TRANSPORT FORMATTED TABLES FOR DATASET SUBMISSION

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

USING CLSI GUIDELINES TO PERFORM METHOD EVALUATION STUDIES IN YOUR LABORATORY

How Far is too Far? Statistical Outlier Detection

Factors affecting online sales

TEST REPORT: SIEVERS M-SERIES PERFORMANCE SPECIFICATIONS

Chapter 7: Simple linear regression Learning Objectives

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

SUMAN DUVVURU STAT 567 PROJECT REPORT

A Review of Statistical Outlier Methods

2. Simple Linear Regression

How To Run Statistical Tests in Excel

Guidance for Industry

Curve Fitting. Before You Begin

Example: Boats and Manatees

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

FOOD FOR THOUGHT Topical Insights from our Subject Matter Experts UNDERSTANDING WHAT IS NEEDED TO PRODUCE QUALITY DATA

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

5. Linear Regression

Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information.

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

General and statistical principles for certification of RM ISO Guide 35 and Guide 34

Fairfield Public Schools

Research Methods & Experimental Design

Multiple Linear Regression in Data Mining

Tutorial on Using Excel Solver to Analyze Spin-Lattice Relaxation Time Data

Study Guide for the Final Exam

INTERNATIONAL JOURNAL OF PHARMACEUTICAL RESEARCH AND BIO-SCIENCE

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Data Analysis, Research Study Design and the IRB

Additional sources Compilation of sources:

Mouse Insulin ELISA. For the quantitative determination of insulin in mouse serum and plasma

Case Study in Data Analysis Does a drug prevent cardiomegaly in heart failure?

The importance of graphing the data: Anscombe s regression examples

Normality Testing in Excel

AP Physics 1 and 2 Lab Investigations


Using Excel for inferential statistics

DECISION LIMITS FOR THE CONFIRMATORY QUANTIFICATION OF THRESHOLD SUBSTANCES

Assay Migration Studies for In Vitro Diagnostic Devices Guidance for Industry and FDA Staff

STATISTICA Formula Guide: Logistic Regression. Table of Contents

2.500 Threshold e Threshold. Exponential phase. Cycle Number

Evaluating Current Practices in Shelf Life Estimation

Analytical Procedures and Methods Validation for Drugs and Biologics

Simple Regression Theory II 2010 Samuel L. Baker

Definition of Minimum Performance Requirements for Analytical Methods of GMO Testing European Network of GMO Laboratories (ENGL)

Version 5.0. Regression Guide. Harvey Motulsky President, GraphPad Software Inc. GraphPad Prism All rights reserved.

Simple Linear Regression Inference

Univariate Regression

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Regression Modeling Strategies

Adequacy of Biomath. Models. Empirical Modeling Tools. Bayesian Modeling. Model Uncertainty / Selection

MEASURES OF LOCATION AND SPREAD

4. Simple regression. QBUS6840 Predictive Analytics.

Lean Six Sigma Analyze Phase Introduction. TECH QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY

Using Excel (Microsoft Office 2007 Version) for Graphical Analysis of Data

VETERINARY SERVICES MEMORANDUM NO

Elements of statistics (MATH0487-1)

A Primer on Forecasting Business Performance

5. Multiple regression

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

Nonlinear Regression Functions. SW Ch 8 1/54/

An analysis appropriate for a quantitative outcome and a single quantitative explanatory. 9.1 The model behind linear regression

Transcription:

Applying Statistics Recommended by Regulatory Documents Steven Walfish President, Statistical Outsourcing Services steven@statisticaloutsourcingservices.com 301-325 325-31293129

About the Speaker Mr. Steven Walfish is the President of Statistical Outsourcing Services, a consulting company that provides statistical analysis and training to variety of industries. Prior to starting Statistical Outsourcing Services, Mr. Walfish was the Senior Manager Biostatistics, Non-clinical at Human Genome Sciences in Rockville MD. Prior to joining HGS, Mr. Walfish was a Senior Associate at PricewaterhouseCoopers specializing in the pharmaceutical industry. Mr. Walfish brings over 18 years of industrial expertise in the development and application of statistical methods for solving complex business issues including data collection, analysis and reporting. Mr. Walfish has held positions with Johnson & Johnson and Chiron Diagnostics where he worked with large data sets for monitoring process data. Mr. Walfish holds a Bachelors of Arts in Statistics from the University of Buffalo, Masters of Science in Statistics from Rutgers University and an Executive MBA from Boston University.

Learning Objectives Understand regulations that govern method validation. Using the correct statistical methods. How to Plan experiments

Agenda Regulatory Documents that Govern Method Validation Components of Method Validation Other Statistical Methods Integrating Statistical Methods with Regulatory Requirements Interactive Exercise: Case Study

Regulatory Documents USP General Chapter <1225>, Validation of compendial methods, United States Pharmacopeia XXIII, National Formulary, XVIII, Rockville, MD, The United States Pharmacopeial Convention, Inc, 1995, 1710 1612 ICH Q2A Q2B FDA Guidance on Bioanalytical Method Validation FDA Draft Guidance for Industry Analytical Procedures and Methods Validation; Chemistry, Manufacturing, and Controls Documentation

Compendial Methods All USP methods are considered validated according to the guidelines in place at the time they were first published. Many USP methods are several years old, and what may have passed for validation in the past has changed significantly. A minimal re-validation of an USP method to prove suitability for your product should be conducted.

ICH Guidelines Specificity Linearity Range Accuracy Precision Repeatability Intermediate precision Detection limit Quantification limit

Specificity Specificity is the degree of bias (or lack thereof) caused by expected sample components and common interferences, determined by measuring the analyte with and without anticipated interferences.

t-testtest The t-test is a statistical test for comparing two sample means. A common application of this process is to test if an interfering substance has a significant effect on the response variable (Specificity of the method). Compare the spiked samples to the neat samples.

t-testtest The t-test has the following general formula: t = X 1 s n 2 1 1 + where n 1 and n 2 are the sample sizes, X 1 and X 2 are the sample means, and S 2 1 and S2 2 are the sample variances. The degrees of freedom are n 1 +n 2-2 X s n 2 2 2 2

Dunnett s Test A procedure for comparing each experimental mean with the control mean. Dunnett test is more powerful than tests designed to compare each mean with each other mean.

Specificity Example Sample Response 0.5 1 0.5 1.02 0.5 1.04 0.5 1.04 0.5 1.04 0.25 0.95 0.25 0.97 0.25 0.93 0.25 0.93 0.25 0.97 0.125 0.84 0.125 0.86 0.125 0.84 0.125 0.84 0.125 0.85 0.0625 0.81 0.0625 0.83 0.0625 0.84 Sample is serially diluted from a neat sample of 0.5 mg/ml 0.25 is the first sample that is statistically different than 0.0625. (p<0.001) 0.125 sample p-value using Dunnett test is 0.185. 0.0625 0.83 0.0625 0.83

Linearity Linearity is the variance in the response with concentration, measured as the RSD of response factors or the correlation coefficient (R 2 ) from a linear regression fit.

Regression The ordinary least squares regression model has a single regressor x that has a straight line relationship with respect to y. The intercept (β 0 ) is the average y-value when x is equal to zero. The slope (β 1 ) is the change in y for a unit change in x. All models have some random error component (ε i ). β + β x + ε The simple linear model is: 0 i y = 1

Simple Regression Model Y=β 0 +β 1 X Y 1 Unit β 1 Units β o X

Hazards of Regression Extrapolating beyond the range of the x values. Influential observations or outliers give misleading models. Regression does not mean cause and effect. The regression of y=x is not the same as the regression of x=y.

Prediction Intervals A potential use for regression analysis is to predict future values of the response variable. The predicted values are computed by plugging the value of each x variable into the regression equation. An interval around the regression line can be calculated that gives a (1-α)% prediction interval for a new observation.

Linearity Example Y = -2.733333 + 1.048 X X Y 50 45 50 48 50 55 75 72 75 84 75 77 100 94 100 99 100 109 125 135 125 129 125 121 150 145 150 155 150 164 Y 175 150 125 100 75 50 25 25 50 75 100 125 150 175 X

Range Range is the concentration upper and lower levels which meets the linearity, precision, and accuracy performance characteristics.

Accuracy Accuracy is closeness to the true value, measured by % recovery of sample spikes or % error in the analysis of a reference sample.

Tolerance Intervals Statistical tolerance limits are limits within which we expect a stated proportion of the population to lie. What interval will contain p percent of the population measurements? Tolerance intervals take the form of: X ± ks k is a constant such that the interval will cover p percent of the population with at we are p percent of the population at a certain confidence.

Accuracy Example Target Actual Mean Std Dev % Recovery Lower CI Upper CI 50 45 50 48 50 55 49.33 5.13 98.67% 36.59 62.08 75 72 73% 113% 75 84 75 77 77.67 6.03 103.56% 62.69 92.64 100 94 84% 120% 100 99 100 109 100.67 7.64 100.67% 81.69 119.64 125 135 82% 110% 125 129 125 121 128.33 7.02 102.67% 110.89 145.78 150 145 89% 120% 150 155 150 164 154.67 9.50 103.11% 131.06 178.28 87% 109%

Precision Precision is the degree of agreement between replicate analyses of an homogenous sample, usually measured as the relative percent difference (RPD) between duplicates or the relative standard deviation (RSD) of a set of replicates. The ICH Guideline defines three precision measurements: repeatability or short term precision intermediate precision which is essentially the same as intra-lab robustness reproducibility which is essentially the same as inter-lab ruggedness.

Variance Components Variance components, or decomposition of variance is a method to partition the different sources of variation. Repeatability expresses the precision under the same operating conditions over a short interval of time. Repeatability is also termed intra-assay precision. Intermediate precision expresses within-laboratories variations: different days, different analysts, different equipment, etc. Q2A and Q2B talk about reproducibility being the lab to lab variability. Some might include it in the intermediate precision.

Variance Components Model for Assessing Precision Y i = μ + Level + Analyst + Day + i j k ε ijkl Average Intermediate Precision Error = Repeatability

Precision Example Analyst Run Rep Purity 1 1 1 99 1 1 2 96 1 1 3 95 1 2 1 95 1 2 2 89 1 2 3 89 1 3 1 92 1 3 2 91 1 3 3 90 2 1 1 88 2 1 2 92 2 1 3 88 2 2 1 86 2 2 2 85 2 2 3 83 2 3 1 84 2 3 2 86 2 3 3 86 Intermediate precision is analyst + run. Repeatability is rep. Mean Purity = 89.7% Variance Analyst = 17.92 Variance Run (within Analyst) = 7.07 Variance Rep = 4.39 Source Variance CV % Variance Analyst 17.92 4.7% 61.0% Run 7.07 3.0% 24.1% Error 4.39 2.3% 14.9%

Detection Limit Detection limit is the lowest concentration which can be detected with confidence (usually at the 99% confidence level), determined as three times the standard deviation (SD) of the background signal or the low level sample spikes.

Quantification Limit Quantification Limit is the concentration level above which the concentration can be determined with acceptable precision (usually RSD < 10 to 25%) and accuracy (usually 80-120% recovery), usually determined as ten times the SD of the background or low level sample spikes.

Robustness and Ruggedness Robustness is usually done in development to assess the performance of the test method after varying method parameters. Ruggedness is a measure of a test method s capacity to remain unaffected by small, but deliberate variations in method parameters and provides an indication of its reliability during normal usage Design of Experiments can help to assess the robustness and ruggedness of the method.

Non-Normal Normal Data How do we validate methods with nonnormal data? Use Non-parametric statistics. Cell based methods use logistic regression. Some nonlinear regression models are applicable.

Nonparametric Statistics Nonparametric statistical methods assume no underlying distribution. Median and range replace mean and standard deviation. Nonparametric methods are useful when: The measurements are only categorical; i.e., they are nominal or ordinal scaled The assumptions underlying the use of parametric methods cannot be met.

Advantage of Nonparametric Tests They may be used on all types of datacategorical data, which are nominally scaled or are in rank form, called ordinally scaled, as well as interval or ratio-scaled data. For small sample sizes they are easy to apply. They make fewer and less stringent assumptions than their parametric counterparts.

Disadvantages of Nonparametric Tests If the assumptions of the parametric methods can be met, it is generally more efficient to use them. For large sample sizes, data manipulations tend to become more laborious, unless computer software is available. Often special tables of critical values are needed for the test statistic, and these values cannot always be generated by computer software.

Nonlinear Regression Nonlinear models often occur as a result of a physical phenomenon such as exponential growth or decay. The nonlinear model CANNOT be linearized through transformation.

Example of a Nonlinear Model 1600 Y 1 2) 3 2 = β ( X β + β X + ε 1400 1200 1000 800 600 400 200 0 0 5 10 15 20 25 30 35 40 45 50

Estimation Techniques for Nonlinear Regression Nonlinear models cannot be solved explicitly, therefore iterative methods must be used. Convergence of the model might occur if the model is mis-specified or insufficient data. Similar to linear regression, model diagnostics can be applied to nonlinear regression.

Logistic Regression A bioassay is used to determine the strength or biological activity of a substance, such as a drug or hormone, by comparing its effects with those of a standard preparation on a culture of living cells or a test organism. 100000 10000 50% Ref Most bioassays follow a sigmoid dose response curve when plotted on a log-log scale. 1000 0.1 1 10 100 1000 The relative potency is the relationship between the reference curve and test curve.

Parallelism is essential to compute relative potency. Parallelism Parallelism or dilutional similarity means that the two dilutional curves are proportional throughout the range of the assay. In the linear regression case, it would be similar slopes with different intercepts.

4-Parameter Logistic Model The model for the 4-parameter fit is: Y = A + 1 + B conc EC 50 A Slope A = Lower Asymptote B = Upper Asymptote EC 50 = 50% binding concentration Slope = shape parameter of the model Conc = X variable

Issues With the Approach The model fit is iterative. Model does not always converge. Reference and test curves might not be parallel.

Combined Protocol Test specificity using a minimum of 6 replicates per interfering substance level. Usually use a single analyst for this study. Test LOD and LOQ 2 analysts, 2 days, 3 samples at 5 different concentrations. These 60 observations can be used for linearity, accuracy, precision and range.

Data Monitoring How should method validation and method qualification data be monitored and handled? Would like to assess if any outliers exists before the study is complete. System suitability testing accomplishes this without the chance of biasing the data.

Case Study 9 samples each with 6 repeats. Assess accuracy and precision for each sample. Assess linearity

Case Study 1 2 3 4 5 6 7 8 9 10 11 12 A B 1 7 C 2 D 3 8 E 4 F 5 9 G 6 H Degraded Samples

Case Study Any issues with the experiment? What can be improved in the plate layout? What additional analysis should be conducted?

Sample Cell Type Target Observed 1 B2 Normal 50 57 1 B3 Normal 50 43 1 B4 Normal 50 46 1 B5 Normal 50 52 1 B6 Normal 50 43 1 B7 Normal 50 50 2 C2 Normal 75 65 2 C3 Normal 75 63 2 C4 Normal 75 85 2 C5 Normal 75 55 2 C6 Normal 75 98 2 C7 Normal 75 84 3 D2 Normal 100 95 3 D3 Normal 100 129 3 D4 Normal 100 96 3 D5 Normal 100 95 3 D6 Normal 100 124 3 D7 Normal 100 88 4 E2 Normal 125 149 4 E3 Normal 125 126 4 E4 Normal 125 120 4 E5 Normal 125 107 4 E6 Normal 125 137 4 E7 Normal 125 119 5 F2 Normal 150 130 5 F3 Normal 150 162 5 F4 Normal 150 160 5 F5 Normal 150 148 5 F6 Normal 150 162 5 F7 Normal 150 178 6 G2 Degraded 75 65 6 G3 Degraded 75 62 6 G4 Degraded 75 65 6 G5 Degraded 75 55 6 G6 Degraded 75 67 6 G7 Degraded 75 71 7 B9 Degraded 100 95 7 B10 Degraded 100 89 7 B11 Degraded 100 95 7 C9 Degraded 100 94 7 C10 Degraded 100 94 7 C11 Degraded 100 87 8 D9 Degraded 125 118 8 D10 Degraded 125 106 8 D11 Degraded 125 109 8 E9 Degraded 125 107 8 E10 Degraded 125 106 8 E11 Degraded 125 109 9 F9 Degraded 150 130 9 F10 Degraded 150 132 9 F11 Degraded 150 130 9 G9 Degraded 150 137 9 G10 Degraded 150 131 9 G11 Degraded 150 138

Accuracy and Precision Sample Type Spiked Concentration Mean Std Dev % Recovery Degraded 75 64.2 5.4 85.6 Degraded 100 92.3 3.4 92.3 Degraded 125 109.2 4.5 87.3 Degraded 150 133.0 3.6 88.7 Normal 50 48.5 5.5 97.0 Normal 75 75.0 16.5 100.0 Normal 100 104.5 17.4 104.5 Normal 125 126.3 14.8 101.1 Normal 150 156.7 16.2 104.4

Linearity RP = -4.866667 + 1.0706667 Target Summary of Fit RSquare RSquare Adj Root Mean Square Error Mean of Response Observations (or Sum Wgts) 0.886613 0.882564 14.01215 102.2 30 200 150 RP 100 50 25 50 75 100 125 150 175 Target

Row Effect Samples 7, 8 and 9 are tested in separate rows. Can do testing to see if there is a row effect. Only 3 observations in each row, so the power is not very good. All t-tests show no significant row effects.

Final Thoughts Statistical design should be integrated into any method validation. Do not have to do one at a time experimentation. Precision analysis is the most important component to any study.