Quality System Guidance



Similar documents
LAB ID and/or LABORATORY NAME: ASSESSOR NAME: CONTINUOUS RADON MONITOR (CRM) by Proportional Counting. Method Number: ELAP method code 7037

APPENDIX N. Data Validation Using Data Descriptors

Control Charts and Trend Analysis for ISO Speakers: New York State Food Laboratory s Quality Assurance Team

Content Sheet 7-1: Overview of Quality Control for Quantitative Tests

American Association for Laboratory Accreditation

ASSURING THE QUALITY OF TEST RESULTS

MODULE 2D ENVIRONMENTAL MICROBIOLOGICAL LABORATORY ACCREDITATION PROGRAM (EMLAP) ADDITIONAL REQUIREMENTS

SPSS Guide: Regression Analysis

1 Quality Assurance and Quality Control Project Plan

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

Validation and Calibration. Definitions and Terminology

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Variables Control Charts

How to Verify Performance Specifications

Descriptive Statistics

Data Quality. Tips for getting it, keeping it, proving it! Central Coast Ambient Monitoring Program

Ecology Quality Assurance Glossary

USING CLSI GUIDELINES TO PERFORM METHOD EVALUATION STUDIES IN YOUR LABORATORY

CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA

How to Design and Interpret a Multiple-Choice-Question Test: A Probabilistic Approach*

Z - Scores. Why is this Important?

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group

5.5. Solving linear systems by the elimination method

AP Physics 1 and 2 Lab Investigations

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS

Method To Solve Linear, Polynomial, or Absolute Value Inequalities:

MATH10212 Linear Algebra. Systems of Linear Equations. Definition. An n-dimensional vector is a row or a column of n numbers (or letters): a 1.

INTRODUCTION TO MULTIPLE CORRELATION

Part 2: Analysis of Relationship Between Two Variables

Experiment #1, Analyze Data using Excel, Calculator and Graphs.

Section 1.1 Linear Equations: Slope and Equations of Lines

Module 3: Correlation and Covariance

Lean Six Sigma Analyze Phase Introduction. TECH QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY

Confidence Intervals for the Difference Between Two Means

Acceptance Criteria for Durability Tests

To avoid potential inspection

PTA proficiency testing for metal testing laboratories

Applying Statistics Recommended by Regulatory Documents

Measurement with Ratios

MULTIPLE REGRESSION WITH CATEGORICAL DATA

Session 7 Bivariate Data and Analysis

Introduction to Quantitative Methods

Non-Parametric Tests (I)

1. PURPOSE To provide a written procedure for laboratory proficiency testing requirements and reporting.

Study Guide for the Final Exam

Non-Inferiority Tests for Two Proportions

Correlation key concepts:

Chapter 5 Risk and Return ANSWERS TO SELECTED END-OF-CHAPTER QUESTIONS

Signal, Noise, and Detection Limits in Mass Spectrometry

Quality Assurance/Quality Control in Acid Deposition Monitoring

CALCULATIONS & STATISTICS

NPV Versus IRR. W.L. Silber We know that if the cost of capital is 18 percent we reject the project because the NPV

13: Additional ANOVA Topics. Post hoc Comparisons

Design of Experiments for Analytical Method Development and Validation

Evaluating Laboratory Data. Tom Frick Environmental Assessment Section Bureau of Laboratories

HOW ACCURATE ARE THOSE THERMOCOUPLES?

Two Correlated Proportions (McNemar Test)

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

NCSS Statistical Software

Credit Scorecards for SME Finance The Process of Improving Risk Measurement and Management

Terms concerned with internal quality control procedures

Chemical Kinetics. 2. Using the kinetics of a given reaction a possible reaction mechanism

Technical note I: Comparing measures of hospital markets in England across market definitions, measures of concentration and products

Mesh Discretization Error and Criteria for Accuracy of Finite Element Solutions

International Statistical Institute, 56th Session, 2007: Phil Everson

USE OF REFERENCE MATERIALS IN THE LABORATORY

Math 1. Month Essential Questions Concepts/Skills/Standards Content Assessment Areas of Interaction

Online Appendix to Stochastic Imitative Game Dynamics with Committed Agents

UNIVERSITY OF CALIFORNIA AT BERKELEY College of Engineering Department of Electrical Engineering and Computer Sciences. EE105 Lab Experiments

Definition 8.1 Two inequalities are equivalent if they have the same solution set. Add or Subtract the same value on both sides of the inequality.

Using Excel for inferential statistics

Analytical Methods: A Statistical Perspective on the ICH Q2A and Q2B Guidelines for Validation of Analytical Methods

Robust procedures for Canadian Test Day Model final report for the Holstein breed

Inventory Management and Risk Pooling. Xiaohong Pang Automation Department Shanghai Jiaotong University

Statistical Rules of Thumb

Lin s Concordance Correlation Coefficient

Zeros of a Polynomial Function

Factor Analysis. Chapter 420. Introduction

Technical Guidance Manual for the Implementation of the Hawai`i State Contingency Plan. Interim Final SECTION 10

Chapter 4 and 5 solutions

A three point formula for finding roots of equations by the method of least squares

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Analyses: Statistical Measures

Partial Estimates of Reliability: Parallel Form Reliability in the Key Stage 2 Science Tests

Data Analysis on the ABI PRISM 7700 Sequence Detection System: Setting Baselines and Thresholds. Overview. Data Analysis Tutorial

Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13

3. What is the difference between variance and standard deviation? 5. If I add 2 to all my observations, how variance and mean will vary?

Method Validation/Verification. CAP/CLIA regulated methods at Texas Department of State Health Services Laboratory

1.0 XRF Analyses Performance Evaluation

Statistics. Measurement. Scales of Measurement 7/18/2012

AN ANALYSIS OF INSURANCE COMPLAINT RATIOS

PEARSON R CORRELATION COEFFICIENT

Descriptive Statistics

Normality Testing in Excel

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS. + + x 2. x n. a 11 a 12 a 1n b 1 a 21 a 22 a 2n b 2 a 31 a 32 a 3n b 3. a m1 a m2 a mn b m

MATH 60 NOTEBOOK CERTIFICATIONS

CHAPTER TWELVE TABLES, CHARTS, AND GRAPHS

APPENDIX 7-B SUGGESTED OUTLINE OF A QUALITY ASSURANCE PROJECT PLAN

Transcription:

Page 1 of 13 The following is a discussion of statistical schemes that have been used and may be appropriate for evaluation of microbiological QC data (duplicates, replicates, daily reference slides for quantifiable analyses). In all cases, initial control limit values should be determined and then updated on a specified frequency as more data is collected. Use of control charts is recommended to facilitate trending. Consideration for All QC Evaluations Microbial and particulate loading within a sample will influence level of variability detected in repeat counts for many types of analyses. Samples that are either lightly or heavily loaded with microorganisms for a given matrix (i.e., bulk, air, swab, etc.) will typically display more variability in repeat counts than samples that have optimal loading ranges for counting, especially if only a portion of the sample preparation is assessed. Amount of sample assessed will also influence variability detected in repeat counts. Repeat counts on sample preparations that have been 100% completely assessed should yield closer numbers than repeat counts on samples that have had a smaller portion assessed (i.e., <100%). It should also be noted that statistical treatment may not be meaningful at the lowest concentrations for some analyses (e.g. indicating 1 count vs 3 counts or 1 versus 0 on a direct exam may not be a reason for failed QC whereas for air culture it might be). Statistical treatment should be limited to organism concentrations high enough to indicate true failures if calculations exceed acceptance criteria. After, becoming experienced with performing sample preparations and analyses for a given technique and getting an intuitive understanding of how loading affects the variation of repeat counts, it is important to do the following. 1. Decide whether or not, the technique requires statistics to be performed separately for different loading ranges (i.e., similar to NIOSH PCM 7400 method). 2. If decision to item 1 is yes, then define the concentration categories/ranges that statistics shall be performed on separately. Remember, category concentration ranges for a given technique are arbitrary, and they should be defined after some observations and empirical data have been obtained. As more QC data are obtained, the loading categories may have to be redefined and changed. 3. Perform the QC statistics for each concentration range of a given technique. QC statistics will be explained in the next section of this document.

Page 2 of 13 4. Decide how many data points (i.e., 10, 20, 30, etc.) within a concentration range of a technique are required to establish acceptance criteria that can be compared against to see if a given value or repeat count is acceptable. The number of data points required to establish a baseline comparison is arbitrary but should be at least 10; 20-30 points is better. 5. After determining statistics based on total counts/concentrations, decide if the same statistics should be performed on the individual dominant microorganisms. Statistical Comparisons for QC Evaluations The following are statistical approaches that may be considered and may be appropriate for your laboratory. Other approaches may be equally suitable. A. NIOSH PCM 7400 Model for Data Processing Ref 1 (may be most appropriate for air direct exam data, both reference slides and intra/inter analyst recounts) 1. A set of reference slides, containing a range of spore types and concentrations, are read daily by analysts (lab-generated slides containing only one spore type are no longer allowed as they are not representative of real samples and taxon ranking cannot be done). Data collected should be spores/trace and spores/m 3 for the number of spores of each type and total spores. After 10-20 values for each slide are obtained, acceptance criteria for at least total count can be calculated by determining the mean and standard deviation (StdDev) of each data set (Statistical evaluation of concentrations of dominant spore types may also be done but are not mandatory. See discussion of Organism ID and Ranking for requirements). Warning limits for each slide are typically mean +/-2 StdDev while control limits are mean +/-3 StdDev. 2. Variability or coefficient of variation (Cv), also referred to as relative standard deviation (Sr), can be calculated from the reference slide data (Cv = StdDev/Mean). Cv values should be evaluated to see if there is a correlation between Cv value and spore concentration. If there is, pooled Cv values may be calculated for various spore concentration ranges (e.g. 5-50, 50-200, >200 spores counted, or whatever ranges seem appropriate). If there is no obvious Cv correlation to spore concentration, one pooled Cv value can be used for the lab. Pooled Cv is calculated as follows: Cv pool = [(Cv 1 2 + Cv 2 2 + Cv n 2 )/(n)]

Page 3 of 13 where n = number of Cv values pooled 3. Cv values can then be used in the NIOSH 7400 formula for determining acceptability of the 5% duplicate (inter-analyst) and replicate (intra-analyst) analyses (total spore count/trace). The formula is as follows: X 1 - X 2 < 2.77(mean X)(pooled Cv/2) where mean X = ( X 1 + X 2 )/2 This formula can be simplified to: X 1 - X 2 < 0.69( X 1 + X 2 )(pooled Cv) As long as the left side of this equation is less than or equal to the right side, the duplicate/replicate results are considered acceptable. B. QC for Intra/Inter Analyst Comparisons The following are statistical approaches that may be considered and may be appropriate for your laboratory. Other approaches may be equally suitable. 1. The NIOSH 7400 formula from Step 3 above can be used for intra/inter analyst acceptability for all other quantifiable analyses as well (quantified direct exam swabs or tape lifts and all culturable analyses), provided a way is found to determine appropriate Cv values without the use of reference slides. One possible way is to determine mean, StdDev and Cv for each replicate or duplicate pair. The Cv values can then be pooled (total or by organism concentration) to produce Cv values to be used in the formula shown in Step 3 above. Duplicate and replicate values to use should be organisms/sample rather than final concentrations based on air volumes, areas, or weights unless consistent values are used for these parameters. 2. Calculations Based on Relative Percent Difference or RPD (Variations of this are used by many labs. This calculation can be used for any intra/inter analyst data set.) a. RPD is calculated as the absolute difference between a pair of readings divided by the mean and multiplied by 100 to change result to % or (( X 2 X 1 )/Mean)*100.

Page 4 of 13 b. There are correct and incorrect methods for determining RPD control limits, as shown in the Steve Lerman discussion attached at the end of this document as Appendix A. (Arbitrary selection of an acceptance limit (e.g. +/-100%) is not statistics based). c. The easiest correct approach is to keep the sign on the RPD value rather than using absolute values. This then allows for use of 2-sided 2 and 3StdDev limits (Steve s Figure 3). d. Keeping the sign on the RPD values when calculating the StdDev and then using 1-sided 2 and 3StdDev limits to track absolute value of RPD is also commonly used (Steve s Figure 4). e. The New York State Department of Health Environmental Laboratory Approval Program method of using a single control limit at 3.27 times the mean Std Dev is also acceptable (Steve s Figure 5). f. As with the Cv discussion above, there may be a concentration dependence whereby more than one RPD database and set of control limits may be needed if there is a wide range of appropriate RPD values. 3. Calculations Based on Standard Deviation a. There are statistical ways to estimate StdDev for a series of duplicate/replicate measurements. The following is perhaps easiest as it does not require use of outside tables Ref 2 : StdDev est = ( d 2 /2N) where d = difference between the 2 values in each pair N = number of pairs Acceptance criteria can then be set at StdDev sple < 3StdDev est. b. From the same statistics source, the following formula calculates uncertainty from the estimated StdDev: U = t(stddev est )/N where t = value from Student t table N = number of pairs used in Step b above 4. QC for Cultures

Page 5 of 13 a. Standard Methods Ref 3 uses the range criterion (Ŕ) to determine if duplicate or replicate counts are acceptable or not. The Ŕ is calculated from historical data by summing all differences between logarithms of each duplicate count and dividing by number of duplicate counts. After Ŕ is calculated and established, it can be used to determine whether subsequent duplicate or replicate counts are acceptable or not by using the following formula logx 1 -logx 2 > 3.27 Ŕ then reject acceptability where x 1 = 1st count x 2 = 2nd count b. The New York Department of health Environmental Laboratory Approval Certification Manual states that duplicate (intra-analyst) counts for cultures should not have more than 5% difference, and that replicate (inter-analyst) counts should not have more than 10% difference to be acceptable. This is not a statistical determination but may be a place to start as a default value. Protocols for Taxonomic ID and Ranking The following are approaches that may be considered and may be appropriate for your laboratory. Other approaches may be equally suitable. 1. At present a common approach is to require identification of some percentage of the prevalent organism types in a sample to agree (e.g. top 2, 2 out of top 3, 3 out of top 5, etc). Looking at statistical concentration comparison of top 1-2, in addition to total concentration of all organisms, may be appropriate, but is not mandatory with the current policies. 2. Another method of looking at organism ranking is use of the Spearman Rank Order Correlation Coefficient (r s ). The following formula is used to determine if rankings within 2 sets of results are similar or related: r s = 1 ((6( D 2 )/(N(N 2-1)) where D = difference in an organisms ranking in the 2 data sets N = number of organisms in each data set

Page 6 of 13 The closer r s is to 1, the closer the agreement in the rankings. A cutoff value for acceptable values for r s would need to be established (at least >0.5, probably higher). A discussion of Spearman Ranking can be found at http://www.mnstate.edu/wasson/ed602spearcorr.htm. Spearman ranking alone is probably not enough, but combined with statistical concentration comparison of top 1-2 organisms may be appropriate. References: 1. Abell, M.T., Shulman, S.A., and Baron, P.A. 1989. The Quality of Fiber Count Data, Appl Ind Hyg. Vol. 4 No. 11 pp. 273-285. 2. Taylor, John Keenan, 1987, Quality Assurance of Chemical Measurements, p22. 3. Standard Methods for the Examination of Water and Waste Water 20 th Edition, pp. 9-10.

Page 7 of 13 Appendix A STATISTICAL QUALITY CONTROL TECHNIQUES FOR DUPLICATE QC DATA Steve Lerman SIL Consulting, Inc. 3 Allan Gate Plainview, NY 11803-6112 (516) 433-3412 Most Laboratory QC Managers are well familiar with the techniques for generating traditional control charts for reference and spike samples by taking the mean and standard deviation of the data, and plotting warning and control limits at +2s and +3s respectively (see Figure 1 and Table 1). Please note that only six observations have been used for illustrative purposes. Statistically based limits should typically be established using a minimum of 20 to 30 observations collected under similar analytical conditions. Establishing and plotting control limits for duplicate data, however, is another matter entirely. Here, we see what is essentially the same approach - calculating the mean and standard deviation of the difference, or relative % difference (% RPD), between the two results, then setting up control limits as one would for reference and spike samples. The % RPD is the absolute value of the difference divided by the average, to allow data of different magnitudes to be statistically combined. Although this approach is widely used, some applications of this approach are incorrect. There is no significance to using the mean of the RPD as a mean - there is no trend line for duplicate data; if there is one, it would be zero. Instead, the RPD can be treated in the same manner as the difference between the values and the mean for reference and spike data, i. e., the data can be taken and calculated as a standard deviation. We will take the data in Table 1, and calculate it several ways to illustrate what we mean. Table 2 and Figure 2 show the data calculated and graphed using an erroneous approach. Although the appearance looks comforting, it is meaningless. There are no lower control limits for duplicate data, and the mean is not the desired target. Probably the simplest approach is the one recommended by Paul Britton, the Statistical Data Manager for EPA's proficiency program and Results Advisor for ASTM Committee D19 on

Page 8 of 13 water 1. His approach is similar to the erroneous one, except that the sign of the RPD is kept. Using this approach, the resulting distribution is Gaussian, and we can look for trends in the first vs. second result. The mean converges at zero, as we would expect. Table 3 and Figure 3 show this approach. There are other valid approaches. Taylor 2 uses the root-mean-square approach, where he takes the root-mean-square sum of the RPD value and uses that value as a standard deviation (this is the same as calculating a standard deviation). He then takes 2x and 3x that value as control limits - the trend line is zero. This produces a 1-way control chart, which is totally suitable for duplicate data. Figure 4 and Table 4 show such a graph using the data in Table 1. A similar approach is used by the New York State Department of Health Environmental Laboratory Approval Program 3. They calculate the mean of the RPD, but use that as a measure of standard deviation. They take 3.27 times this value as the single control limit, thus also producing a 1-way control chart, but with a single control limit. 2.51 times this value may also be used as a warning limit. Figure 5 and Table 5 show such a graph using the data in Table 1, and with both control limits. All of these protocols provide statistically similar and valid results, giving the Laboratory QC Manager a wide variety of options for establishing and plotting control limits for duplicate QC data. 1 ASTM Subcommittee D19.02 Meeting. 2 Taylor, John K., Quality Assurance of Chemical Measurement, pp. 22-23, Lewis Publishers, Chelsea, MI, 1987. 3 NY State DOH ELAP Certification Manual, Section 261, pp. 6-7, Albany, NY.

Page 9 of 13 Figure 1 70 60 50 Test Result 40 30 20 10 0 0 1 2 3 4 5 6 7 Sample # Table 1 Result # Result 1 1 41 2 37 3 47 4 28 5 26 6 29 Mean 34.7 Std. Dev. 8.4 +2s 51.4 +3s 59.7-2s 17.9-3s 9.6

Page 10 of 13 Figure 2 25.0 20.0 % RPD 15.0 10.0 5.0 0.0 0 1 2 3 4 5 6 Sample # 7 Table 2 Result # Result 1 Result 2 % RPD 1 41 37 10.3 2 37 42 12.7 3 47 51 8.2 4 28 24 15.4 5 26 30 14.3 6 29 27 7.1 Mean 11.3 Std. Dev. 3.3 +2s 18.0 +3s 21.3-2s 4.6-3s 1.3

Page 11 of 13 Figure 3 % RPD 50.0 40.0 30.0 20.0 10.0 0.0-10.0-20.0-30.0-40.0-50.0 0 1 2 3 4 5 6 Sample # 7 Table 3 Result # Result 1 Result 2 % RPD 1 41 37 10.3 2 37 42-12.7 3 47 51-8.2 4 28 24 15.4 5 26 30-14.3 6 29 27 7.1 Mean -0.4 Std. Dev. 12.8 +2s 25.3 +3s 38.1-2s -26.0-3s -38.9

Page 12 of 13 Figure 4 % RPD 40.0 35.0 30.0 25.0 20.0 15.0 10.0 5.0 0.0 0 1 2 3 4 5 6 7 Sample # Table 4 Result # Result 1 Result 2 % RPD 1 41 37 10.3 2 37 42 12.7 3 47 51 8.2 4 28 24 15.4 5 26 30 14.3 6 29 27 7.1 RMS 11.7 +2*RMS 23.4 +3*RMS 35.2

Page 13 of 13 Figure 5 % RPD 40.0 35.0 30.0 25.0 20.0 15.0 10.0 5.0 0.0 0 1 2 3 4 5 6 7 Sample # Table 5 Result # Result 1 Result 2 % RPD 1 41 37 10.3 2 37 42 12.7 3 47 51 8.2 4 28 24 15.4 5 26 30 14.3 6 29 27 7.1 Mean 11.3 +2.51*Mean 28.4 +3.27*Mean 37.0