Automated Quadratic Characterization of Flow Cytometer Instrument Sensitivity (flowqb Package: Introductory Processing Using Data NIH))



Similar documents
Using the BD TM Cytometer Setup and Tracking (CS&T) System for Instrument Characterization and Performance Tracking

Introduction to flow cytometry

CELL CYCLE BASICS. G0/1 = 1X S Phase G2/M = 2X DYE FLUORESCENCE

Instrument Characterization and Performance Tracking for Digital Flow Cytometers

CELL CYCLE BASICS. G0/1 = 1X S Phase G2/M = 2X DYE FLUORESCENCE

Using BD FACSDiva CST To. Evaluate Cytometer Performance, Create Custom Assay Settings. and

FACSCanto RUO Special Order QUICK REFERENCE GUIDE

Multicolor Bead Flow Cytometry Standardization Heba Degheidy MD, PhD, QCYM DB/OSEL/CDRH/FDA Manager of MCM Flow Cytometry Facility

Technical Bulletin. Threshold and Analysis of Small Particles on the BD Accuri C6 Flow Cytometer

BD FACSComp Software Tutorial

Spherotech, Inc Irma Lee Circle, Unit 101, Lake Forest, Illinois

Technical Bulletin. Standardizing Application Setup Across Multiple Flow Cytometers Using BD FACSDiva Version 6 Software. Abstract

BD FACSDiva TUTORIAL TSRI FLOW CYTOMETRY CORE FACILITY

Flow Cytometry A Basic Overview

APPLICATION INFORMATION

COMPENSATION MIT Flow Cytometry Core Facility

Flow cytometry basics fluidics, optics, electronics...

These particles have something in common

Univariate Regression

Instructions for Use. CyAn ADP. High-speed Analyzer. Summit G June Beckman Coulter, Inc N. Harbor Blvd. Fullerton, CA 92835

Zecotek S Light Projection Network Marketing

Katharina Lückerath (AG Dr. Martin Zörnig) adapted from Dr. Jörg Hildmann BD Biosciences,Customer Service

Compensation Basics - Bagwell. Compensation Basics. C. Bruce Bagwell MD, Ph.D. Verity Software House, Inc.

Application Information Bulletin: Set-Up of the CytoFLEX Set-Up of the CytoFLEX* for Extracellular Vesicle Measurement

Analyzing Flow Cytometry Data with Bioconductor

Three Rules for Compensation Controls

Analyzer Experiment setup guide for LSRII and Canto s

Evaluating System Suitability CE, GC, LC and A/D ChemStation Revisions: A.03.0x- A.08.0x

LEGENDplex Data Analysis Software

Immunophenotyping Flow Cytometry Tutorial. Contents. Experimental Requirements...1. Data Storage...2. Voltage Adjustments...3. Compensation...

flowtrans: A Package for Optimizing Data Transformations for Flow Cytometry

GETTING STARTED. 6. Click the New Specimen icon.

Introduction to General and Generalized Linear Models

Simple Linear Regression Inference

Selected Topics in Electrical Engineering: Flow Cytometry Data Analysis

123count ebeads Catalog Number: Also known as: Absolute cell count beads GPR: General Purpose Reagents. For Laboratory Use.

INSIDE THE BLACK BOX

Standardization, Calibration and Quality Control

DYNAMIC LIGHT SCATTERING COMMON TERMS DEFINED

CyAn : 11 Parameter Desktop Flow Cytometer

POLYNOMIAL AND MULTIPLE REGRESSION. Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model.

powerful. Intuitive. Customizable. The new star of benchtop flow cytometry

Technical Bulletin. An Introduction to Compensation for Multicolor Assays on Digital Flow Cytometers

Attune NxT Acoustic Focusing Cytometer

Fundamentals of modern UV-visible spectroscopy. Presentation Materials

Flow Cytometry for Everyone Else Susan McQuiston, J.D., MLS(ASCP), C.Cy.

Multicolor Flow Cytometry: Setup and Optimization on the BD Accuri C6 Flow Cytometer

5. Linear Regression

1. Instrument Layout pg Summit Overview pg Data Storage and Data Files pgs Saving Data pg Start Up pg 8

Stepcount. Product Description: Closed transparent tubes with a metal screen, including a white matrix at the bottom. Cat. Reference: STP-25T

DELPHI 27 V 2016 CYTOMETRY STRATEGIES IN THE DIAGNOSIS OF HEMATOLOGICAL DISEASES

FlowMergeCluster Documentation

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

MEASURABLE PARAMETERS: Flow cytometers are capable of measuring a variety of cellular characteristics such as:

Introduction to Flow Cytometry

Using CyTOF Data with FlowJo Version Revised 2/3/14

Establishing Optimum Baseline PMT Gains to Maximize Resolution on BD Biosciences Digital Flow Cytometers

VALIDATION OF ANALYTICAL PROCEDURES: TEXT AND METHODOLOGY Q2(R1)

Introduction to Flow Cytometry

Analytical Test Method Validation Report Template

Robust procedures for Canadian Test Day Model final report for the Holstein breed

Introduction to Flow Cytometry:

CyFlow SL. Healthcare Immunology. Portable FCM System for 3-colour Immunophenotyping

Supplementary Material. Free-radical production after post-thaw incubation of ram spermatozoa is related to decreased in vivo fertility

Immunophenotyping peripheral blood cells

Application of Automated Data Collection to Surface-Enhanced Raman Scattering (SERS)

use. 3,5 Apoptosis Viability Titration Assays

Step-by-Step Analytical Methods Validation and Protocol in the Quality System Compliance Industry

Flow Data Analysis. Qianjun Zhang Application Scientist, Tree Star Inc. Oregon, USA FLOWJO CYTOMETRY DATA ANALYSIS SOFTWARE

Environmental Remote Sensing GEOG 2021

Regression Analysis: A Complete Example

2. Simple Linear Regression

CyFlow SL. Microbiology. Detection and Analysis of Microorganisms and Small Particles


Science is hard. Flow cytometry should be easy.


The accurate calibration of all detectors is crucial for the subsequent data

Geostatistics Exploratory Analysis

Cell Cycle Tutorial. Contents

Boundary-breaking acoustic focusing cytometry

Setting up and calibration of a flow cytometer for multicolor immunophenotyping

CyTOF2. Mass cytometry system. Unveil new cell types and function with high-parameter protein detection

Avalanche Photodiodes: A User's Guide

No-wash, no-lyse detection of leukocytes in human whole blood on the Attune NxT Flow Cytometer

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

Econometrics Simple Linear Regression

Functional Data Analysis of MALDI TOF Protein Spectra

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

17. SIMPLE LINEAR REGRESSION II

Deep profiling of multitube flow cytometry data Supplemental information

1.1. Simple Regression in Excel (Excel 2010).

LSR II Users Guide Stanford Shared FACS Facility 06/10/13

Institut für Experimentelle Immunologie und Bildgebung

Guidance for Industry

Flow Cytometry 101. Jodi Moore, PhD Jan 2012 Goldenson 113

Fluorescence Compensation. Jennifer Wilshire, PhD

Linear Threshold Units

Social Media Mining. Data Mining Essentials

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Easy to use. Compact. Flexible. Sorting Chip

Transcription:

Automated Quadratic Characterization of Flow Cytometer Instrument Sensitivity (flowqb Package: Introductory Processing Using Data NIH)) October 14, 2013 1 Licensing Under the Artistic License, you are free to use and redistribute this software. 2 Background Flow cytometer sensitivity has been well defined in the community and relates to two functions: (1) How well a dim staining population is resolved from an unstained population; (2) How well various dim staining populations can be distinguished from each other. The most common terms for characterizing flow cytometer sensitivity are known as Q (detector efficiency) and B (background light level), which are calculated using beads acquired on a cytometer. Any routine method to measure Q and B must be rapid and sufficiently accurate to provide useful results, see the works in [1, 2, 3]. Manually gating multi-peak bead data to generate the MFIs and SDs from the beads is an extremely time-consuming process. We showed the feasibility of an automated approach based on the clustering algorithm Kmeans to detect the bead sub-populations and to generate the MFIs and SDs in an easy and fully automatic rapid fashion. Furthermore, we extended the standard linear formulation, used to derive coefficients for Q and B calculation, with a quadratic term that is taking into account intrinsic variance from both the instrument and the bead product, see the work in [4]. Consequently we have developed a fully automated R Bioconductor package that we call flowqb. This document is intended to provide full access to This project was supported by the Terry Fox Foundation and the Terry Fox Research Institute and by grant 700374 from the Canadian Cancer Society, Toronto. 1

the R implementation of the theoretical descriptions of Q and B calculation in our manuscript [4] and to explain the generic functions that enable R as an informatics research platform for flow cytometer sensitivity: ˆ To calculate automatically the detector efficiency (Q), optical background (B), and electronic noise. ˆ To determine the optimal voltages for each fluorescence parameter when a series of voltages are applied to the photomultiplier tube (PMT) to setup the optimal separation and sensitivity, see the work in [4]. ˆ Methods We propose a collection of R generic functions for automatic Q and B calculation. We have implemented these functions in the Bioconductor package flowqb. We illustrate their use with a number of case studies. ˆ Results We hope that these proposed R generic functions will become the base for the development of many tools for the Q and B calculation. ˆ keywords Flow Cytometry, High Throughput, Doublet, Instrument Sensitivity, Kmeans, Mean Fluorescence Intensity (MFI), Molecules of Equivalent Soluble Fluorochrome (MESF), linear and quadratic regressions, Q (detector efficiency), B (background light level). 3 R QB Generic Functions We define generic functions for our automatic Q and B calculations. A generic function is a standard R function with a special body to do an analytical calculation. We have four global generic functions to conduct an automatic Q and B calculation with the following utilities: ˆ Function ReadDD reads a given FCS file and remove doublet events for a given channel (ChanGiven), see section 4. ˆ Function KmeansMeanSD takes a given 2D array and generates a number of clusters and output their Means and SDs, see section 5. ˆ Function MFI2MESF converts the MFI means to MESF means. For instance, the MFI output of KmeansMeanSD are converted to MESF values. The SDs are also corrected for Illumination-Correction, see section 6. ˆ Functions lrmesf and qrmesf use the Means and the variances SD 2 in terms of the MESF values to conduct a linear and quadratic regression, see section 7. ˆ Function DiscriminantExamination uses the values in the output of the function qrmesf to estimate the discriminant of the resulting quadratic equation and can be used as an additional interpretation tool to aid in understanding cytometer sensitivity, see section 8. 2

ˆ An advanced processing uses logicle transformation to transform the data as previously described in [?]. To identify bead singlets, it used the approach of gating on FSC & SSC (see [3, 2] for more information on doublet discrimination effects on Q and B calculations). It performed Kmeans clustering as an automated gating procedure in order to measure the bead MFIs and SDs. Two strategies are used: (1) 2D Kmeans processing, the selected marker with SSC or (2) Multi-level gating of a set of selected channels. The functions gps and rps are for 2D Kmeans processing. gps function output Gaussian distribution fitting for peak statistics extraction, the fitting parameters means and standard deviations are the estimations of MFIs and SDs values. Estimations of MFIs and SDs values using Robust Statistics [?] is implemented in the function rps in flowqb. Estimations of MFIs and SDs values using multi level Kmeans is implemented in the functions MultilevelgPS and MultilevelrPS in flowqb. The function MultilevelrPS uses robust statsitics and MultilevelgPS uses Gaussian distribution fitting R software MASS package to extract the statistics for the peak found by multi-kmeans gating. We implemented the weighted least squares regression by using the function qrweightedmesf. The vignette Advancedf lowqb.rnw has a detailed illustration how to use these functions: gps, rps MultilevelgPS, MultilevelrPS and qrweightedmesf. 4 Doublet Discrimination Function The function ReadDD reads a given FCS file and remove doublet events for a given channel (ChanGiven). When spherical beads are run through the cytometer, sometimes two pass too close together which results in a doublet reading. To exclude doublets from further analysis, a step of doublet discrimination is applied before moving onto the auto-gating step. A standard approach for doublet discrimination is taken here, see our manuscript [4] for more information on doublet discrimination effects on Q and B calculations. Two channels Chan1DD and Chan2DD are used to detect the doublet events. For ( instance, the ratio of forward scatter height to forward scatter area is used FSC.H FSC.A). The further this ratio is from 1, the more likely it is that the event considered is a doublet. Let P [50, 100] be the percentage of the singlet events to keep; i.e, only events with forward scattering between P 100 F S.H F S.A 2 P 100 are extracted by the function ReadDD. The output is a 2D array (ChanGiven, ChanCompanion) having the positive fluorescent intensities of channel of interest ChanGiven and the companion channel ChanCompanion which will be used to facilitate the 2D Kmeans clustering, for instance side scattering. As the function ReadDD reads only a FCS file, we need to extract the FCS file from extdata folder: 3

> rm(list=ls(all=true)) > library(flowqb) > File= system.file("extdata","nih.fcs",package="flowqb") The function ReadDD reads first the FCS file NIH.fcs, using the read.fcs function in flowcore package. Second, the Forward Scattering Area index 1 and Forward Scattering Height index 2 are used to obtain singlet events with a Doublet Discrimination value equals to P=96. Finally, the processing returns a 2D singlet events for the channel of interest index 5 (B515), with the companion channel Side Scattering, index 3 (SS). > P <- 96 > MFI2D <- ReadDD(File,1,2,P,5,3) Note that MF I2D is now a 2D array. 5 Kmeans to Extract Means and SDs The function KmeansMeanSD takes a given 2D array, generates a number of clusters and outputs thr Means and SDs. For instance, the output of ReadDD is used by KmeansMeanSD to generate the required statistics for Q & B calcultaion. K-means clustering in 2D is used as an automated gating procedure in order to measure the bead MFIs and SDs. K-means clustering is a method which partitions a set of data points (or events) into K clusters. Intuitively, each data point is assigned to the cluster whose mean value is closest to that of the data point in question. More formally, it aims to find the partition which minimizes the within-cluster sum of errors. Function KmeansMeanSD returns the MFI Means and SDs of the 8 clusters. > MFIMeansSDs=KmeansMeanSD(MFI2D,8,300,300,1) Note that MF IMeansSDs is now a 2D array having MFI Means and SDs. The first dimension is associated to the MFI Means: > MFIMeansSDs[,1] [1] 8.647741e+00 4.175010e+01 9.177818e+02 4.071844e+03 2.074090e+04 [6] 5.064784e+04 1.106388e+05 2.097744e+05 The second dimension is associated to the MFI SDs: > MFIMeansSDs[,2] [1] 4.488664 19.995563 107.087508 233.277625 599.271541 1160.629743 [7] 2074.637101 3903.530374 4

6 Mean Fluorescence Intensity (MFI) to Molecules of Equivalent Soluble Fluorochrome (MESF) The function MFI2MESF converts the MFI means to MESF means. For instance, the MFI output of KmeansMeanSD are converted to MESF values. The SDs are also corrected for Illumination-Correction. The MFI associated with each micro-sphere population (cluster) is a function of the assigned MESF value for that micro-sphere population. Since the cytometer is a linear device, the recorded intensity of the fluorescence pulses (and hence the MFIs) should be correlated linearly with the MESF values, which in turn are proportional to the number of fluorophores on the micro-sphere, and hence to the fluorescence signal, see our maniscript [4]. This gives the relation MESF = p MF I, where p is a constant. The Q & B values will be defined in terms of the MESF values. For MESF calculation, the constant conversion between MFI and MESF is set to p = 357217.00 7102. MFIs are converted to MESFs without correcting the SDs as we set IllCorrCV = 0. > p <- 357217.00/7102 > MFI2MSEF=MFI2MESF(MFIMeansSDs,p,0) Note that MESF: MESF Mean and MESFV: MESF Variance (SD 2 or σ 2 ) and MF I2MSEF is now a 2D array having MESF Means and SDs. The first dimension is associated to the MESF Means: > MFI2MSEF[,1] [1] 4.349648e+02 2.099950e+03 4.616267e+04 2.048060e+05 1.043228e+06 [6] 2.547489e+06 5.564920e+06 1.055125e+07 The second dimension is associated to the MESF Variances: > MFI2MSEF[,2] [1] 4.095352e+04 9.647415e+05 2.894907e+07 1.376055e+08 9.084653e+08 [6] 3.407793e+09 1.088878e+10 3.854907e+10 7 Linear and Quadratic Regressions Let (c 0, c 1, c 2 ) = (σe 2, 1 K, σ2 S ) where σ2 E summarizes electronic sources of noise, K is the photon transfer constant for the PMTs and σs 2 incorporates intrinsic variance from both the cytometer instrument and the bead product, and any source of noise in general that depends linearly on the light signal. We need to estimate the coefficients (c 0, c 1, c 2 ) in the following quadratic model: SD 2 mesf = c 2 MESF 2 + c 1 MESF + c 0. (1) 5

The coefficients of this quadratic model form two quantities of interest: Q = 1 c 1 (detection efficiency) and B = c0 c 1 (optical background or background noise), see the work in [4]. The R function for regression analysis lm is used to fit linear regression in the function lrmesf and the quadratic regression in the function qrmesf. The functions lrmesf and qrmesf use the Means and SDs 2 in terms of the MESF values to generate estimate (c e 0, c e 1, c e 2). The Q & B are estimated as Q = 1 c and B = ce e 0 1 1 c. e 1 The MESF Means and SDs values in MF I2MSEF are used to compute the Q value using the two regression approaches: ˆ Linear Regression: Only bead categories Dim1 (Peak1), Dim2 (Peak2) and Dim3 (Peak3), their MESF Means and SDs values will be extracted from MF I2MSEF, are used to estimate Q & B values. SDs in MFI2MSEF should be correct for illumination. ˆ Quadratic regression: the bead categories Dim1 (Peak1), Dim2 (Peak2), Dim3 (Peak3) and Dim4 (Peak4), their MESF Means and SDs values will be extracted from MF I2MSEF, are used to estimate Q & B values. Note the Quadratic regression is processed without correcting the SDs. For quadratic Q and B calculation, the peaks of the cluster 3 to cluster 6 are extracted from MF I2MSEF and used in the quadratic regression: > QQB <- qrmesf(mfi2msef,3,6) > OV <- c(q=as.numeric(qqb[1]), B=as.numeric(QQB[2]), Rsquared=as.numeric(round(QQB[3], 2))) > OV[1] Q 0.001857138 > OV[2] B 15194.25 > OV[3] Rsquared 1 Note that c1 = 1 Q, c0 = B Q and c2 = sigmas2. For linear Q and B calculation, the peaks of the cluster 3 to cluster 5 are extracted from M F I2M SEF and used to compute the linear regression coefficients. As we need the illumination correction MFIs are converted to MESFs and SDs are corrected using the beads in cluster 8. > IllCorrCV <- MFIMeansSDs[8,2]/MFIMeansSDs[8,1] > MFI2MSEF <- MFI2MESF(MFIMeansSDs,p,IllCorrCV) > LQB <- lrmesf(mfi2msef,3,5) > OV <- c(q=as.numeric(lqb[1]), B=as.numeric(LQB[2]), Rsquared=as.numeric(round(LQB[3], 2))) > OV[1] 1 If c e 1 = 0, the functions will generate a stop( No linear term to calculate Q and B ) 6

Q 0.002003786 > OV[2] B 24735.08 > OV[3] Rsquared 1 > Note that c1 = 1 Q, c0 = B Q. 8 Discriminant Examination We examine the discriminant of the quadratic equation: SD 2 mesf = c e 2 MESF 2 + c e 1 MESF + c e 0. (2) Let = (c e 1) 2 4c e 0c e 2, there are two possible scenarios: 0 and < 0. The calculation of is implemented in the flowqb package, so that its estimate can be used as an additional interpretation tool to aid in understanding cytometer sensitivity: ˆ If 0, the larger the variation product of (σ E ) and (σ S ), the lower the upper bound on the detection efficiency Q. ˆ If < 0, the lower the variation product of (σ E ) and (σ S ), the greater the upper bound on the detection efficiency Q. For more details about the physical interpretation of values, see the work in [4]. Discriminant of the Quadratic Equation: > Coefs <- DiscriminantExamination(as.numeric(QQB[1]),as.numeric(QQB[2]),as.numeric(QQB[4])) > Coefs[1] c0 8181544 > Coefs[2] c1 538.4631 > Coefs[3] c2 0.0003124441 > Coefs[4] Delta 279717.4 7

> Delta <- Coefs[4] > if(delta >= 0) + { + cat(paste("the sign of the discriminant is positive with the value", round(delta,2), ". ") + cat("the larger the variation product of (sigmae2) and (sigmas2), ") + cat("the lower the upper bound on the detection efficiency Q. ") + } The sign of the discriminant is positive with the value 279717.38. The larger the variation product of (sigmae2) and (sigmas2), the lower the upper bound on the detection efficiency Q. > if(delta < 0) + { + cat(paste("the sign of the discriminant is negative with the value", round(delta,2), ". ") + cat("the lower the variation product of (sigmae2) and (sigmas2), ") + cat("the greater the upper bound on the detection efficiency Q. ") + } > 8

References [1] J. Wood, Fundamental Flow Cytometer Properties Governing Sensitivity and Resolution, Cytometry 33, (1998), p. 260-6. [2] R. Hoffman and J. Wood, Characterization of Flow Cytometer Instrument Sensitivity, Current Protocols in Cytometry, Chapter 1: Unit 1.20 (2007). [3] E. Chase and R. Hoffman, Resolution of Dimly Fluorescent Particles: a Practical Measure of Fluorescence Sensitivity, Cytometry 33 (1998), p. 267-279. [4] F. El Khettabi et al. 2013, Automated Quadratic Characterization of Flow Cytometer Instrument Sensitivit, to be submitted. 9