Analyzing Flow Cytometry Data with Bioconductor
|
|
|
- Elvin Tyler Edwards
- 9 years ago
- Views:
Transcription
1 Introduction Data Analysis Analyzing Flow Cytometry Data with Bioconductor Nolwenn Le Meur, Deepayan Sarkar, Errol Strain, Byron Ellis, Perry Haaland, Florian Hahne Fred Hutchinson Cancer Research Center 6 August 2007
2 Introduction Data Analysis Flow Cytometry ˆ High throughput, high volume, high dimensional data ˆ Software challenges ˆ implementing standard methods ˆ developing novel methods ˆ R + Bioconductor ˆ open source ˆ supports rapid prototyping ˆ large existing base of users
3 Introduction Data Analysis Bioconductor packages for FCM data ˆ Existing R packages: prada, rflowcyt ˆ underlying data structures different ˆ : attempt at standardization ˆ data input ˆ object model ˆ standard tools (gates, transformations) ˆ flowviz ˆ flexible visualization using infrastructure ˆ Potentially more packages ˆ building on top of the basic infrastructure ˆ implementing future developments
4 Introduction Data Analysis Goals of Lab Session ˆ Introduction to ˆ Some simple analysis and visualization
5 File formats ˆ Standard formats ˆ FCS 2, FCS 3 ˆ LMD (list mode) files ˆ can read all these formats ˆ read.fcs() single file ˆ read.fcsheader() header only ˆ read.flowset() collection of files
6 Reading in a file read.fcs() reads individual files > ff <- read.fcs("../data/ f06") > ff flowframe object with cells and 8 observables: <FSC-H> FSC-H <SSC-H> SSC-H <FL1-H> <FL2-H> <FL3-H> <FL1-A> <FL4- slot 'description' has 147 elements
7 > exprs(ff)[c(1:5, 9996:10000), c("fsc-h", "SSC-H", "FL1-H", "FL2-H", "Time")] FSC-H SSC-H FL1-H FL2-H Time [1,] [2,] [3,] [4,] [5,] [6,] [7,] [8,] [9,] [10,]
8 > pdata(parameters(ff)) name desc range $P1 FSC-H FSC-H 1024 $P2 SSC-H SSC-H 1024 $P3 FL1-H 1024 $P4 FL2-H 1024 $P5 FL3-H 1024 $P6 FL1-A <NA> 1024 $P7 FL4-H 1024 $P8 Time Time (51.20 sec.) 1024
9 > str(head(keyword(ff), 15)) List of 15 $ FCSversion: chr "2" $ $BYTEORD : chr "4,3,2,1" $ $DATATYPE : chr "I" $ $NEXTDATA : chr "0" $ $SYS : chr [1:4] "Macintosh" "System" "Software" "9.2.2" $ CREATOR : chr [1:2] "CELLQuest<aa>" "3.3" $ $TOT : chr "10000" $ $MODE : chr "L" $ $PAR : chr "8" $ $P1N : chr "FSC-H" $ $P1R : chr "1024" $ $P1B : chr "16" $ $P1E : chr "0,0" $ $P2N : chr "SSC-H" $ $P2R : chr "1024"
10 read.flowset() reads in a collection of files > fset <- read.flowset(...) > fset A flowset with 35 experiments. Reading in multiple files rownames: s5a01, s5a02,..., s10a07 (35 total) varlabels and varmetadata: Patient: Patient code Visit: Visit number...:... name: NA (5 total) column names: FSC-H SSC-H FL1-H FL2-H FL3-H FL2-A FL4-H Time
11 Individual frames can be accessed using [[ and $ > fset[[2]] flowframe object with 3405 cells and 8 observables: <FSC-H> FSC-Height <SSC-H> SSC-Height <FL1-H> CD15 FITC <FL2-H> C slot 'description' has 153 elements > fset$"s5a02" flowframe object with 3405 cells and 8 observables: <FSC-H> FSC-Height <SSC-H> SSC-Height <FL1-H> CD15 FITC <FL2-H> C slot 'description' has 153 elements
12 Subsets of a flowset can be extracted using [ > fset[1:5] A flowset with 5 experiments. rownames: s5a01, s5a02,..., s5a05 (5 total) varlabels and varmetadata: Patient: Patient code Visit: Visit number...:... name: NA (5 total) column names: FSC-H SSC-H FL1-H FL2-H FL3-H FL2-A FL4-H Time
13 fsapply(): applies function on all frames in a flowset > head(fsapply(fset, nrow)) [,1] s5a s5a s5a s5a s5a s5a
14 Phenodata manipulation > varmetadata(phenodata(fset)) labeldescription Patient Patient code Visit Visit number Days Days since graft Grade Grade (leukemia) name <NA> > varmetadata(phenodata(fset))["name", "labeldescription"] <- "File name" > phenodata(fset) rownames: s5a01, s5a02,..., s10a07 (35 total) varlabels and varmetadata: Patient: Patient code Visit: Visit number...:... name: File name (5 total)
15 Visualization ˆ has some basic plots ˆ flowviz has more flexible methods based on lattice
16 > plot(fset$"s10a06", c("ssc-h", "FSC-H")) FSC H SSC H
17 > xyplot(`fsc-h` ~ `SSC-H`, data = fset$"s10a06") FSC H SSC H
18 > xyplot(fset$"s10a06") Time FL1 H FL2 A FL2 H FL3 H FL4 H FSC H SSC H
19 > splom(fset$"s10a06") FL4 H FL2 A FL3 H FL2 H FL1 H SSC H FSC H Scatter Plot Matrix
20 Transformations ˆ Original scale often not very good for visualization ˆ The transform() function creates transformed versions > summary(transform(fset$"s10a06", asinh.fsc.h = asinh(`fsc-h`))) FSC-H SSC-H FL1-H FL2-H FL3-H FL2-A FL4-H Min st Qu Median Mean rd Qu Max asinh.fsc.h Min st Qu Median Mean rd Qu Max
21 Transformations > summary(transform("fsc-h" = asinh) %on% fset$s10a06) FSC-H SSC-H FL1-H FL2-H FL3-H FL2-A FL4-H Min st Qu Median Mean rd Qu Max
22 > s10a06 <- fset$"s10a06"[, 1:5] > splom(transform("fsc-h" = asinh, "SSC-H" = asinh, "FL1-H" = asinh, "FL2-H" = asinh, "FL3-H" = asinh) %on% s10a06) FL3 H FL2 H FL1 H SSC H FSC H Scatter Plot Matrix
23 { a x < a truncatetransform y = x x a scaletransform f (x) = x a b a lineartransform f (x) = a + bx quadratictransform f (x) = ax 2 + bx + c lntransform f (x) = log (x) r d Standard Transformations logtransform f (x) = log b (x) r d biexponentialtransform f 1 (x) = ae bx ce dx + f logicletransform A special form of the biexponential transform with parameters selected by the data. arcsinhtransform f (x) = asinh (a + bx) + c
24 Transforms can be applied on an entire flowframe > fset.trans <- transform("fsc-h" = asinh, "SSC-H" = asinh, "FL1-H" = asinh, "FL2-H" = asinh, "FL3-H" = asinh) %on% fset
25 Quality Assessment ˆ Conditional plots (lattice + flowviz) ˆ Numerical summaries
26 > ecdfplot(~ `FL2-H` Patient, fset.trans, groups = Visit) Empirical CDF FL2 H
27 > ecdfplot(~ `FL2-H` Visit, fset.trans, subset = Patient == "10") Empirical CDF FL2 H
28 Numerical quality measures > fsapply(fset.trans, each_col, IQR)[29:35, 1:4] FSC-H SSC-H FL1-H FL2-H s10a s10a s10a s10a s10a s10a s10a
29 > xyplot(`ssc-h` ~ `FL2-H` factor(days), fset.trans, subset = Patient == "10") SSC H FL2 H
30 Standard Filters (a.k.a. Gates) rectanglegate Cubic shape in one or more dimensions (e.g. interval gates) polygongate Two dimensional polygonal gate polytopegate Convex hull of a set of points, possibly in more than two dimensions ellipsoidgate Ellipsoidal region in two or more dimensions
31 Data-driven Filters norm2filter Finds a subregion that most resembles a bivariate Gaussian distribution kmeansfilter (Multiple) populations using one-dimensional k-means clustering
32 > xyplot(`ssc-h` ~ `FL2-H` factor(days), fset.trans, subset = Patient == "10", filter = norm2filter("ssc-h", "FL2-H")) SSC H FL2 H
33 > rect.gate <- rectanglegate("rangegate", "FL2-H" = c(4, 10)) > rect.results <- filter(fset.trans, rect.gate) > lapply(rect.results[29:35], summary) $s10a01 RangeGate: of (99.01%) $s10a02 RangeGate: 3609 of 4530 (79.67%) $s10a03 RangeGate: 2496 of 6765 (36.90%) $s10a04 RangeGate: 5402 of 9540 (56.62%) $s10a05 RangeGate: of (97.68%) $s10a06 RangeGate: of (95.70%)
34 > xyplot(`ssc-h` ~ `FL2-H` factor(days), subset = Patient == "10", data = Subset(fset.trans, rect.results), filter = norm2filter("ssc-h", "FL2-H", scale = 2)) SSC H FL2 H
THE BIOCONDUCTOR PACKAGE FLOWCORE, A SHARED DEVELOPMENT PLATFORM FOR FLOW CYTOMETRY DATA ANALYSIS IN R
THE BIOCONDUCTOR PACKAGE FLOWCORE, A SHARED DEVELOPMENT PLATFORM FOR FLOW CYTOMETRY DATA ANALYSIS IN R N. Le Meur 1,2, F. Hahne 1, R. Brinkman 3, B. Ellis 5, P. Haaland 4, D. Sarkar 1, J. Spidlen 3, E.
flowcore: data structures package for flow cytometry data
flowcore: data structures package for flow cytometry data N. Le Meur F. Hahne B. Ellis P. Haaland May 4, 2010 Abstract Background The recent application of modern automation technologies to staining and
flowcore: data structures package for flow cytometry data
flowcore: data structures package for flow cytometry data N. Le Meur F. Hahne B. Ellis P. Haaland August 20, 2015 Abstract Background The recent application of modern automation technologies to staining
Gates/filters in Flow Cytometry Data Visualization
Gates/filters in Flow Cytometry Data Visualization October 3, Abstract The flowviz package provides tools for visualization of flow cytometry data. This document describes the support for visualizing gates
FlowMergeCluster Documentation
FlowMergeCluster Documentation Description: Author: Clustering of flow cytometry data using the FlowMerge algorithm. Josef Spidlen, [email protected] Please see the gp-flowcyt-help Google Group (https://groups.google.com/a/broadinstitute.org/forum/#!forum/gpflowcyt-help)
COMPENSATION MIT Flow Cytometry Core Facility
COMPENSATION MIT Flow Cytometry Core Facility Why do we need compensation? 1) Because the long emission spectrum tail of dyes causes overlap like with the fluorophores FITC and PE. 2) For sensitivity reasons,
Automated Quadratic Characterization of Flow Cytometer Instrument Sensitivity (flowqb Package: Introductory Processing Using Data NIH))
Automated Quadratic Characterization of Flow Cytometer Instrument Sensitivity (flowqb Package: Introductory Processing Using Data NIH)) October 14, 2013 1 Licensing Under the Artistic License, you are
Using self-organizing maps for visualization and interpretation of cytometry data
1 Using self-organizing maps for visualization and interpretation of cytometry data Sofie Van Gassen, Britt Callebaut and Yvan Saeys Ghent University September, 2014 Abstract The FlowSOM package provides
flowtrans: A Package for Optimizing Data Transformations for Flow Cytometry
flowtrans: A Package for Optimizing Data Transformations for Flow Cytometry Greg Finak, Raphael Gottardo October 13, 2014 [email protected], [email protected] Contents 1 Licensing 2 2 Overview
Deep profiling of multitube flow cytometry data Supplemental information
Deep profiling of multitube flow cytometry data Supplemental information Kieran O Neill et al December 19, 2014 1 Table S1: Markers in simulated multitube data. The data was split into three tubes, each
GWAS Data Cleaning. GENEVA Coordinating Center Department of Biostatistics University of Washington. January 13, 2016.
GWAS Data Cleaning GENEVA Coordinating Center Department of Biostatistics University of Washington January 13, 2016 Contents 1 Overview 2 2 Preparing Data 3 2.1 Data formats used in GWASTools............................
DELPHI 27 V 2016 CYTOMETRY STRATEGIES IN THE DIAGNOSIS OF HEMATOLOGICAL DISEASES
DELPHI 27 V 2016 CYTOMETRY STRATEGIES IN THE DIAGNOSIS OF HEMATOLOGICAL DISEASES CLAUDIO ORTOLANI UNIVERSITY OF URBINO - ITALY SUN TZU (544 b.c. 496 b.c) SUN TZU (544 b.c. 496 b.c.) THE ART OF CYTOMETRY
INSIDE THE BLACK BOX
FLOW CYTOMETRY ESSENTIALS INSIDE THE BLACK BOX Alice L. Givan Englert Cell Analysis Laboratory of the Norris Cotton Cancer Center Dartmouth Medical School HOW NOT TO BE A FLOW CYTOMETRIST Drawing by Ben
CELL CYCLE BASICS. G0/1 = 1X S Phase G2/M = 2X DYE FLUORESCENCE
CELL CYCLE BASICS Analysis of a population of cells replication state can be achieved by fluorescence labeling of the nuclei of cells in suspension and then analyzing the fluorescence properties of each
Flow Cytometry for Everyone Else Susan McQuiston, J.D., MLS(ASCP), C.Cy.
Flow Cytometry for Everyone Else Susan McQuiston, J.D., MLS(ASCP), C.Cy. At the end of this session, the participant will be able to: 1. Describe the components of a flow cytometer 2. Describe the gating
Flow Data Analysis. Qianjun Zhang Application Scientist, Tree Star Inc. Oregon, USA FLOWJO CYTOMETRY DATA ANALYSIS SOFTWARE
Flow Data Analysis Qianjun Zhang Application Scientist, Tree Star Inc. Oregon, USA Flow Data Analysis From Cells to Data - data measurement and standard Data display and gating - Data Display options -
CELL CYCLE BASICS. G0/1 = 1X S Phase G2/M = 2X DYE FLUORESCENCE
CELL CYCLE BASICS Analysis of a population of cells replication state can be achieved by fluorescence labeling of the nuclei of cells in suspension and then analyzing the fluorescence properties of each
Science is hard. Flow cytometry should be easy.
Science is hard. Flow cytometry should be easy. TABLE OF CONTENTS 1 INTRODUCTION TO BD ACCURI C6 SOFTWARE... 1 1.1 Starting BD Accuri C6 Software... 1 1.2 BD Accuri C6 Software Workspace... 2 1.3 Opening
CyAn : 11 Parameter Desktop Flow Cytometer
CyAn : 11 Parameter Desktop Flow Cytometer Cyan ADP 3 excitation lines 488nm, 635nm, and UV or violet 11 simultaneous parameters FSC, SSC, and 7-9 colors with simultaneous width, peak, area, and log on
R Language Fundamentals
R Language Fundamentals Data Types and Basic Maniuplation Steven Buechler Department of Mathematics 276B Hurley Hall; 1-6233 Fall, 2007 Outline Where did R come from? Overview Atomic Vectors Subsetting
Immunophenotyping Flow Cytometry Tutorial. Contents. Experimental Requirements...1. Data Storage...2. Voltage Adjustments...3. Compensation...
Immunophenotyping Flow Cytometry Tutorial Contents Experimental Requirements...1 Data Storage...2 Voltage Adjustments...3 Compensation...5 Experimental Requirements For immunophenotyping with FITC and
Data Mining: Exploring Data. Lecture Notes for Chapter 3. Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler
Data Mining: Exploring Data Lecture Notes for Chapter 3 Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler Topics Exploratory Data Analysis Summary Statistics Visualization What is data exploration?
--Verity Software House, ModFit for Proliferation http://www.vsh.com/ offers online tutorials
Customers may want to use another analysis software package with data collected on Attune. Reason for this include: they already have the SW and are confortable with using it, their institution may provide
No-wash, no-lyse detection of leukocytes in human whole blood on the Attune NxT Flow Cytometer
APPLICATION NOTE Attune NxT Flow Cytometer No-wash, no-lyse detection of leukocytes in human whole blood on the Attune NxT Flow Cytometer Introduction Standard methods for isolating and detecting leukocytes
Exploratory Data Analysis
Exploratory Data Analysis Johannes Schauer [email protected] Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction
Instructions for Use. CyAn ADP. High-speed Analyzer. Summit 4.3. 0000050G June 2008. Beckman Coulter, Inc. 4300 N. Harbor Blvd. Fullerton, CA 92835
Instructions for Use CyAn ADP High-speed Analyzer Summit 4.3 0000050G June 2008 Beckman Coulter, Inc. 4300 N. Harbor Blvd. Fullerton, CA 92835 Overview Summit software is a Windows based application that
Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining
Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar Tan,Steinbach, Kumar Introduction to Data Mining 8/05/2005 1 What is data exploration? A preliminary
Environmental Remote Sensing GEOG 2021
Environmental Remote Sensing GEOG 2021 Lecture 4 Image classification 2 Purpose categorising data data abstraction / simplification data interpretation mapping for land cover mapping use land cover class
APPLICATION INFORMATION
DRAFT: Rev. D A-2045A APPLICATION INFORMATION Flow Cytometry 3-COLOR COMPENSATION Raquel Cabana,* Mark Cheetham, Jay Enten, Yong Song, Michael Thomas,* and Brendan S. Yee Beckman Coulter, Inc., Miami FL
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
COM CO P 5318 Da t Da a t Explora Explor t a ion and Analysis y Chapte Chapt r e 3
COMP 5318 Data Exploration and Analysis Chapter 3 What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include Helping
Republic Polytechnic School of Information and Communications Technology C355 Business Intelligence. Module Curriculum
Republic Polytechnic School of Information and Communications Technology C355 Business Intelligence Module Curriculum This document addresses the content related abilities, with reference to the module.
Classes and Methods for Spatial Data: the sp Package
Classes and Methods for Spatial Data: the sp Package Edzer Pebesma Roger S. Bivand Feb 2005 Contents 1 Introduction 2 2 Spatial data classes 2 3 Manipulating spatial objects 3 3.1 Standard methods..........................
Immunophenotyping peripheral blood cells
IMMUNOPHENOTYPING Attune Accoustic Focusing Cytometer Immunophenotyping peripheral blood cells A no-lyse, no-wash, no cell loss method for immunophenotyping nucleated peripheral blood cells using the Attune
BD FACSDiva 4.1 - TUTORIAL TSRI FLOW CYTOMETRY CORE FACILITY
BD FACSDiva 4.1 - TUTORIAL TSRI FLOW CYTOMETRY CORE FACILITY IMPORTANT NOTES BEFORE READING THIS TUTORIAL This is a very expensive piece of equipment so PLEASE treat it with respect! After you are done
GETTING STARTED. 6. Click the New Specimen icon.
BD FACSDiva Software Basic Experiment Guide Please note, this guide is not intended to take the place of in-person training but is to be used as a reference as needed after completing training with Unified
Technical Bulletin. Threshold and Analysis of Small Particles on the BD Accuri C6 Flow Cytometer
Threshold and Analysis of Small Particles on the BD Accuri C6 Flow Cytometer Contents 2 Thresholds 2 Setting the Threshold When analyzing small particles, defined as particles smaller than 3.0 µm, on the
Computation of the Aggregate Claim Amount Distribution Using R and actuar. Vincent Goulet, Ph.D.
Computation of the Aggregate Claim Amount Distribution Using R and actuar Vincent Goulet, Ph.D. Actuarial Risk Modeling Process 1 Model costs at the individual level Modeling of loss distributions 2 Aggregate
Compensation Controls Data Visualization Panel Development Gating Quantum Dots
Compensation Controls Data Visualization Panel Development Gating Quantum Dots FITC Single Stain Control FITC PE Argon Laser FL1 FL2 450 500 550 600 FITC Compensation Control Unwanted signal detected in
1 Descriptive statistics: mode, mean and median
1 Descriptive statistics: mode, mean and median Statistics and Linguistic Applications Hale February 5, 2008 It s hard to understand data if you have to look at it all. Descriptive statistics are things
FACSCanto RUO Special Order QUICK REFERENCE GUIDE
FACSCanto RUO Special Order QUICK REFERENCE GUIDE INSTRUMENT: 1. The computer is left on at all times. Note: If not Username: Administrator Password: BDIS 2. Unlock the screen with your PPMS account (UTSW
LEGENDplex Data Analysis Software
LEGENDplex Data Analysis Software Version 7.0 User Guide Copyright 2013-2014 VigeneTech. All rights reserved. Contents Introduction... 1 Lesson 1 - The Workspace... 2 Lesson 2 Quantitative Wizard... 3
Data Exploration Data Visualization
Data Exploration Data Visualization What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include Helping to select
BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I
BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential
Package MBA. February 19, 2015. Index 7. Canopy LIDAR data
Version 0.0-8 Date 2014-4-28 Title Multilevel B-spline Approximation Package MBA February 19, 2015 Author Andrew O. Finley , Sudipto Banerjee Maintainer Andrew
Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining
Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar What is data exploration? A preliminary exploration of the data to better understand its characteristics.
Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller
Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller Getting to know the data An important first step before performing any kind of statistical analysis is to familiarize
Multiple Linear Regression
Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is
Final Exam Practice Problem Answers
Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal
The Forgotten JMP Visualizations (Plus Some New Views in JMP 9) Sam Gardner, SAS Institute, Lafayette, IN, USA
Paper 156-2010 The Forgotten JMP Visualizations (Plus Some New Views in JMP 9) Sam Gardner, SAS Institute, Lafayette, IN, USA Abstract JMP has a rich set of visual displays that can help you see the information
Katharina Lückerath (AG Dr. Martin Zörnig) adapted from Dr. Jörg Hildmann BD Biosciences,Customer Service
Introduction into Flow Cytometry Katharina Lückerath (AG Dr. Martin Zörnig) adapted from Dr. Jörg Hildmann BD Biosciences,Customer Service How does a FACS look like? FACSCalibur FACScan What is Flow Cytometry?
DEA implementation and clustering analysis using the K-Means algorithm
Data Mining VI 321 DEA implementation and clustering analysis using the K-Means algorithm C. A. A. Lemos, M. P. E. Lins & N. F. F. Ebecken COPPE/Universidade Federal do Rio de Janeiro, Brazil Abstract
TIBCO Spotfire Business Author Essentials Quick Reference Guide. Table of contents:
Table of contents: Access Data for Analysis Data file types Format assumptions Data from Excel Information links Add multiple data tables Create & Interpret Visualizations Table Pie Chart Cross Table Treemap
BD CellQuest Pro Software Analysis Tutorial
BD CellQuest Pro Analysis Tutorial This tutorial guides you through an analysis example using BD CellQuest Pro software. If you are already familiar with BD CellQuest Pro software on Mac OS 9, refer to
Normalization of RNA-Seq
Normalization of RNA-Seq Davide Risso Modified: April 27, 2012. Compiled: April 27, 2012 1 Retrieving the data Usually, an RNA-Seq data analysis from scratch starts with a set of FASTQ files (see e.g.
Classification and Regression by randomforest
Vol. 2/3, December 02 18 Classification and Regression by randomforest Andy Liaw and Matthew Wiener Introduction Recently there has been a lot of interest in ensemble learning methods that generate many
Introduction to the BioConductor framework Algorithmic Analysis of Flow Cytometry Data (Part 1)
Introduction to the BioConductor framework Algorithmic Analysis of Flow Cytometry Data (Part 1) Ryan Brinkman Senior Scientist, BC Cancer Agency Associate Professor, Department of Medical Genetics, UBC
Data exploration with Microsoft Excel: analysing more than one variable
Data exploration with Microsoft Excel: analysing more than one variable Contents 1 Introduction... 1 2 Comparing different groups or different variables... 2 3 Exploring the association between categorical
The Steepest Descent Algorithm for Unconstrained Optimization and a Bisection Line-search Method
The Steepest Descent Algorithm for Unconstrained Optimization and a Bisection Line-search Method Robert M. Freund February, 004 004 Massachusetts Institute of Technology. 1 1 The Algorithm The problem
Statistical Data Mining. Practical Assignment 3 Discriminant Analysis and Decision Trees
Statistical Data Mining Practical Assignment 3 Discriminant Analysis and Decision Trees In this practical we discuss linear and quadratic discriminant analysis and tree-based classification techniques.
Analysis of Bead-summary Data using beadarray
Analysis of Bead-summary Data using beadarray Mark Dunning October 13, 2015 Contents 1 Introduction 2 2 feature and pheno data 3 3 Subsetting the data 4 4 Exploratory analysis using boxplots 7 4.1 A note
Fluorescence Compensation. Jennifer Wilshire, PhD [email protected]
Fluorescence Compensation Jennifer Wilshire, PhD [email protected] Outline 1. Compensation Basics 2. Practical Applications -selecting comp controls (cells, stains?) 3. Multicolor Issues -spreading due
Technical Bulletin. Standardizing Application Setup Across Multiple Flow Cytometers Using BD FACSDiva Version 6 Software. Abstract
arch 212 Standardizing Application Setup Across ultiple Flow Cytometers Using BD FACSDiva Version 6 Ellen einelt, ervi Reunanen, ark Edinger, aria Jaimes, Alan Stall, Dennis Sasaki, Joe Trotter Contents
Descriptive Statistics
Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web
Microsoft Excel 2010 Charts and Graphs
Microsoft Excel 2010 Charts and Graphs Email: [email protected] Web Page: http://training.health.ufl.edu Microsoft Excel 2010: Charts and Graphs 2.0 hours Topics include data groupings; creating
123count ebeads Catalog Number: 01-1234 Also known as: Absolute cell count beads GPR: General Purpose Reagents. For Laboratory Use.
Page 1 of 1 Catalog Number: 01-1234 Also known as: Absolute cell count beads GPR: General Purpose Reagents. For Laboratory Use. Normal human peripheral blood was stained with Anti- Human CD45 PE (cat.
Selected Topics in Electrical Engineering: Flow Cytometry Data Analysis
Selected Topics in Electrical Engineering: Flow Cytometry Data Analysis Bilge Karaçalı, PhD Department of Electrical and Electronics Engineering Izmir Institute of Technology Outline Compensation and gating
An Application of Weibull Analysis to Determine Failure Rates in Automotive Components
An Application of Weibull Analysis to Determine Failure Rates in Automotive Components Jingshu Wu, PhD, PE, Stephen McHenry, Jeffrey Quandt National Highway Traffic Safety Administration (NHTSA) U.S. Department
Iris Sample Data Set. Basic Visualization Techniques: Charts, Graphs and Maps. Summary Statistics. Frequency and Mode
Iris Sample Data Set Basic Visualization Techniques: Charts, Graphs and Maps CS598 Information Visualization Spring 2010 Many of the exploratory data techniques are illustrated with the Iris Plant data
Exploratory data analysis (Chapter 2) Fall 2011
Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,
Anchorage School District/Alaska Sr. High Math Performance Standards Algebra
Anchorage School District/Alaska Sr. High Math Performance Standards Algebra Algebra 1 2008 STANDARDS PERFORMANCE STANDARDS A1:1 Number Sense.1 Classify numbers as Real, Irrational, Rational, Integer,
Structural Health Monitoring Tools (SHMTools)
Structural Health Monitoring Tools (SHMTools) Getting Started LANL/UCSD Engineering Institute LA-CC-14-046 c Copyright 2014, Los Alamos National Security, LLC All rights reserved. May 30, 2014 Contents
Hierarchical Clustering Analysis
Hierarchical Clustering Analysis What is Hierarchical Clustering? Hierarchical clustering is used to group similar objects into clusters. In the beginning, each row and/or column is considered a cluster.
Deploying Predictive Analytics Solutions Dr. Stephan Gerali Lockheed Martin Dr. Rafael Pacheco SAP
Deploying Predictive Analytics Solutions Dr. Stephan Gerali Lockheed Martin Dr. Rafael Pacheco SAP SESSION CODE: BI1521 How Lockheed Martin Space Systems Uses Predictive Analytics to Forecast Supply Chain
Navios Quick Reference
Navios Quick Reference Purpose: The purpose of this Quick Reference is to provide a simple step by step outline of the information needed to perform various tasks on the system. We begin with basic tasks
These particles have something in common
These particles have something in common Blood cells Chromosomes Algae Protozoa Certain parameters of these particles can be measured with a flow cytometer Which parameters can be measured? the relative
R Graphics Cookbook. Chang O'REILLY. Winston. Tokyo. Beijing Cambridge. Farnham Koln Sebastopol
R Graphics Cookbook Winston Chang Beijing Cambridge Farnham Koln Sebastopol O'REILLY Tokyo Table of Contents Preface ix 1. R Basics 1 1.1. Installing a Package 1 1.2. Loading a Package 2 1.3. Loading a
FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT MINING SYSTEM
International Journal of Innovative Computing, Information and Control ICIC International c 0 ISSN 34-48 Volume 8, Number 8, August 0 pp. 4 FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT
BD FACSComp Software Tutorial
BD FACSComp Software Tutorial This tutorial guides you through a BD FACSComp software lyse/no-wash assay setup run. If you are already familiar with previous versions of BD FACSComp software on Mac OS
DATA MINING CLUSTER ANALYSIS: BASIC CONCEPTS
DATA MINING CLUSTER ANALYSIS: BASIC CONCEPTS 1 AND ALGORITHMS Chiara Renso KDD-LAB ISTI- CNR, Pisa, Italy WHAT IS CLUSTER ANALYSIS? Finding groups of objects such that the objects in a group will be similar
POLYNOMIAL AND MULTIPLE REGRESSION. Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model.
Polynomial Regression POLYNOMIAL AND MULTIPLE REGRESSION Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model. It is a form of linear regression
MODULE 7: FINANCIAL REPORTING AND ANALYSIS
MODULE 7: FINANCIAL REPORTING AND ANALYSIS Module Overview Businesses running ERP systems capture lots of data through daily activity. This data, which reflects such things as the organization's sales
Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)
Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume
Application Information Bulletin: Set-Up of the CytoFLEX Set-Up of the CytoFLEX* for Extracellular Vesicle Measurement
Application Information Bulletin: Set-Up of the CytoFLEX Set-Up of the CytoFLEX* for Extracellular Vesicle Measurement Andreas Spittler, MD, Associate Professor for Pathophysiology, Medical University
Scientific Visualization with ParaView
Scientific Visualization with ParaView Geilo Winter School 2016 Andrea Brambilla (GEXCON AS, Bergen) Outline Part 1 (Monday) Fundamentals Data Filtering Part 2 (Tuesday) Time Dependent Data Selection &
A survey analysis example
A survey analysis example Thomas Lumley November 20, 2013 This document provides a simple example analysis of a survey data set, a subsample from the California Academic Performance Index, an annual set
The NIAID Flow Cytometry Advisory Committee; the Guidelines Subcommittee
January, 1999 From: To: Subject: The NIAID Flow Cytometry Advisory Committee; the Guidelines Subcommittee NIAID DAIDS Flow Cytometry Laboratories Comparison study information for labs wishing to switch
Visualization and descriptive statistics. D.A. Forsyth
Visualization and descriptive statistics D.A. Forsyth What s going on here? Most important, most creative scientific question Getting answers Make helpful pictures and look at them Compute numbers in support
Package retrosheet. April 13, 2015
Type Package Package retrosheet April 13, 2015 Title Import Professional Baseball Data from 'Retrosheet' Version 1.0.2 Date 2015-03-17 Maintainer Richard Scriven A collection of tools
Linear Programming. March 14, 2014
Linear Programming March 1, 01 Parts of this introduction to linear programming were adapted from Chapter 9 of Introduction to Algorithms, Second Edition, by Cormen, Leiserson, Rivest and Stein [1]. 1
Graphs. Exploratory data analysis. Graphs. Standard forms. A graph is a suitable way of representing data if:
Graphs Exploratory data analysis Dr. David Lucy [email protected] Lancaster University A graph is a suitable way of representing data if: A line or area can represent the quantities in the data in
Data Exploration and Preprocessing. Data Mining and Text Mining (UIC 583 @ Politecnico di Milano)
Data Exploration and Preprocessing Data Mining and Text Mining (UIC 583 @ Politecnico di Milano) References Jiawei Han and Micheline Kamber, "Data Mining: Concepts and Techniques", The Morgan Kaufmann
Introduction to Data Visualization
Introduction to Data Visualization STAT 133 Gaston Sanchez Department of Statistics, UC Berkeley gastonsanchez.com github.com/gastonstat/stat133 Course web: gastonsanchez.com/teaching/stat133 Graphics
Contents WEKA Microsoft SQL Database
WEKA User Manual Contents WEKA Introduction 3 Background information. 3 Installation. 3 Where to get WEKA... 3 Downloading Information... 3 Opening the program.. 4 Chooser Menu. 4-6 Preprocessing... 6-7
From The Little SAS Book, Fifth Edition. Full book available for purchase here.
From The Little SAS Book, Fifth Edition. Full book available for purchase here. Acknowledgments ix Introducing SAS Software About This Book xi What s New xiv x Chapter 1 Getting Started Using SAS Software
