Functional Data Analysis of MALDI TOF Protein Spectra


 Arthur Mathews
 3 years ago
 Views:
Transcription
1 Functional Data Analysis of MALDI TOF Protein Spectra Dean Billheimer Department of Biostatistics Vanderbilt University Vanderbilt Ingram Cancer Center FDA for MALDI TOF MS p.1/43
2 Outline Overview of MALDI TOF Mass Spectrometry Characteristics of Spectral Signals Standard Analysis and Some Problems Analysis of Spectra as Functions Analysis of Glioma Proteins Extending FDA for Mass Spectra (coming attractions) Summary FDA for MALDI TOF MS p.2/43
3 MALDI TOF Mass Spectrometry Emerging as a key technology in proteomics (Nobel prize 2002). Proposed for cancer screening, diagnosis, treatment. Tremendous promise for protein profiling. Matrix Assisted Laser Desorption Ionization method of generating ions from large biomolecules (proteins!) Chemical matrix is added to sample to enhance ion formation. Pulsed laser light vaporizes/ionizes biomolecules from sample. Electric field accelerates ions and directs them into the mass analyzer. Time Of Flight separates ions based on size (mass/charge. TOF Small molecules are fast Large molecules are slow short travel time long travel time ) FDA for MALDI TOF MS p.3/43
4 MALDI TOF MS Schematic Laser Ion Beam Time of Flight Analyzer Detector Sample and Matrix FDA for MALDI TOF MS p.4/43
5 MALDITOF Spectrum  Normal White Matter Intensity Mass/Charge FDA for MALDI TOF MS p.5/43
6 MALDITOF Spectra  Normal White Matter Intensity Normal 1 Normal Mass/Charge FDA for MALDI TOF MS p.6/43
7 Pros/Cons of MALDI TOF MS Advantages Can be used for tissue, serum or other biological samples. Measures proteins directly. Proteins remain intact (vs. other methods). Allows measurement of many proteins simultaneously. Disadvantage Signal can be complicated. Molecules are identified only by mass/charge. Ion detection is mass dependent. 10fold more efficient at 6 kda than 66 kda. Resolution is mass dependent. FDA for MALDI TOF MS p.7/43
8 Characteristics of Spectral Signals Fundamental Premise: At a given, the mean intensity is proportional to the relative amount of protein at that. (see graph) This may be difficult to detect in individual spectra because of nuisance variation. sample matrix heterogeneity (intensity) chemical noise, protein fragments, salts, fats (baseline) detector output characteristics and sensitivity other sources of error (noise) Need good signal normalization! (see graph) FDA for MALDI TOF MS p.8/43
9 Statistical Issues of MALDI TOF Spectra Highly multivariate! ( ). Structured signal intensity is a function of mass/charge. Variance (and higher moments) related to intensity (and ). Nuisance variation (for each spectrum) baseline adjustment intensity scaling Model identification issues. Incidental parameter problem (Neyman and Scott, 1948) FDA for MALDI TOF MS p.9/43
10 Survey of Standard Analysis of MALDI Spectra Within each spectrum Smoothing ( de noising ) and baseline correction Mass assignment (registration, calibration) Intensity normalization (nonlinear transformation) Peak detection from smoothed spectrum to create a peak list. Across multiple spectra Peak binning identify homologous peaks (nearby values.) Use binned peak list intensities in a classification/clustering algorithm to segregate (known) biological samples. Test classifier on independent data to assess predictive performance FDA for MALDI TOF MS p.10/43
11 Concerns with Standard Analysis Within a spectrum Mass registration is subject to error. (magnitude increases with distance from control points) Smoothing goals and criteria are unclear (usually by the software shipped with the spectrometer) What is baseline? (how defined?) Peak detection How is peak defined? often based on S/N (but both of these change with More fundamental concern assumes all relevant information is captured by peak location and intensity huge data reduction loss of information ) (see graph) FDA for MALDI TOF MS p.11/43
12 More Concerns... Combine information across multiple spectra Errors in peak detection and/or mass assignment lead to binning problems. (see graph) Tends to omit small peaks that are consistently expressed. Classification algorithm, Ignores the ordering inherent in the data ( scale) Ignores all inference goals except classification/clustering Each step proceeds conditionally on all preceeding steps (no acknowledgement of uncertainty). FDA for MALDI TOF MS p.12/43
13 Brief Introduction to Functional Data Analysis (Ramsay and Silverman, 1997) functional data the fundamental unit of observation is a curve (function)  patient s hormone profile (through time)  electrical potential of a neuron measured through time  spectra (mass, Raman, fluorescence, and otherwise) IDEA: We are measuring a function (often at discrete sample points), and would like to treat the function as the observation. ADVANTAGE: We are incorporating into the analysis methods structural constraints (e.g., continuity, smoothness) that are present in the data. FDA for MALDI TOF MS p.13/43
14 Steps in FDA Data representation: convert sample points to functional form select a functional basis (e.g., Bspline, Fourier, Wavelet) project sample points onto basis space ensuing calculations involve the basis coefficients same methods as smoothing (but not the goal) Data registration or feature alignment. Data display Calculation of Summary Statistics Statistical Modeling FDA for MALDI TOF MS p.14/43
15 ( ) ( ) ( ) Descriptive Statistics. The! " "", and be an observed function where Let estimated mean function % $ '& # The estimated variance function # % $ '& var Covariance and Correlation functions # # % $ '& cov ) cov corr ) ) var var FDA for MALDI TOF MS p.15/43
16 A Functional Linear Model, 0, * / *  3$ / Usual Linear Model / ., *+ where is an design matrix and coefficients. The usual parameter estimator is a vector of unknown * 2, 2, 1, ( In a functional model (FANOVA). , where, and  are functions, but is same as before. FDA for MALDI TOF MS p.16/43
17 Basis Function Representation : 4 5& * 3$ Represent the observations via basis function expansion where 8 5 are basis functions covering More compactly,, and are coefficients. 5 6 :; where is the matrix of basis function coefficients. Now the FANOVA estimator is 2:;, 2, 1, ( FDA for MALDI TOF MS p.17/43
18 Other (* easy *) Operations in FDA Functional principal components analysis Functional linear modeling Functional ANOVA observations and parameters are functions (standard design matrix) Scalar response variable and functional independent variable All model terms are functional Functional canonical correlation Differential operators and analysis ** Thanks to Jim Ramsay for making available code for FDA. FDA for MALDI TOF MS p.18/43
19 Glioma Protein Analysis Glioma is a type of tumor found in the brain s white matter (infiltrating tumor cells). Four stages defined by tissue pathology. Stage progression not well understood. Compare resected tumor tissue with normal white matter from lobectomy patients. Interest in identifying protein markers of stage. FDA for MALDI TOF MS p.19/43
20 Analysis of Brain Tissue Mass Spectra < = Data from normal and tumor tissue specimens. Tissue cross section mounted to MALDI plate (IMS prep) Mass (per charge) range from 2000 to Da/z Focus on limited mass range 7600 to 8000 Da/z 35 patients (7 normal, 8 grade II, 9 grade III, 11 grade IV) Use Bspline basis with 120 basis functions ( data values) Thanks to Sarah Schwarz in Vanderbilt MSRC for providing data. FDA for MALDI TOF MS p.20/43
21 Spectrum Normalization C C B Piecewise linear baseline correction Scaling by regression against standard spectrum. Global BoxCox transfomation based on sampling replicate spectra A. is baseline correction is a scaling coefficient ( is the BoxCox parameter D C in the following analysis) FDA for MALDI TOF MS p.21/43
22 Autocorrelation of Spectra FDA for MALDI TOF MS p.22/43
23 Functional Analysis of Variance F Statistic (3, 31) Mass/Charge FDA for MALDI TOF MS p.23/43
24 Group Means Normalized Intensity Normal Grade 2 Grade 3 Grade Mass/Charge FDA for MALDI TOF MS p.24/43
25 Key Points from Glioma Protein Spectra Analysis Identify regions exhibiting differential protein expression. Some of these regions would be difficult to find via peak selection. Autocorrelation plot suggests method for identifying different forms of a single protein. FDA for MALDI TOF MS p.25/43
26 Next New Thing Currently the following steps are performed sequentially 1. smooth (or de noise) spectrum 2. estimate and remove baseline 3. normalize 4. peak selection 5. do actual analysis Each step depends on all preceeding steps any error is propagated forward any uncertainty is ignored Instead, try simultaneous modeling of the (believed) components of spectra. FDA for MALDI TOF MS p.26/43
27 Spectrum Decomposition Spectrum Decomposition Baseline Group Specific Signal Spectrum Specific FDA for MALDI TOF MS p.27/43
28 Spectrum Decomposition via Bayesian Inference Baseline nuisance background (in each spectrum) smoooooth monotone non increasing non negative Group Specific Signal peaks common to a group of interest combine information across multiple spectra non negative represent peaks when present, zero otherwise Spectrum Specific Signal subject or spectrum specific unexplained variation no substantial prior information aid identification may prefer mean zero for each spectrum FDA for MALDI TOF MS p.28/43
29 MCMC Baseline Estimate of Mass Frauda y x FDA for MALDI TOF MS p.29/43
30 Peaks and Spectrum Effects Baseline Corrected Signal Estimate for MS Frauda y x FDA for MALDI TOF MS p.30/43
31 Corrected Signal with Peaks y x FDA for MALDI TOF MS p.31/43
32 Parallel Approaches to Inference E VAMPIRE cluster of 110 linuxbased processors (Beowulf) Currently Embarrassingly Parallel problems Code: combination of C, R, and job scheduling languages Pointwise mixedmodel analysis (Bayesian inference, using MCMC) Next Steps: combine FDA with Componentwise Bayesian model implement ScaLaPack behind language FDA for MALDI TOF MS p.32/43
33 Summary Protein analysis by MS has tremendous potenital for cancer screening, diagnosis, and treatment. Functional data approach is a natural fit to MS data. identified expression differences that would be difficult to find with peak detection approaches inference limitations computational challenges Good normalization is key to quantitative analysis. Theory of Normalization (w/ B. LaFleur) Proteomics = Proteo metrics All problems reduce to quantitation Adherence to statistical principles is important! FDA for MALDI TOF MS p.33/43
34 Quantitation of MALDI Spectra MALDI TOF MS Calibration Experiment (Bucknall, et al. 2002) go back Peak Intensity Ratio y = 1.17x 0.14 r = Concentration rat met GH (nmol) FDA for MALDI TOF MS p.34/43
35 Unnormalized MALDI Spectra MALDI TOF MS Calibration Experiment No Normalization (Bucknall, et al. 2002) Peak Intensity y = 65.77x r = 0.83 go back Concentration rat met GH (nmol) FDA for MALDI TOF MS p.35/43
36 Spectrum 1 Intensity Mass / Charge FDA for MALDI TOF MS p.36/43
37 Spectrum 1 with Peak Detection Intensity Mass / Charge FDA for MALDI TOF MS p.37/43
38 Spectrum 1 Peaks Only Intensity go back Mass / Charge FDA for MALDI TOF MS p.38/43
39 Spectrum 1 Intensity Mass / Charge FDA for MALDI TOF MS p.39/43
40 Spectrum 1 with Peak Detection Intensity Mass / Charge FDA for MALDI TOF MS p.40/43
41 Spectrum 2 Intensity Mass / Charge FDA for MALDI TOF MS p.41/43
42 Spectrum 2 with Peak Detection Intensity Mass / Charge FDA for MALDI TOF MS p.42/43
43 Peaks from Spectra 1 and 2 Intensity go back Mass / Charge FDA for MALDI TOF MS p.43/43
Preprocessing, Management, and Analysis of Mass Spectrometry Proteomics Data
Preprocessing, Management, and Analysis of Mass Spectrometry Proteomics Data M. Cannataro, P. H. Guzzi, T. Mazza, and P. Veltri Università Magna Græcia di Catanzaro, Italy 1 Introduction Mass Spectrometry
More informationAlignment and Preprocessing for Data Analysis
Alignment and Preprocessing for Data Analysis Preprocessing tools for chromatography Basics of alignment GC FID (D) data and issues PCA F Ratios GC MS (D) data and issues PCA F Ratios PARAFAC Piecewise
More informationWith data depth and quality
With data depth and quality Analysis of a tryptic digest by peptide mass fingerprinting, MS/MS and MS/MS/MS MS was performed on the tryptic digest of horse myoglobin using DHBA on the target. The resulting
More informationMass Spectrometry for Chemists and Biochemists
Erasmus Intensive Program SYNAPS Univ. of Crete  Summer 2007 Mass Spectrometry for Chemists and Biochemists Spiros A. Pergantis Assistant Professor of Analytical Chemistry Department of Chemistry University
More informationStatistical Analysis. NBAFB Metabolomics Masterclass. Mark Viant
Statistical Analysis NBAFB Metabolomics Masterclass Mark Viant 1. Introduction 2. Univariate analysis Overview of lecture 3. Unsupervised multivariate analysis Principal components analysis (PCA) Interpreting
More informationWeight Loss Determined from Mass Spectrometry Trend Data in a Thermogravimetric/Mass Spectrometer System
Weight Loss Determined from Mass Spectrometry Trend Data in a Thermogravimetric/Mass Spectrometer System Carlton G. Slough TA Instruments, 109 Lukens Drive, New Castle DE 19720, USA ABSTRACT The use of
More informationThe accurate calibration of all detectors is crucial for the subsequent data
Chapter 4 Calibration The accurate calibration of all detectors is crucial for the subsequent data analysis. The stability of the gain and offset for energy and time calibration of all detectors involved
More informationAB SCIEX TOF/TOF 4800 PLUS SYSTEM. Cost effective flexibility for your core needs
AB SCIEX TOF/TOF 4800 PLUS SYSTEM Cost effective flexibility for your core needs AB SCIEX TOF/TOF 4800 PLUS SYSTEM It s just what you expect from the industry leader. The AB SCIEX 4800 Plus MALDI TOF/TOF
More information1 Genzyme Corp., Framingham, MA, 2 Positive Probability Ltd, Isleham, U.K.
Overview Fast and Quantitative Analysis of Data for Investigating the Heterogeneity of Intact Glycoproteins by ESIMS Kate Zhang 1, Robert Alecio 2, Stuart Ray 2, John Thomas 1 and Tony Ferrige 2. 1 Genzyme
More informationDetailed simulation of mass spectra for quadrupole mass spectrometer systems
Detailed simulation of mass spectra for quadrupole mass spectrometer systems J. R. Gibson, a) S. Taylor, and J. H. Leck Department of Electrical Engineering and Electronics, The University of Liverpool,
More information13C NMR Spectroscopy
13 C NMR Spectroscopy Introduction Nuclear magnetic resonance spectroscopy (NMR) is the most powerful tool available for structural determination. A nucleus with an odd number of protons, an odd number
More informationSELDITOF Mass Spectrometry Protein Data By Huong Thi Dieu La
SELDITOF Mass Spectrometry Protein Data By Huong Thi Dieu La References Alejandro CruzMarcelo, Rudy Guerra, Marina Vannucci, Yiting Li, Ching C. Lau, and TszKwong Man. Comparison of algorithms for preprocessing
More informationTeaching notes: Time of flight mass spectrometry
Teaching notes: Time of flight mass spectrometry These teaching notes relate to section 3.1.1.2 Mass numbers and isotopes of our AS and Alevel Chemistry specifications (7404, 7405). This resource aims
More informationMass Analyzers 1: Timeofflight
Mass Analyzers 1: Timeofflight CU Boulder CHEM5181 Mass Spectrometry & Chromatography Prof. JoseLuis Jimenez MS Interpretation Lectures High Vacuum Sample Inlet Ion Source Mass Analyzer Detector Recorder
More informationQuantitative proteomics background
Proteomics data analysis seminar Quantitative proteomics and transcriptomics of anaerobic and aerobic yeast cultures reveals post transcriptional regulation of key cellular processes de Groot, M., Daran
More informationEffects of Intelligent Data Acquisition and Fast Laser Speed on Analysis of Complex Protein Digests
Effects of Intelligent Data Acquisition and Fast Laser Speed on Analysis of Complex Protein Digests AB SCIEX TOF/TOF 5800 System with DynamicExit Algorithm and ProteinPilot Software for Robust Protein
More informationVNA Basics. VNA Basics Errors and Calibration Examples. Spectrum Analyzer
2 Spectrum Spectrum 1 Measures Sparameters of a Device Under Test (DUT) For further reading: Agilent application note Network Basics, available at wwwagilentcom Spectrum 4 Motivation: Why Measure Amplitude?
More informationTop five list for Mass Spectrometry. 1. Molecular weight 2. Fragmentation pattern 3. Isotope ratio 4. Nitrogen rule 5. Exact mass
Mass Spectrometry Top five list for Mass Spectrometry 1. Molecular weight 2. Fragmentation pattern 3. Isotope ratio 4. Nitrogen rule 5. Exact mass A Mass Spectrometer A mass spectrometer is designed to
More informationSample Analysis Design. Element2  Basic Software Concepts
Sample Analysis Design Element2  Basic Software Concepts Scan Modes Magnetic Scan (BScan): the electric field is kept constant and the magnetic field is varied as a function of time the BScan is suitable
More informationQuantitative & Qualitative HPLC
Quantitative & Qualitative HPLC i Wherever you see this symbol, it is important to access the online course as there is interactive material that cannot be fully shown in this reference manual. Contents
More informationChapter 7. Diagnosis and Prognosis of Breast Cancer using Histopathological Data
Chapter 7 Diagnosis and Prognosis of Breast Cancer using Histopathological Data In the previous chapter, a method for classification of mammograms using wavelet analysis and adaptive neurofuzzy inference
More informationHighresolution MALDIFTICR MS Imaging for the insitu Analysis of Metabolites from Intact Tissues. Axel Walch. Research Unit Analytical Pathology
Highresolution MALDIFTICR MS Imaging for the insitu Analysis of Metabolites from Intact Tissues Axel Walch Research Unit Analytical Pathology Neuherberg, 20161012 Molecular Tissue Analysis by MALDI
More informationNonlinear Iterative Partial Least Squares Method
Numerical Methods for Determining Principal Component Analysis Abstract Factors Béchu, S., RichardPlouet, M., Fernandez, V., Walton, J., and Fairley, N. (2016) Developments in numerical treatments for
More informationInDepth Qualitative Analysis of Complex Proteomic Samples Using High Quality MS/MS at Fast Acquisition Rates
InDepth Qualitative Analysis of Complex Proteomic Samples Using High Quality MS/MS at Fast Acquisition Rates Using the Explore Workflow on the AB SCIEX TripleTOF 5600 System A major challenge in proteomics
More informationPolyacrylamide gel formation
Part II Protein Identification PolyAcrylamide Gel Electrophoresis (PAGE) is the best method in protein identification, MW determination, DNA sequencing, proteinprotein or proteindna interaction etc
More informationIntroduction to mass spectrometry (MS) based proteomics and metabolomics
Introduction to mass spectrometry (MS) based proteomics and metabolomics Tianwei Yu Department of Biostatistics and Bioinformatics Rollins School of Public Health Emory University September 10, 2015 Background
More informationApplication of Automated Data Collection to SurfaceEnhanced Raman Scattering (SERS)
Application Note: 52020 Application of Automated Data Collection to SurfaceEnhanced Raman Scattering (SERS) Timothy O. Deschaines, Ph.D., Thermo Fisher Scientific, Madison, WI, USA Key Words Array Automation
More informationAiping Lu. Key Laboratory of System Biology Chinese Academic Society APLV@sibs.ac.cn
Aiping Lu Key Laboratory of System Biology Chinese Academic Society APLV@sibs.ac.cn Proteome and Proteomics PROTEin complement expressed by genome Marc Wilkins Electrophoresis. 1995. 16(7):10904. proteomics
More informationStatistics Graduate Courses
Statistics Graduate Courses STAT 7002Topics in StatisticsBiological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.
More informationCopyright 2007 Casa Software Ltd. www.casaxps.com. ToF Mass Calibration
ToF Mass Calibration Essentially, the relationship between the mass m of an ion and the time taken for the ion of a given charge to travel a fixed distance is quadratic in the flight time t. For an ideal
More informationAdvantages of High Resolution in High Bandwidth Digitizers
Advantages of High Resolution in High Bandwidth Digitizers Two of the key specifications of digitizers are bandwidth and amplitude resolution. These specifications are not independent  with increasing
More informationTutorial for proteome data analysis using the Perseus software platform
Tutorial for proteome data analysis using the Perseus software platform Laboratory of Mass Spectrometry, LNBio, CNPEM Tutorial version 1.0, January 2014. Note: This tutorial was written based on the information
More informationData, Measurements, Features
Data, Measurements, Features Middle East Technical University Dep. of Computer Engineering 2009 compiled by V. Atalay What do you think of when someone says Data? We might abstract the idea that data are
More informationDecision Trees and Random Forests. Reference: Leo Breiman,
Decision Trees and Random Forests Reference: Leo Breiman, http://www.stat.berkeley.edu/~breiman/randomforests 1. Decision trees Example (Guerts, Fillet, et al., Bioinformatics 2005): Patients to be classified:
More informationAccurate calibration of online Time of Flight Mass Spectrometer (TOFMS) for high molecular weight combustion product analysis
Accurate calibration of online Time of Flight Mass Spectrometer (TOFMS) for high molecular weight combustion product analysis B. Apicella*, M. Passaro**, X. Wang***, N. Spinelli**** mariadellarcopassaro@gmail.com
More informationProblem Set 3 Solutions CH332 (SP 06) 1. Skoog problem 151 (omit terms (j), (k) and (m)). Draw diagrams as necessary.
Problem Set 3 Solutions CH332 (SP 06) 1. Skoog problem 151 (omit terms (j), (k) and (m)). Draw diagrams as necessary. a) fluorescence Relaxation of an excited state by emission of a photon without a change
More informationFTIR Instrumentation
FTIR Instrumentation Adopted from the FTIR lab instruction by H.N. Hsieh, New Jersey Institute of Technology: http://wwwec.njit.edu/~hsieh/ene669/ftir.html 1. IR Instrumentation Two types of instrumentation
More information1 st day Basic Training Course
DATES AND LOCATIONS 1314 April 2015 Princeton Marriott at Forrestal, 100 College Road East, Princeton NJ 08540, New Jersey 1617 April 2015 Hotel Nikko San Francisco 222 Mason Street, San Francisco, CA
More informationSpectrum Quality Assessment in Mass Spectrometry Proteomics
Spectrum Quality Assessment in Mass Spectrometry Proteomics 1. Background Rheanna Mainzer Supervised by Dr. Luke Prendergast La Trobe University An important research problem in mass spectrometry is in
More informationIncreasing the Multiplexing of High Resolution Targeted Peptide Quantification Assays
Increasing the Multiplexing of High Resolution Targeted Peptide Quantification Assays Scheduled MRM HR Workflow on the TripleTOF Systems Jenny Albanese, Christie Hunter AB SCIEX, USA Targeted quantitative
More informationF321 THE STRUCTURE OF ATOMS. ATOMS Atoms consist of a number of fundamental particles, the most important are... in the nucleus of an atom
Atomic Structure F32 TE STRUCTURE OF ATOMS ATOMS Atoms consist of a number of fundamental particles, the most important are... Mass / kg Charge / C Relative mass Relative Charge PROTON NEUTRON ELECTRON
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.cs.toronto.edu/~rsalakhu/ Lecture 6 Three Approaches to Classification Construct
More informationMass Spectrometry Signal Calibration for Protein Quantitation
Cambridge Isotope Laboratories, Inc. www.isotope.com Proteomics Mass Spectrometry Signal Calibration for Protein Quantitation Michael J. MacCoss, PhD Associate Professor of Genome Sciences University of
More informationFunctional Data Analysis with R and MATLAB
J.O. Ramsay Giles Hooker Spencer Graves Functional Data Analysis with R and MATLAB Springer Contents 2 Introduction to Functional Data Analysis. What Are Functional Data?.. Data on the Growth of Girls..2
More informationOplAnalyzer: A Toolbox for MALDITOF Mass Spectrometry Data Analysis
OplAnalyzer: A Toolbox for MALDITOF Mass Spectrometry Data Analysis Thang V. Pham and Connie R. Jimenez OncoProteomics Laboratory, Cancer Center Amsterdam, VU University Medical Center De Boelelaan 1117,
More informationSignal, Noise, and Detection Limits in Mass Spectrometry
Signal, Noise, and Detection Limits in Mass Spectrometry Technical Note Chemical Analysis Group Authors Greg Wells, Harry Prest, and Charles William Russ IV, Agilent Technologies, Inc. 2850 Centerville
More informationAlgebra 1 Course Information
Course Information Course Description: Students will study patterns, relations, and functions, and focus on the use of mathematical models to understand and analyze quantitative relationships. Through
More informationStatistical Analysis Strategies for Shotgun Proteomics Data
Statistical Analysis Strategies for Shotgun Proteomics Data Ming Li, Ph.D. Cancer Biostatistics Center Vanderbilt University Medical Center Ayers Institute Biomarker Pipeline normal shotgun proteome analysis
More information> plot(exp.btgpllm, main = "treed GP LLM,", proj = c(1)) > plot(exp.btgpllm, main = "treed GP LLM,", proj = c(2)) quantile diff (error)
> plot(exp.btgpllm, main = "treed GP LLM,", proj = c(1)) > plot(exp.btgpllm, main = "treed GP LLM,", proj = c(2)) 0.4 0.2 0.0 0.2 0.4 treed GP LLM, mean treed GP LLM, 0.00 0.05 0.10 0.15 0.20 x1 x1 0.4
More informationTowards the Prediction of Protein Abundance from Tandem Mass Spectrometry Data
Towards the Prediction of Protein Abundance from Tandem Mass Spectrometry Data Anthony J Bonner Han Liu Abstract This paper addresses a central problem of Proteomics: estimating the amounts of each of
More informationFUNCTIONAL DATA ANALYSIS: INTRO TO R s FDA
FUNCTIONAL DATA ANALYSIS: INTRO TO R s FDA EXAMPLE IN R...fda.txt DOCUMENTATION: found on the fda website, link under software (http://www.psych.mcgill.ca/misc/fda/) Nice 2005 document with examples, explanations.
More informationA Streamlined Workflow for Untargeted Metabolomics
A Streamlined Workflow for Untargeted Metabolomics Employing XCMS plus, a Simultaneous Data Processing and Metabolite Identification Software Package for Rapid Untargeted Metabolite Screening Baljit K.
More informationHigh Dimensional Data Analysis with Applications in IMS and fmri Processing
High Dimensional Data Analysis with Applications in IMS and fmri Processing Don Hong Department of Mathematical Sciences Center for Computational Sciences Middle Tennessee State University Murfreesboro,
More informationLinear Models and Conjoint Analysis with Nonlinear Spline Transformations
Linear Models and Conjoint Analysis with Nonlinear Spline Transformations Warren F. Kuhfeld Mark Garratt Abstract Many common data analysis models are based on the general linear univariate model, including
More information[ Care and Use Manual ]
PREP Calibration Mix DIOS Low i. Introduction Prepackaged PREP Calibration Mixtures eliminate the need to purchase and store large quantities of the component calibration reagents, simplifying sample
More informationMarkerView Software 1.2.1 for Metabolomic and Biomarker Profiling Analysis
MarkerView Software 1.2.1 for Metabolomic and Biomarker Profiling Analysis Overview MarkerView software is a novel program designed for metabolomics applications and biomarker profiling workflows 1. Using
More informationChapter 20 Molecular Mass Spectrometry
Problems: 1,, 4, 7, 10, 11, 15, 16 Chapter 0 Molecular Mass Spectrometry Note may have to go over sections of Chapter 11, Atomic Mass Spectrometry 0A Molecular Mass Spectra Figure 01 Typical Mass Spectrum
More informationIntegrated Data Mining Strategy for Effective Metabolomic Data Analysis
The First International Symposium on Optimization and Systems Biology (OSB 07) Beijing, China, August 8 10, 2007 Copyright 2007 ORSC & APORC pp. 45 51 Integrated Data Mining Strategy for Effective Metabolomic
More informationFluorescence Workshop UMN Physics June 810, Fluorescence Microscopy and Fluorescence Correlation Spectroscopy Joachim Mueller
Fluorescence Workshop UMN Physics June 810, 2006 Fluorescence Microscopy and Fluorescence Correlation Spectroscopy Joachim Mueller Fluorescence Microscopy Use a microscope as a fluorometer Advantages:
More informationDetermining the Optimal Sampling Rate of a Sonic Anemometer Based on the ShannonNyquist Sampling Theorem
Determining the Optimal Sampling Rate of a Sonic Anemometer Based on the ShannonNyquist Sampling Theorem Andrew Mahre National Oceanic and Atmospheric Administration Research Experiences for Undergraduates
More informationWaters Core Chromatography Training (2 Days)
2015 Page 2 Waters Core Chromatography Training (2 Days) The learning objective of this two day course is to teach the user core chromatography, system and software fundamentals in an outcomes based approach.
More informationData Mining Techniques for Prognosis in Pancreatic Cancer
Data Mining Techniques for Prognosis in Pancreatic Cancer by Stuart Floyd A Thesis Submitted to the Faculty of the WORCESTER POLYTECHNIC INSTITUE In partial fulfillment of the requirements for the Degree
More informationMRMPilot Software: Accelerating MRM Assay Development for Targeted Quantitative Proteomics
MRMPilot Software: Accelerating MRM Assay Development for Targeted Quantitative Proteomics With Unique QTRAP and TripleTOF 5600 System Technology Targeted peptide quantification is a rapidly growing application
More informationGuidance for Industry
Guidance for Industry Q2B Validation of Analytical Procedures: Methodology November 1996 ICH Guidance for Industry Q2B Validation of Analytical Procedures: Methodology Additional copies are available from:
More informationAnalysis of proteins
Analysis of proteins Western blot Protein seperation (liqiuid chromatography) Mass spectrometry Assaying of protein in... Blood (e.g. viral infections, pregnancy test) Cells Tissue Urin (bladder infection)
More informationExtended control charts
Extended control charts The control chart types listed below are recommended as alternative and additional tools to the Shewhart control charts. When compared with classical charts, they have some advantages
More informationLecture 18 Linear Regression
Lecture 18 Statistics Unit Andrew Nunekpeku / Charles Jackson Fall 2011 Outline 1 1 Situation  used to model quantitative dependent variable using linear function of quantitative predictor(s). Situation
More informationIntroduction to Longitudinal Data Analysis
Introduction to Longitudinal Data Analysis Longitudinal Data Analysis Workshop Section 1 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 1: Introduction
More informationVALIDATION OF ANALYTICAL PROCEDURES: TEXT AND METHODOLOGY Q2(R1)
INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE ICH HARMONISED TRIPARTITE GUIDELINE VALIDATION OF ANALYTICAL PROCEDURES: TEXT AND METHODOLOGY
More information5MD00. Assignment Introduction. Luc Waeijen 16122014
5MD00 Assignment Introduction Luc Waeijen 16122014 Contents EEG application Background on EEG Early Seizure Detection Algorithm Implementation Details Super Scalar Assignment Description Tooling (simple
More informationUsing CyTOF Data with FlowJo Version 10.0.7. Revised 2/3/14
Using CyTOF Data with FlowJo Version 10.0.7 Revised 2/3/14 Table of Contents 1. Background 2. Scaling and Display Preferences 2.1 Cytometer Based Preferences 2.2 Useful Display Preferences 3. Scale and
More information0 10 20 30 40 50 60 70 m/z
Mass spectrum for the ionization of acetone MS of Acetone + Relative Abundance CH 3 H 3 C O + M 15 (loss of methyl) + O H 3 C CH 3 43 58 0 10 20 30 40 50 60 70 m/z It is difficult to identify the ions
More informationIsotopes and Mass Spectrometry
PSI AP Chemistry Activity Isotopes and Mass Spectrometry Why? In this activity we will address the questions: Are all atoms of an element identical and how do we know? How can data from mass spectrometry
More informationComprehensive Examinations for the Program in Bioinformatics and Computational Biology
Comprehensive Examinations for the Program in Bioinformatics and Computational Biology The Comprehensive exams will be given once a year. The format will be six exams. Students must show competency on
More informationusing ms based proteomics
quantification using ms based proteomics lennart martens Computational Omics and Systems Biology Group Department of Medical Protein Research, VIB Department of Biochemistry, Ghent University Ghent, Belgium
More informationCancer Biostatistics Workshop Science of Doing Science  Biostatistics
Cancer Biostatistics Workshop Science of Doing Science  Biostatistics Yu Shyr, PhD Jan. 18, 2008 Cancer Biostatistics Center VanderbiltIngram Cancer Center Yu.Shyr@vanderbilt.edu Aims Cancer Biostatistics
More informationConcepts in Machine Learning, Unsupervised Learning & Astronomy Applications
Data Mining In Modern Astronomy Sky Surveys: Concepts in Machine Learning, Unsupervised Learning & Astronomy Applications ChingWa Yip cwyip@pha.jhu.edu; Bloomberg 518 Human are Great Pattern Recognizers
More informationNew Work Item for ISO 35345 Predictive Analytics (Initial Notes and Thoughts) Introduction
Introduction New Work Item for ISO 35345 Predictive Analytics (Initial Notes and Thoughts) Predictive analytics encompasses the body of statistical knowledge supporting the analysis of massive data sets.
More informationBio and Polymer Analytics. RD Instrumental Analytical Chemistry. Organic Trace Analytics. RD Environmental & Process Analytics
MOLEKULARE IMAGING MASSENSPEKTROMETRIE VON GEWEBSOBERFLÄCHEN und BIOPOLYMERANALTIK Günter Allmaier RESEARCH GROUP BIO AND POLYMER ANALYSIS Vienna University of Technology, Institute of Chemical Technologies
More informationIntroduction to Fourier Transform Infrared Spectrometry
Introduction to Fourier Transform Infrared Spectrometry What is FTIR? I N T R O D U C T I O N FTIR stands for Fourier Transform InfraRed, the preferred method of infrared spectroscopy. In infrared spectroscopy,
More informationTutorial for Proteomics Data Submission. Katalin F. Medzihradszky Robert J. Chalkley UCSF
Tutorial for Proteomics Data Submission Katalin F. Medzihradszky Robert J. Chalkley UCSF Why Have Guidelines? Largescale proteomics studies create huge amounts of data. It is impossible/impractical to
More informationIdentification algorithms for hybrid systems
Identification algorithms for hybrid systems Giancarlo FerrariTrecate Modeling paradigms Chemistry White box Thermodynamics System Mechanics... Drawbacks: Parameter values of components must be known
More informationSimple Predictive Analytics Curtis Seare
Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use
More informationBetter decision making under uncertain conditions using Monte Carlo Simulation
IBM Software Business Analytics IBM SPSS Statistics Better decision making under uncertain conditions using Monte Carlo Simulation Monte Carlo simulation and risk analysis techniques in IBM SPSS Statistics
More information泛 用 蛋 白 質 體 學 之 質 譜 儀 資 料 分 析 平 台 的 建 立 與 應 用 Universal Mass Spectrometry Data Analysis Platform for Quantitative and Qualitative Proteomics
泛 用 蛋 白 質 體 學 之 質 譜 儀 資 料 分 析 平 台 的 建 立 與 應 用 Universal Mass Spectrometry Data Analysis Platform for Quantitative and Qualitative Proteomics 2014 Training Course WeiHung Chang ( 張 瑋 宏 ) ABRC, Academia
More informationBackground Information
1 Gas Chromatography/Mass Spectroscopy (GC/MS/MS) Background Information Instructions for the Operation of the Varian CP3800 Gas Chromatograph/ Varian Saturn 2200 GC/MS/MS See the Cary Eclipse Software
More informationMultiple Regression YX1 YX2 X1X2 YX1.X2
Multiple Regression Simple or total correlation: relationship between one dependent and one independent variable, Y versus X Coefficient of simple determination: r (or r, r ) YX YX XX Partial correlation:
More informationA Beginner s Guide to ICPMS Part X Detectors
TUTORIAL A Beginner s Guide to ICPMS Part X Detectors Robert Thomas Robert Thomas has more than 30 years of experience in trace element analysis. He is the principal of his own freelance writing and consulting
More informationGed Ridgway Wellcome Trust Centre for Neuroimaging University College London. [slides from the FIL Methods group] SPM Course Vancouver, August 2010
Ged Ridgway Wellcome Trust Centre for Neuroimaging University College London [slides from the FIL Methods group] SPM Course Vancouver, August 2010 β β y X X e one sample ttest two sample ttest paired
More informationBEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES
BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES 123 CHAPTER 7 BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES 7.1 Introduction Even though using SVM presents
More informationJava Modules for Time Series Analysis
Java Modules for Time Series Analysis Agenda Clustering Nonnormal distributions Multifactor modeling Implied ratings Time series prediction 1. Clustering + Cluster 1 Synthetic Clustering + Time series
More informationIntroduction. Chapter 12 Mass Spectrometry and Infrared Spectroscopy. Electromagnetic Spectrum. Types of Spectroscopy 8/29/2011
Organic Chemistry, 6 th Edition L. G. Wade, Jr. Chapter 12 Mass Spectrometry and Infrared Spectroscopy Introduction Spectroscopy is an analytical technique which helps determine structure. It destroys
More informationSpectrophotometry and the BeerLambert Law: An Important Analytical Technique in Chemistry
Spectrophotometry and the BeerLambert Law: An Important Analytical Technique in Chemistry Jon H. Hardesty, PhD and Bassam Attili, PhD Collin College Department of Chemistry Introduction: In the last lab
More informationChapter 12 Discovering New Knowledge Data Mining
Chapter 12 Discovering New Knowledge Data Mining BecerraFernandez, et al.  Knowledge Management 1/e  2004 Prentice Hall Additional material 2007 Dekai Wu Chapter Objectives Introduce the student to
More informationSoftware Approaches for Structure Information Acquisition and Training of Chemistry Students
Software Approaches for Structure Information Acquisition and Training of Chemistry Students Nikolay T. Kochev, Plamen N. Penchev, Atanas T. Terziyski, George N. Andreev Department of Analytical Chemistry,
More informationS SG. Design of Experiment in Metabolomics. Hemant K. Tiwari, Ph.D. Professor and Head. Metabolomics: Bench to Bedside. ection ON tatistical.
S SG ection ON tatistical enetics Design of Experiment in Metabolomics Hemant K. Tiwari, Ph.D. Professor and Head Section on Statistical Genetics Department of Biostatistics School of Public Health Metabolomics:
More informationFundamentals of modern UVvisible spectroscopy. Presentation Materials
Fundamentals of modern UVvisible spectroscopy Presentation Materials The Electromagnetic Spectrum E = hν ν = c / λ 1 Electronic Transitions in Formaldehyde 2 Electronic Transitions and Spectra of Atoms
More informationA Introduction to Matrix Algebra and Principal Components Analysis
A Introduction to Matrix Algebra and Principal Components Analysis Multivariate Methods in Education ERSH 8350 Lecture #2 August 24, 2011 ERSH 8350: Lecture 2 Today s Class An introduction to matrix algebra
More informationHigh resolution mass spectrometry (HRMS*) in Graz
High resolution mass spectrometry (HRMS*) in Graz 10.2.2012 * Instruments with high resolution where exact mass measurements can be performed 1 HRMS in Graz s and Ionization Techniques 2 6 3 5 4 1 TU Graz
More informationMonitoring of Cerebral Blood Flow. Transcranial Doppler Laser Doppler Flowmetry Thermal dilution method (Hemedex)
Monitoring of Cerebral Blood Flow Transcranial Doppler Laser Doppler Flowmetry Thermal dilution method (Hemedex) Ultrasound in Tissue Some Facts: blood cell tissue probe ultrasound travels at a constant
More information