Pep-Miner: A Novel Technology for Mass Spectrometry-Based Proteomics



Similar documents
MultiQuant Software 2.0 for Targeted Protein / Peptide Quantification

AB SCIEX TOF/TOF 4800 PLUS SYSTEM. Cost effective flexibility for your core needs

In-Depth Qualitative Analysis of Complex Proteomic Samples Using High Quality MS/MS at Fast Acquisition Rates

Increasing the Multiplexing of High Resolution Targeted Peptide Quantification Assays

Chapter 14. Modeling Experimental Design for Proteomics. Jan Eriksson and David Fenyö. Abstract. 1. Introduction

泛 用 蛋 白 質 體 學 之 質 譜 儀 資 料 分 析 平 台 的 建 立 與 應 用 Universal Mass Spectrometry Data Analysis Platform for Quantitative and Qualitative Proteomics

Aiping Lu. Key Laboratory of System Biology Chinese Academic Society

Tutorial for Proteomics Data Submission. Katalin F. Medzihradszky Robert J. Chalkley UCSF

Global and Discovery Proteomics Lecture Agenda

ProteinScape. Innovation with Integrity. Proteomics Data Analysis & Management. Mass Spectrometry

Session 1. Course Presentation: Mass spectrometry-based proteomics for molecular and cellular biologists

ProteinPilot Report for ProteinPilot Software

Effects of Intelligent Data Acquisition and Fast Laser Speed on Analysis of Complex Protein Digests

PeptidomicsDB: a new platform for sharing MS/MS data.

HRMS in Clinical Research: from Targeted Quantification to Metabolomics

Application Note # LCMS-81 Introducing New Proteomics Acquisiton Strategies with the compact Towards the Universal Proteomics Acquisition Method

Introduction to Proteomics 1.0

Introduction to mass spectrometry (MS) based proteomics and metabolomics

Preprocessing, Management, and Analysis of Mass Spectrometry Proteomics Data

Accurate Mass Screening Workflows for the Analysis of Novel Psychoactive Substances

Introduction to Proteomics

The Scheduled MRM Algorithm Enables Intelligent Use of Retention Time During Multiple Reaction Monitoring

Overview. Introduction. AB SCIEX MPX -2 High Throughput TripleTOF 4600 LC/MS/MS System

MRMPilot Software: Accelerating MRM Assay Development for Targeted Quantitative Proteomics

Proteomic Analysis using Accurate Mass Tags. Gordon Anderson PNNL January 4-5, 2005

Building innovative drug discovery alliances. Evotec Munich. Quantitative Proteomics to Support the Discovery & Development of Targeted Drugs

Mass Spectrometry Based Proteomics

Learning Objectives:

Error Tolerant Searching of Uninterpreted MS/MS Data

Using Ontologies in Proteus for Modeling Data Mining Analysis of Proteomics Experiments

Challenges in Computational Analysis of Mass Spectrometry Data for Proteomics

Pinpointing phosphorylation sites using Selected Reaction Monitoring and Skyline

Methods for Protein Analysis

Research-grade Targeted Proteomics Assay Development: PRMs for PTM Studies with Skyline or, How I learned to ditch the triple quad and love the QE

MarkerView Software for Metabolomic and Biomarker Profiling Analysis

Guide to Reverse Phase SpinColumns Chromatography for Sample Prep

Introduction to Proteomics

Application Note # LCMS-62 Walk-Up Ion Trap Mass Spectrometer System in a Multi-User Environment Using Compass OpenAccess Software

LC-MS/MS for Chromatographers

OplAnalyzer: A Toolbox for MALDI-TOF Mass Spectrometry Data Analysis

Advantages of the LTQ Orbitrap for Protein Identification in Complex Digests

BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS

Statistical Analysis Strategies for Shotgun Proteomics Data

Mass Frontier Version 7.0

Unique Software Tools to Enable Quick Screening and Identification of Residues and Contaminants in Food Samples using Accurate Mass LC-MS/MS

Mass Spectra Alignments and their Significance

Master course KEMM03 Principles of Mass Spectrometric Protein Characterization. Exam

Thermo Scientific PepFinder Software A New Paradigm for Peptide Mapping

using ms based proteomics

Biopharmaceutical Glycosylation Analysis

STANFORD UNIVERSITY MASS SPECTROMETRY 333 CAMPUS DR., MUDD 175 STANFORD, CA

EKSIGENT EKSPERT NANOLC 400. Unmatched flexibility for low flow LC/MS

CUSTOM ANTIBODIES. Fully customised services: rat and murine monoclonals, rat and rabbit polyclonals, antibody characterisation, antigen preparation

DRUG METABOLISM. Drug discovery & development solutions FOR DRUG METABOLISM

MASCOT Search Results Interpretation

Overview. Triple quadrupole (MS/MS) systems provide in comparison to single quadrupole (MS) systems: Introduction

T cell Epitope Prediction

A Streamlined Workflow for Untargeted Metabolomics

for mass spectrometry calibration tools Thermo Scientific Pierce Controls and Standards for Mass Spectrometry

Monoclonal Antibody Fragment Separation and Characterization Using Size Exclusion Chromatography Coupled with Mass Spectrometry

Quantification of Multiple Therapeutic mabs in Serum Using microlc-esi-q-tof Mass Spectrometry

Shotgun Proteomic Analysis. Department of Cell Biology The Scripps Research Institute

MaxQuant User s Guide Version

Electrospray Ion Trap Mass Spectrometry. Introduction

Background Information

The Facts on Equine Drug Testing

Pesticide Analysis by Mass Spectrometry

How Mascot Integra helps run a Core Lab

Mascot Search Results FAQ

Simultaneous qualitative and quantitative analysis using the Agilent 6540 Accurate-Mass Q-TOF

Thermo Scientific GC-MS Data Acquisition Instructions for Cerno Bioscience MassWorks Software

Choices, choices, choices... Which sequence database? Which modifications? What mass tolerance?

Retrospective Analysis of a Host Cell Protein Perfect Storm: Identifying Immunogenic Proteins and Fixing the Problem

La Protéomique : Etat de l art et perspectives

Proteomic data analysis for Orbitrap datasets using Resources available at MSI. September 28 th 2011 Pratik Jagtap

Week 9: MS in Space and Proteomics

NUVISAN Pharma Services

Overview. Purpose. Methods. Results

Improving the Metabolite Identification Process with Efficiency and Speed: LightSight Software for Metabolite Identification

CPAS Overview. Josh Eckels LabKey Software

Mascot Integra: Data management for Proteomics ASMS 2004

Chapter 3. Protein Structure and Function

Standard Mixture. TOF/TOF Calibration Mixture. Calibration Mixture 1 (Cal Mix 1, 1:10) Calibration Mixture 1 (Cal Mix 1, 1:100)

Quan%ta%ve proteomics. Maarten Altelaar, 2014

Tutorial 9: SWATH data analysis in Skyline

WATERS QUANTITATIVE ANALYSIS solutions

Application of Structure-Based LC/MS Database Management for Forensic Analysis

Per Andrén Dept. of Pharmaceutical Biosciences Medical Mass Spectrometry, Uppsala University, Uppsala, Sweden

A Navigation through the Tracefinder Software Structure and Workflow Options. Frans Schoutsen Pesticide Symposium Prague 27 April 2015

High-Throughput 3-D Chromatography Through Ion Exchange SPE

Workshop IIc. Manual interpretation of MS/MS spectra. Ebbing de Jong. Center for Mass Spectrometry and Proteomics Phone (612) (612)

Quantitative proteomics background

Investigating Biological Variation of Liver Enzymes in Human Hepatocytes

Agilent 6000 Series LC/MS Solutions CONFIDENCE IN QUANTITATIVE AND QUALITATIVE ANALYSIS

Transcription:

Pep-Miner: A Novel Technology for Mass Spectrometry-Based Proteomics Ilan Beer Haifa Research Lab Dec 10, 2002

Pep-Miner s Location in the Life Sciences World The post-genome era - the age of proteome begins Proteins (rather than genes) are the real players in biology Fundamental activities Identifying/cataloguing proteins in various tissues Comparing cells in different conditions (e.g. healthy/diseased) Major goals: drug discovery, diagnostics Mass spectrometry: leading proteomics technology Rapidly replacing older technologies (Gel, Edman,...) Well-defined interface between wet biology and computing

Location (cont.) Pep-Miner: novel proteome analysis technology High-throughput algorithms for processing mass spec data Considerable data size reduction Improved analysis quality Better peptide identification and peptide mixture comparison

Mass Spectrometry Basics 1. Proteins are extracted from cells and digested into peptides; or peptides are directly extracted from cells Peptide mixture 2. Peptides elute from HPLC column ordered by hydrophobicity; mixture becomes simpler HPLC 3. Mixture is sampled every few seconds by mass spectrometer; peptide masses are shown as MS spectrum MS 300 m/z 2000 4. Peptides of selected masses are fragmented into sub-peptides; fragment masses are shown as MS/MS spectrum MS/MS 300 m/z 1000 5. Peptides are identified based on their MS and MS/MS spectra Identification program

Major Problem: Huge Amounts of Data A mass spectrometer produces a lot of data More than 1000 MS and 1000 MS/MS spectra/hour 10 100 MB/hour One day in a big pharma/biotech with 100 mass specs 100 x 24 x 1000 = 2.4 M MS/MS spectra to analyze 100 x 24 x 100 MB = 240 GB to store Problems Long analysis time: seconds to minutes per spectrum Enormous computing power is needed Limited long-term storage capability of valuable data Data is hard to manage

More Problems Most peptides remain unidentified Mass spectrometers have limited accuracy, sensitivity, resolution, mass range Chemical/biological contamination Unknown fragmentation rules (intensity) Incomplete protein/gene libraries Modifications and mutations Identification-based mixture comparison is rendered meaningless Pep-Miner technology addresses these problems

Pep-Miner: A Novel Proteome Analysis Technology Clustering of MS/MS data All similar spectra in all mass spec runs across a project are grouped into a cluster A cluster represents one peptide A representative spectrum is computed for each cluster (average), replacing the raw spectra Source for multiplicity A peptide may be fragmented more than once in a run Repeated runs of same material Many proteins, and hence peptides, are common to the various cell types and the various conditions

Cluster Example A. Member 1 Score:74 B. Member 2 Score:59 C. Member 3 Score:67 500 m/z 1000 500 m/z 1000 Three examples out of 52 MS/MS spectra that have been grouped into one cluster D. Combined Score:88 500 m/z 1000 500 m/z 1000 Combined spectrum of the cluster

Work Flow Read mass spectrometer files Find clusters Compute cluster representative spectra (reps); delete raw spectra Store cluster information and reps in database Send reps to identification and store results in database Show analysis results (database view) to user Read more mass spectrometer files, possibly as soon as they are created Try distributing new spectra among existing clusters; Create new clusters for the rest Compute reps Send only new reps to identification Show updated database view to user

Major Benefits Enormous saving in time and space Data size reduction (x40-x100) Allows to store larger amounts of valuable data for future analysis Reduced analysis time (x40-x100) Only cluster reps are analyzed Much of the new data is categorized rather than analyzed => saving increases as the project progresses Improved data quality Reps have better signal/noise ratio and accuracy than raw spectra Increased identification yield More accurate data -> faster analysis

More Benefits Peptide mixture comparison rendered meaningful No need for prior identification Rely on spectrum similarity only Further analysis can focus on interesting differences only Data management by humans becomes easy Whole project in one view 10 3 clusters to inspect rather than 10 5 raw spectra Knowledge acquisition and exploitation Biological knowledge is accumulated in a database Analysis quality improves as the project progresses the tool learns from the analyzed data (e.g. retention time prediction and sequence motifs)

Publications Analysis of endogenous peptides bound by soluble MHC class I molecules: a novel approach for identifying tumor-specific antigens, Barnea E, Beer I, Patoka R, Ziv T, Kessler O, Tzehoval E, Eisenbach L, Zavazava N, Admon A., European Journal of Immunology, January 2002 Application of Liquid Chromatography Tandem Mass-Spectrometry Data Clustering to Peptide Analysis, Beer I, Barnea E, Ziv T, Admon A, submitted