The Open2Dprot Proteomics Project for n-dimensional Protein Expression Data Analysis
|
|
|
- Homer Wilcox
- 10 years ago
- Views:
Transcription
1 The Open2Dprot Proteomics Project for n-dimensional Protein Expression Data Analysis Revised * (cf. 2D-LC)
2 Introduction There is a need for integrated proteomics expression databases and bioinformatic tools that help perform exploratory data analysis and data mining in the context of the large number of high-quality characterization, annotation, pathway and functional databases increasingly available on the Internet. Some of the biological problems addressed by these types of bioinformatic tools include aid in the detection and better understanding of post-translational modifications; helping in the discovery of biomarkers for diagnosis and monitoring of disease, detecting toxicity, and developing new drugs; analysis of coordinated expression of sets of proteins; and pathway elucidation. The Open2Dprot project is a community effort to create a fully open-source n-dimensional protein expression data analysis system that can be freely downloaded and used for data mining protein expression profiles across sets of n- dimensional data from research experiments (2D gels, 2D LC-MS, protein microarrays, n-dimensional LC-MS*MS*..., etc). The focus of Open2Dprot is to provide an integrated set of open source software tools for n-d database analysis that is hosted on the SourceForge.net repository. A pipeline control program called Open2Dprot analyzes, schedules, and runs the pipeline modules required to pre-process and create a Composite Sample Database used in the data mining. 2
3 Open2Dprot is being expanded to handle data from other protein separation methods. It uses the open source methodology modeled after our MAExplorer DNA microarray analysis software. The Open2Dprot goals and software development plan are described on Open2Dprot is being written in Java/R using XML and MySQL RDBMS. It is based partly on some refactored code from earlier C/Unix/X-windows 2D PAGE data-mining systems, in part on code from other open-source bioinformatic software projects (such as Bioconductor), and Java and R languages using code from MAExplorer, Flicker, and GELLAB-II. It is being extended with other 2D-proteomics analyses, mass spectrometry, protein microarray, and related proteomics software codes as well as developer efforts donated by the research community. It uses XML interchange formats and a SQL/schema modeled after the Protein Standards Initiative (PSI) MIAPE proteomics community data standard as then interface between stages of the analysis pipeline. This standardization allows for data sharing and alternate methods for 2D gel spot segmentation or 2D LC-MS peptide "spot" clusters, protein-arrays, spot pairing, data analysis methods, etc., could be made added. This will be critical when applying it to other types of 2D proteomics data. As pipeline components become usable, they are made available on the Web site (see 'Module list' for current status). 3
4 The Open2Dprot Project Open2Dprot is an open-source project for the development of n-dimensional proteomics exploratory data analysis bioinformatic tools. The tools can be used for analyzing quantified protein expression data across multiple n-d samples from research experiments. The tools could be adapted for use with a variety of quantified 2-D or n-dimensional protein separation sources of expression data. 4
5 Proteomic Separation Methods 2D-PAGE (P. O Farrell, 1975) pie vs Mm (mass), 2D-gels 2D LC-MS retention-times vs m/z (mass) 2D IPG-MS pie vs m/z (mass) 2D LC-LC pie vs RP-HPLC n-d (e.g., LC-MS*MS*MS ) Protein arrays (analytes vs antibodies) All share a common paradigm: proteins separated by orthogonal features Some of these methods are semi-quantitative Data represented as protein expression profiles lends itself to exploratory data analysis Open2Dprot could be used as part of a broader set of integrated tools 5
6 Composite Samples Database (CSD) Paradigm Proteomic composite samples database (CSD) consisting of a set of n samples G 1, G 2,,G n with representative sample G r = G 1 Expression profiles A,B,C,... O = present, X = missing A canonical sample database is a statistical representation of the CSD spot geometry and quantification that could be used for data mining Lemkin & Lester Clinical Chemistry,
7 n-dimensional Protein Expression Analysis PRE-PROCESSING PIPELINE: Samples are accessioned and processed in a data-reduction pipeline to construct a Composite Samples Database CSD merging paired spot lists from the set of samples Interactive and batch processing of samples DATABASE: CSD is created & maintained in RDBMS could be shared between groups XML files data interchange High-speed caches used for data mining DATA MINING: The CSD data is explored using exploratory data analysis techniques: statistics, clustering, classification, directmanipulation graphics and reports, etc. with access to Internet proteomic / genomic / PubMed / function / pathway databases Interactive and batch processing of samples 7
8 Pre-Processing Pipeline Steps Accession n-d sample images or n-d data and experiment data into database Quantify spots from sample images, 2D LC-MS peptide peak clusters, or protein arrays Pair spots between samples and reference samples Construct Composite Samples Database (CSD) for all sets of paired samples 8
9 Data-Mining Analyses Being Developed for the Composite Samples Database Manage replicate samples and condition sets of samples Manage subsets of proteins in the database Analyze expression profiles for multiple conditions Cluster proteins and cluster samples Classify samples by protein subsets Data-filter protein sets by statistics, clustering, set membership Direct-manipulation of data in graphics, spreadsheets Java and R language statistical, clustering, classifiers, class prediction, and other plug-in methods Access Internet proteomic/genomic/pubmed/function/pathway data bases during data mining of protein subsets 9
10 Why 2D-Gels Now? 2D-PAGE was not widely used until recently due to: - limitations in identifying spots differentially expressed - difficulty resolving and detecting specialized classes of proteins (e.g., basic proteins, membrane proteins, low abundance proteins) Today, 2D-PAGE is often used as prescreening stage for mass-spectrometry to identify excised spots found in differential analysis Improved resolution: zoom 2D-gels, new pre-fractionation methods There are other protein separation techniques that could use these 2D-gel and recent DNA-microarray database analysis paradigms including 2D LC-MS and protein arrays 10
11 2D-PAGE Gel Spot Segmentation and Quantification Original 2D-PAGE gel Segmented and quantified spots 11 * Images flipped horizontally from original data
12 2-D LC-MS Map of Spectral Data Intensity m/z Intensity Each spectrum has a retention time for when is was collected Retention times are reproducible measures of when peptides are released from the RP column Retention time and m/z can be used as coordinates Intensity values for each peak are mapped to a specific location in a 2-D space m/z Retention time Retention time m/z * John Lewis, Geo-Centers Inc, US Army Center for Environmental Health Res., Nov, 2003 preliminary data (with permission) 12
13 Protein Array Example of sensitivity and reproducibility analysis of the reverse phase protein microarrays. From Sheehan, K. M. (2005) Mol. Cell. Proteomics 4: With RP arrays, analytes are immobilized in solid-phase on the array. Copyright 2005 American Society for Biochemistry and Molecular Biology. With permission. 13
14 Why Open Source? The basic idea behind open source is very simple: When programmers can read, redistribute, and modify the source code for a piece of software, the software evolves. People improve it, people adapt it, people fix bugs. And this can happen at a speed that, if one is used to the slow pace of conventional software development, seems astonishing. We in the open source community have learned that this rapid evolutionary process produces better software than the traditional closed model, in which only a very few programmers can see the source and everybody else must blindly use an opaque block of bits. From the Open Source Initiative (OSI) 14
15 Why an Open-Source nd-data Proteomics Effort? An open-source project can be advantageous to the community at large, since there is a far greater likelihood of progress in algorithm design in an academic style collaboration than a closed-source business model. Researchers can more rapidly adapt new methods to existing software without waiting for release of commercial products Use contributed expertise and code of proteomics experts and bioinformaticians to help build and test open software Algorithms more transparent, so researchers can verify results more easily More opportunity to share data in standard non-proprietary 15 formats
16 Why Open Source Proteomics? (continued) No expensive software licenses required - reduces deployment costs within large organizations and small labs Using proper open-source licenses can encourage adoption and collaboration between industry, academic, and government interests (e.g., Linux, FireFox, Apache, Eclipse etc.) Many free open-source repositories available Repositories offer tools to support collaboration, software development, documentation, forums, and distribution 16
17 Open Source Repositories - E.g., SourceForge.Net Free code Repositories Developer, collaborator, user environments SourceForge.net Statistics Registered Projects: 107,096 Registered Users: 1,187,819 SourceForge.net is the world's largest Open Source software development website, with the largest repository of Open Source code and applications available on the Internet. SourceForge.net provides free services to Open Source developers
18 Open2Dprot - Project Goals An international community effort to create an open-source n-d quantitative data analysis system A stand-alone downloadable system that can connect to DBs Use for data mining protein expression of sets of samples from researcher s experiments to investigate and find significant protein expression differences from multiple experimental conditions Will provide integrated set of software tools, analysis methods and data structures for quantitative and system biology protein expression Will handle protein expression data from 2D-gel, 2D LC-MS, protein arrays, and other protein separation methods 18
19 Development Plan Open2Dprot is being written in Java and R languages using XML (MIAPE proteomics schema) and MySQL RDBMS - modern modular open-source technologies aiding portability and extensibility Open2Dprot was derived from new and refactored Java code from various projects including: MAExplorer, Flicker, GELLAB-II Data mining will use Java- and R-plugins derived from MAExplorer and R data-mining open-source proteomics (e.g., Bioconductor), as well as other bioinformatics data-mining software Will be extended with other open-source 2D-gel, LC-MS N and analysis related proteomics software codes with additional efforts by the research community 19
20 Using Open Source Resources Hosted and developed on SourceForge repository at open2dprot.sourceforge.net Web site discusses the Open2Dprot software development plan, and contains documentation and software distributions Uses the similar open-source development methodology used in our Java/R-based MAExplorer maexplorer.sourceforge.net DNA microarray data-mining software Open2Dprot could later reside as part of HUPO.org analysis or other reference database Web sites integrated with other tools relating to 2D gels, mass spectrometry, dye multiplexing, 20 protein arrays, Internet proteomic databases, etc.
21 n-d Protein Expression Analysis Pipeline 1. Accession samples, images or n-d data, and experiment information XML (processed data) 2. Quantify n-d data to by extracting spots for all samples (2Dgels, 2D-LC-MS peak clusters, protein-arrays, etc.) XML 3. Create landmark database between reference sample(s) and remaining samples for spot pairing algorithm (if required) interactive batch interactive XML 4. Pair spots between reference sample(s) and the rest of samples batch XML 5. Construct Composite Database, CSD, by merging paired spot lists batch CSD: RDBMS, XML and caches interactive 6. Explore the CSD data using exploratory data analysis techniques: statistics, clustering, direct-manipulation graphics and reports, etc. 21 batch
22 Pipeline Control Program Open2Dprot The pre-processing is controlled by the pipeline control program Open2Dprot Modules are assigned to Processing stages It determines what data exists, then from that dependency determines what data needs to be created from existing data, and creates the target data It then schedules and runs the required dynamically assigned modules in the pipeline to create the target data. Multiple processors could be used This is repeated until the desired data is created 22
23 Open2Dprot Pipeline Control Program User Open2Dprot pipeline control program Analyze dependencies Dependency analysis of what data exists and what needs to be created, schedule XML module processes Assign active modules Database of potential processing module descriptions Database of assigned processing modules Run pipeline Database - XML or RDBMS/cache for: accession/miape, landmark, SSF, SPF, CSD, etc. Launch processes and monitor pipeline processes 23
24 Data-Mining the Composite Sample Database The previous slide shows some of the types of tools that will be developed for Open2Dprot CSD data mining analysis as we have done previously for MAExplorer DNA microarray software using Javaand R-plugins In Open2Dprot, many of the R-plugins will use methods developed for or derived from Bioconductor (see bioconductor.org, DNA microarray analysis system written in the R language, r-project.org) 24
25 Early MIAPE (PEDRo) UML Schema n-d Data Classes that could be used with 2D-gels Additional fields / classes are needed for Open2Dprot in Taylor et.al., Nature Biotechnology, March PEDRo has been renamed MIAPE Minimal Information About a Proteomics Experiment (Oct. 2003, HUPO-II) by EMBL-EBI 25 See psidev.sf.net
26 MIAPE Minimal Information About a Proteomics Experiment psidev.sourceforge.net/gps/#miape As of
27 Home: In Table of Contents, see: Under Open2Dprot * Home * Development plan * Overview (PDF) * Sub projects * Participation 27
28 Open2Dprot Pipeline Subprojects - Status Additional alternative modules are being developed for all pipeline stages
29 Contributed Associated or Related Projects We added some additional non-pipeline open source projects that may use similar data or common software modules. They may be useful for performing other types of analysis on data used by Open2Dprot or provide other types of analyses
30 Summary of Open2Dprot Open2Dprot is a fully open-source n-d proteomics data-mining project for a variety of proteomic expression data sources and is being developed at It has a flexible pipeline-modules design using XML data interchange and /RDBMS-caches and portable Java and R using existing code where possible As parts of the project pipeline become usable, they are being released as stand-alone programs 30
Molecular Genetics: Challenges for Statistical Practice. J.K. Lindsey
Molecular Genetics: Challenges for Statistical Practice J.K. Lindsey 1. What is a Microarray? 2. Design Questions 3. Modelling Questions 4. Longitudinal Data 5. Conclusions 1. What is a microarray? A microarray
Introduction to Proteomics
Introduction to Proteomics Åsa Wheelock, Ph.D. Division of Respiratory Medicine & Karolinska Biomics Center [email protected] In: Systems Biology and the Omics Cascade, Karolinska Institutet, June 9-13,
Using Ontologies in Proteus for Modeling Data Mining Analysis of Proteomics Experiments
Using Ontologies in Proteus for Modeling Data Mining Analysis of Proteomics Experiments Mario Cannataro, Pietro Hiram Guzzi, Tommaso Mazza, and Pierangelo Veltri University Magna Græcia of Catanzaro, 88100
Integrated Data Mining Strategy for Effective Metabolomic Data Analysis
The First International Symposium on Optimization and Systems Biology (OSB 07) Beijing, China, August 8 10, 2007 Copyright 2007 ORSC & APORC pp. 45 51 Integrated Data Mining Strategy for Effective Metabolomic
Global and Discovery Proteomics Lecture Agenda
Global and Discovery Proteomics Christine A. Jelinek, Ph.D. Johns Hopkins University School of Medicine Department of Pharmacology and Molecular Sciences Middle Atlantic Mass Spectrometry Laboratory Global
La Protéomique : Etat de l art et perspectives
La Protéomique : Etat de l art et perspectives Odile Schiltz Institut de Pharmacologie et de Biologie Structurale CNRS, Université de Toulouse, [email protected] Protéomique et Spectrométrie de Masse
Introduction to Proteomics 1.0
Introduction to Proteomics 1.0 CMSP Workshop Tim Griffin Associate Professor, BMBB Faculty Director, CMSP Objectives Why are we here? For participants: Learn basics of MS-based proteomics Learn what s
Lecture 11 Data storage and LIMS solutions. Stéphane LE CROM [email protected]
Lecture 11 Data storage and LIMS solutions Stéphane LE CROM [email protected] Various steps of a DNA microarray experiment Experimental steps Data analysis Experimental design set up Chips on catalog
Protein Protein Interaction Networks
Functional Pattern Mining from Genome Scale Protein Protein Interaction Networks Young-Rae Cho, Ph.D. Assistant Professor Department of Computer Science Baylor University it My Definition of Bioinformatics
Introduction to mass spectrometry (MS) based proteomics and metabolomics
Introduction to mass spectrometry (MS) based proteomics and metabolomics Tianwei Yu Department of Biostatistics and Bioinformatics Rollins School of Public Health Emory University September 10, 2015 Background
Un (bref) aperçu des méthodes et outils de fouilles et de visualisation de données «omics»
Un (bref) aperçu des méthodes et outils de fouilles et de visualisation de données «omics» Workshop «Protéomique & Maladies rares» 25 th September 2012, Paris [email protected] CEA Grenoble irtsv
Tutorial for proteome data analysis using the Perseus software platform
Tutorial for proteome data analysis using the Perseus software platform Laboratory of Mass Spectrometry, LNBio, CNPEM Tutorial version 1.0, January 2014. Note: This tutorial was written based on the information
Proteomics in Practice
Reiner Westermeier, Torn Naven Hans-Rudolf Höpker Proteomics in Practice A Guide to Successful Experimental Design 2008 Wiley-VCH Verlag- Weinheim 978-3-527-31941-1 Preface Foreword XI XIII Abbreviations,
Dr Alexander Henzing
Horizon 2020 Health, Demographic Change & Wellbeing EU funding, research and collaboration opportunities for 2016/17 Innovate UK funding opportunities in omics, bridging health and life sciences Dr Alexander
Imaging and Bioinformatics Software
Imaging and Bioinformatics Software Software Overview 242 Gel Analysis Software 243 Ordering Information 246 Software Overview Software Overview See Also Imaging systems: pages 232 237. Bio-Plex Manager
#jenkinsconf. Jenkins as a Scientific Data and Image Processing Platform. Jenkins User Conference Boston #jenkinsconf
Jenkins as a Scientific Data and Image Processing Platform Ioannis K. Moutsatsos, Ph.D., M.SE. Novartis Institutes for Biomedical Research www.novartis.com June 18, 2014 #jenkinsconf Life Sciences are
Increasing the Multiplexing of High Resolution Targeted Peptide Quantification Assays
Increasing the Multiplexing of High Resolution Targeted Peptide Quantification Assays Scheduled MRM HR Workflow on the TripleTOF Systems Jenny Albanese, Christie Hunter AB SCIEX, USA Targeted quantitative
MRMPilot Software: Accelerating MRM Assay Development for Targeted Quantitative Proteomics
MRMPilot Software: Accelerating MRM Assay Development for Targeted Quantitative Proteomics With Unique QTRAP and TripleTOF 5600 System Technology Targeted peptide quantification is a rapidly growing application
AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE
ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS
Case Study: Using Open Source Solutions to Simplify The Network
I will be happy when I don t even know I have a computer network!" Danny McNicholl, Managing Director, 3Peaks Solutions Ltd Case Study: Using Open Source Solutions to Simplify The Network Summary An overly
泛 用 蛋 白 質 體 學 之 質 譜 儀 資 料 分 析 平 台 的 建 立 與 應 用 Universal Mass Spectrometry Data Analysis Platform for Quantitative and Qualitative Proteomics
泛 用 蛋 白 質 體 學 之 質 譜 儀 資 料 分 析 平 台 的 建 立 與 應 用 Universal Mass Spectrometry Data Analysis Platform for Quantitative and Qualitative Proteomics 2014 Training Course Wei-Hung Chang ( 張 瑋 宏 ) ABRC, Academia
ASMS Regulated Bioanalysis Interest Group (RBIG) Workshop. Antibody-Drug Conjugates (ADC) A Complex Problem in Regulated Bioanalysis.
ASMS Regulated Bioanalysis Interest Group (RBIG) Workshop Antibody-Drug Conjugates (ADC) A Complex Problem in Regulated Bioanalysis June 17, 2014 Presiding: Dr. Keyang Xu (Genentech) and Dr. Fabio Garofolo
Kinexus has an in-house inventory of lysates prepared from 16 human cancer cell lines that have been selected to represent a diversity of tissues,
Kinexus Bioinformatics Corporation is seeking to map and monitor the molecular communications networks of living cells for biomedical research into the diagnosis, prognosis and treatment of human diseases.
ProteinPilot Report for ProteinPilot Software
ProteinPilot Report for ProteinPilot Software Detailed Analysis of Protein Identification / Quantitation Results Automatically Sean L Seymour, Christie Hunter SCIEX, USA Pow erful mass spectrometers like
MultiQuant Software 2.0 for Targeted Protein / Peptide Quantification
MultiQuant Software 2.0 for Targeted Protein / Peptide Quantification Gold Standard for Quantitative Data Processing Because of the sensitivity, selectivity, speed and throughput at which MRM assays can
FACULTY OF MEDICAL SCIENCE
Doctor of Philosophy in Biochemistry FACULTY OF MEDICAL SCIENCE Naresuan University 73 Doctor of Philosophy in Biochemistry The Biochemistry Department at Naresuan University is a leader in lower northern
Introduction to Proteomics
Introduction to Proteomics Why Proteomics? Same Genome Different Proteome Black Swallowtail - larvae and butterfly Biological Complexity Yeast - a simple proteome 6,113 proteins = 344,855 tryptic peptides
BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS
BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS 1. The Technology Strategy sets out six areas where technological developments are required to push the frontiers of knowledge
Quantitative proteomics background
Proteomics data analysis seminar Quantitative proteomics and transcriptomics of anaerobic and aerobic yeast cultures reveals post transcriptional regulation of key cellular processes de Groot, M., Daran
The Scheduled MRM Algorithm Enables Intelligent Use of Retention Time During Multiple Reaction Monitoring
The Scheduled MRM Algorithm Enables Intelligent Use of Retention Time During Multiple Reaction Monitoring Delivering up to 2500 MRM Transitions per LC Run Christie Hunter 1, Brigitte Simons 2 1 AB SCIEX,
Aiping Lu. Key Laboratory of System Biology Chinese Academic Society [email protected]
Aiping Lu Key Laboratory of System Biology Chinese Academic Society [email protected] Proteome and Proteomics PROTEin complement expressed by genome Marc Wilkins Electrophoresis. 1995. 16(7):1090-4. proteomics
A leader in the development and application of information technology to prevent and treat disease.
A leader in the development and application of information technology to prevent and treat disease. About MOLECULAR HEALTH Molecular Health was founded in 2004 with the vision of changing healthcare. Today
DeCyder Extended Data Analysis module Version 1.0
GE Healthcare DeCyder Extended Data Analysis module Version 1.0 Module for DeCyder 2D version 6.5 User Manual Contents 1 Introduction 1.1 Introduction... 7 1.2 The DeCyder EDA User Manual... 9 1.3 Getting
HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM. Aniket Bochare - [email protected]. CMSC 601 - Presentation
HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM Aniket Bochare - [email protected] CMSC 601 - Presentation Date-04/25/2011 AGENDA Introduction and Background Framework Heterogeneous
Pep-Miner: A Novel Technology for Mass Spectrometry-Based Proteomics
Pep-Miner: A Novel Technology for Mass Spectrometry-Based Proteomics Ilan Beer Haifa Research Lab Dec 10, 2002 Pep-Miner s Location in the Life Sciences World The post-genome era - the age of proteome
Research-grade Targeted Proteomics Assay Development: PRMs for PTM Studies with Skyline or, How I learned to ditch the triple quad and love the QE
Research-grade Targeted Proteomics Assay Development: PRMs for PTM Studies with Skyline or, How I learned to ditch the triple quad and love the QE Jacob D. Jaffe Skyline Webinar July 2015 Proteomics and
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM Thanh-Nghi Do College of Information Technology, Cantho University 1 Ly Tu Trong Street, Ninh Kieu District Cantho City, Vietnam
Effects of Intelligent Data Acquisition and Fast Laser Speed on Analysis of Complex Protein Digests
Effects of Intelligent Data Acquisition and Fast Laser Speed on Analysis of Complex Protein Digests AB SCIEX TOF/TOF 5800 System with DynamicExit Algorithm and ProteinPilot Software for Robust Protein
Preprocessing, Management, and Analysis of Mass Spectrometry Proteomics Data
Preprocessing, Management, and Analysis of Mass Spectrometry Proteomics Data M. Cannataro, P. H. Guzzi, T. Mazza, and P. Veltri Università Magna Græcia di Catanzaro, Italy 1 Introduction Mass Spectrometry
Management of Proteomics Data: 2D Gel Electrophoresis and Other Methods
Management of Proteomics Data: 2D Gel Electrophoresis and Other Methods Philip Andrews National Resource for Proteomics & Pathway Mapping Michigan Proteome Consortium University of Michigan Outline of
MarkerView Software 1.2.1 for Metabolomic and Biomarker Profiling Analysis
MarkerView Software 1.2.1 for Metabolomic and Biomarker Profiling Analysis Overview MarkerView software is a novel program designed for metabolomics applications and biomarker profiling workflows 1. Using
Building innovative drug discovery alliances. Evotec Munich. Quantitative Proteomics to Support the Discovery & Development of Targeted Drugs
Building innovative drug discovery alliances Evotec Munich Quantitative Proteomics to Support the Discovery & Development of Targeted Drugs Evotec AG, Evotec Munich, June 2013 About Evotec Munich A leader
LC-MS/MS for Chromatographers
LC-MS/MS for Chromatographers An introduction to the use of LC-MS/MS, with an emphasis on the analysis of drugs in biological matrices LC-MS/MS for Chromatographers An introduction to the use of LC-MS/MS,
A Streamlined Workflow for Untargeted Metabolomics
A Streamlined Workflow for Untargeted Metabolomics Employing XCMS plus, a Simultaneous Data Processing and Metabolite Identification Software Package for Rapid Untargeted Metabolite Screening Baljit K.
Thermo Scientific SIEVE Software for Differential Expression Analysis
m a s s s p e c t r o m e t r y Thermo Scientific SIEVE Software for Differential Expression Analysis Automated, label-free, semi-quantitative analysis of proteins, peptides, and metabolites based on comparisons
Biotechnology and Life Science Marketing Services Mailing List and Data Card Order Form
C H I Cambridge Healthtech Institute s Biotechnology and Life Science Marketing Services Mailing List and Data Card Order Form Over 800,000 names segmented by scientific interest Featuring U.S and International
Retrospective Analysis of a Host Cell Protein Perfect Storm: Identifying Immunogenic Proteins and Fixing the Problem
Retrospective Analysis of a Host Cell Protein Perfect Storm: Identifying Immunogenic Proteins and Fixing the Problem Kevin Van Cott, Associate Professor Dept. of Chemical and Biomolecular Engineering Nebraska
Medical Informatics II
Medical Informatics II Zlatko Trajanoski Institute for Genomics and Bioinformatics Graz University of Technology http://genome.tugraz.at [email protected] Medical Informatics II Introduction
Tutorial for Proteomics Data Submission. Katalin F. Medzihradszky Robert J. Chalkley UCSF
Tutorial for Proteomics Data Submission Katalin F. Medzihradszky Robert J. Chalkley UCSF Why Have Guidelines? Large-scale proteomics studies create huge amounts of data. It is impossible/impractical to
Session 1. Course Presentation: Mass spectrometry-based proteomics for molecular and cellular biologists
Program Overview Session 1. Course Presentation: Mass spectrometry-based proteomics for molecular and cellular biologists Session 2. Principles of Mass Spectrometry Session 3. Mass spectrometry based proteomics
An Introduction to Genomics and SAS Scientific Discovery Solutions
An Introduction to Genomics and SAS Scientific Discovery Solutions Dr Karen M Miller Product Manager Bioinformatics SAS EMEA 16.06.03 Copyright 2003, SAS Institute Inc. All rights reserved. 1 Overview!
BIOINFORMATICS Supporting competencies for the pharma industry
BIOINFORMATICS Supporting competencies for the pharma industry ABOUT QFAB QFAB is a bioinformatics service provider based in Brisbane, Australia operating nationwide and internationally. QFAB was established
ProteinScape. Innovation with Integrity. Proteomics Data Analysis & Management. Mass Spectrometry
ProteinScape Proteomics Data Analysis & Management Innovation with Integrity Mass Spectrometry ProteinScape a Virtual Environment for Successful Proteomics To overcome the growing complexity of proteomics
MassHunter for Agilent GC/MS & GC/MS/MS
MassHunter for Agilent GC/MS & GC/MS/MS Next Generation Data Analysis Software Presented by : Terry Harper GC/MS Product Specialist 1 Outline of Topics Topic 1: Introduction to MassHunter Topic 2: Data
Application Note # LCMS-81 Introducing New Proteomics Acquisiton Strategies with the compact Towards the Universal Proteomics Acquisition Method
Application Note # LCMS-81 Introducing New Proteomics Acquisiton Strategies with the compact Towards the Universal Proteomics Acquisition Method Introduction During the last decade, the complexity of samples
DATA MINING ALPHA MINER
DATA MINING ALPHA MINER AlphaMiner is developed by the E-Business Technology Institute (ETI) of the University of Hong Kong under the support from the Innovation and Technology Fund (ITF) of the Government
Thermo Scientific PepFinder Software A New Paradigm for Peptide Mapping
Thermo Scientific PepFinder Software A New Paradigm for Peptide Mapping For Conclusive Characterization of Biologics Deep Protein Characterization Is Crucial Pharmaceuticals have historically been small
OplAnalyzer: A Toolbox for MALDI-TOF Mass Spectrometry Data Analysis
OplAnalyzer: A Toolbox for MALDI-TOF Mass Spectrometry Data Analysis Thang V. Pham and Connie R. Jimenez OncoProteomics Laboratory, Cancer Center Amsterdam, VU University Medical Center De Boelelaan 1117,
ProteinQuest user guide
ProteinQuest user guide 1. Introduction... 3 1.1 With ProteinQuest you can... 3 1.2 ProteinQuest basic version 4 1.3 ProteinQuest extended version... 5 2. ProteinQuest dictionaries... 6 3. Directions for
MiSeq: Imaging and Base Calling
MiSeq: Imaging and Page Welcome Navigation Presenter Introduction MiSeq Sequencing Workflow Narration Welcome to MiSeq: Imaging and. This course takes 35 minutes to complete. Click Next to continue. Please
AB SCIEX TOF/TOF 4800 PLUS SYSTEM. Cost effective flexibility for your core needs
AB SCIEX TOF/TOF 4800 PLUS SYSTEM Cost effective flexibility for your core needs AB SCIEX TOF/TOF 4800 PLUS SYSTEM It s just what you expect from the industry leader. The AB SCIEX 4800 Plus MALDI TOF/TOF
A Primer of Genome Science THIRD
A Primer of Genome Science THIRD EDITION GREG GIBSON-SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc. Publishers Sunderland, Massachusetts USA Contents Preface xi 1 Genome Projects:
In-Depth Qualitative Analysis of Complex Proteomic Samples Using High Quality MS/MS at Fast Acquisition Rates
In-Depth Qualitative Analysis of Complex Proteomic Samples Using High Quality MS/MS at Fast Acquisition Rates Using the Explore Workflow on the AB SCIEX TripleTOF 5600 System A major challenge in proteomics
Mass Frontier Version 7.0
Mass Frontier Version 7.0 User Guide XCALI-97349 Revision A February 2011 2011 Thermo Fisher Scientific Inc. All rights reserved. Mass Frontier, Mass Frontier Server Manager, Fragmentation Library, Spectral
Vad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives
Vad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives [email protected] 2015-05-21 Functional Bioinformatics, Örebro University Vad är bioinformatik och varför
Industry Perspective: Advantages of Open Access and Walkup LC/ MS Supporting Protein Drug Discovery and Development
Industry Perspective: Advantages of Open Access and Walkup LC/ MS Supporting Protein Drug Discovery and Development Dawn Stickle, Agilent Technologies Originally presented by Eric Fang, Novartis Overview
Data, Measurements, Features
Data, Measurements, Features Middle East Technical University Dep. of Computer Engineering 2009 compiled by V. Atalay What do you think of when someone says Data? We might abstract the idea that data are
Data processing goes big
Test report: Integration Big Data Edition Data processing goes big Dr. Götz Güttich Integration is a powerful set of tools to access, transform, move and synchronize data. With more than 450 connectors,
Investigating Biological Variation of Liver Enzymes in Human Hepatocytes
Investigating Biological Variation of Liver Enzymes in Human Hepatocytes MS/MS ALL with SWATH Acquisition on the TripleTOF Systems Xu Wang 1, Hui Zhang 2, Christie Hunter 1 1 AB SCIEX, USA, 2 Pfizer, USA
RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE. Luigi Grimaudo 178627 Database And Data Mining Research Group
RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE Luigi Grimaudo 178627 Database And Data Mining Research Group Summary RapidMiner project Strengths How to use RapidMiner Operator
BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16
Course Director: Dr. Barry Grant (DCM&B, [email protected]) Description: This is a three module course covering (1) Foundations of Bioinformatics, (2) Statistics in Bioinformatics, and (3) Systems
Software Architecture Document
Software Architecture Document Natural Language Processing Cell Version 1.0 Natural Language Processing Cell Software Architecture Document Version 1.0 1 1. Table of Contents 1. Table of Contents... 2
Pipeline Pilot Enterprise Server. Flexible Integration of Disparate Data and Applications. Capture and Deployment of Best Practices
overview Pipeline Pilot Enterprise Server Pipeline Pilot Enterprise Server (PPES) is a powerful client-server platform that streamlines the integration and analysis of the vast quantities of data flooding
using ms based proteomics
quantification using ms based proteomics lennart martens Computational Omics and Systems Biology Group Department of Medical Protein Research, VIB Department of Biochemistry, Ghent University Ghent, Belgium
Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources
1 of 8 11/7/2004 11:00 AM National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools
Per Andrén Dept. of Pharmaceutical Biosciences Medical Mass Spectrometry, Uppsala University, Uppsala, Sweden
Lung Imaging in Inhaled Product Development Novartis, Horsham, UK, 2011-11-09 Mapping Drug Deposition using Imaging Mass Spectrometry Per Andrén Dept. of Pharmaceutical Biosciences Medical Mass Spectrometry,
PperCHIP. High-Content Peptide Microarrays. Epitope Mapping & Serum Profiling Services
PepperChip High-Content Peptide Microarrays PepperMap Epitope Mapping & Serum Profiling Services PperCHIP access to custom and dard peptide microarrays up to 9,000 features per in a uniquely flexible and
Biomarker Discovery and Data Visualization Tool for Ovarian Cancer Screening
, pp.169-178 http://dx.doi.org/10.14257/ijbsbt.2014.6.2.17 Biomarker Discovery and Data Visualization Tool for Ovarian Cancer Screening Ki-Seok Cheong 2,3, Hye-Jeong Song 1,3, Chan-Young Park 1,3, Jong-Dae
Novell ZENworks 10 Configuration Management SP3
AUTHORIZED DOCUMENTATION Software Distribution Reference Novell ZENworks 10 Configuration Management SP3 10.3 November 17, 2011 www.novell.com Legal Notices Novell, Inc., makes no representations or warranties
REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf])
820 REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf]) (See also General Regulations) BMS1 Admission to the Degree To be eligible for admission to the degree of Bachelor
BIOS 6660: Analysis of Biomedical Big Data Using R and Bioconductor, Fall 2015 Computer Lab: Education 2 North Room 2201DE (TTh 10:30 to 11:50 am)
BIOS 6660: Analysis of Biomedical Big Data Using R and Bioconductor, Fall 2015 Computer Lab: Education 2 North Room 2201DE (TTh 10:30 to 11:50 am) Course Instructor: Dr. Tzu L. Phang, Assistant Professor
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
SYLLABUSES FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (applicable to students admitted in the academic year 2015-2016 and thereafter)
MSc(CompSc)-1 (SUBJECT TO UNIVERSITY S APPROVAL) SYLLABUSES FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (applicable to students admitted in the academic year 2015-2016 and thereafter) The curriculum
Software and Methods for the Analysis of Affymetrix GeneChip Data. Rafael A Irizarry Department of Biostatistics Johns Hopkins University
Software and Methods for the Analysis of Affymetrix GeneChip Data Rafael A Irizarry Department of Biostatistics Johns Hopkins University Outline Overview Bioconductor Project Examples 1: Gene Annotation
Exercise with Gene Ontology - Cytoscape - BiNGO
Exercise with Gene Ontology - Cytoscape - BiNGO This practical has material extracted from http://www.cbs.dtu.dk/chipcourse/exercises/ex_go/goexercise11.php In this exercise we will analyze microarray
DMPK: Experimentation & Data
DMPK: Experimentation & Data Interpretation Mingshe Zhu, Mike S. Lee, Naidong Weng, and Mark Hayward Prerequisite: Entry-level scientists with hands on experience in LC/MS as well as advanced students
CPAS Overview. Josh Eckels LabKey Software [email protected]
CPAS Overview Josh Eckels LabKey Software [email protected] CPAS Web-based system for processing, storing, and analyzing results of MS/MS experiments Key goals: Provide a great analysis front-end for
RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison
RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the
SELDI-TOF Mass Spectrometry Protein Data By Huong Thi Dieu La
SELDI-TOF Mass Spectrometry Protein Data By Huong Thi Dieu La References Alejandro Cruz-Marcelo, Rudy Guerra, Marina Vannucci, Yiting Li, Ching C. Lau, and Tsz-Kwong Man. Comparison of algorithms for pre-processing
