Databases and platforms for data analysis from NGS of MTB
|
|
|
- Charla Thompson
- 10 years ago
- Views:
Transcription
1 Databases and platforms for data analysis from NGS of MTB Derrick Crook MMM Consortium
2 MMM Consortium Linking Clinical record systems and NHS databases Translating next generation sequencing for patient benefit Sequenced > 25,000 isolates: 7,500 C. difficile 7,500 S. aureus 2,600 TB 2,500 E. coli/klebsiella spp 2000 Grp B Streptococcus ~ 3000 other (including viruses)
3 Storing, searching and analysing the data locally Growing problems with managing sequence data and linking it to meta-data (quality statistics and organism/patient specific data) Obstacles to automated processing on a large and even national scale Just SQL database storage system not suitable Replaced with a non-sql system using Casandra hosted on an Amazon cloud like technology (i.e. Eucalyptus) including Hadoop and MapReduce Store minimal de-identified metadata on the genomics datastore and have architecture to link back to clinical records Plan to follow the model being proposed by Genome England (GeL) for the genomes project
4 Open access and sharing of data? 1. What data to deposit 2. When to deposit 3. Where to deposit How to use the publically accessible data Some questions need little data (these may be the most important now) Other questions need rich data (these are more important in the future) There is good progress on depositing next generation sequencing data through established portals
5 A group assembled by the FDA and DTU are doing what we need About Global Microbial Identifier The genomic epidemiological database for global identification of microorganisms or global identifier of microorganisms is a platform for storing whole genome sequencing (WGS) data of microorganisms, for the identification of relevant genes and for the comparison of genomes to detect outbreaks and emerging pathogens. The database holds two types of information: 1) genomic information of microorganisms, linked to, 2) metadata of those microorganism such as epidemiological details.
6 Three funded international archives GenBank NIH genetic sequence database (NCBI) ENA European Nucleotide Archive (EMBL EBI) DDBJ DNA Data Bank of Japan International Nucleotide Sequence Database Collaboration NCBI is developing a automated uploader of data with meta-data; will be incorporated by ENA and DDBJ
7 NCBI prototypic schema sample name attribute package unique ID for the sample Indicate the type of pathogen. Allowed values are "clinical or host-associated pathogen" or "environmental, food or other pathogen". Value provided in this field drives validation of other fields. organism strain collection_date collected-by isolation-source geo_loc_name lat_lon specific_host host-disease scientific name of the organism that provided the sequenced genetic material- expect genus species strain/isolate from which sequence was obtained Date of sampling, in "DD-Mmm-YYYY", "Mmm-YYYY" or "YYYY" format (single instance, eg., 05-Oct-1990, Oct or 1990) or ISO 8601 standard "YYYY-mm-dd" or "YYYY-mm-ddThh:mm:ss" (eg or T14:41:36) Name of the person or lab who collected the sample. Describes the physical, environmental and/or local geographical source of the biological sample from which the sample was derived. Geographical origin of the sample Report values in decimal degrees and in WGS84 system Required for 'clinical or host-associated pathogen' sample type- Taxid or organism name of host Required for 'clinical or host-associated pathogen' sample type- Name of relevant disease, e.g. Salmonella gastroenteritis. Controlled vocabulary, or Bill Klimke is the contact person: Klimke, Bill (NIH/NLM/NCBI) [E]
8 NCBI prototypic schema
9 NCBI prototypic schema
10 One possible approach Source data for upload to archive when published or other agreed stage with minimal set of metadata Research group 1 International Nucleotide Sequence Database Collaboration Research group 2 Industry e.g. Whole UK National/International agency
11 Platforms for analysis Simple objectives e.g. : Species identification Resistance prediction Relatedness (genomic match) Each needs a knowledge base Each will have a design, which will vary according to e.g. question, sequencing platform, method of assembly (mapped or de novo), method of querying the knowledge base etc.
12 Platforms for analysis These, at present, will be software needing high performance computing (assembly, statistical genetic and machine learning methodologies) Research vs Service Three key endeavours for success for public health Resistance prediction with high sensitivity and specificity which is continuously updated Rapid identification of transmission chains Rapid cheap processing using light weight compute
13 More complex questions Strain specific factors determining disease manifestation e.g. Latent vs active Pulmonary vs other Species adaptation (e.g. M. bovis) Will need development of new statistical genetic methodologies Clinical response etc Analysing vast amounts of data i.e. many thousands or even millions of genomes This moves into the field of Big Data
14 Acknowledgements PHE and Gel Jim Davies (Director informatics Gel) Philip Monk (lead for TB genomes project) Grace Smith (head TB reference lab Birmingham) International advisor/s Stefan Niemann Oxford University Analysis: Zamin Iqbal Daniel Wilson David Clifton Sarah Walker Tim Walker Tim Peto High performance computing: Jim Davies Charles Crichton
Establishing a Whole Genome Sequence-Based national network for the detection and traceback of Foodborne Pathogens
Establishing a Whole Genome Sequence-Based national network for the detection and traceback of Foodborne Pathogens Marc W. Allard Ph.D. Microbiologist, Division of Microbiology, ORS, Center for Food Safety
Three data delivery cases for EMBL- EBI s Embassy. Guy Cochrane www.ebi.ac.uk
Three data delivery cases for EMBL- EBI s Embassy Guy Cochrane www.ebi.ac.uk EMBL European Bioinformatics Institute Genes, genomes & variation European Nucleotide Archive 1000 Genomes Ensembl Ensembl Genomes
Whole genome sequencing of foodborne pathogens: experiences from the Reference Laboratory. Kathie Grant Gastrointestinal Bacteria Reference Unit
Whole genome sequencing of foodborne pathogens: experiences from the Reference Laboratory Kathie Grant Gastrointestinal Bacteria Reference Unit 16 th June 2014 Planning for Implementation of WGS 2011-2014
Module 1. Sequence Formats and Retrieval. Charles Steward
The Open Door Workshop Module 1 Sequence Formats and Retrieval Charles Steward 1 Aims Acquaint you with different file formats and associated annotations. Introduce different nucleotide and protein databases.
NIH Genomic Data Sharing (GDS) Policy Guidance Memo #2 1
MEMORANDUM TO: Principal Investigators and Research Staff DATE: 2/22/15 FROM: Anne Klibanski, MD, Partners Chief Academic Officer (CAO) Paul Anderson, MD, PhD, BWH CAO Harry Orf, PhD, MGH Sr. Vice President-Research
RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison
RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the
Sharing Data from Large-scale Biological Research Projects: A System of Tripartite Responsibility
Sharing Data from Large-scale Biological Research Projects: A System of Tripartite Responsibility Report of a meeting organized by the Wellcome Trust and held on 14 15 January 2003 at Fort Lauderdale,
Molecular typing of VTEC: from PFGE to NGS-based phylogeny
Molecular typing of VTEC: from PFGE to NGS-based phylogeny Valeria Michelacci 10th Annual Workshop of the National Reference Laboratories for E. coli in the EU Rome, November 5 th 2015 Molecular typing
Accelerate genomic breakthroughs in microbiology. Gain deeper insights with powerful bioinformatic tools.
Accelerate genomic breakthroughs in microbiology. Gain deeper insights with powerful bioinformatic tools. Empowering microbial genomics. Extensive methods. Expansive possibilities. In microbiome studies
The 100,000 genomes project
The 100,000 genomes project Tim Hubbard @timjph Genomics England King s College London, King s Health Partners Wellcome Trust Sanger Institute ClinGen / Decipher Washington DC, 26 th May 2015 The 100,000
Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources
1 of 8 11/7/2004 11:00 AM National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools
Use of Whole Genome Sequencing (WGS) of food-borne pathogens for public health protection
EFSA Scientific Colloquium n 20 Use of Whole Genome Sequencing (WGS) of food-borne pathogens for public health protection Parma, Italy, 16-17 June 2014 Why WGS based approach Infectious diseases are responsible
Typing in the NGS era: The way forward!
Typing in the NGS era: The way forward! Valeria Michelacci NGS course, June 2015 Typing from sequence data NGS-derived conventional Multi Locus Sequence Typing (University of Warwick, 7 housekeeping genes)
Genome Viewing. Module 2. Using Genome Browsers to View Annotation of the Human Genome
Module 2 Genome Viewing Using Genome Browsers to View Annotation of the Human Genome Bert Overduin, Ph.D. PANDA Coordination & Outreach EMBL - European Bioinformatics Institute Wellcome Trust Genome Campus
Core Bioinformatics. Degree Type Year Semester. 4313473 Bioinformàtica/Bioinformatics OB 0 1
Core Bioinformatics 2014/2015 Code: 42397 ECTS Credits: 12 Degree Type Year Semester 4313473 Bioinformàtica/Bioinformatics OB 0 1 Contact Name: Sònia Casillas Viladerrams Email: [email protected]
Metagenomics revisits the one pathogen/one disease postulates and translate the One Health concept into action
Les Rencontres de L INRA Metagenomics revisits the one pathogen/one disease postulates and translate the One Health concept into action E Albina (CIRAD) / S Guyomard(Institut Pasteur) Guadeloupe The era
Provisioning robust automated analytical pipelines for whole genome-based public health microbiological typing
Provisioning robust automated analytical pipelines for whole genome-based public health microbiological typing Anthony Underwood Bioinformatics Unit, Infectious Disease Informatics, Microbiological Services,
The National Antimicrobial Resistance Monitoring System (NARMS)
The National Antimicrobial Resistance Monitoring System (NARMS) Strategic Plan 2012-2016 Table of Contents Background... 2 Mission... 3 Overview of Accomplishments, 1996-2011... 4 Strategic Goals and Objectives...
General Services Administration Federal Supply Service Authorized Federal Supply Schedule Price List
General Services Administration Federal Supply Service Authorized Federal Supply Schedule Price List GSA Schedule 66 Scientific Equipment and Services SIN 66-1000 Professional Scientific Services IHRC,
Workshop Rapid NGS for Public Health Microbiology
Medical Microbial Genomics Centre Workshop Rapid NGS for Public Health Microbiology March 4 th 8 th, 2013; Univ. Münster Sponsors In co-operation with MMGC 2013 1 Objectives and Organization Objectives
Putting Genomes in the Cloud with WOS TM. ddn.com. DDN Whitepaper. Making data sharing faster, easier and more scalable
DDN Whitepaper Putting Genomes in the Cloud with WOS TM Making data sharing faster, easier and more scalable Table of Contents Cloud Computing 3 Build vs. Rent 4 Why WOS Fits the Cloud 4 Storing Sequences
Data Registry Workshop Report
Data Registry Workshop Report Background A Joint Working Group on Data Sharing and Archiving (JWG), representing major professional societies that publish ecology, evolution, and organismal biology journals,
Use of Whole Genome. of food-borne pathogens for public health protection. Efsa Scientific Colloquium Summary Report. 16-17 June 2014, Parma, Italy
20 Efsa Scientific Colloquium Summary Report ISSN 2363-2240 Use of Whole Genome Sequencing (WGS) of food-borne pathogens for public health protection 16-17 June 2014, Parma, Italy EFSA Scientific Colloquium
Implementation of a near real-time phylogenetic monitoring program for HIV transmission outbreaks
Implementation of a near real-time phylogenetic monitoring program for HIV transmission outbreaks Art FY Poon 1,2, Conan K Woods 1, Susan Shurgold 1, Guillaume Colley 1, Robert S Hogg 1,3, Mel Krajden
Managing and Conducting Biomedical Research on the Cloud Prasad Patil
Managing and Conducting Biomedical Research on the Cloud Prasad Patil Laboratory for Personalized Medicine Center for Biomedical Informatics Harvard Medical School SaaS & PaaS gmail google docs app engine
Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences
Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.2 Community Needs of
Next Generation Sequencing in Public Health Laboratories. 2014 Survey Results
Next Generation Sequencing in Public Health Laboratories 2014 Survey Results MAY 2015 This project was 100% funded with federal funds from a federal program of $215,972. This publication was supported
A Service for Data-Intensive Computations on Virtual Clusters
A Service for Data-Intensive Computations on Virtual Clusters Executing Preservation Strategies at Scale Rainer Schmidt, Christian Sadilek, and Ross King [email protected] Planets Project Permanent
Survey of clinical data mining applications on big data in health informatics
Survey of clinical data mining applications on big data in health informatics Matthew Herland, Taghi M. Khoshgoftaar, and Randall Wald 劉 俊 成 Survey of clinical data mining applications on big data in health
A Multiple DNA Sequence Translation Tool Incorporating Web Robot and Intelligent Recommendation Techniques
Proceedings of the 2007 WSEAS International Conference on Computer Engineering and Applications, Gold Coast, Australia, January 17-19, 2007 402 A Multiple DNA Sequence Translation Tool Incorporating Web
OpenCB a next generation big data analytics and visualisation platform for the Omics revolution
OpenCB a next generation big data analytics and visualisation platform for the Omics revolution Development at the University of Cambridge - Closing the Omics / Moore s law gap with Dell & Intel Ignacio
Lessons from the Stanford HIV Drug Resistance Database
1 Lessons from the Stanford HIV Drug Resistance Database Bob Shafer, MD Department of Medicine and by Courtesy Pathology (Infectious Diseases) Stanford University Outline 2 Goals and rationale for HIVDB
European Genome-phenome Archive database of human data consented for use in biomedical research at the European Bioinformatics Institute
European Genome-phenome Archive database of human data consented for use in biomedical research at the European Bioinformatics Institute Justin Paschall Team Leader Genetic Variation / EGA ! European Genome-phenome
NIH s Genomic Data Sharing Policy
NIH s Genomic Data Sharing Policy 2 Benefits of Data Sharing Enables data generated from one study to be used to explore a wide range of additional research questions Increases statistical power and scientific
FACULTY OF MEDICAL SCIENCE
Doctor of Philosophy Program in Microbiology FACULTY OF MEDICAL SCIENCE Naresuan University 171 Doctor of Philosophy Program in Microbiology The time is critical now for graduate education and research
P. ramorum diagnostics - update. USDA APHIS PPQ CPHST March, 2006
P. ramorum diagnostics - update USDA APHIS PPQ CPHST March, 2006 The Times (UK) 11/17/05 Provisional Approval Process Protocol still evolving and improving > 20 labs in system 7 approved: CDFA, ODA, Oregon
Twister4Azure: Data Analytics in the Cloud
Twister4Azure: Data Analytics in the Cloud Thilina Gunarathne, Xiaoming Gao and Judy Qiu, Indiana University Genome-scale data provided by next generation sequencing (NGS) has made it possible to identify
A review of the evidence for pathogenic contamination in the nonhospital. implications for infection transmission
A review of the evidence for pathogenic contamination in the nonhospital environment and implications for infection transmission Elizabeth Scott Simmons College, Boston Simmons Center for Hygiene and Health
Personalized Medicine and IT
Personalized Medicine and IT Data-driven Medicine in the Age of Genomics www.intel.com/healthcare/bigdata Ketan Paranjape General Manager, Life Sciences Intel Corp. @Portlandketan 1 The Central Dogma of
Big Data Analytics and Healthcare
Big Data Analytics and Healthcare Anup Kumar, Professor and Director of MINDS Lab Computer Engineering and Computer Science Department University of Louisville Road Map Introduction Data Sources Structured
CCR Biology - Chapter 9 Practice Test - Summer 2012
Name: Class: Date: CCR Biology - Chapter 9 Practice Test - Summer 2012 Multiple Choice Identify the choice that best completes the statement or answers the question. 1. Genetic engineering is possible
Committee on WIPO Standards (CWS)
E CWS/1/5 ORIGINAL: ENGLISH DATE: OCTOBER 13, 2010 Committee on WIPO Standards (CWS) First Session Geneva, October 25 to 29, 2010 PROPOSAL FOR THE PREPARATION OF A NEW WIPO STANDARD ON THE PRESENTATION
ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE. October 2013
ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE October 2013 Introduction As sequencing technologies continue to evolve and genomic data makes its way into clinical use and
Bioinformatics: course introduction
Bioinformatics: course introduction Filip Železný Czech Technical University in Prague Faculty of Electrical Engineering Department of Cybernetics Intelligent Data Analysis lab http://ida.felk.cvut.cz
Sequence Formats and Sequence Database Searches. Gloria Rendon SC11 Education June, 2011
Sequence Formats and Sequence Database Searches Gloria Rendon SC11 Education June, 2011 Sequence A is the primary structure of a biological molecule. It is a chain of residues that form a precise linear
Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers
Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers Ntinos Krampis Asst. Professor J. Craig Venter Institute [email protected] http://www.jcvi.org/cms/about/bios/kkrampis/
Cloud-Based Big Data Analytics in Bioinformatics
Cloud-Based Big Data Analytics in Bioinformatics Presented By Cephas Mawere Harare Institute of Technology, Zimbabwe 1 Introduction 2 Big Data Analytics Big Data are a collection of data sets so large
Delivering the power of the world s most successful genomics platform
Delivering the power of the world s most successful genomics platform NextCODE Health is bringing the full power of the world s largest and most successful genomics platform to everyday clinical care NextCODE
Hadoopizer : a cloud environment for bioinformatics data analysis
Hadoopizer : a cloud environment for bioinformatics data analysis Anthony Bretaudeau (1), Olivier Sallou (2), Olivier Collin (3) (1) [email protected], INRIA/Irisa, Campus de Beaulieu, 35042,
NIAID Genomics and Bioinformatics Programs
NIAID Genomics and Bioinformatics Programs Vivien G. Dugan, PhD Office of Genomics and Advanced Technologies (OGAT) Division of Microbiology and Infectious Diseases NIAID, NIH, DHHS [email protected]
Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI
Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements
Big Data a threat or a chance?
Big Data a threat or a chance? Helwig Hauser University of Bergen, Dept. of Informatics Big Data What is Big Data? well, lots of data, right? we come back to this in a moment. certainly, a buzz-word but
SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications
Product Bulletin Sequencing Software SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications Comprehensive reference sequence handling Helps interpret the role of each
Biotechnology: DNA Technology & Genomics
Chapter 20. Biotechnology: DNA Technology & Genomics 2003-2004 The BIG Questions How can we use our knowledge of DNA to: diagnose disease or defect? cure disease or defect? change/improve organisms? What
Identification and Characterization of Foodborne Pathogens by Whole Genome Sequencing: A Shift in Paradigm
Identification and Characterization of Foodborne Pathogens by Whole Genome Sequencing: A Shift in Paradigm Peter Gerner-Smidt, MD, ScD Enteric Diseases Laboratory Branch EFSA Scientific Colloquium N o
6 ELIXIR Domain Specific Services
6 ELIXIR Domain Specific Services Work stream leads: Alfonso Valencia (ES), Inge Jonassen (NO), Jose Leal (PT) Work stream members: Nils-Peder Willassen (NO), Finn Drablos (NO), Mark Viant (UK), Ferran
2009 Atlanta Conference on Science, Technology and Innovation Policy. The Role of Information Technology in Medical Research
2009 Atlanta Conference on Science, Technology and Innovation Policy The Role of Information Technology in Medical Research Daniel Castro October 2009 2009 IEEE. This material is posted here with permission
Multi-locus sequence typing (MLST) of C. jejuni infections in the United States Patrick Kwan, PhD
Multi-locus sequence typing (MLST) of C. jejuni infections in the United States Patrick Kwan, PhD National Campylobacter and Helicobacter Reference Laboratory Enteric Diseases Laboratory Branch Centers
Cloud Computing for Scientific Research
Cloud Computing for Scientific Research The NIH Nephele Project for Microbiome Analysis On behalf of: Yentram Huyen, Ph.D., Chief Nick Weber, Scientific Computing Project Manager Bioinformatics and Computational
School of Nursing. Presented by Yvette Conley, PhD
Presented by Yvette Conley, PhD What we will cover during this webcast: Briefly discuss the approaches introduced in the paper: Genome Sequencing Genome Wide Association Studies Epigenomics Gene Expression
UCLA Team Sequences Cell Line, Puts Open Source Software Framework into Production
Page 1 of 6 UCLA Team Sequences Cell Line, Puts Open Source Software Framework into Production February 05, 2010 Newsletter: BioInform BioInform - February 5, 2010 By Vivien Marx Scientists at the department
Hadoop: A Framework for Data- Intensive Distributed Computing. CS561-Spring 2012 WPI, Mohamed Y. Eltabakh
1 Hadoop: A Framework for Data- Intensive Distributed Computing CS561-Spring 2012 WPI, Mohamed Y. Eltabakh 2 What is Hadoop? Hadoop is a software framework for distributed processing of large datasets
Lecture 13: DNA Technology. DNA Sequencing. DNA Sequencing Genetic Markers - RFLPs polymerase chain reaction (PCR) products of biotechnology
Lecture 13: DNA Technology DNA Sequencing Genetic Markers - RFLPs polymerase chain reaction (PCR) products of biotechnology DNA Sequencing determine order of nucleotides in a strand of DNA > bases = A,
Where We Are. References. Cloud Computing. Levels of Service. Cloud Computing History. Introduction to Data Management CSE 344
Where We Are Introduction to Data Management CSE 344 Lecture 25: DBMS-as-a-service and NoSQL We learned quite a bit about data management see course calendar Three topics left: DBMS-as-a-service and NoSQL
Innovations in Molecular Epidemiology
Innovations in Molecular Epidemiology Molecular Epidemiology Measure current rates of active transmission Determine whether recurrent tuberculosis is attributable to exogenous reinfection Determine whether
Principles of Disease and Epidemiology. Copyright 2010 Pearson Education, Inc.
Principles of Disease and Epidemiology Pathology, Infection, and Disease Disease: An abnormal state in which the body is not functioning normally Pathology: The study of disease Etiology: The study of
Course Specification
1 Course Specification Program on which the course is given: Department offering the program: Department offering the course: Academic year /level: Date of specification approval: 2008/2009 Masters Degree
An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives
An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives Chalapathy Neti, Ph.D. Associate Director, Healthcare Transformation, Shahram Ebadollahi, Ph.D. Research Staff Memeber IBM Research,
Genomics and Health Data Standards: Lessons from the Past and Present for a Genome-enabled Future
Genomics and Health Data Standards: Lessons from the Past and Present for a Genome-enabled Future Daniel Masys, MD Professor and Chair Department of Biomedical Informatics Professor of Medicine Vanderbilt
GenomeTrakr: A Pathogen Database
GenomeTrakr: A Pathogen Database Marc W. Allard, PhD Senior Biomedical Research Services Officer Division of Microbiology Eric W. Brown, PhD Director Division of Microbiology FAO Expert workshop to develop
Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness
Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness Melanie Dulong de Rosnay Fellow, Science Commons and Berkman Center for Internet & Society at Harvard University This article
Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料
Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料 美 國 13 歲 學 生 用 Big Data 找 出 霸 淩 熱 點 Puri 架 設 網 站 Bullyvention, 藉 由 分 析 Twitter 上 找 出 提 到 跟 霸 凌 相 關 的 詞, 搭 配 地 理 位 置
The Human Genome Project. From genome to health From human genome to other genomes and to gene function Structural Genomics initiative
The Human Genome Project From genome to health From human genome to other genomes and to gene function Structural Genomics initiative June 2000 What is the Human Genome Project? U.S. govt. project coordinated
Searching Nucleotide Databases
Searching Nucleotide Databases 1 When we search a nucleic acid databases, Mascot always performs a 6 frame translation on the fly. That is, 3 reading frames from the forward strand and 3 reading frames
DATAOPT SOLUTIONS. What Is Big Data?
DATAOPT SOLUTIONS What Is Big Data? WHAT IS BIG DATA? It s more than just large amounts of data, though that s definitely one component. The more interesting dimension is about the types of data. So Big
GenBank: A Database of Genetic Sequence Data
GenBank: A Database of Genetic Sequence Data Computer Science 105 Boston University David G. Sullivan, Ph.D. An Explosion of Scientific Data Scientists are generating ever increasing amounts of data. Relevant
Immunology Ambassador Guide (updated 2014)
Immunology Ambassador Guide (updated 2014) Immunity and Disease We will talk today about the immune system and how it protects us from disease. Also, we ll learn some unique ways that our immune system
A Scalable Data Transformation Framework using the Hadoop Ecosystem
A Scalable Data Transformation Framework using the Hadoop Ecosystem Raj Nair Director Data Platform Kiru Pakkirisamy CTO AGENDA About Penton and Serendio Inc Data Processing at Penton PoC Use Case Functional
European Medicines Agency
European Medicines Agency July 1996 CPMP/ICH/139/95 ICH Topic Q 5 B Quality of Biotechnological Products: Analysis of the Expression Construct in Cell Lines Used for Production of r-dna Derived Protein
Intro to Bioinformatics
Intro to Bioinformatics Marylyn D Ritchie, PhD Professor, Biochemistry and Molecular Biology Director, Center for Systems Genomics The Pennsylvania State University Sarah A Pendergrass, PhD Research Associate
Challenges in data acquisition, storage and processing for NIH funded studies
UAB Research Computing Day September 15, 2011 Challenges in data acquisition, storage and processing for NIH funded studies Stephen Barnes, PhD Department of Pharmacology & Toxicology and the Targeted
