Public Health Laboratory Workforce Development Bioinformatics

Size: px
Start display at page:

Download "Public Health Laboratory Workforce Development Bioinformatics"

Transcription

1 Public Health Laboratory Workforce Development Bioinformatics Templates for Course Development Contents Overview... 1 Going Beyond the Introductory Courses... 1 Course Templates... 3 Template 1: Introduction to Bioinformatics for Public Health Laboratorians... 3 Day Day Template 2: Hands-On Introduction to Linux for PHL Bioinformaticians... 5 Template 3: Introduction to Next-Generation Sequencing and Bioinformatics for Epidemiologists... 6 Overview This document is provided as an example of curricula for introductory courses on bioinformatics for public health laboratory and epidemiology staff. Public health departments planning training for their staff may wish to use this as a basis for developing their own course series. The materials included here are intended to be used as templates as examples of how introductory courses might be organized. It is anticipated that health departments choosing to use these will adapt them to their own setting and to meet the special needs of their own staff. Other health departments may wish to develop their own curricula from scratch. CDC doesn t endorse a particular approach to curriculum development, except to emphasize that it needs to be tailored to account for the existing skill level of the staff and the available resources for teaching the course. Going Beyond the Introductory Courses For individual learning, there are numerous online courses, some paid, some for free, in a wide variety of formats. Results of a 2015 survey of online options is available on CDC s AMD website ( 1

2 In addition, CDC encourages health departments to work with local academic and other institutions to develop course options customized to the needs of their staff. For examples of potential topics for courses, refer to the IHRC web site ( which shows 2- and 4-day courses developed by IHRC and the faculty at Georgia Tech for trainings at CDC [note: CDC is including the website as an example and is NOT endorsing the use of this or any other specific vendor to develop the training]. 2

3 Course Templates Template 1: Introduction to Bioinformatics for Public Health Laboratorians Format: two days, lectures The table below provides an example curriculum for a 2-day introduction to bioinformatics. It is aimed at microbiologists who have little or no bioinformatics training and is based on experience at CDC in offering such training for laboratory staff. Note that the two days are somewhat intense, and consist entirely of lectures. While a 2-day course is not unreasonable, where feasible, some groups may wish to divide this into two separate one-day courses. Another option is to divide the second day into two half-days, using the rest of the time for another purpose, such as hands-on training. As with all templates in this document, this is meant to be modified as much as needed to meet local needs. Time 75m Day 1 Topic Basic Pathogen Genomics have a basic understanding of the pathogen genomics to be used throughout the course Sequencing Technology--Library prep to raw NGS data have a basic understanding of sequencing technologies, including both Sanger and NGS understand the major advantages and disadvantages of each technology have an understanding of basic workflows for DNA-seq, RNA-seq, metagenomics sequencing have a sense library prep technologies for NGS (single-end and paired-end) know the file formats generated by these technologies know terminology commonly used in NGS Genome Assembly raw NGS data to assembled contigs be able to explain different metrics used to assess NGS sequence quality be able to explain why trimming and filtering raw NGS data is necessary and how it can affect final assembly be able to describe the process of assembling reads into contigs be able to describe what metrics are used to assess assembly quality (coverage, N50, etc.) Lunch Databases and Searches be able to describe the basic algorithm behind BLAST (words, HSP, e-value) be able to list commonly used databases for dna, protein, functional annotation, metabolic pathways) Sequence Alignments be able to describe methods used to align 2 sequences and how that method can be extended to multiple sequences (DNA and Protein) 3

4 90m Genomics be able to explain what an alignment is and what SNPs and InDels are be able to list and describe current molecular typing techniques (PFGE, MLST, MLVA, VNTR, SNP, wgmlst) explain (traditional) MLST, and wgmlst how Kmer based analysis is used to find SNPs and what advantage/disadvantages it has over mgmlst Day 2 Metagenomics be able to explain different types of metagenomics approaches be able to describe several tools used to analyze metagenomic data Phylogenetic Reconstruction be able to explain different type of trees, both in terms of the data underlying the trees and the tree topology know how to interpret key aspects of trees topology, branch lengths, bootstrap values, etc. understand implications of tree structure and how to interpret clustering within a tree Gene Finding and Function Assignments be able to describe the two main methods for gene finding be able to list some examples of these methods be able to describe methods for functional assignments be able to list several of the databases used in functional annotations Sequence Visualization be able to describe examples of genome browsers commonly used be able to list the types of data that can be displayed on genome browsers Lunch Phenotypic Inference understand how phenotype is inferred from sequence be able to cite examples of phenotypic characteristics that can be inferred from sequence understand the limitations of phenotypic inference Molecular Epidemiology (case studies/examples) be able to describe some examples of how CDC and the PHLs are using molecular epidemiology to detect, investigate, and track outbreaks Commonly Used Bioinformatics Software be able to list some of the commonly used bioinformatics tools for different analytical tasks 4

5 Template 2: Hands-On Introduction to Linux for PHL Bioinformaticians Format: one day, hands-on training using the Linux OS This course is designed as a hands-on training for staff who understand the basics of bioinformatics and wish to gain some facility with more advanced software, much of which is open source and freely available, but which requires an understanding of Linux and the use of the command line. This is typically not a course for all laboratory staff, but is an excellent option for those who want or need to have access to more sophisticated analytic tools. Each student in the course will need access to a computer with Linux OS (most common distributions should work) and, if the HPC section is included, will need remote access to an HPC cluster. For this reason, a computer training laboratory would be an excellent site for the training. As an alternative to this course, health departments that have licensed commercial bioinformatics software might consider training for that software. 90m 90m Basic LINUX be able to execute simple LINUX commands (directory listings, create a directory, change directories, move, copy, and view files) be able to get the manual pages for LINUX commands be able to describe what a linux desktop environment is move files from PC to LINUX use a simple text editor be able to describe the difference between relative and absolute paths Intermediate LINUX be able to demonstrate the use of file permissions and how to change them be able to demonstrate how to redirect and pipe output be able to demonstrate the use of wildcards be able to demonstrate the use of grep have a basic understanding of the LINUX system file structure be able to describe what an environment variable is be able to tar and compress a directory and contents Lunch Scripting be able to write a basic bash script and execute be able to describe how scripts are used in building analysis pipelines 5

6 90m HPC Cluster Computing be able to describe how cluster computing environments can used in analysis applications be able to describe what a queueing system is and how they work be able to describe how applications submit jobs to queueing systems and how to retrieve results from the analysis be able to submit a job to a queue using qsub be able to check status, delete, and list details of their jobs using various q commands be able to create a simple qsub script and execute Template 3: Introduction to Next-Generation Sequencing and Bioinformatics for Epidemiologists Format: one day, lectures and structured case studies The table below shows a one-day course for epidemiologists. The course is based on a curriculum developed by microbiologists and epidemiologists at CDC and is scheduled to be offered several times in Because this course is taught by CDC staff, slides should be available for use elsewhere. The template includes case studies during the afternoon. Ideally, these should be based on local experience. Time Introduction Basic Pathogen Genomics have a basic understanding of the pathogen genomics to be used throughout the course DNA and RNA Typical genome sizes and characteristics o Viruses o Bacteria o Eukaryotes Special topics o Plasmids o Phages o Other mobile elements o Recombination Other important technologies o DNA/RNA extraction o PCR; real-time PCR 6

7 45m Sequencing Technology--Library prep to raw NGS data have a basic understanding of sequencing technologies, including both Sanger and NGS understand the major advantages and disadvantages of each technology have a sense library prep technologies for NGS know terminology commonly used in NGS Sanger sequencing how it works, limitations of the technology NGS technology how it works; advantages and disadvantages of different technologies o Illumina (Note: plan to spend most of time on this, given its role at CDC and in state labs), including how samples are multiplexed on a run o Other technologies Ion Torrent PacBio Oxford Nanopore Library prep for NGS o Enzymatic vs physical o Walk-through of typical enzymatic prep o Terminology Genome Assembly raw NGS data to assembled contigs be able to explain different methods for processing raw NGS data into contigs have a sense of how bioinformaticians accomplish this assembly Consensus sequence vs. variants Sequence Comparison understand what an alignment is be able to explain (traditional) MLST and wgmlst understand what a SNP analysis is and some of the considerations that go into SNP analysis What is an alignment? MLST What it is and why it was used for bacterial classification wgmlst o How this differs from traditional MLST o Automated vs curated o Flavors of wgmlst (cgmlst, etc.) SNP analysis o What is a SNP? o What needs to be masked out in a SNP analysis o Effect of reference sequence on results SNPs vs wgmlst advantages and disadvantages 7

8 Phenotypic Inference understand how phenotype is inferred from sequence be able to cite examples of phenotypic characteristics that can be inferred from sequence understand the limitations of phenotypic inference Common types of phenotypic inference o antibiotic and antiviral resistance o serotype o virulence How phenotypic inference is done (i.e., generally by comparisons with databases, not by heuristics) Limitations of phenotypic inference need for validation Lunch 45m Trees and Clusters be able to explain different type of trees, both in terms of the data underlying the trees and the tree topology know how to interpret key aspects of trees topology, branch lengths, etc. understand implications of tree structure and how to interpret clustering within a tree Basics of trees o Sequence-based vs distance-based o Rooted vs. unrooted o Formats: rectangular, diagonal, polar, radial, etc. Distance-based trees o Nucleotide substitution models (what they are and how they affect trees; not the details) o How neighbor-joining trees work Sequence-based tree example: maximum likelihood tree Branch lengths what do they mean? Numbers at nodes what do those mean? How to interpret Special case: Bayesian tree, and what to infer from one Case Studies 1 8

9 NGS: applications, pathogens and paradigms be able to explain what the AMD program is understand that the application of NGS goes beyond cluster detection be able to cite several examples of how NGS is being used in public health be able to explain what capacity state and local health departments are developing in NGS The AMD program what it is Examples of applications of NGS at CDC o Flu inferring phenotype to improve efficiency and improve vaccine strain selection o Streptococcus inferring phenotype to improve efficiency and to make phenotyping (including serotyping) more widely available o Hepatitis C the importance of quasispecies o Pertussis understanding emergence of a pathogen o HIV understanding transmission, integrating genomic and epi data o URDO Seq a new paradigm for solving outbreaks NGS in state health departments what pathogens will states be typing in the near future Case Studies 2 Other Technologies and Omics be familiar with a few other important AMD technologies be able to name several other important -omics fields Other AMD technologies o MALDI-TOF o Optical Mapping o Synthetic long-reads Metagenomics o What is metagenomics? o The microbiome o Metagenomics for diagnosis (examples: TB, foodborne) Other Omics (give several examples with very brief explanation) Case Studies 3 Wrap-up, evaluation Include state health department perspective on this how will this technology impact state health lab operations, what pathogens do they see as priorities 9

University of Glasgow - Programme Structure Summary C1G5-5100 MSc Bioinformatics, Polyomics and Systems Biology

University of Glasgow - Programme Structure Summary C1G5-5100 MSc Bioinformatics, Polyomics and Systems Biology University of Glasgow - Programme Structure Summary C1G5-5100 MSc Bioinformatics, Polyomics and Systems Biology Programme Structure - the MSc outcome will require 180 credits total (full-time only) - 60

More information

Molecular typing of VTEC: from PFGE to NGS-based phylogeny

Molecular typing of VTEC: from PFGE to NGS-based phylogeny Molecular typing of VTEC: from PFGE to NGS-based phylogeny Valeria Michelacci 10th Annual Workshop of the National Reference Laboratories for E. coli in the EU Rome, November 5 th 2015 Molecular typing

More information

Use of Whole Genome Sequencing (WGS) of food-borne pathogens for public health protection

Use of Whole Genome Sequencing (WGS) of food-borne pathogens for public health protection EFSA Scientific Colloquium n 20 Use of Whole Genome Sequencing (WGS) of food-borne pathogens for public health protection Parma, Italy, 16-17 June 2014 Why WGS based approach Infectious diseases are responsible

More information

Next Generation Sequencing in Public Health Laboratories. 2014 Survey Results

Next Generation Sequencing in Public Health Laboratories. 2014 Survey Results Next Generation Sequencing in Public Health Laboratories 2014 Survey Results MAY 2015 This project was 100% funded with federal funds from a federal program of $215,972. This publication was supported

More information

Whole genome sequencing of foodborne pathogens: experiences from the Reference Laboratory. Kathie Grant Gastrointestinal Bacteria Reference Unit

Whole genome sequencing of foodborne pathogens: experiences from the Reference Laboratory. Kathie Grant Gastrointestinal Bacteria Reference Unit Whole genome sequencing of foodborne pathogens: experiences from the Reference Laboratory Kathie Grant Gastrointestinal Bacteria Reference Unit 16 th June 2014 Planning for Implementation of WGS 2011-2014

More information

Bacterial Next Generation Sequencing - nur mehr Daten oder auch mehr Wissen? Dag Harmsen Univ. Münster, Germany dharmsen@uni-muenster.

Bacterial Next Generation Sequencing - nur mehr Daten oder auch mehr Wissen? Dag Harmsen Univ. Münster, Germany dharmsen@uni-muenster. Bacterial Next Generation Sequencing - nur mehr Daten oder auch mehr Wissen? Dag Harmsen Univ. Münster, Germany dharmsen@uni-muenster.de Commercial Disclosure Dag Harmsen is co-founder and partial owner

More information

G E N OM I C S S E RV I C ES

G E N OM I C S S E RV I C ES GENOMICS SERVICES THE NEW YORK GENOME CENTER NYGC is an independent non-profit implementing advanced genomic research to improve diagnosis and treatment of serious diseases. capabilities. N E X T- G E

More information

2 Short biographies and contact information of the workshop organizers

2 Short biographies and contact information of the workshop organizers 1 Title of the workshop from sequence to surveillance 2 Short biographies and contact information of the workshop organizers Dr Peter Durr - peter.durr@csiro.au Veterinary epidemiologist, Australian Animal

More information

Phylogenetic Trees Made Easy

Phylogenetic Trees Made Easy Phylogenetic Trees Made Easy A How-To Manual Fourth Edition Barry G. Hall University of Rochester, Emeritus and Bellingham Research Institute Sinauer Associates, Inc. Publishers Sunderland, Massachusetts

More information

Typing in the NGS era: The way forward!

Typing in the NGS era: The way forward! Typing in the NGS era: The way forward! Valeria Michelacci NGS course, June 2015 Typing from sequence data NGS-derived conventional Multi Locus Sequence Typing (University of Warwick, 7 housekeeping genes)

More information

Delivering the power of the world s most successful genomics platform

Delivering the power of the world s most successful genomics platform Delivering the power of the world s most successful genomics platform NextCODE Health is bringing the full power of the world s largest and most successful genomics platform to everyday clinical care NextCODE

More information

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the

More information

Building Bioinformatics Capacity in Africa. Nicky Mulder CBIO Group, UCT

Building Bioinformatics Capacity in Africa. Nicky Mulder CBIO Group, UCT Building Bioinformatics Capacity in Africa Nicky Mulder CBIO Group, UCT Outline What is bioinformatics? Why do we need IT infrastructure? What e-infrastructure does it require? How we are developing this

More information

Next Generation Sequencing

Next Generation Sequencing Next Generation Sequencing Technology and applications 10/1/2015 Jeroen Van Houdt - Genomics Core - KU Leuven - UZ Leuven 1 Landmarks in DNA sequencing 1953 Discovery of DNA double helix structure 1977

More information

Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center

Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center Computational Challenges in Storage, Analysis and Interpretation of Next-Generation Sequencing Data Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center Next Generation Sequencing

More information

E. coli plasmid and gene profiling using Next Generation Sequencing

E. coli plasmid and gene profiling using Next Generation Sequencing E. coli plasmid and gene profiling using Next Generation Sequencing Jeroen F. J. Laros Leiden Genome Technology Center Department of Human Genetics Center for Human and Clinical Genetics Introduction General

More information

Focusing on results not data comprehensive data analysis for targeted next generation sequencing

Focusing on results not data comprehensive data analysis for targeted next generation sequencing Focusing on results not data comprehensive data analysis for targeted next generation sequencing Daniel Swan, Jolyon Holdstock, Angela Matchan, Richard Stark, John Shovelton, Duarte Mohla and Simon Hughes

More information

Tutorial for Windows and Macintosh. Preparing Your Data for NGS Alignment

Tutorial for Windows and Macintosh. Preparing Your Data for NGS Alignment Tutorial for Windows and Macintosh Preparing Your Data for NGS Alignment 2015 Gene Codes Corporation Gene Codes Corporation 775 Technology Drive, Ann Arbor, MI 48108 USA 1.800.497.4939 (USA) 1.734.769.7249

More information

A Primer of Genome Science THIRD

A Primer of Genome Science THIRD A Primer of Genome Science THIRD EDITION GREG GIBSON-SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc. Publishers Sunderland, Massachusetts USA Contents Preface xi 1 Genome Projects:

More information

Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community

Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community Ntinos Krampis Asst. Professor J. Craig Venter Institute kkrampis@jcvi.org http://www.jcvi.org/cms/about/bios/kkrampis/

More information

An introduction to bioinformatic tools for population genomic and metagenetic data analysis, 2.5 higher education credits Third Cycle

An introduction to bioinformatic tools for population genomic and metagenetic data analysis, 2.5 higher education credits Third Cycle An introduction to bioinformatic tools for population genomic and metagenetic data analysis, 2.5 higher education credits Third Cycle Faculty of Science; Department of Marine Sciences The Swedish Royal

More information

Nazneen Aziz, PhD. Director, Molecular Medicine Transformation Program Office

Nazneen Aziz, PhD. Director, Molecular Medicine Transformation Program Office 2013 Laboratory Accreditation Program Audioconferences and Webinars Implementing Next Generation Sequencing (NGS) as a Clinical Tool in the Laboratory Nazneen Aziz, PhD Director, Molecular Medicine Transformation

More information

Introduction to next-generation sequencing data

Introduction to next-generation sequencing data Introduction to next-generation sequencing data David Simpson Centre for Experimental Medicine Queens University Belfast http://www.qub.ac.uk/research-centres/cem/ Outline History of DNA sequencing NGS

More information

General Services Administration Federal Supply Service Authorized Federal Supply Schedule Price List

General Services Administration Federal Supply Service Authorized Federal Supply Schedule Price List General Services Administration Federal Supply Service Authorized Federal Supply Schedule Price List GSA Schedule 66 Scientific Equipment and Services SIN 66-1000 Professional Scientific Services IHRC,

More information

Next Generation Sequencing: Technology, Mapping, and Analysis

Next Generation Sequencing: Technology, Mapping, and Analysis Next Generation Sequencing: Technology, Mapping, and Analysis Gary Benson Computer Science, Biology, Bioinformatics Boston University gbenson@bu.edu http://tandem.bu.edu/ The Human Genome Project took

More information

LifeScope Genomic Analysis Software 2.5

LifeScope Genomic Analysis Software 2.5 USER GUIDE LifeScope Genomic Analysis Software 2.5 Graphical User Interface DATA ANALYSIS METHODS AND INTERPRETATION Publication Part Number 4471877 Rev. A Revision Date November 2011 For Research Use

More information

Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers

Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers Ntinos Krampis Asst. Professor J. Craig Venter Institute kkrampis@jcvi.org http://www.jcvi.org/cms/about/bios/kkrampis/

More information

Introduction to Bioinformatics AS 250.265 Laboratory Assignment 6

Introduction to Bioinformatics AS 250.265 Laboratory Assignment 6 Introduction to Bioinformatics AS 250.265 Laboratory Assignment 6 In the last lab, you learned how to perform basic multiple sequence alignments. While useful in themselves for determining conserved residues

More information

Accelerate genomic breakthroughs in microbiology. Gain deeper insights with powerful bioinformatic tools.

Accelerate genomic breakthroughs in microbiology. Gain deeper insights with powerful bioinformatic tools. Accelerate genomic breakthroughs in microbiology. Gain deeper insights with powerful bioinformatic tools. Empowering microbial genomics. Extensive methods. Expansive possibilities. In microbiome studies

More information

Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community

Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community Ntinos Krampis Asst. Professor J. Craig Venter Institute kkrampis@jcvi.org http://www.jcvi.org/cms/about/bios/kkrampis/

More information

NGS data analysis. Bernardo J. Clavijo

NGS data analysis. Bernardo J. Clavijo NGS data analysis Bernardo J. Clavijo 1 A brief history of DNA sequencing 1953 double helix structure, Watson & Crick! 1977 rapid DNA sequencing, Sanger! 1977 first full (5k) genome bacteriophage Phi X!

More information

Module 1. Sequence Formats and Retrieval. Charles Steward

Module 1. Sequence Formats and Retrieval. Charles Steward The Open Door Workshop Module 1 Sequence Formats and Retrieval Charles Steward 1 Aims Acquaint you with different file formats and associated annotations. Introduce different nucleotide and protein databases.

More information

Analysis of ChIP-seq data in Galaxy

Analysis of ChIP-seq data in Galaxy Analysis of ChIP-seq data in Galaxy November, 2012 Local copy: https://galaxy.wi.mit.edu/ Joint project between BaRC and IT Main site: http://main.g2.bx.psu.edu/ 1 Font Conventions Bold and blue refers

More information

Use of Whole Genome. of food-borne pathogens for public health protection. Efsa Scientific Colloquium Summary Report. 16-17 June 2014, Parma, Italy

Use of Whole Genome. of food-borne pathogens for public health protection. Efsa Scientific Colloquium Summary Report. 16-17 June 2014, Parma, Italy 20 Efsa Scientific Colloquium Summary Report ISSN 2363-2240 Use of Whole Genome Sequencing (WGS) of food-borne pathogens for public health protection 16-17 June 2014, Parma, Italy EFSA Scientific Colloquium

More information

SMRT Analysis v2.2.0 Overview. 1. SMRT Analysis v2.2.0. 1.1 SMRT Analysis v2.2.0 Overview. Notes:

SMRT Analysis v2.2.0 Overview. 1. SMRT Analysis v2.2.0. 1.1 SMRT Analysis v2.2.0 Overview. Notes: SMRT Analysis v2.2.0 Overview 100 338 400 01 1. SMRT Analysis v2.2.0 1.1 SMRT Analysis v2.2.0 Overview Welcome to Pacific Biosciences' SMRT Analysis v2.2.0 Overview 1.2 Contents This module will introduce

More information

DNA Sequencing and Personalised Medicine

DNA Sequencing and Personalised Medicine DNA Sequencing and Personalised Medicine Mick Watson Director of ARK-Genomics The Roslin Institute PERSONALISED MEDICINE What is personalised medicine? Personalized Medicine refers to the tailoring of

More information

An introduction to bioinformatic tools for metagenetic and population genomic data analysis, 2.0 higher education credits

An introduction to bioinformatic tools for metagenetic and population genomic data analysis, 2.0 higher education credits An introduction to bioinformatic tools for metagenetic and population genomic data analysis, 2.0 higher education credits Course period: 3-7 November 2014 Course leaders / Addresses for applications: Pierre

More information

The University is comprised of seven colleges and offers 19. including more than 5000 graduate students.

The University is comprised of seven colleges and offers 19. including more than 5000 graduate students. UNC CHARLOTTE A doctoral, research-intensive university, UNC Charlotte is the largest institution of higher education in the Charlotte region. The University is comprised of seven colleges and offers 19

More information

HP-UX Essentials and Shell Programming Course Summary

HP-UX Essentials and Shell Programming Course Summary Contact Us: (616) 875-4060 HP-UX Essentials and Shell Programming Course Summary Length: 5 Days Prerequisite: Basic computer skills Recommendation Statement: Student should be able to use a computer monitor,

More information

Master's projects at ITMO University. Daniil Chivilikhin PhD Student @ ITMO University

Master's projects at ITMO University. Daniil Chivilikhin PhD Student @ ITMO University Master's projects at ITMO University Daniil Chivilikhin PhD Student @ ITMO University General information Guidance from our lab's researchers Publishable results 2 Research areas Research at ITMO Evolutionary

More information

CLOSHA MANUAL ver1.1. KOBIC (Korean Bioinformation Center) kogun82@kribb.re.kr 2016-05-08. Bioinformatics Workflow management System in Bio-Express

CLOSHA MANUAL ver1.1. KOBIC (Korean Bioinformation Center) kogun82@kribb.re.kr 2016-05-08. Bioinformatics Workflow management System in Bio-Express CLOSHA MANUAL ver1.1 Bioinformatics Workflow management System in Bio-Express Cloud Services for Massive Sequencing Data Analysis KOBIC (Korean Bioinformation Center) kogun82@kribb.re.kr 2016-05-08 1 1.

More information

FACULTY OF MEDICAL SCIENCE

FACULTY OF MEDICAL SCIENCE Doctor of Philosophy Program in Microbiology FACULTY OF MEDICAL SCIENCE Naresuan University 171 Doctor of Philosophy Program in Microbiology The time is critical now for graduate education and research

More information

Identification and Characterization of Foodborne Pathogens by Whole Genome Sequencing: A Shift in Paradigm

Identification and Characterization of Foodborne Pathogens by Whole Genome Sequencing: A Shift in Paradigm Identification and Characterization of Foodborne Pathogens by Whole Genome Sequencing: A Shift in Paradigm Peter Gerner-Smidt, MD, ScD Enteric Diseases Laboratory Branch EFSA Scientific Colloquium N o

More information

Provisioning robust automated analytical pipelines for whole genome-based public health microbiological typing

Provisioning robust automated analytical pipelines for whole genome-based public health microbiological typing Provisioning robust automated analytical pipelines for whole genome-based public health microbiological typing Anthony Underwood Bioinformatics Unit, Infectious Disease Informatics, Microbiological Services,

More information

SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications

SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications Product Bulletin Sequencing Software SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications Comprehensive reference sequence handling Helps interpret the role of each

More information

Overview sequence projects

Overview sequence projects Overview sequence projects Bioassist NGS meeting 15-01-2010 Barbera van Schaik KEBB - Bioinformatics Laboratory b.d.vanschaik@amc.uva.nl NGS at the Academic Medical Center Sequence facility Laboratory

More information

Final Project Report

Final Project Report CPSC545 by Introduction to Data Mining Prof. Martin Schultz & Prof. Mark Gerstein Student Name: Yu Kor Hugo Lam Student ID : 904907866 Due Date : May 7, 2007 Introduction Final Project Report Pseudogenes

More information

PROGRAMMING FOR BIOLOGISTS. BIOL 6297 Monday, Wednesday 10 am -12 pm

PROGRAMMING FOR BIOLOGISTS. BIOL 6297 Monday, Wednesday 10 am -12 pm PROGRAMMING FOR BIOLOGISTS BIOL 6297 Monday, Wednesday 10 am -12 pm Tomorrow is Ada Lovelace Day Ada Lovelace was the first person to write a computer program Today s Lecture Overview of the course Philosophy

More information

BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS

BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title:

More information

Introduction to NGS data analysis

Introduction to NGS data analysis Introduction to NGS data analysis Jeroen F. J. Laros Leiden Genome Technology Center Department of Human Genetics Center for Human and Clinical Genetics Sequencing Illumina platforms Characteristics: High

More information

Next generation sequencing (NGS)

Next generation sequencing (NGS) Next generation sequencing (NGS) Vijayachitra Modhukur BIIT modhukur@ut.ee 1 Bioinformatics course 11/13/12 Sequencing 2 Bioinformatics course 11/13/12 Microarrays vs NGS Sequences do not need to be known

More information

Practical Solutions for Big Data Analytics

Practical Solutions for Big Data Analytics Practical Solutions for Big Data Analytics Ravi Madduri Computation Institute (madduri@anl.gov) Paul Dave (pdave@uchicago.edu) Dinanath Sulakhe (sulakhe@uchicago.edu) Alex Rodriguez (arodri7@uchicago.edu)

More information

CD-HIT User s Guide. Last updated: April 5, 2010. http://cd-hit.org http://bioinformatics.org/cd-hit/

CD-HIT User s Guide. Last updated: April 5, 2010. http://cd-hit.org http://bioinformatics.org/cd-hit/ CD-HIT User s Guide Last updated: April 5, 2010 http://cd-hit.org http://bioinformatics.org/cd-hit/ Program developed by Weizhong Li s lab at UCSD http://weizhong-lab.ucsd.edu liwz@sdsc.edu 1. Introduction

More information

Core Bioinformatics. Degree Type Year Semester. 4313473 Bioinformàtica/Bioinformatics OB 0 1

Core Bioinformatics. Degree Type Year Semester. 4313473 Bioinformàtica/Bioinformatics OB 0 1 Core Bioinformatics 2014/2015 Code: 42397 ECTS Credits: 12 Degree Type Year Semester 4313473 Bioinformàtica/Bioinformatics OB 0 1 Contact Name: Sònia Casillas Viladerrams Email: Sonia.Casillas@uab.cat

More information

UMass High Performance Computing Center

UMass High Performance Computing Center .. UMass High Performance Computing Center University of Massachusetts Medical School October, 2014 2 / 32. Challenges of Genomic Data It is getting easier and cheaper to produce bigger genomic data every

More information

Data Processing of Nextera Mate Pair Reads on Illumina Sequencing Platforms

Data Processing of Nextera Mate Pair Reads on Illumina Sequencing Platforms Data Processing of Nextera Mate Pair Reads on Illumina Sequencing Platforms Introduction Mate pair sequencing enables the generation of libraries with insert sizes in the range of several kilobases (Kb).

More information

Basic Course on Bioinformatics tools for Next Generation Sequencing data mining 11-12 June, 2015 Istituto Superiore di Sanità, SIDBAE Training Room

Basic Course on Bioinformatics tools for Next Generation Sequencing data mining 11-12 June, 2015 Istituto Superiore di Sanità, SIDBAE Training Room EU Reference Laboratory for E. coli Department of Veterinary Public Health and Food Safety Unit of Foodborne Zoonoses Basic Course on Bioinformatics tools for Next Generation Sequencing data mining 11-12

More information

Next generation DNA sequencing technologies. theory & prac-ce

Next generation DNA sequencing technologies. theory & prac-ce Next generation DNA sequencing technologies theory & prac-ce Outline Next- Genera-on sequencing (NGS) technologies overview NGS applica-ons NGS workflow: data collec-on and processing the exome sequencing

More information

Bioinformatics and its applications

Bioinformatics and its applications Bioinformatics and its applications Alla L Lapidus, Ph.D. SPbAU, SPbSU, St. Petersburg Term Bioinformatics Term Bioinformatics was invented by Paulien Hogeweg (Полина Хогевег) and Ben Hesper in 1970 as

More information

Bioinformatics Grid - Enabled Tools For Biologists.

Bioinformatics Grid - Enabled Tools For Biologists. Bioinformatics Grid - Enabled Tools For Biologists. What is Grid-Enabled Tools (GET)? As number of data from the genomics and proteomics experiment increases. Problems arise for the current sequence analysis

More information

A data management framework for the Fungal Tree of Life

A data management framework for the Fungal Tree of Life Web Accessible Sequence Analysis for Biological Inference A data management framework for the Fungal Tree of Life Kauff F, Cox CJ, Lutzoni F. 2007. WASABI: An automated sequence processing system for multi-gene

More information

Bio-Informatics Lectures. A Short Introduction

Bio-Informatics Lectures. A Short Introduction Bio-Informatics Lectures A Short Introduction The History of Bioinformatics Sanger Sequencing PCR in presence of fluorescent, chain-terminating dideoxynucleotides Massively Parallel Sequencing Massively

More information

NECC History. Karl V. Steiner 2011 Annual NECC Meeting, Orono, Maine March 15, 2011

NECC History. Karl V. Steiner 2011 Annual NECC Meeting, Orono, Maine March 15, 2011 NECC History Karl V. Steiner 2011 Annual NECC Meeting, Orono, Maine March 15, 2011 EPSCoR Cyberinfrastructure Workshop First regional NENI (now NECC) Workshop held in Vermont in August 2007 Workshop heldinkentucky

More information

HPC Wales Skills Academy Course Catalogue 2015

HPC Wales Skills Academy Course Catalogue 2015 HPC Wales Skills Academy Course Catalogue 2015 Overview The HPC Wales Skills Academy provides a variety of courses and workshops aimed at building skills in High Performance Computing (HPC). Our courses

More information

UF EDGE brings the classroom to you with online, worldwide course delivery!

UF EDGE brings the classroom to you with online, worldwide course delivery! What is the University of Florida EDGE Program? EDGE enables engineering professional, military members, and students worldwide to participate in courses, certificates, and degree programs from the UF

More information

PreciseTM Whitepaper

PreciseTM Whitepaper Precise TM Whitepaper Introduction LIMITATIONS OF EXISTING RNA-SEQ METHODS Correctly designed gene expression studies require large numbers of samples, accurate results and low analysis costs. Analysis

More information

Workshop on Methods for Isolation and Identification of Campylobacter spp. June 13-17, 2005

Workshop on Methods for Isolation and Identification of Campylobacter spp. June 13-17, 2005 Workshop on Methods for Isolation and Identification of Campylobacter spp June 13-17, 2005 Goal: build capacity within the state public health laboratories to effectively identify Campylobacter species

More information

Using Illumina BaseSpace Apps to Analyze RNA Sequencing Data

Using Illumina BaseSpace Apps to Analyze RNA Sequencing Data Using Illumina BaseSpace Apps to Analyze RNA Sequencing Data The Illumina TopHat Alignment and Cufflinks Assembly and Differential Expression apps make RNA data analysis accessible to any user, regardless

More information

Quality Assurance and Validation of Next Generation Sequencing

Quality Assurance and Validation of Next Generation Sequencing Quality Assurance and Validation of Next Generation Sequencing Amy Gargis, Ph.D. IHRC Inc. BioDefense Research and Development Laboratory Laboratory Preparedness and Response Branch Next Generation Sequencing:

More information

Metagenomics revisits the one pathogen/one disease postulates and translate the One Health concept into action

Metagenomics revisits the one pathogen/one disease postulates and translate the One Health concept into action Les Rencontres de L INRA Metagenomics revisits the one pathogen/one disease postulates and translate the One Health concept into action E Albina (CIRAD) / S Guyomard(Institut Pasteur) Guadeloupe The era

More information

Vector NTI Advance 11 Quick Start Guide

Vector NTI Advance 11 Quick Start Guide Vector NTI Advance 11 Quick Start Guide Catalog no. 12605050, 12605099, 12605103 Version 11.0 December 15, 2008 12605022 Published by: Invitrogen Corporation 5791 Van Allen Way Carlsbad, CA 92008 U.S.A.

More information

Protein Sequence Analysis - Overview -

Protein Sequence Analysis - Overview - Protein Sequence Analysis - Overview - UDEL Workshop Raja Mazumder Research Associate Professor, Department of Biochemistry and Molecular Biology Georgetown University Medical Center Topics Why do protein

More information

An example of bioinformatics application on plant breeding projects in Rijk Zwaan

An example of bioinformatics application on plant breeding projects in Rijk Zwaan An example of bioinformatics application on plant breeding projects in Rijk Zwaan Xiangyu Rao 17-08-2012 Introduction of RZ Rijk Zwaan is active worldwide as a vegetable breeding company that focuses on

More information

The Power of Next-Generation Sequencing in Your Hands On the Path towards Diagnostics

The Power of Next-Generation Sequencing in Your Hands On the Path towards Diagnostics The Power of Next-Generation Sequencing in Your Hands On the Path towards Diagnostics The GS Junior System The Power of Next-Generation Sequencing on Your Benchtop Proven technology: Uses the same long

More information

Lectures 1 and 8 15. February 7, 2013. Genomics 2012: Repetitorium. Peter N Robinson. VL1: Next- Generation Sequencing. VL8 9: Variant Calling

Lectures 1 and 8 15. February 7, 2013. Genomics 2012: Repetitorium. Peter N Robinson. VL1: Next- Generation Sequencing. VL8 9: Variant Calling Lectures 1 and 8 15 February 7, 2013 This is a review of the material from lectures 1 and 8 14. Note that the material from lecture 15 is not relevant for the final exam. Today we will go over the material

More information

REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf])

REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf]) 820 REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf]) (See also General Regulations) BMS1 Admission to the Degree To be eligible for admission to the degree of Bachelor

More information

UGENE Quick Start Guide

UGENE Quick Start Guide Quick Start Guide This document contains a quick introduction to UGENE. For more detailed information, you can find the UGENE User Manual and other special manuals in project website: http://ugene.unipro.ru.

More information

Next Generation Sequencing; Technologies, applications and data analysis

Next Generation Sequencing; Technologies, applications and data analysis ; Technologies, applications and data analysis Course 2542 Dr. Martie C.M. Verschuren Research group Analysis techniques in Life Science, Breda Prof. dr. Johan T. den Dunnen Leiden Genome Technology Center,

More information

Computational Genomics. Next generation sequencing (NGS)

Computational Genomics. Next generation sequencing (NGS) Computational Genomics Next generation sequencing (NGS) Sequencing technology defies Moore s law Nature Methods 2011 Log 10 (price) Sequencing the Human Genome 2001: Human Genome Project 2.7G$, 11 years

More information

Leading Genomics. Diagnostic. Discove. Collab. harma. Shanghai Cambridge, MA Reykjavik

Leading Genomics. Diagnostic. Discove. Collab. harma. Shanghai Cambridge, MA Reykjavik Leading Genomics Diagnostic harma Discove Collab Shanghai Cambridge, MA Reykjavik Global leadership for using the genome to create better medicine WuXi NextCODE provides a uniquely proven and integrated

More information

Visualization of Phylogenetic Trees and Metadata

Visualization of Phylogenetic Trees and Metadata Visualization of Phylogenetic Trees and Metadata November 27, 2015 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com support-clcbio@qiagen.com

More information

BioHPC Web Computing Resources at CBSU

BioHPC Web Computing Resources at CBSU BioHPC Web Computing Resources at CBSU 3CPG workshop Robert Bukowski Computational Biology Service Unit http://cbsu.tc.cornell.edu/lab/doc/biohpc_web_tutorial.pdf BioHPC infrastructure at CBSU BioHPC Web

More information

OMU350 Operations Manager 9.x on UNIX/Linux Advanced Administration

OMU350 Operations Manager 9.x on UNIX/Linux Advanced Administration OMU350 Operations Manager 9.x on UNIX/Linux Advanced Administration Instructor-Led Training For versions 9.0, 9.01, & 9.10 OVERVIEW This 5-day instructor-led course focuses on advanced administration topics

More information

4.2.1. What is a contig? 4.2.2. What are the contig assembly programs?

4.2.1. What is a contig? 4.2.2. What are the contig assembly programs? Table of Contents 4.1. DNA Sequencing 4.1.1. Trace Viewer in GCG SeqLab Table. Box. Select the editor mode in the SeqLab main window. Import sequencer trace files from the File menu. Select the trace files

More information

Version 5.0 Release Notes

Version 5.0 Release Notes Version 5.0 Release Notes 2011 Gene Codes Corporation Gene Codes Corporation 775 Technology Drive, Ann Arbor, MI 48108 USA 1.800.497.4939 (USA) +1.734.769.7249 (elsewhere) +1.734.769.7074 (fax) www.genecodes.com

More information

BIO 3352: BIOINFORMATICS II HYBRID COURSE SYLLABUS

BIO 3352: BIOINFORMATICS II HYBRID COURSE SYLLABUS BIO 3352: BIOINFORMATICS II HYBRID COURSE SYLLABUS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title: Bioinformatics

More information

Institutional Partnership Program

Institutional Partnership Program GENEWIZ Outsourcing Services Institutional Partnership Program Solid Science. Superior Service. DNA Sequencing Partners to Fuel Your Success Institutions whose success depends on significant life science

More information

Cisco Networking Academy Program Curriculum Scope & Sequence. Fundamentals of UNIX version 2.0 (July, 2002)

Cisco Networking Academy Program Curriculum Scope & Sequence. Fundamentals of UNIX version 2.0 (July, 2002) Cisco Networking Academy Program Curriculum Scope & Sequence Fundamentals of UNIX version 2.0 (July, 2002) Course Description: Fundamentals of UNIX teaches you how to use the UNIX operating system and

More information

imc FAMOS 6.3 visualization signal analysis data processing test reporting Comprehensive data analysis and documentation imc productive testing

imc FAMOS 6.3 visualization signal analysis data processing test reporting Comprehensive data analysis and documentation imc productive testing imc FAMOS 6.3 visualization signal analysis data processing test reporting Comprehensive data analysis and documentation imc productive testing www.imcfamos.com imc FAMOS at a glance Four editions to Optimize

More information

NIAID Genomics and Bioinformatics Programs

NIAID Genomics and Bioinformatics Programs NIAID Genomics and Bioinformatics Programs Vivien G. Dugan, PhD Office of Genomics and Advanced Technologies (OGAT) Division of Microbiology and Infectious Diseases NIAID, NIH, DHHS duganv@niaid.nih.gov

More information

Bioruptor NGS: Unbiased DNA shearing for Next-Generation Sequencing

Bioruptor NGS: Unbiased DNA shearing for Next-Generation Sequencing STGAAC STGAACT GTGCACT GTGAACT STGAAC STGAACT GTGCACT GTGAACT STGAAC STGAAC GTGCAC GTGAAC Wouter Coppieters Head of the genomics core facility GIGA center, University of Liège Bioruptor NGS: Unbiased DNA

More information

-> Integration of MAPHiTS in Galaxy

-> Integration of MAPHiTS in Galaxy Enabling NGS Analysis with(out) the Infrastructure, 12:0512 Development of a workflow for SNPs detection in grapevine From Sets to Graphs: Towards a Realistic Enrichment Analy species: MAPHiTS -> Integration

More information

Towards Integrating the Detection of Genetic Variants into an In-Memory Database

Towards Integrating the Detection of Genetic Variants into an In-Memory Database Towards Integrating the Detection of Genetic Variants into an 2nd International Workshop on Big Data in Bioinformatics and Healthcare Oct 27, 2014 Motivation Genome Data Analysis Process DNA Sample Base

More information

Deep Sequencing Data Analysis

Deep Sequencing Data Analysis Deep Sequencing Data Analysis Ross Whetten Professor Forestry & Environmental Resources Background Who am I, and why am I teaching this topic? I am not an expert in bioinformatics I started as a biologist

More information

Molecular and Cell Biology Laboratory (BIOL-UA 223) Instructor: Ignatius Tan Phone: 212-998-8295 Office: 764 Brown Email: ignatius.tan@nyu.

Molecular and Cell Biology Laboratory (BIOL-UA 223) Instructor: Ignatius Tan Phone: 212-998-8295 Office: 764 Brown Email: ignatius.tan@nyu. Molecular and Cell Biology Laboratory (BIOL-UA 223) Instructor: Ignatius Tan Phone: 212-998-8295 Office: 764 Brown Email: ignatius.tan@nyu.edu Course Hours: Section 1: Mon: 12:30-3:15 Section 2: Wed: 12:30-3:15

More information

How Sequencing Experiments Fail

How Sequencing Experiments Fail How Sequencing Experiments Fail v1.0 Simon Andrews simon.andrews@babraham.ac.uk Classes of Failure Technical Tracking Library Contamination Biological Interpretation Something went wrong with a machine

More information

New generation sequencing: current limits and future perspectives. Giorgio Valle CRIBI - Università di Padova

New generation sequencing: current limits and future perspectives. Giorgio Valle CRIBI - Università di Padova New generation sequencing: current limits and future perspectives Giorgio Valle CRIBI Università di Padova Around 2004 the Race for the 1000$ Genome started A few questions... When? How? Why? Standard

More information

BIOL 3200 Spring 2015 DNA Subway and RNA-Seq Data Analysis

BIOL 3200 Spring 2015 DNA Subway and RNA-Seq Data Analysis BIOL 3200 Spring 2015 DNA Subway and RNA-Seq Data Analysis By the end of this lab students should be able to: Describe the uses for each line of the DNA subway program (Red/Yellow/Blue/Green) Describe

More information

OpenCB a next generation big data analytics and visualisation platform for the Omics revolution

OpenCB a next generation big data analytics and visualisation platform for the Omics revolution OpenCB a next generation big data analytics and visualisation platform for the Omics revolution Development at the University of Cambridge - Closing the Omics / Moore s law gap with Dell & Intel Ignacio

More information

A demonstration of the use of Datagrid testbed and services for the biomedical community

A demonstration of the use of Datagrid testbed and services for the biomedical community A demonstration of the use of Datagrid testbed and services for the biomedical community Biomedical applications work package V. Breton, Y Legré (CNRS/IN2P3) R. Météry (CS) Credits : C. Blanchet, T. Contamine,

More information