Keeping up with DNA technologies
|
|
|
- Suzanna Gilbert
- 10 years ago
- Views:
Transcription
1 Keeping up with DNA technologies Mihai Pop Department of Computer Science Center for Bioinformatics and Computational Biology University of Maryland, College Park
2 The evolution of DNA sequencing Since Technology Read length Throughput/run Throughput/hour cost/run Sanger sequencing > 1000bp 4hr kbp 100 kbp $ pyrosequencing bp 4hr Mbp Mbp $13, Illumina/Solexa bp 3 days 2-3 Gbp ABI SOLiD 35-50bp 3 days 6-20 Gbp Mbp $3, Mbp est. $3-5, Helicos single molecule bp 8 days 10 Gbp ~50 Mbp est. 1Gbp/hour ~$18,000 TBA (2010) Pacific Biosciences single molecule kbp???
3 From DNA to images Image from Helicos through
4 What? X Image from Tree of Life Web Project Sage/RNAseq Image from wikipedia
5 Not too long ago 1995 Haemophilus influenzae 2000 Drosophila melanogaster 2001 Homo sapiens
6 Assembly Given a collection of short DNA pieces from a genome Reconstruct the original genome sequence Related to Shortest Common Superstring given a set of strings S, find the shortest string that contains all the strings in S as a substring. NP-hard SHOTGUN SEQUENCING shearing original DNA (hopefully) sequencing assembly
7 Challenges Note: Research on alignment/assembly algorithms is quite old and the basic concepts are unchanged by technology Large amounts of data even data storage/transfer is a problem Special error characteristics most very short reads (25-35 bp instead of 1000s) 454 homopolymer stutter ACAAGAAATGT ACAAGA--TGT ACAAGAA-TGT Helicos high error rates, same DNA fragment sequenced multiple times Pacific Biosciences very long reads, possibly high error rate Special data ABI/SOLiD color space sequencing
8 Short-read assembly at the CBCB De novo assembly Minimus-SR (w/ Dan Sommer) all-pair alignments performed fast (~2 minutes for 8.6 million reads) key use bit-wise operations (e.g. XOR to test equality) very good results (better than most short read assemblers) Comparative assembly AMOScmp (w/ Adam Phillippy, Daniela Puiu, S. Salzberg) originally developed for Sanger data works for short reads too ABBA (Dan Sommer, Steven Salzberg) comparative assembly with a protein reference (resequencing of genes) Short read alignment Bowtie (w/ Ben Langmead and Steven Salzberg) uses Burrows-Wheeler index of a genome (only 1.1GB for human) up to fold faster than other aligners
9 Optical mapping The better restriction mapping output: ordered list of sizes Restriction Fragment Size(kb) Standard Deviation (kb) Map-map alignment and contig-map alignment easier relatively simple dynamic programming
10 SOMA scaffolding with optical maps Dynamic programming O(m 2 n 2 ) S [i, j]=max 0 k i,0 l j C r i k l j s=k penalty for mismatched sites Bootstrapped computation of p-values (how likely is a match) Scheduling algorithm for near-optimal placement of contigs with multiple good matches. Approx % of a genome can be placed in one scaffold i c s t=l j 2 t=l t j o t 2 chi-squared score of alignment btwn fragments 2 best /n best F-test to asses equally good matches 2 other /n other S [k 1, l 1] Nagarajan, Read, Pop. Bioinformatics 2008
11 Opportunities for collaborations We are driven by data and technology Let us know if you have interesting data-sets... interesting DNA technologies... useful computational technologies (in particular parallel/grid infrastructure and high performance storage) Examples of collaborations with UM School of Medicine study of diarrhea in 3rd world countries with USDA sequencing of cow and turkey with Google/IBM/NSF applications of Cloud Computing to bioinformatics with Opgen Inc. - new ways to analyze optical mapping data... and many more
12 CBCB Steven Salzberg Art Delcher Carl Kingsford Daniela Puiu Acknowledgments Optical mapping Tim Read Zheng Wang Dave Schwartz Opgen, Inc. Pop group Niranjan Nagarajan Mohammad Ghodsi Sergey Koren Ben Langmead Bo Liu Dan Sommer James White Funding NSF NIH Bill & Melinda Gates Foundation Henry Jackson Foundation OpenOffice.org
CRAC: An integrated approach to analyse RNA-seq reads Additional File 3 Results on simulated RNA-seq data.
: An integrated approach to analyse RNA-seq reads Additional File 3 Results on simulated RNA-seq data. Nicolas Philippe and Mikael Salson and Thérèse Commes and Eric Rivals February 13, 2013 1 Results
July 7th 2009 DNA sequencing
July 7th 2009 DNA sequencing Overview Sequencing technologies Sequencing strategies Sample preparation Sequencing instruments at MPI EVA 2 x 5 x ABI 3730/3730xl 454 FLX Titanium Illumina Genome Analyzer
A Tutorial in Genetic Sequence Classification Tools and Techniques
A Tutorial in Genetic Sequence Classification Tools and Techniques Jake Drew Data Mining CSE 8331 Southern Methodist University [email protected] www.jakemdrew.com Sequence Characters IUPAC nucleotide
Automated DNA sequencing 20/12/2009. Next Generation Sequencing
DNA sequencing the beginnings Ghent University (Fiers et al) pioneers sequencing first complete gene (1972) first complete genome (1976) Next Generation Sequencing Fred Sanger develops dideoxy sequencing
Introduction to NGS data analysis
Introduction to NGS data analysis Jeroen F. J. Laros Leiden Genome Technology Center Department of Human Genetics Center for Human and Clinical Genetics Sequencing Illumina platforms Characteristics: High
Computational Genomics. Next generation sequencing (NGS)
Computational Genomics Next generation sequencing (NGS) Sequencing technology defies Moore s law Nature Methods 2011 Log 10 (price) Sequencing the Human Genome 2001: Human Genome Project 2.7G$, 11 years
Next generation sequencing (NGS)
Next generation sequencing (NGS) Vijayachitra Modhukur BIIT [email protected] 1 Bioinformatics course 11/13/12 Sequencing 2 Bioinformatics course 11/13/12 Microarrays vs NGS Sequences do not need to be known
NGS data analysis. Bernardo J. Clavijo
NGS data analysis Bernardo J. Clavijo 1 A brief history of DNA sequencing 1953 double helix structure, Watson & Crick! 1977 rapid DNA sequencing, Sanger! 1977 first full (5k) genome bacteriophage Phi X!
Introduction to Bioinformatics 3. DNA editing and contig assembly
Introduction to Bioinformatics 3. DNA editing and contig assembly Benjamin F. Matthews United States Department of Agriculture Soybean Genomics and Improvement Laboratory Beltsville, MD 20708 [email protected]
Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center
Computational Challenges in Storage, Analysis and Interpretation of Next-Generation Sequencing Data Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center Next Generation Sequencing
Guidelines for Establishment of Contract Areas Computer Science Department
Guidelines for Establishment of Contract Areas Computer Science Department Current 07/01/07 Statement: The Contract Area is designed to allow a student, in cooperation with a member of the Computer Science
New generation sequencing: current limits and future perspectives. Giorgio Valle CRIBI - Università di Padova
New generation sequencing: current limits and future perspectives Giorgio Valle CRIBI Università di Padova Around 2004 the Race for the 1000$ Genome started A few questions... When? How? Why? Standard
Next Generation Sequencing: Technology, Mapping, and Analysis
Next Generation Sequencing: Technology, Mapping, and Analysis Gary Benson Computer Science, Biology, Bioinformatics Boston University [email protected] http://tandem.bu.edu/ The Human Genome Project took
Reading DNA Sequences:
Reading DNA Sequences: 18-th Century Mathematics for 21-st Century Technology Michael Waterman University of Southern California Tsinghua University DNA Genetic information of an organism Double helix,
NECC History. Karl V. Steiner 2011 Annual NECC Meeting, Orono, Maine March 15, 2011
NECC History Karl V. Steiner 2011 Annual NECC Meeting, Orono, Maine March 15, 2011 EPSCoR Cyberinfrastructure Workshop First regional NENI (now NECC) Workshop held in Vermont in August 2007 Workshop heldinkentucky
Analyze Human Genome Using Big Data
Analyze Human Genome Using Big Data Poonm Kumari 1, Shiv Kumar 2 1 Mewar University, Chittorgargh, Department of Computer Science of Engineering, NH-79, Gangrar-312901, India 2 Co-Guide, Mewar University,
BLAST. Anders Gorm Pedersen & Rasmus Wernersson
BLAST Anders Gorm Pedersen & Rasmus Wernersson Database searching Using pairwise alignments to search databases for similar sequences Query sequence Database Database searching Most common use of pairwise
Introduction to transcriptome analysis using High Throughput Sequencing technologies (HTS)
Introduction to transcriptome analysis using High Throughput Sequencing technologies (HTS) A typical RNA Seq experiment Library construction Protocol variations Fragmentation methods RNA: nebulization,
An Overview of DNA Sequencing
An Overview of DNA Sequencing Prokaryotic DNA Plasmid http://en.wikipedia.org/wiki/image:prokaryote_cell_diagram.svg Eukaryotic DNA http://en.wikipedia.org/wiki/image:plant_cell_structure_svg.svg DNA Structure
Version 5.0 Release Notes
Version 5.0 Release Notes 2011 Gene Codes Corporation Gene Codes Corporation 775 Technology Drive, Ann Arbor, MI 48108 USA 1.800.497.4939 (USA) +1.734.769.7249 (elsewhere) +1.734.769.7074 (fax) www.genecodes.com
Overview of Next Generation Sequencing platform technologies
Overview of Next Generation Sequencing platform technologies Dr. Bernd Timmermann Next Generation Sequencing Core Facility Max Planck Institute for Molecular Genetics Berlin, Germany Outline 1. Technologies
4.2.1. What is a contig? 4.2.2. What are the contig assembly programs?
Table of Contents 4.1. DNA Sequencing 4.1.1. Trace Viewer in GCG SeqLab Table. Box. Select the editor mode in the SeqLab main window. Import sequencer trace files from the File menu. Select the trace files
Focusing on results not data comprehensive data analysis for targeted next generation sequencing
Focusing on results not data comprehensive data analysis for targeted next generation sequencing Daniel Swan, Jolyon Holdstock, Angela Matchan, Richard Stark, John Shovelton, Duarte Mohla and Simon Hughes
GeneSifter: Next Generation Data Management and Analysis for Next Generation Sequencing
for Next Generation Sequencing Dale Baskin, N. Eric Olson, Laura Lucas, Todd Smith 1 Abstract Next generation sequencing technology is rapidly changing the way laboratories and researchers approach the
HPC-MAQ : A PARALLEL SHORT-READ REFERENCE ASSEMBLER
HPC-MAQ : A PARALLEL SHORT-READ REFERENCE ASSEMBLER Veeram Venkata Siva Prasad 1 and Gunisetti Loshma 2 1 M.Tech(CSE), Sri Vasavi Engineering college, Tadepalligudem, West Godavari District, Andhra Pradesh,
Searching Nucleotide Databases
Searching Nucleotide Databases 1 When we search a nucleic acid databases, Mascot always performs a 6 frame translation on the fly. That is, 3 reading frames from the forward strand and 3 reading frames
Eoulsan Analyse du séquençage à haut débit dans le cloud et sur la grille
Eoulsan Analyse du séquençage à haut débit dans le cloud et sur la grille Journées SUCCES Stéphane Le Crom (UPMC IBENS) [email protected] Paris November 2013 The Sanger DNA sequencing method Sequencing
Efficient Parallel Execution of Sequence Similarity Analysis Via Dynamic Load Balancing
Efficient Parallel Execution of Sequence Similarity Analysis Via Dynamic Load Balancing James D. Jackson Philip J. Hatcher Department of Computer Science Kingsbury Hall University of New Hampshire Durham,
Deep Sequencing Data Analysis
Deep Sequencing Data Analysis Ross Whetten Professor Forestry & Environmental Resources Background Who am I, and why am I teaching this topic? I am not an expert in bioinformatics I started as a biologist
Cloud-Based Big Data Analytics in Bioinformatics
Cloud-Based Big Data Analytics in Bioinformatics Presented By Cephas Mawere Harare Institute of Technology, Zimbabwe 1 Introduction 2 Big Data Analytics Big Data are a collection of data sets so large
Microbial Oceanomics using High-Throughput DNA Sequencing
Microbial Oceanomics using High-Throughput DNA Sequencing Ramiro Logares Institute of Marine Sciences, CSIC, Barcelona 9th RES Users'Conference 23 September 2015 Importance of microbes in the sunlit ocean
Cloud-enabling Sequence Alignment with Hadoop MapReduce: A Performance Analysis
2012 4th International Conference on Bioinformatics and Biomedical Technology IPCBEE vol.29 (2012) (2012) IACSIT Press, Singapore Cloud-enabling Sequence Alignment with Hadoop MapReduce: A Performance
Large-scale DNA Sequence Analysis in the Cloud: A Stream-based Approach
Large-scale DNA Sequence Analysis in the Cloud: A Stream-based Approach Romeo Kienzler 1, Rémy Bruggmann 2, Anand Ranganathan 3, Nesime Tatbul 1 1 Department of Computer Science, ETH Zurich, Switzerland
The NGS IT notes. George Magklaras PhD RHCE
The NGS IT notes George Magklaras PhD RHCE Biotechnology Center of Oslo & The Norwegian Center of Molecular Medicine University of Oslo, Norway http://www.biotek.uio.no http://www.ncmm.uio.no http://www.no.embnet.org
Parallel Compression and Decompression of DNA Sequence Reads in FASTQ Format
, pp.91-100 http://dx.doi.org/10.14257/ijhit.2014.7.4.09 Parallel Compression and Decompression of DNA Sequence Reads in FASTQ Format Jingjing Zheng 1,* and Ting Wang 1, 2 1,* Parallel Software and Computational
Modified Genetic Algorithm for DNA Sequence Assembly by Shotgun and Hybridization Sequencing Techniques
International Journal of Electronics and Computer Science Engineering 2000 Available Online at www.ijecse.org ISSN- 2277-1956 Modified Genetic Algorithm for DNA Sequence Assembly by Shotgun and Hybridization
Tutorial for Windows and Macintosh. Preparing Your Data for NGS Alignment
Tutorial for Windows and Macintosh Preparing Your Data for NGS Alignment 2015 Gene Codes Corporation Gene Codes Corporation 775 Technology Drive, Ann Arbor, MI 48108 USA 1.800.497.4939 (USA) 1.734.769.7249
Data Processing of Nextera Mate Pair Reads on Illumina Sequencing Platforms
Data Processing of Nextera Mate Pair Reads on Illumina Sequencing Platforms Introduction Mate pair sequencing enables the generation of libraries with insert sizes in the range of several kilobases (Kb).
Chapter 2. imapper: A web server for the automated analysis and mapping of insertional mutagenesis sequence data against Ensembl genomes
Chapter 2. imapper: A web server for the automated analysis and mapping of insertional mutagenesis sequence data against Ensembl genomes 2.1 Introduction Large-scale insertional mutagenesis screening in
An FPGA Acceleration of Short Read Human Genome Mapping
An FPGA Acceleration of Short Read Human Genome Mapping Corey Bruce Olson A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Electrical Engineering University
NGS Technologies for Genomics and Transcriptomics
NGS Technologies for Genomics and Transcriptomics Massimo Delledonne Department of Biotechnologies - University of Verona http://profs.sci.univr.it/delledonne 13 years and $3 billion required for the Human
Next Generation Sequencing for DUMMIES
Next Generation Sequencing for DUMMIES Looking at a presentation without the explanation from the author is sometimes difficult to understand. This document contains extra information for some slides that
Analysis of NGS Data
Analysis of NGS Data Introduction and Basics Folie: 1 Overview of Analysis Workflow Images Basecalling Sequences denovo - Sequencing Assembly Annotation Resequencing Alignments Comparison to reference
Sequence Formats and Sequence Database Searches. Gloria Rendon SC11 Education June, 2011
Sequence Formats and Sequence Database Searches Gloria Rendon SC11 Education June, 2011 Sequence A is the primary structure of a biological molecule. It is a chain of residues that form a precise linear
14.3 Studying the Human Genome
14.3 Studying the Human Genome Lesson Objectives Summarize the methods of DNA analysis. State the goals of the Human Genome Project and explain what we have learned so far. Lesson Summary Manipulating
Next Generation Sequencing
Next Generation Sequencing Technology and applications 10/1/2015 Jeroen Van Houdt - Genomics Core - KU Leuven - UZ Leuven 1 Landmarks in DNA sequencing 1953 Discovery of DNA double helix structure 1977
Master's projects at ITMO University. Daniil Chivilikhin PhD Student @ ITMO University
Master's projects at ITMO University Daniil Chivilikhin PhD Student @ ITMO University General information Guidance from our lab's researchers Publishable results 2 Research areas Research at ITMO Evolutionary
Bioinformatics I, WS 09-10, D. Huson, January 27, 2010 145
Bioinformatics I, WS 09-10, D. Huson, January 27, 2010 145 10 DNA sequencing This exposition is very closely based on the following sources, which are all recommended reading: 1. Clyde A. Hutchison, DNA
Introduction to next-generation sequencing data
Introduction to next-generation sequencing data David Simpson Centre for Experimental Medicine Queens University Belfast http://www.qub.ac.uk/research-centres/cem/ Outline History of DNA sequencing NGS
Hadoopizer : a cloud environment for bioinformatics data analysis
Hadoopizer : a cloud environment for bioinformatics data analysis Anthony Bretaudeau (1), Olivier Sallou (2), Olivier Collin (3) (1) [email protected], INRIA/Irisa, Campus de Beaulieu, 35042,
How long is long enough?
How long is long enough? - Modeling to predict genome assembly performance - Hayan Lee@Schatz Lab Feb 26, 2014 Quantitative Biology Seminar 1 Outline Background Assembly history Recent sequencing technology
De Novo Assembly Using Illumina Reads
De Novo Assembly Using Illumina Reads High quality de novo sequence assembly using Illumina Genome Analyzer reads is possible today using publicly available short-read assemblers. Here we summarize the
DNA Sequencing & The Human Genome Project
DNA Sequencing & The Human Genome Project An Endeavor Revolutionizing Modern Biology Jutta Marzillier, Ph.D Lehigh University Biological Sciences November 13 th, 2013 Guess, who turned 60 earlier this
High Performance Computing for DNA Sequence Alignment and Assembly
High Performance Computing for DNA Sequence Alignment and Assembly Michael C. Schatz May 18, 2010 Stone Ridge Technology Outline 1.! Sequence Analysis by Analogy 2.! DNA Sequencing and Genomics 3.! High
Algorithms in Computational Biology (236522) spring 2007 Lecture #1
Algorithms in Computational Biology (236522) spring 2007 Lecture #1 Lecturer: Shlomo Moran, Taub 639, tel 4363 Office hours: Tuesday 11:00-12:00/by appointment TA: Ilan Gronau, Taub 700, tel 4894 Office
Final Project Report
CPSC545 by Introduction to Data Mining Prof. Martin Schultz & Prof. Mark Gerstein Student Name: Yu Kor Hugo Lam Student ID : 904907866 Due Date : May 7, 2007 Introduction Final Project Report Pseudogenes
MAKING AN EVOLUTIONARY TREE
Student manual MAKING AN EVOLUTIONARY TREE THEORY The relationship between different species can be derived from different information sources. The connection between species may turn out by similarities
Next Generation Sequencing; Technologies, applications and data analysis
; Technologies, applications and data analysis Course 2542 Dr. Martie C.M. Verschuren Research group Analysis techniques in Life Science, Breda Prof. dr. Johan T. den Dunnen Leiden Genome Technology Center,
CD-HIT User s Guide. Last updated: April 5, 2010. http://cd-hit.org http://bioinformatics.org/cd-hit/
CD-HIT User s Guide Last updated: April 5, 2010 http://cd-hit.org http://bioinformatics.org/cd-hit/ Program developed by Weizhong Li s lab at UCSD http://weizhong-lab.ucsd.edu [email protected] 1. Introduction
Bursting to a Hybrid Cloud for Services OFC 2015
Bursting to a Hybrid Cloud for Services OFC 2015 Big Data applications Big Compute in the cloud Why burst to the cloud? Opportunities 2 Big Data Apps Need Big Compute Life Sciences Bioinformatics Next
Genomic Applications on Cray supercomputers: Next Generation Sequencing Workflow. Barry Bolding. Cray Inc Seattle, WA
Genomic Applications on Cray supercomputers: Next Generation Sequencing Workflow Barry Bolding Cray Inc Seattle, WA 1 CUG 2013 Paper Genomic Applications on Cray supercomputers: Next Generation Sequencing
Welcome to Pacific Biosciences' Introduction to SMRTbell Template Preparation.
Introduction to SMRTbell Template Preparation 100 338 500 01 1. SMRTbell Template Preparation 1.1 Introduction to SMRTbell Template Preparation Welcome to Pacific Biosciences' Introduction to SMRTbell
RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison
RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the
Data Analysis for Ion Torrent Sequencing
IFU022 v140202 Research Use Only Instructions For Use Part III Data Analysis for Ion Torrent Sequencing MANUFACTURER: Multiplicom N.V. Galileilaan 18 2845 Niel Belgium Revision date: August 21, 2014 Page
Nazneen Aziz, PhD. Director, Molecular Medicine Transformation Program Office
2013 Laboratory Accreditation Program Audioconferences and Webinars Implementing Next Generation Sequencing (NGS) as a Clinical Tool in the Laboratory Nazneen Aziz, PhD Director, Molecular Medicine Transformation
DNA Mapping/Alignment. Team: I Thought You GNU? Lars Olsen, Venkata Aditya Kovuri, Nick Merowsky
DNA Mapping/Alignment Team: I Thought You GNU? Lars Olsen, Venkata Aditya Kovuri, Nick Merowsky Overview Summary Research Paper 1 Research Paper 2 Research Paper 3 Current Progress Software Designs to
Bioinformatics: course introduction
Bioinformatics: course introduction Filip Železný Czech Technical University in Prague Faculty of Electrical Engineering Department of Cybernetics Intelligent Data Analysis lab http://ida.felk.cvut.cz
Comparing Methods for Identifying Transcription Factor Target Genes
Comparing Methods for Identifying Transcription Factor Target Genes Alena van Bömmel (R 3.3.73) Matthew Huska (R 3.3.18) Max Planck Institute for Molecular Genetics Folie 1 Transcriptional Regulation TF
Next generation DNA sequencing technologies. theory & prac-ce
Next generation DNA sequencing technologies theory & prac-ce Outline Next- Genera-on sequencing (NGS) technologies overview NGS applica-ons NGS workflow: data collec-on and processing the exome sequencing
Genomics Services @ GENterprise
Genomics Services @ GENterprise since 1998 Mainz University spin-off privately financed 6-10 employees since 2006 Genomics Services @ GENterprise Sequencing Service (Sanger/3730, 454) Genome Projects (Bacteria,
How-To: SNP and INDEL detection
How-To: SNP and INDEL detection April 23, 2014 Lumenogix NGS SNP and INDEL detection Mutation Analysis Identifying known, and discovering novel genomic mutations, has been one of the most popular applications
Figure 1: Genome sizes of different organisms.
How big are genomes? Genomes are now being sequenced at such a rapid rate that it is fair to say that it is becoming routine. As a result, there is a growing interest in trying to understand the meaning
FlipFlop: Fast Lasso-based Isoform Prediction as a Flow Problem
FlipFlop: Fast Lasso-based Isoform Prediction as a Flow Problem Elsa Bernard Laurent Jacob Julien Mairal Jean-Philippe Vert September 24, 2013 Abstract FlipFlop implements a fast method for de novo transcript
Analysis of ChIP-seq data in Galaxy
Analysis of ChIP-seq data in Galaxy November, 2012 Local copy: https://galaxy.wi.mit.edu/ Joint project between BaRC and IT Main site: http://main.g2.bx.psu.edu/ 1 Font Conventions Bold and blue refers
History of DNA Sequencing & Current Applications
History of DNA Sequencing & Current Applications Christopher McLeod President & CEO, 454 Life Sciences, A Roche Company IMPORTANT NOTICE Intended Use Unless explicitly stated otherwise, all Roche Applied
Lecture 4: Exact string searching algorithms. Exact string search algorithms. Definitions. Exact string searching or matching
COSC 348: Computing for Bioinformatics Definitions A pattern (keyword) is an ordered sequence of symbols. Lecture 4: Exact string searching algorithms Lubica Benuskova http://www.cs.otago.ac.nz/cosc348/
Fast. Integrated Genome Browser & DAS. Easy. Flexible. Free. bioviz.org/igb
bioviz.org/igb Integrated Genome Browser & DAS Free tools for visualizing, sharing, and publishing genomes and genome-scale data. Easy Flexible Fast Free Funding: National Science Foundation Arabidopsis
Databases and mapping BWA. Samtools
Databases and mapping BWA Samtools FASTQ, SFF, bax.h5 ACE, FASTG FASTA BAM/SAM GFF, BED GenBank/Embl/DDJB many more File formats FASTQ Output format from Illumina and IonTorrent sequencers. Quality scores:
The Chinese University of Hong Kong igem 2010 Bacterial based storage and encryp2on device
The Chinese University of Hong Kong igem 2010 Bacterial based storage and encryp2on device Bacterial based informaaon storage device Bancro4 s group (2001) Mount Sinai School of Meducube Yachie s group
Typing in the NGS era: The way forward!
Typing in the NGS era: The way forward! Valeria Michelacci NGS course, June 2015 Typing from sequence data NGS-derived conventional Multi Locus Sequence Typing (University of Warwick, 7 housekeeping genes)
Two scientific papers (1, 2) recently appeared reporting
On the sequencing of the human genome Robert H. Waterston*, Eric S. Lander, and John E. Sulston *Genome Sequencing Center, Washington University, Saint Louis, MO 63108; Whitehead Institute Massachusetts
When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want
1 When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want to search other databases as well. There are very
2 The Human Genome Project
2 The Human Genome Project LAP CHEE TSUI STEVE W. SCHERER Toronto, Canada 1 Introduction 42 2 Chromosome Maps 42 2.1 Genetic Maps 43 2.2 Physical Maps 44 3 DNA Sequencing 45 3.1 cdna Sequencing 47 3.2
Next Generation Sequencing data Analysis at Genoscope. Jean-Marc Aury
Next Generation Sequencing data Analysis at Genoscope Jean-Marc Aury Introduction Presentation of Genoscope and NGS activities Overview of sequencing technologies Sequencing and assembly of prokaryotic
Overview of Genome Assembly Techniques
6 Overview of Genome Assembly Techniques Sun Kim and Haixu Tang The most common laboratory mechanism for reading DNA sequences (e.g., gel electrophoresis) can determine sequence of up to approximately
Sharing Data from Large-scale Biological Research Projects: A System of Tripartite Responsibility
Sharing Data from Large-scale Biological Research Projects: A System of Tripartite Responsibility Report of a meeting organized by the Wellcome Trust and held on 14 15 January 2003 at Fort Lauderdale,
New solutions for Big Data Analysis and Visualization
New solutions for Big Data Analysis and Visualization From HPC to cloud-based solutions Barcelona, February 2013 Nacho Medina [email protected] http://bioinfo.cipf.es/imedina Head of the Computational Biology
2.3 Identify rrna sequences in DNA
2.3 Identify rrna sequences in DNA For identifying rrna sequences in DNA we will use rnammer, a program that implements an algorithm designed to find rrna sequences in DNA [5]. The program was made by
