Introduction to Illumina Next Generation Sequencing Technology

Similar documents
Introduction to next-generation sequencing data

FOR REFERENCE PURPOSES

Illumina Sequencing Technology

Next Generation Sequencing

Bioanalyzer Applications for

PreciseTM Whitepaper

Introduction To Real Time Quantitative PCR (qpcr)

Cluster Generation. Module 2: Overview

G E N OM I C S S E RV I C ES

Recombinant DNA & Genetic Engineering. Tools for Genetic Manipulation

Whole genome Bisulfite Sequencing for Methylation Analysis Preparing Samples for the Illumina Sequencing Platform

Introduction to transcriptome analysis using High Throughput Sequencing technologies (HTS)

TruSeq Custom Amplicon v1.5

Genetic Analysis. Phenotype analysis: biological-biochemical analysis. Genotype analysis: molecular and physical analysis

Genotyping by sequencing and data analysis. Ross Whetten North Carolina State University

Thermo Scientific DyNAmo cdna Synthesis Kit for qrt-pcr Technical Manual

Welcome to Pacific Biosciences' Introduction to SMRTbell Template Preparation.

RT rxns. RT rxns TRANSCRIPTME Enzyme Mix (1) 40 µl 2 x 50 µl 5 x 40 µl

ab Hi-Fi cdna Synthesis Kit

Introduction Bioo Scientific

HiPer RT-PCR Teaching Kit

How many of you have checked out the web site on protein-dna interactions?

2. True or False? The sequence of nucleotides in the human genome is 90.9% identical from one person to the next. False (it s 99.

CompleteⅡ 1st strand cdna Synthesis Kit

Reduced Representation Bisulfite Sequencing for Methylation Analysis Preparing Samples for the Illumina Sequencing Platform

RevertAid Premium First Strand cdna Synthesis Kit

Nextera XT Library Prep: Tips and Troubleshooting

DNA Integrity Number (DIN) For the Assessment of Genomic DNA Samples in Real-Time Quantitative PCR (qpcr) Experiments

July 7th 2009 DNA sequencing

Mir-X mirna First-Strand Synthesis Kit User Manual

Next generation DNA sequencing technologies. theory & prac-ce

Application Guide... 2

USER GUIDE. Encore PART NOS and SP Rapid Library Systems

ncounter Gene Expression Assay Manual Total RNA and Cell Lysate Protocols

How is genome sequencing done?

New Technologies for Sensitive, Low-Input RNA-Seq. Clontech Laboratories, Inc.

Essentials of Real Time PCR. About Sequence Detection Chemistries

Concepts and methods in sequencing and genome assembly

Go where the biology takes you. Genome Analyzer IIx Genome Analyzer IIe

New generation sequencing: current limits and future perspectives. Giorgio Valle CRIBI - Università di Padova

TruSeq DNA Methylation Library Preparation Guide

Data Analysis for Ion Torrent Sequencing

SEQUENCING. From Sample to Sequence-Ready

Analysis of DNA methylation: bisulfite libraries and SOLiD sequencing

Paired-End Sample Preparation Guide

Protocol. Introduction to TaqMan and SYBR Green Chemistries for Real-Time PCR

Real-Time PCR Vs. Traditional PCR

Sequencing Library qpcr Quantification Guide

Technical Note. Roche Applied Science. No. LC 18/2004. Assay Formats for Use in Real-Time PCR

Illumina TruSeq DNA Adapters De-Mystified James Schiemer

User Manual. CelluLyser Lysis and cdna Synthesis Kit. Version 1.4 Oct 2012 From cells to cdna in one tube

Next Generation Sequencing for DUMMIES

Co Extra (GM and non GM supply chains: Their CO EXistence and TRAceability) Outcomes of Co Extra

The Techniques of Molecular Biology: Forensic DNA Fingerprinting

The Power of Next-Generation Sequencing in Your Hands On the Path towards Diagnostics

Advances in RainDance Sequence Enrichment Technology and Applications in Cancer Research. March 17, 2011 Rendez-Vous Séquençage

Single Nucleotide Polymorphisms (SNPs)

Amplicon Template Preparation and Sequencing

MMLV High Performance Reverse Transcriptase

qpcr Quantification Protocol Guide

Real-time PCR: Understanding C t

1. Molecular computation uses molecules to represent information and molecular processes to implement information processing.

restriction enzymes 350 Home R. Ward: Spring 2001

Global MicroRNA Amplification Kit

Validating Microarray Data Using RT 2 Real-Time PCR Products

Data Processing of Nextera Mate Pair Reads on Illumina Sequencing Platforms

DyNAmo cdna Synthesis Kit for qrt-pcr

Core Facility Genomics

ExpressArt Bacterial H-TR cdna synthesis kit. With extreme selectivity against rrnas

All-in-One First-Strand cdna Synthesis Kit

Description: Molecular Biology Services and DNA Sequencing

QPCR Applications using Stratagene s Mx Real-Time PCR Platform

Gene Expression Assays

Overview of Next Generation Sequencing platform technologies

NGS data analysis. Bernardo J. Clavijo

VLLM0421c Medical Microbiology I, practical sessions. Protocol to topic J10

Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center

Biotechnology and Recombinant DNA (Chapter 9) Lecture Materials for Amy Warenda Czura, Ph.D. Suffolk County Community College

An Overview of DNA Sequencing

Table of Contents. I. Description II. Kit Components III. Storage IV. 1st Strand cdna Synthesis Reaction... 3

Forensic DNA Testing Terminology

qstar mirna qpcr Detection System

Creating Standard Curves with Genomic DNA or Plasmid DNA Templates for Use in Quantitative PCR

First Strand cdna Synthesis

AffinityScript QPCR cdna Synthesis Kit

Nazneen Aziz, PhD. Director, Molecular Medicine Transformation Program Office

Genomic DNA Clean & Concentrator Catalog Nos. D4010 & D4011

Reverse Transcription System

MiSeq: Imaging and Base Calling

Sanger Sequencing and Quality Assurance. Zbigniew Rudzki Department of Pathology University of Melbourne

Troubleshooting Sequencing Data

An Introduction to Next-Generation Sequencing for in vitro Fertilization

Real-time quantitative RT -PCR (Taqman)

Mir-X mirna First-Strand Synthesis and SYBR qrt-pcr

Introduction to Quantitative PCR

Introduction. Preparation of Template DNA

Molecular Cloning, Product Brochure

Recombinant DNA and Biotechnology

Dynabeads mrna DIRECT Micro Kit

Transcription:

The Nancy and Stephen Grand Israel National Center for Personalized Medicine (G-INCPM) Introduction to Illumina Next Generation Sequencing Technology Shmulik Motola, PhD March 2016

DNA Sequencing a process of determining the precise order of nucleotides (A, C, G, T) within a DNA molecule The order, or sequence, of nucleotides determines the genetic information available for building and maintaining an organism Sequence variation Natural polymorphism Mutation

Seq. Primer Sanger Sequencing (Est. 1975) DNA template (cloned & isolated from an E.Coli colony) Frederick Sanger 1918-2013 Replication products are separated by Electrophoresis

The Human Genome Project Used Sanger Sequencing Global international effort involving 20 Research Centers Lasted 13 years (first completed draft published in 2003) Cost: 3,000,000,000 $! Facilitated the discovery of more than 1800 diseaseassociated genes

Cost per human genome sequence Sanger Sequencing Next Generation Sequencing van Dijk EL et. al. Trends Genet. 2014 Sep;30(9):418-26

DNA Sequencing Methods: From Sanger to Next Generation Sequencing (NGS) Sanger Seq. Next Generation Seq. Throughput Low High Prior knowledge of template DNA sequence? Required. For PCR and Seq. primer design Not required. PCR and Seq. primers are universal DNA template to be sequenced Quantitative or Gene expression assay? Single DNA region Not supported May map anywhere on the genome (in case of whole genome seq.) Supported http://support.illumina.com/training/online-courses/sequencing.html

Schematic view of illumina NGS Technology NGS resolves hundreds of Millions of DNA Sequences on a single run! Complex DNA sample Attachment to solid surface Parallel Sequencing of all DNA fragments Data analysis

Common NGS Applications Whole Transcriptome (RNA-Seq) Whole Genome Sequencing Whole Exome Sequencing 16S Microbiome Small-RNA Seq DNA Methylation Analysis Chromatin Immunoprecipitation (ChIP)-Seq

Illumina Sequencing Overview 9 FOR RESEARCH USE ONLY

Illumina Sequencing Workflow Library Preparation Cluster Generation cbot MiSeq HiSeq 2500 Sequencing HiSeq MiSeq NextSeq500 Data Analysis ICS/RTA CASAVA MSR BaseSpace 10 FOR RESEARCH USE ONLY

Sample ( Library ) Preparation Overview: Aim: Obtaining Nucleic Acid Fragments with Adapters attached on both ends Nucleic acid (DNA/RNA) Modify to proper insert size Add adapters with sites for: - Flow cell binding and - Sequencing primer binding Same general template architecture regardless of application

Sample ( Library ) Preparation Overview: Sample Indexing Index= known short DNA sequence included in the DNA adapter which labels all DNA molecules of a particular sample Adapted from illumina

Single vs. Dual-indexed NGS Libraries Single-indexed libraries Index sequence P5 P7 Dual-indexed libraries P5 P7 The number of samples pooled determines the need for single vs. dual indexing

Illumina Sequencing Workflow Library Preparation Cluster Generation cbot MiSeq HiSeq 2500 Sequencing Data Analysis HiSeq MiSeq NextSeq500 GAIIx ICS/RTA CASAVA MSR BaseSpace 14 FOR RESEARCH USE ONLY

Cluster Generation: Aims Attachment of DNA molecules to the FlowCell Amplification of single DNA molecules into clonal clusters FlowCell (HiSeq High Output) FOR RESEARCH USE ONLY

What is a Flow Cell? Cluster generation occurs on a flow cell A flow cell is a thick glass slide with channels or lanes Each lane is randomly coated with a lawn of oligos that are complementary to library adapters 16 FOR RESEARCH USE ONLY

Instrumentation Single DNA Library Amplified Clonal Cluster cbot Sequencer 17 FOR RESEARCH USE ONLY

Hybridize Fragment & Extend Single strand DNA libraries are hybridized to primer lawn Adapter sequence Bound libraries are then extended by polymerases Surface of flow cell coated with a lawn of oligo pairs 3 extension 18 FOR RESEARCH USE ONLY

Denature Double-Stranded DNA Double-stranded molecule is denatured Original template Newly synthesized strand Original template washed away discard Newly synthesized strand is covalently attached to flow cell surface 19 FOR RESEARCH USE ONLY

Single-Stranded DNA NOTE: Single molecules bind to flow cell in a random pattern 20 FOR RESEARCH USE ONLY

Bridge Amplification Single-stranded molecule flips over and forms a bridge by hybridizing to adjacent, complementary primer Hybridized primer is extended by polymerases 21 FOR RESEARCH USE ONLY

Bridge Amplification Double-stranded bridge is formed 22 FOR RESEARCH USE ONLY

Denature Double-Stranded Bridge Double-stranded bridge is denatured Result: Two copies of covalently bound single-stranded templates 23 FOR RESEARCH USE ONLY

Bridge Amplification Single-stranded molecules flip over to hybridize to adjacent primers Hybridized primer is extended by polymerase 24 FOR RESEARCH USE ONLY

Bridge Amplification Bridge amplification cycle is repeated until multiple bridges are formed 25 FOR RESEARCH USE ONLY

Linearization dsdna bridges are denatured 26 FOR RESEARCH USE ONLY

Reverse Strand Cleavage Reverse strands are cleaved and washed away, leaving a cluster with forward strands only 27 FOR RESEARCH USE ONLY

Blocking Free 3 ends are blocked to prevent unwanted DNA priming 28 FOR RESEARCH USE ONLY

Read 1 Primer Hybridization Sequencing primer is hybridized to adapter sequence Sequencing primer 29 FOR RESEARCH USE ONLY

Illumina Sequencing Workflow Library Preparation Cluster Generation cbot MiSeq HiSeq 2500 Sequencing HiSeq NextSeq GAIIx MiSeq Data Analysis ICS/RTA CASAVA MSR BaseSpace 30 FOR RESEARCH USE ONLY

Sequencing By Synthesis Add 4 Fl-NTP s + Polymerase Incorporated FI- NTP imaged Terminator & fluorescent dye cleaved from FI-NTP X 36-251 31 FOR RESEARCH USE ONLY

Reversible Terminator Chemistry All 4 labeled nucleotides in 1 reaction Higher accuracy No problems with homopolymer repeats Next Cycle Incorporation Detection Deblock Fluor Removal 32 FOR RESEARCH USE ONLY

Clusters (of DNA molecules sequenced): Cluster Intensities collected following every base addition 100 Microns 33 FOR RESEARCH USE ONLY

Illumina Sequencing Workflow Library Preparation Cluster Generation cbot MiSeq HiSeq 2500 Sequencing Data Analysis HiSeq MiSeq NextSeq500 GAIIx ICS/RTA CASAVA MSR BaseSpace 34 FOR RESEARCH USE ONLY

Data Analysis Overview Analysis Type Software Outputs Sequencing ICS/RTA Images/TIFF files Primary Analysis ICS/RTA Intensities Base Calling Secondary Analysis HiSeq Analysis Software Alignments and Variant Detection 35 FOR RESEARCH USE ONLY

Paired End Sequencing 36 FOR RESEARCH USE ONLY

Single End Sequencing 37 FOR RESEARCH USE ONLY

Paired End Sequencing 38 FOR RESEARCH USE ONLY

Paired End Sequencing Reference Single-reads Paired-reads This is really the best way to do sequencing This is is really really the the best sequencing This is (----100 characters-------) sequencing Assembly becomes easier!! 39 FOR RESEARCH USE ONLY

Paired End Sequencing Sequenced strand is stripped off Blocked 3 -ends Sequenced strand 3 -ends of template strands and lawn primers are unblocked 40 FOR RESEARCH USE ONLY

Paired End Sequencing Single-stranded template loops over to form a bridge by hybridizing with a lawn primer 3 -ends of lawn primer is extended Bridge formation 3 extension 41 FOR RESEARCH USE ONLY

Paired End Sequencing Double stranded DNA 42 FOR RESEARCH USE ONLY

Paired End Sequencing Bridges are linearized and the original forward template is cleaved Original forward strand 43 FOR RESEARCH USE ONLY

Paired End Sequencing Free 3 ends of the reverse template and lawn primers are blocked to prevent unwanted DNA priming Blocked 3 -ends Sequencing primer Sequencing primer is hybridized to adapter sequence Reverse strand template 44 FOR RESEARCH USE ONLY

Sequencing By Synthesis 2 nd Read Add 4 Fl-NTP s + Polymerase Incorporated FI- NTP imaged Terminator & fluorescent dye cleaved from FI-NTP X 36-251 45 FOR RESEARCH USE ONLY

Sequencing Paired End Libraries with Single Index Read DNA Insert Index 46 FOR RESEARCH USE ONLY

Paired End Sequencing of Single-indexed libraries Read 1 Seq Primer (HP6) Utilizes 3 sequencing reads Read 2 Seq Primer (HP7) 1 3 Paired End Turnaround 2 Index Seq Primer (HP8) 47 FOR RESEARCH USE ONLY

Sequencing Paired End Libraries with Dual Index Read DNA Insert Index2 Index1 48 FOR RESEARCH USE ONLY

Paired End Sequencing of Dual Indexed Libraries Utilizes 4 Sequencing Reads 1 2 3 4 Paired End Turnaround

Questions?

Part II: NGS Library Preparation and Quality Control

user responsibility user illumina user / illumina Taken from: http://rnaseq.uoregon.edu/library_prep.html

Common NGS Applications RNA-Seq DNA-Seq (Whole genome, ChIP-Seq)

5 step procedure separated by Bead-based size selection

Step1: DNA/RNA Fragmentation Physical Fragmentation Acoustic shearing: breaks DNA into 100 bp-5kb (Covaris) Sonication: shears chromatin & DNA into 150 bp-1 kb (Bioruptor) Enzymatic Fragmentation (DNA endonucleases, Transposase) Considered consistent, but less random when compared to physical DNA-shearing methods Chemical Fragmentation Heat and divalent metal Cation (Mg +2 /Zn +2 ): used for breakup of RNA molecules Ideally results in 115-350 nt RNA molecules

Step 2: End repair and bead based size selection

Step 3: Adenylate 3 End

Step 4: Ligate indexed paired end adapters

Step 5: PCR enrich ligation product

RNA-Seq library preparation protocol: (TruSeq RNA v2, illumina) (Similar to a DNA-Seq library prep procedure) Total RNA Purify and Fragment mrna cdna Synthesis (First & Second strand) Ends Repair Adenylate 3 Ends Ligated Indexed Paired-End Adapters PCR Amplification

Library Validation: Critical for Successful Sequencing Sample Preparation Library Validation: Accurate quantification Library size & quality Cluster Generation cbot MiSeq Sequencing HiSeq HiScan SQ GA IIx MiSeq Data Analysis

Accurate Library Quantification (Why?): It Maximizes Data Quality and Quantity Optimized flow cell clustering determines data quality and overall data yield 20pM 10pM 5pM 1pM Overclustering can result in: Loss of data quality and data output Loss of focus Reduced base calls and Q30 scores Complete run failure Underclustering can result in: Loss of time and money Loss of focus Complete run failure

Accurate Quantification Is Critical When Multiplexing Sample Calculated concentration is 10X higher for one library in pool Expected Output Actual Output 1 16% 20% 2 16% 20% 3 16% 20% 4 16% 20% 5 16% 20% 6 16% 2% Sample Expected Output Calculated concentration is 10X lower for one library in pool Actual Output 1 16% 66% 2 16% 6% 3 16% 6% 4 16% 6% 5 16% 6% 6 16% 6%

Quantification Methods of NGS Libraries UV- spectrophotometer Nanodrop Detects nucleic acids nonspecifically Contaminants elevate values Should not be used for input or library quantification Bioanalyzer 2100 Accuracy highly dependent on dilution and sample handling Recommended for quality control only Fluorescence-based ds-dna assay Qubit or PicoGreen Specifically detects double-stranded DNA Does not discriminate incomplete libraries qpcr Specifically measures full-length libraries Detection very sensitive

Library Quantification using qpcr

Library qpcr Overview qpcr Designed to quantify only cluster-forming fragments in the samples Uses primers complementary to adapters to mimic amplification on the flow cell Only amplifies and quantifies library fragments with proper adapters at both ends

Steps for Quantifying Libraries with qpcr Step 1 Create a Control standard curve using a Control template of known concentration Step 2 Run qpcr on Control template standard curve and unknown libraries Step 3 Extrapolate concentration of unknown libraries from standard curve

Assessing Library Quality with Bioanalyzer

Agilent Bioanalyzer 2100: Overview Image from Bioanalyzer Applications for Next-Gen Sequencing: updates and tips from Agilent Technologies

Understanding a Bioanalyzer Trace Lower Marker Upper Marker Sample Peak Baseline

Understanding a Bioanalyzer Report Summary Page Sample Details

Bioanalyzer Details Region can be set in 2100 Expert software Average Library Size Don t use to quantify

Calculation of Library Molar Concentration Library concentration (ng/ul) (Fluoremetric assay Qubit, qpcr) + Average library size (bp) (BioAnalyzer/ Tapestation) Library Molar Concentration Optimized flow cell clustering & seq. data

Summary Accurate quantitation is critical for maximizing high quality data output Library quantitation is especially critical when pooling indexed libraries Library Validation Use recommended method to quantify final libraries prior to sequencing Check library quality using a Bioanalyzer 2100

Garbage in Garbage out Bad Sample Bad Library Bad Sequencing Data

user responsibility illumina user / illumina Taken from: http://rnaseq.uoregon.edu/library_prep.html

RNA Handling Best Practices: Harvest RNA quickly Use filter pipette tips Treat work area and equipment with RNAse decon soln Avoid RNA degradation Use RNAsefree plastics and solutions Store RNA by freezing Wear gloves

Best Practices Summary Follow protocol as written Take care when adding viscous reagents Complete all wash steps Follow magnetic beads best practices Heat thermocycler lid during incubations Don t over amplify libraries! Validate your libraries for quality and quantity

Questions?