A Brief Guide to Interpreting the DNA Sequencing Electropherogram Version 3.0



Similar documents
Sanger Sequencing. Troubleshooting Guide. Failed sequence

Introduction. Preparation of Template DNA

Procedures For DNA Sequencing

Dye-Blob message: Example: Generally, this is due to incomplete excess dye removal of the cycle sequence reaction.

DNA SEQUENCING SANGER: TECHNICALS SOLUTIONS GUIDE

DNA Sequencing Troubleshooting Guide

Sequencing Guidelines Adapted from ABI BigDye Terminator v3.1 Cycle Sequencing Kit and Roswell Park Cancer Institute Core Laboratory website

DNA Sequencing Troubleshooting Guide.

Troubleshooting Sequencing Data

DNA Sequencing Handbook

DNA Sequence Analysis

DNA Sequencing Setup and Troubleshooting

Sanger Sequencing and Quality Assurance. Zbigniew Rudzki Department of Pathology University of Melbourne

DNA Core Facility: DNA Sequencing Guide

1/12 Dideoxy DNA Sequencing

Troubleshooting Overview

CHAPTER 5 Troubleshooting DNA Sequencing Data

PyroPhage 3173 DNA Polymerase, Exonuclease Minus (Exo-)

Concepts and methods in sequencing and genome assembly

PicoMaxx High Fidelity PCR System

Welcome to Pacific Biosciences' Introduction to SMRTbell Template Preparation.

Troubleshooting Guide for DNA Electrophoresis

BacReady TM Multiplex PCR System

Recombinant DNA & Genetic Engineering. Tools for Genetic Manipulation

ZR-96 DNA Sequencing Clean-up Kit Catalog Nos. D4052 & D4053

RevertAid Premium First Strand cdna Synthesis Kit

PATHOGEN DETECTION SYSTEMS BY REAL TIME PCR. Results Interpretation Guide

PrimeSTAR HS DNA Polymerase

ZR DNA Sequencing Clean-up Kit

SERVICES CATALOGUE WITH SUBMISSION GUIDELINES

Lecture 13: DNA Technology. DNA Sequencing. DNA Sequencing Genetic Markers - RFLPs polymerase chain reaction (PCR) products of biotechnology

For information regarding shipping specifications, please refer to the following link:

DNA Detection. Chapter 13

Troubleshooting for PCR and multiplex PCR

Introduction to next-generation sequencing data

Essentials of Real Time PCR. About Sequence Detection Chemistries

Taq98 Hot Start 2X Master Mix

DNA Separation Methods. Chapter 12

Aurora Forensic Sample Clean-up Protocol

Wizard DNA Clean-Up System INSTRUCTIONS FOR USE OF PRODUCT A7280.

An Overview of DNA Sequencing

RT rxns. RT rxns TRANSCRIPTME Enzyme Mix (1) 40 µl 2 x 50 µl 5 x 40 µl

SYBR Green Realtime PCR Master Mix -Plus-

Introduction To Real Time Quantitative PCR (qpcr)

CCR Biology - Chapter 9 Practice Test - Summer 2012

MICB ABI PRISM 310 SEQUENCING GUIDE SEQUENCING OF PLASMID DNA

The Techniques of Molecular Biology: Forensic DNA Fingerprinting

July 7th 2009 DNA sequencing

Thermo Scientific DyNAmo cdna Synthesis Kit for qrt-pcr Technical Manual

HiPer RT-PCR Teaching Kit

DNA FRAGMENT ANALYSIS by Capillary Electrophoresis

ISOLATE II PCR and Gel Kit. Product Manual

Forensic DNA Testing Terminology

GenScript BloodReady TM Multiplex PCR System

Genomic DNA Clean & Concentrator Catalog Nos. D4010 & D4011

Real-Time PCR Vs. Traditional PCR

All-in-One mirna qrt-pcr Reagent Kits For quantitative detection of mature mirna

First Strand cdna Synthesis

Application Guide... 2

DNA Replication in Prokaryotes

ab Hi-Fi cdna Synthesis Kit

Next Generation Sequencing

PCR and Sequencing Reaction Clean-Up Kit (Magnetic Bead System) 50 preps Product #60200

GENOTYPING ASSAYS AT ZIRC

mircute mirna qpcr Detection Kit (SYBR Green)

DNA Fingerprinting. Unless they are identical twins, individuals have unique DNA

1. Molecular computation uses molecules to represent information and molecular processes to implement information processing.

DNA Sequencing Overview

Cloning Blunt-End Pfu DNA Polymerase- Generated PCR Fragments into pgem -T Vector Systems

Objectives: Vocabulary:

Artisan Scientific is You~ Source for: Quality New and Certified-Used/Pre:-awned ECJuiflment

360 Master Mix. , and a supplementary 360 GC Enhancer.

Mitochondrial DNA Analysis

Genolution Pharmaceuticals, Inc. Life Science and Molecular Diagnostic Products

Automated High Throughput Purification of BigDye TM Terminator Fluorescent DNA Sequencing Reactions Using Wizard MagneSil TM Paramagnetic Particles

WESTERN BLOTTING TIPS AND TROUBLESHOOTING GUIDE TROUBLESHOOTING GUIDE

HighPure Maxi Plasmid Kit

Thermo Scientific Phusion Site-Directed Mutagenesis Kit #F-541

Terra PCR Direct Polymerase Mix User Manual

STRUCTURES OF NUCLEIC ACIDS

TIANquick Mini Purification Kit

AffinityScript QPCR cdna Synthesis Kit

Beginner s Guide to Real-Time PCR

Electrophoresis, cleaning up on spin-columns, labeling of PCR products and preparation extended products for sequencing

Application Note. Biotechnology Explorer Crime Scene Investigator PCR Basics. Kit: A Real-Time PCR Extension

The Biotechnology Education Company

Co Extra (GM and non GM supply chains: Their CO EXistence and TRAceability) Outcomes of Co Extra

RT-PCR: Two-Step Protocol

PreciseTM Whitepaper

Creating Standard Curves with Genomic DNA or Plasmid DNA Templates for Use in Quantitative PCR

SEQUENCING SERVICES. McGill University and Génome Québec Innovation Centre JANUARY 21, Version 3.4

IIID 14. Biotechnology in Fish Disease Diagnostics: Application of the Polymerase Chain Reaction (PCR)

VLLM0421c Medical Microbiology I, practical sessions. Protocol to topic J10

1. Location matters: Primers should flank the DNA you want to amplify

2. True or False? The sequence of nucleotides in the human genome is 90.9% identical from one person to the next. False (it s 99.

Biotechnology and Recombinant DNA (Chapter 9) Lecture Materials for Amy Warenda Czura, Ph.D. Suffolk County Community College

- In , Allan Maxam and walter Gilbert devised the first method for sequencing DNA fragments containing up to ~ 500 nucleotides.

Validating Microarray Data Using RT 2 Real-Time PCR Products

DyNAmo cdna Synthesis Kit for qrt-pcr

DNA sequencing. Dideoxy-terminating sequencing or Sanger dideoxy sequencing

Transcription:

A Brief Guide to Interpreting the DNA Sequencing Electropherogram Version 3.0 Plant-Microbe Genomics Facility The Ohio State University 484 W.12 th Ave., Columbus, OH 43210 Ph: 614/247-6204 FAX: 614/247-8696 pmgf@osu.edu www.biosci.ohio-state.edu/~pmgf/ This guide includes an example of high quality sequence as well as many different problems that occur with DNA sequences attained from the 3700 DNA Analyzer in the Plant-Microbe Genomics Facility. For those figures that demonstrate a problem there is a (1) description of the problem, (2) the most likely cause(s), and (3) one or more solutions. If you have additional questions or comments about the figures below, then please do not hesitate to contact the facility. Figure 1 Electropherogram with high quality sequence. 2 Figure 2 Electropherogram that demonstrates the limit of the resolution of the 3700 DNA Analyzer _3 Figure 3 Electropherogram with unincorporated nucleotide peaks. 4 Figure 4 Electropherogram with mobility errors. 5 Figure 5 Electropherogram that demonstrates a lack of extension products, i. e. no bands. 6 Figure 6 Electropherogram that has multiple sequences. 7 Figure 7 Electropherograms that have homon slippage. 8 Figure 8 Electropherogram that has a strong stop. 9 Figure 9 Electropherogram that has a micro-air bubble or debris. 10 Figure 10 Electropherogram that has "the spread". 11 Figure 11 Electropherogram that has Primer N-1. 12 mrz; 9-03 1

Figure 1 Electropherogram with high quality sequence. DNA sequence of high quality is characterized by sharp peaks and little to no background as seen below. DNA sequences of high quality typically result in a read length of 650 to 750 bases with an accuracy of 99%, but in some cases the read length can exceed 900 bases. At this facility the positive control reaction has read length of 720 bases with an accuracy of 99%. 2

Figure 2 Electropherogram that demonstrates the limit of the resolution of the 3700 DNA Analyzer The electropherogram below demonstrates how the bands for the extension products eventually become too wide for proper interpretation by the sequencing analysis software. When the width of the base of the peak begins to approach 1/4 of the peak height, then resolution maybe lost. Even though the absolute sequence is not always correct the sequence is indicative of the bases present. For example the T s (underline and lower case) at 782 and 830 were called as N s by the sequencing program. The program often times inserts bases to accommodate a broad peak, for example the sequence from 801 to 810 should be CTTCTGAG and the sequence from 815 to 820 should be AGTGG. The best solution for this limitation is to manually edit the sequence. 3

Figure 3 Electropherogram with unincorporated nucleotide peaks. The electropherogram below has very large peaks at the beginning of the sequence (scan lines 0 to 320), and these peaks are unincorporated nucleotides that can mask the true sequence of the template. The true sequence for bases 1 to 25 is TAGATTCGGGTACCTTAGTGA. The intensity of the unincorporated peaks varies depending upon the efficiency of the sequencing reaction and the efficiency of the sequencing reaction cleanup procedure (gel filtration or solid phase extraction) after the reaction. The cleanup procedure is performed to remove salts and unincorporated nucleotides prior to electrophoresis. The best solution to this limitation is to edit the sequence manually by looking at the peaks underneath the unincorporated nucleotide peaks. 4

Figure 4 Electropherogram with mobility errors. Mobility errors only occur in the first 100 bases, and are characterized by peaks that are to close together or overlapping. This problem is sequence dependent and caused by limitations in the software s inability to accurately judge the spacing between the peaks in the electropherogram. For example, bases 10 through 16 are CTNNCT due to overlap of the peaks, but the sequence is actually CTCACT. Also, the N s at base 47 and 56 should not be there, but the software is expecting a peak since the previous peaks are shifted to the left. The best solution is to edit the sequence manually. 5

Figure 5 Electropherogram that demonstrates a lack of extension products, i. e. no bands. Large initial peaks from the unincorporated nucleotides followed by a nearly flat line with all 4 colors mixed together characterize the electropherogram below. The causes of this result are numerous including template or primer that is: low quality, low concentration, or degraded. Another cause could be the lack the primer binding site. Also, this may be the result of a simple error on the part of the operator, such as not adding a reagent to the reaction, or a malfunction by the 3700 DNA Analyzer, such as an air bubble in the transfer syringe. The solutions are as varied as the problems described above, e.g. repeating the reaction, purifying the template again or redesigning the primer. 6

Figure 6 Electropherogram that has multiple sequences. The electropherogram demonstrates multiple peaks for each position, and the peaks are in phase with each other. For example, base 34 has a T peak under the G peak. This problem can be due to multiple templates, multiple priming sites or multiple primers. Below is a special case, multiple inserts, in which there are two different plasmids present that share the same vector (bases 1 33), but they have different inserts (bases 34-160). The solution is to separate the plasmids and resequence, e.g. restreaking the bacterial culture that contains the plasmids and then purify the plasmid again. 7

Figure 7 Electropherograms that have homon slippage. The sequence is characterized by a stretch of 5 or more As, Ts, Gs, or Cs that results in poor sequence three prime of this homon area. The problem is caused by the DNA separating and reannealing incorrectly at either base +1, -1, +2, -2, etc. resulting in a distribution of peaks around each base. A mild case homot produces a few N s (a) whereas a severe case of homot results in the sequence appearing as waves (b). The best solution is to sequence the complementary strand in this region. a) b) 8

Figure 8 Electropherogram that has a strong stop. This sequence is characterized by a rapid decline in signal strength across 5 to 15 bases with the sequence three prime of the region at a significantly lower signal. Presumably this is due to some secondary structure that is inhibiting the modified Taq polymerase in the sequencing reaction kit. The problem is commonly associated with G/C rich or G/T rich regions. The best solution is to sequence the complementary strand if possible. Alternative solutions to this problem include (1) using the dgtp sequencing kit, (2) adding DMSO to the sequencing reaction, (3) raising the annealing temperature, (4) extending the denaturing step, (5) using single stranded DNA or (6) any combination of the above. From experience at this facility in many cases none of these solutions were successful. A weak stop is characterized by the same rapid drop in signal strength, but the sequence three prime of the stop is still reliable. 9

Figure 9 Electropherogram that has a micro-air bubble or debris. The sequence is characterized by a large spike of all four colors and is caused by either a micro-air bubble or small debris migrating out of the capillary into the laser path and therefore scattering the light. For example, the spike is at base 450 and is masking the true base at this location. The best solution is to run the sample again on the DNA Analyzer. 10

Figure 10 Electropherogram that has "the spread". Bands that gradually spread out and therefore become irresolvable characterize the sequence that has the spread. This problem is characterized by delayed migration of the extension products, i.e. the wider the first peak the longer the delay. The first peak that is too broad (irresolvable) can be from base 1 to as late as base 400. The problem is caused by a small, ionic contaminant in the DNA preparation. Most commonly the contaminant is associated with plasmids that were purified with a solid phase extraction kit, e.g. Qiaprep or Wizard. This is a common problem that is not well understood. To solve or minimize the problem the facility will add EDTA to the sequencing reaction, repeat the cleanup procedure, and then analyze the reaction again. When this does not work we repeat the reaction with different reaction conditions to minimize the amount of contaminant and maximize the amount of extension products. At the moment we know of no confirmed methods the customer can do to minimize the contaminant in their plasmid preparation. Since there is little the customer can do to address this problem, then the reactions with this problem are automatically rerun without the need for customers to request that the reaction be repeated. 11

Figure 11 Electropherogram that has Primer N-1. The sequence is characterized by having multiple peaks at all of the locations and these peaks are arranged such that the same sequence is present but shifted by one, two or more positions. For example, the C at position 289 is represented by a smaller peak at base 288 and an even smaller peak at base 287. This problem is caused by a significant percentage of the primers being short by 1, 2 or more bases. For example, the primer for the reactions below was 20 bases in length, but many molecules are 19 bases long and a few are 18 bases long. The best solution to the problem is to use another primer and do another sequencing reaction. Another solution is to purify the primer, or order a primer that is purified followed by repeating the sequencing reaction. 12