Serial Analysis of Gene Expression (SAGE)

Similar documents
Recombinant DNA & Genetic Engineering. Tools for Genetic Manipulation

Introduction to transcriptome analysis using High Throughput Sequencing technologies (HTS)

Nucleic Acid Techniques in Bacterial Systematics

Essentials of Real Time PCR. About Sequence Detection Chemistries

Biotechnology: DNA Technology & Genomics

Introduction To Real Time Quantitative PCR (qpcr)

PreciseTM Whitepaper

Recombinant DNA and Biotechnology

restriction enzymes 350 Home R. Ward: Spring 2001

How many of you have checked out the web site on protein-dna interactions?

Introduction to SAGEnhaft

HiPer RT-PCR Teaching Kit

1. Molecular computation uses molecules to represent information and molecular processes to implement information processing.

Co Extra (GM and non GM supply chains: Their CO EXistence and TRAceability) Outcomes of Co Extra

Technical Note. Roche Applied Science. No. LC 18/2004. Assay Formats for Use in Real-Time PCR

Real-Time PCR Vs. Traditional PCR

Analysis of gene expression data. Ulf Leser and Philippe Thomas

Mir-X mirna First-Strand Synthesis Kit User Manual

Biotechnology and Recombinant DNA (Chapter 9) Lecture Materials for Amy Warenda Czura, Ph.D. Suffolk County Community College

Next Generation Sequencing

REAL TIME PCR USING SYBR GREEN

First Strand cdna Synthesis

Gene Expression Assays

2. True or False? The sequence of nucleotides in the human genome is 90.9% identical from one person to the next. False (it s 99.

Concepts and methods in sequencing and genome assembly

RevertAid Premium First Strand cdna Synthesis Kit

CHAPTER 6: RECOMBINANT DNA TECHNOLOGY YEAR III PHARM.D DR. V. CHITRA

DNA Sequence Analysis

Microarray Technology

Genetic Analysis. Phenotype analysis: biological-biochemical analysis. Genotype analysis: molecular and physical analysis

Standards, Guidelines and Best Practices for RNA-Seq V1.0 (June 2011) The ENCODE Consortium

How is genome sequencing done?

Real-time qpcr Assay Design Software

Validating Microarray Data Using RT 2 Real-Time PCR Products

DNA Fingerprinting. Unless they are identical twins, individuals have unique DNA

New Technologies for Sensitive, Low-Input RNA-Seq. Clontech Laboratories, Inc.

Next Generation Sequencing

Table of Contents. I. Description II. Kit Components III. Storage IV. 1st Strand cdna Synthesis Reaction... 3

quantitative real-time PCR, grain, simplex DNA extraction: PGS0426 RT-PCR: PGS0494 & PGS0476

STUDIES ON SEED STORAGE PROTEINS OF SOME ECONOMICALLY MINOR PLANTS

Real-time PCR: Understanding C t

Creating Standard Curves with Genomic DNA or Plasmid DNA Templates for Use in Quantitative PCR

Global MicroRNA Amplification Kit

Thermo Scientific DyNAmo cdna Synthesis Kit for qrt-pcr Technical Manual

Mir-X mirna First-Strand Synthesis and SYBR qrt-pcr

HCS Exercise 1 Dr. Jones Spring Recombinant DNA (Molecular Cloning) exercise:

Reverse Transcription System

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

Illumina Sequencing Technology

Introduction to next-generation sequencing data

Molecular Biology Techniques: A Classroom Laboratory Manual THIRD EDITION

Analysis of DNA methylation: bisulfite libraries and SOLiD sequencing

CompleteⅡ 1st strand cdna Synthesis Kit

An Overview of DNA Sequencing

PATHOGEN DETECTION SYSTEMS BY REAL TIME PCR. Results Interpretation Guide

Recombinant DNA Technology

MicroRNA formation. 4th International Symposium on Non-Surgical Contraceptive Methods of Pet Population Control

OriGene Technologies, Inc. MicroRNA analysis: Detection, Perturbation, and Target Validation

QPCR Applications using Stratagene s Mx Real-Time PCR Platform

Molecular Cloning, Product Brochure

Real-time quantitative RT -PCR (Taqman)

Frequently Asked Questions Next Generation Sequencing

Real time and Quantitative (RTAQ) PCR. so I have an outlier and I want to see if it really is changed

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE Q5B

Transfection-Transfer of non-viral genetic material into eukaryotic cells. Infection/ Transduction- Transfer of viral genetic material into cells.

Genome Editing TOOLS TO SUPPORT CRISPR/CAS9 APPLICATIONS

CCR Biology - Chapter 9 Practice Test - Summer 2012

Forensic DNA Testing Terminology

Single Nucleotide Polymorphisms (SNPs)

Next generation DNA sequencing technologies. theory & prac-ce

PrimeSTAR HS DNA Polymerase

Recombinant DNA Unit Exam

qstar mirna qpcr Detection System

Introduction to Quantitative PCR

Description: Molecular Biology Services and DNA Sequencing

PicoMaxx High Fidelity PCR System

New generation sequencing: current limits and future perspectives. Giorgio Valle CRIBI - Università di Padova

DNA Replication & Protein Synthesis. This isn t a baaaaaaaddd chapter!!!

QUANTITATIVE RT-PCR. A = B (1+e) n. A=amplified products, B=input templates, n=cycle number, and e=amplification efficiency.

Next Generation Sequencing for DUMMIES

2.500 Threshold e Threshold. Exponential phase. Cycle Number

Welcome to Pacific Biosciences' Introduction to SMRTbell Template Preparation.

Transcription in prokaryotes. Elongation and termination

Methods and Application Guide. Introduction to Quantitative PCR

All-in-One First-Strand cdna Synthesis Kit

Lecture 13: DNA Technology. DNA Sequencing. DNA Sequencing Genetic Markers - RFLPs polymerase chain reaction (PCR) products of biotechnology

4. DNA replication Pages: Difficulty: 2 Ans: C Which one of the following statements about enzymes that interact with DNA is true?

Bacterial Transformation and Plasmid Purification. Chapter 5: Background

IMBB Genomic DNA purifica8on

Data Analysis for Ion Torrent Sequencing

Protocol. Introduction to TaqMan and SYBR Green Chemistries for Real-Time PCR

TITLE: The Single Prostate Cell Transcriptome as Biological Assay

European Medicines Agency

Factors Influencing Multiplex Real-Time PCR

Rapid Acquisition of Unknown DNA Sequence Adjacent to a Known Segment by Multiplex Restriction Site PCR

Beginner s Guide to Real-Time PCR

Discovery and Quantification of RNA with RNASeq Roderic Guigó Serra Centre de Regulació Genòmica (CRG)

Using Illumina BaseSpace Apps to Analyze RNA Sequencing Data

Universidade Estadual de Maringá

Microarray Data Analysis Workshop. Custom arrays and Probe design Probe design in a pangenomic world. Carsten Friis. MedVetNet Workshop, DTU 2008

Transcription:

Serial Analysis of Gene Expression (SAGE) SAGE analysis is a method derived to provide a read-out, via sequencing, of the spectrum of genes being expressed in a cell. A method for comprehensive analysis of gene expression patterns Serial sequencing of 15-bp tags unique to each gene Each sequencing run can generate sequence for up to 50 high quality tags At least 50,000 tags are required per sample to approach saturation Costs upward of $5000 per sample Unbiased relative to microarrays Choice of reference sample Hybridization artifacts Clone representation Three principles underlie the SAGE methodology 1. A short sequence tags (10-15 bp) contains sufficient information to uniquely identify a transcript provided that the tag is obtained from a unique position within each transcript 2. Sequence tags can be linked together to form long serial molecules that can be cloned and sequenced 3. Quantification of the number of times a particular tag is observed provides the expression level of the corresponding transcript An individual 9bp tag is sufficient to describe all the genes in humans, since 4 9 equals 262,144, which is much larger than any of the proposed number of genes in the human genome (20,000-40,000). Extra stringency step that facilitates gene identification is that the tag must include the 3 most anchoring site in a predicted transcript. A fraction of genes will have multiple tags due to alternative splicing near the 3 end, or use of alternative polyadenylation sites; but for the most part these can be identified. The number of times a specific tag is found in the SAGE sequences reflects its abundance in the mrna population. Unlike array and chip methods, you do not have to make cdnas and ESTs. The expression information derives from SAGE tags, which are produced as part of the analysis. Sequence information is required to assign the tags to individual ORFs. However, unassigned SAGE tags are also useful (in species for which the complete genomes have not been sequenced, unassigned tags will be encountered frequently). They can be used to pull out promoters from genomic clones, to provide information about coordinated gene regulation, and to identify previously unknown genes. Quantitative comparison of SAGE samples is not always easy to interpret. A tag present in four copies in one sample of 50,000 tags and two copies in another may actually be twofold induced or the difference is due to random sampling.

Profile of the genes expressed in the human peripheral retina, macula, and retinal pigment epithelium determined through serial analysis of gene expression (SAGE) Dror Sharon, Seth Blackshaw, Constance L. Cepko, and Thaddeus P. Dryja PNAS 2002 99(1): 315-320

Use of Long SAGE Gene discovery by hybridization of SAGE tags to large genomes requires that these tags be unique to transcripts Used Type II restriction enzymes that generated 21 bp tags >99.8% of these should occur only once in the human genome Serial Analysis of Nuclear Transcript Accumulation (SANTA) Purify nuclei by flow sorting. Extract polya+ RNA. Subject this to SAGE analysis. drawback of this technique is that the process is costly in terms of sequencing sufficient ditags for in-depth genome coverage. Microbeads or LYNX technology Combines the flexibility of hybridization based procedures with the precision offered by tag sequencing. LYNX (www.lynxgen.com) technologies are microbeads (MegaClone), MegaSort ( fluid microarray ) massively parallel signature sequencing (MPSS) (see Brenner et al. 2000, PNAS 97: 1665-1670)/ This technology comprises three components: In vitro cloning and analysis of complex mixtures of DNA on microbeads. Physical separation of differentially expressed cdna molecules. Massively-parallel DNA sequencing. Methodology Construction of a cdna library on the surface of microbeads Every microbead displays on its surface about 100,000 copies of a specific cdna Unlike bacterial clones, these clones cannot be propagated, but each microbead can be isolated and the nucleic acid attached to it can be analyzed.

Principle of the method Fluorophore PCR handle cdna Vector Address tag Amplify and expose address tag Address tag Each nucleic acid molecule first is labeled with an oligonucleotide address tag within a library. The resulting library tag molecules are amplified using PCR. After amplification, a sample of n molecules yields a library containing n families of clones, each identified by a unique tag. BEAD This permits the cdna-tag molecules to be collected on a microbead that carries the complement (antitag) to a family s unique tag.

cdna mrna Convert to cdna The bulk mrna is converted to cdna using oligo -dt primers The 3 sequences of the mrna with their polya + tails, are then cloned into a plasmid library of address tags. cdna Vector Address tag PCR handle Amplify Expose address tag The entire library is amplified with common PCR primers, one of which introduces a fluorophore at the end of the cdna. Attach to microbeads Ligate Finally, the address tags are rendered single stranded and annealed to the complementary capture oligonucleotide on the surface of microbeads. Ligation attaches one strand of the DNA permanently to the beads During the 72 h hybridization process in the last step, each cdnaaddress tag conjugate becomes anchored to their host microbead. The first step in the construction of this library is the attachment of a different 32-mer capture oligonucleotide to the surface of each microbead by combinatorial synthesis. They used a special DNA language with a vocabulary made up of eight four-base words : TTAC, AATC, TACT, ATCA, ACAT, TCTA, CTTT, and CAAA. Each tag consists of eight words (4 x 8 = 32 bp). Each word uses only three (A,T, and C) of the four DNA bases, and differs from all the other words in three of the four bases.

All words form, with the respective complement, one G:C and three A:T base pairs. This eliminates self-complementarity within any sequence made up of the word, and prevents their cleavage by any restriction enzyme with symmetrical recognition sites for G and C. Example 5 -TACT.TTAC.ACAT.ATCA.CTTT.CAAA.AATC 3 3 -ATGA.AATG.TGTA.TAGT.GAAA.GTTT.TTAG 5 Because all the eight-word tag/anti-tag duplexes have the same base pair composition (24 A:T and eight G:C), all the duplexes should have about the same T m. Because all tag sequences differ from their nearest neighbors by at least one word, or at least three base mismatches, there is a good discrimination between perfect matches and one word mismatches. A total of 16,777,218 eight word tags are available which is far in excess of the number of unique transcripts fragments (usually about 50,000). Consequently each transcript molecule receives a unique signature, and each type of transcript will be represented by a number of different clones (microbeads) in proportion to its abundance in the transcriptome. Tags were prepared by eight rounds of combin-atorial synthesis wherein each word, w1-w8, is added from a separate column of a DNA synthesizer. After the eighth round, a portion of the microbeads were separated for further synthesis of a 5 primer binding site, followed by cleavage, amplification, and insertion into the vector

Differential expression ( fluid microarrays ) This approach physically extracts clones that are differentially represented in two libraries After melting off the non-covalently attached DNA strand from the microbeads, the remaining strands are available for competitive hybridization with two probes labeled with different fluorophores, to produce fluid microarrays of millions of microbeads. The ratio of probes hybridizing to each bead can easily be measured using Fluorescent- Activated Cell Sorter (FACS). The ability to generate >10 6 microbeads carrying different cdna clones allows identification and selection of sequences differentially regulated at very low levels Massively Parallel Signature Sequencing (MPSS) It utilizes tagged cdnas cloned on microbeads: The fluid microarray is analyzed in a flow cell. A novel signature sequencing approach is used: Type II restriction endonucleases. Ligation of adapters. Fluorescent molecules bound to decoders. Does not require gels. Does not require DNA fragment sizing. Reading is done by a multispectral optical imager. The process is statistically robust. Immobilization of the microbeads in the flow cell Reagents are pumped in and out through a microbead monolayer

The flow cell is mounted on a confocal microscope Sequencing does not require physical separation of fragments Signatures are 16-20 bp in length Very highly parallel (many millions of sequences at one time)