Introduction to DNA microarray technologies. Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang

Similar documents
Microarray Technology

Analysis of gene expression data. Ulf Leser and Philippe Thomas

Data Acquisition. DNA microarrays. The functional genomics pipeline. Experimental design affects outcome data analysis

Software and Methods for the Analysis of Affymetrix GeneChip Data. Rafael A Irizarry Department of Biostatistics Johns Hopkins University

How many of you have checked out the web site on protein-dna interactions?

Basic Analysis of Microarray Data

REAL TIME PCR USING SYBR GREEN

Introduction To Real Time Quantitative PCR (qpcr)

Molecular Genetics: Challenges for Statistical Practice. J.K. Lindsey

Recombinant DNA and Biotechnology

HiPer RT-PCR Teaching Kit

Biotechnology: DNA Technology & Genomics

Appendix 2 Molecular Biology Core Curriculum. Websites and Other Resources

Measuring gene expression (Microarrays) Ulf Leser

Introduction to transcriptome analysis using High Throughput Sequencing technologies (HTS)

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

Row Quantile Normalisation of Microarrays

Essentials of Real Time PCR. About Sequence Detection Chemistries

Lecture 11 Data storage and LIMS solutions. Stéphane LE CROM

Biotechnology and Recombinant DNA (Chapter 9) Lecture Materials for Amy Warenda Czura, Ph.D. Suffolk County Community College

Core Facility Genomics

Microarray Data Analysis. A step by step analysis using BRB-Array Tools

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison

OriGene Technologies, Inc. MicroRNA analysis: Detection, Perturbation, and Target Validation

Becker Muscular Dystrophy

Recombinant DNA & Genetic Engineering. Tools for Genetic Manipulation

Frequently Asked Questions Next Generation Sequencing

Quality Assessment of Exon and Gene Arrays

Molecular Biology Techniques: A Classroom Laboratory Manual THIRD EDITION

Genetic Analysis. Phenotype analysis: biological-biochemical analysis. Genotype analysis: molecular and physical analysis

Translation Study Guide

A Primer of Genome Science THIRD

New Technologies for Sensitive, Low-Input RNA-Seq. Clontech Laboratories, Inc.

Microarray Data Analysis Workshop. Custom arrays and Probe design Probe design in a pangenomic world. Carsten Friis. MedVetNet Workshop, DTU 2008

Illumina Sequencing Technology

Web-based Tools for the Analysis of DNA Microarrays. End of Project Report. Authors: P. Geeleher 1,2, A. Golden 3, J. Hinde 2 and D. G.

Comparative genomic hybridization Because arrays are more than just a tool for expression analysis

Structure and Function of DNA

CHROMOSOMES Dr. Fern Tsien, Dept. of Genetics, LSUHSC, NO, LA

restriction enzymes 350 Home R. Ward: Spring 2001

DNA Fingerprinting. Unless they are identical twins, individuals have unique DNA

Next Generation Sequencing: Technology, Mapping, and Analysis

G E N OM I C S S E RV I C ES

PreciseTM Whitepaper

Real-Time PCR Vs. Traditional PCR

Nucleic Acid Techniques in Bacterial Systematics

Gene mutation and molecular medicine Chapter 15

Next Generation Sequencing

Real time and Quantitative (RTAQ) PCR. so I have an outlier and I want to see if it really is changed

REAL TIME PCR SYBR GREEN

SICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE

Technical Note. Roche Applied Science. No. LC 18/2004. Assay Formats for Use in Real-Time PCR

Activity 7.21 Transcription factors

Real-time qpcr Assay Design Software

Gene Expression Analysis

ncounter Gene Expression Assay Manual Total RNA and Cell Lysate Protocols

Analysis of Illumina Gene Expression Microarray Data

How To Get A Cell Print

Correlation of microarray and quantitative real-time PCR results. Elisa Wurmbach Mount Sinai School of Medicine New York

SmartFlare RNA Detection Probes: Principles, protocols and troubleshooting

Validating Microarray Data Using RT 2 Real-Time PCR Products

Interaktionen von RNAs und Proteinen

First Strand cdna Synthesis

Real-time PCR: Understanding C t

CCR Biology - Chapter 9 Practice Test - Summer 2012

Thermo Scientific DyNAmo cdna Synthesis Kit for qrt-pcr Technical Manual

micrornas Non protein coding, endogenous RNAs of 21-22nt length Evolutionarily conserved

MUTATION, DNA REPAIR AND CANCER

Viruses. Viral components: Capsid. Chapter 10: Viruses. Viral components: Nucleic Acid. Viral components: Envelope

Illumina TruSeq DNA Adapters De-Mystified James Schiemer

Forensic DNA Testing Terminology

DNA Sequence Analysis

Arabidopsis. A Practical Approach. Edited by ZOE A. WILSON Plant Science Division, School of Biological Sciences, University of Nottingham

Molecular Genetics. RNA, Transcription, & Protein Synthesis

Products - Microarray Scanners - SpotLight Two-Color Microarray Fluorescence Scanners New! SpotLight 2 now available!

Real-time quantitative RT -PCR (Taqman)

Genetics Lecture Notes Lectures 1 2

European Medicines Agency

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE Q5B

sirna Duplexes & RNAi Explorer

Basic Concepts of DNA, Proteins, Genes and Genomes

PNA BRAF Mutation Detection Kit

MeDIP-chip service report

QPCR Applications using Stratagene s Mx Real-Time PCR Platform

March 19, Dear Dr. Duvall, Dr. Hambrick, and Ms. Smith,

Frequently Asked Questions (FAQ)

User Manual/Hand book. qpcr mirna Arrays ABM catalog # MA003 (human) and MA004 (mouse)

1. Molecular computation uses molecules to represent information and molecular processes to implement information processing.

FACULTY OF MEDICAL SCIENCE

Name Date Period. 2. When a molecule of double-stranded DNA undergoes replication, it results in

bitter is de pil Linos Vandekerckhove, MD, PhD

MiSeq: Imaging and Base Calling

Gene Therapy. The use of DNA as a drug. Edited by Gavin Brooks. BPharm, PhD, MRPharmS (PP) Pharmaceutical Press

2.1.2 Characterization of antiviral effect of cytokine expression on HBV replication in transduced mouse hepatocytes line

Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals

Control of Gene Expression

Formalin fixation at low temperature better preserves nucleic acid integrity. Gianni Bussolati. University of Turin

RNA Viruses. A Practical Approac h. Alan J. Cann

IIID 14. Biotechnology in Fish Disease Diagnostics: Application of the Polymerase Chain Reaction (PCR)

Transcription and Translation of DNA

Transcription:

Introduction to DNA microarray technologies Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang

Outline Basic principles cdna microarrays Affymetrix oligonucleotide chips

DNA microarrays

DNA microarrays DNA microarrays rely on the hybridization properties of nucleic acids to monitor DNA or RNA abundance on a genomic scale in different types of cells. The ancestor of cdna microarrays: the Northern blot.

Hybridization Hybridization refers to the annealing of two nucleic acid strands following the base-pairing rules. Nucleic acid strands in a duplex can be separated, or denatured, by heating to destroy the hydrogen bonds.

Hybridization

Hybridization

Gene expression assays The main types of gene expression assays: Serial analysis of gene expression (SAGE); Short oligonucleotide arrays (Affymetrix); Long oligonucleotide arrays (Agilent Inkjet); Fibre optic arrays (Illumina); cdna arrays (Brown/Botstein).

Transcriptome mrna or transcript levels sensitively reflect the state of a cell. Measuring protein levels (translation) would be more direct but more difficult.

Transcriptome The transcriptome reflects Tissue source: cell type, organ. Tissue activity and state: Stage of development, growth, death. Cell cycle. Disease vs. healthy. Response to therapy, stress.

Applications of microarrays Cancer research: Molecular characterization of tumors on a genomic scale more reliable diagnosis and effective treatment of cancer. Immunology: Study of host genomic responses to bacterial infections; reversing immunity.

Applications of microarrays Compare mrna (transcript) levels in different types of cells, i.e., vary Tissue: liver vs. brain; Treatment: drugs A, B, and C; State: tumor vs. non-tumor, development; Organism: different yeast strains; Timepoint; etc.

cdna microarrays

cdna microarrays Prepare cdna target Hybridize target to microarray

cdna microarrays The relative abundance of a spotted DNA sequence in two DNA or RNA samples may be assessed by monitoring the differential hybridization of these two samples to the sequence on the array. Probes: DNA sequences spotted on the array, immobile substrate. Targets: Nucleic acid samples hybridized to the array, mobile substrate.

cdna microarrays The ratio of the red and green fluorescence intensities for each spot is indicative of the relative abundance of the corresponding DNA probe in the two nucleic acid target samples.

cdna microarrays M = log 2 R/G = log 2 R - log 2 G M < 0, gene is over-expressed in greenlabeled sample compared to red-labeled sample. M = 0, gene is equally expressed in both samples. M > 0, gene is over-expressed in red-labeled sample compared to green-labeled sample.

The process Building the microarray: MASSIVE PCR PCR PURIFICATION AND PREPARATION PREPARING SLIDES PRINTING RNA preparation: CELL CULTURE AND HARVEST Hybing the array: POST PROCESSING RNA ISOLATION ARRAY HYBRIDIZATION AND SCANNING cdna PRODUCTION TARGET LABELING DATA ANALYSIS

The arrayer Ngai Lab arrayer, UC Berkeley Print-head

Print-tips collect cdna from wells Print-tip group 1 cdna clones Glass slide 96-well plate Contains cdna probes Array of bound cdna probes 4x4 blocks = 16 print-tip-groups Print-tip group 7

Sample preparation

Hybridization slip cover Hybridize for 5-12 hours Binding of cdna target samples to cdna probes on the slide

Hybridization chamber 3XSSC HYB CHAMBER ARRAY LIFTER SLIP SLIDE LABEL SLIDE LABEL Humidity Temperature Formamide (Lowers the Tmp)

Scanning Detector PMT Image Cy5: 635nm Cy3: 532nm Duplicate spots

RGB overlay of Cy3 and Cy5 images

Raw data E.g. Human cdna arrays ~43K spots; 16 bit TIFFs: ~ 20Mb per channel; ~ 2,000 x 5,500 pixels per image; Spot separation: ~ 136um; For a typical array, the spot area has mean = 43 pixels, med = 32 pixels, SD = 26 pixels.

Oligonucleotide chips

Probe sets Each gene is represented by 16-20 oligonucleotides of 25 base-pairs, i.e., 25-mers. Perfect match probe, PM: A 25-mer complementary to the reference sequence. Mismatch probe, MM: same as PM but with a single homomeric base change for the middle (13 th ) base. Probe pair. A (PM,MM) pair. Probe set. 16-20 probe pairs. The purpose of the MM probe design is to measure non-specific binding and background noise.

Probe sets

Oligonucleotide chips 030 5!74-077, -7 / 0/!74-00 $ 3 0897,3/0/,-0 0/#9,7 09 43:. 049 /0 574-0 * * * * * M2.2 43841.45 0841,850. 1. 4 43:. 049 /0 574-0 / 1107039.425 02039,7 574-08 2, 041-7 / 0/!74-077, 425 2039841 07 4 /

Oligonucleotide chips The probes are synthesized in situ, using combinatorial chemistry and photolithography. Probe cells are square-shaped features on the chip containing millions of copies of a single 25-mer probe. Sides are 18-50 microns.

Oligonucleotide chips The manufacturing of GeneChip probe arrays is a combination of photolithography and combinational chemistry.

Image analysis About 100 pixels per probe cell. These intensities are combined to form one number representing the expression level for the probe cell oligo. CEL file with PM or MM intensity for each cell.

Expression measures Most expression measures are based on differences of PM-MM. The intention if to correct for background and non-specific binding. E.g. MarrayArray Suite (MAS) v. 4.0 uses Average Difference Intensity (ADI) or AvDiff = average of PM-MM. Problem: MM may also measure signal. More on this in lecture Pre-processing in DNA microarray experiments.

What is the evidence? Lockhart et. al. Nature Biotechnology 14 (1996)

Biological question Statistics and Microarrays Experimental design Microarray experiment Image analysis Normalization Estimation Testing Clustering Discrimination Biological verification and interpretation

Statistical computing Everywhere for statistical design and analysis: pre-processing, estimation, testing, clustering, prediction, etc. for integration with biological information resources (in house and external databases) gene annotation (GenBank, LocusLink); literature (PubMed); graphical (pathways, chromosome maps).

Integration of biological metadata Expression, sequence, structure, annotation, literature. Integration will depend on our using a common language and will rely on database methodology as well as statistical analyses. This area is largely unexplored.

WWW resources Complete guide to microarraying http://cmgm.stanford.edu/pbrown/mguide/ http://www.microarrays.org Parts and assembly instructions for printer and scanner; Protocols for sample prep; Software; Forum, etc. cdna microarray animation http://www.bio.davidson.edu/courses/genomics/chip/c hip.html Affymetrix http://www.affymetrix.com

Next Pre-processing in DNA microarray experiments cdna microarrays Image analysis; Normalization. Affymetrix oligonucleotide chips Image analysis; Normalization; Expression measures.