Microarray Data Analysis Workshop. Custom arrays and Probe design Probe design in a pangenomic world. Carsten Friis. MedVetNet Workshop, DTU 2008
|
|
- Adrian Horn
- 8 years ago
- Views:
Transcription
1 Microarray Data Analysis Workshop MedVetNet Workshop, DTU 2008 Custom arrays and Probe design Probe design in a pangenomic world Carsten Friis Media glna tnra GlnA TnrA C2 glnr C3 C5 C6 K GlnR C1 C4 C7
2 Why custom microarrays? Tailored specifically to the relevant organism(s) Better coverage than traditional microarray Compare experiments with different isolates Comparative Genomic Hybridization (CGH) Phylogeny, Stronger than e.g. MLST Identification of infection sources Faster and cheaper than sequencing (for now)
3 Probe design for microarrays What is a Probe Different Probe Types What is a good Probe? Cross Hybridization and Complexity Affinity Position Array Design OligoWiz Custom designed pangenomic arrays 32 genomes on one chip!
4 An Ideal Probe Discriminate well between its intended target and all other targets in the target pool Detect concentration differences under the hybridization conditions
5 Probe Type comparisons Advantages Disadvantages PCR products Inexpensive Linkers can be applied Handling problems Hard to design to avoid cross-hybridization Unequal amplification Oligos Can be designed for many criteria Easy to handle Normalized concentrations Linkers can be applied Expensive Affymetrix GeneChip High quality data Standardized arrays Fast to set up Multiple probes per gene Expensive Arrays available for limited number of species
6 How to avoid cross-hybridization From Kane et al. (2000) we learn that a 50 mer probe can detect significant false signal from a target that has >75-80% homology to a 50 mer oligo or a continuous stretch of >15 complementary bases If we have substantial sequence information on the given organism, we can try to avoid this by choosing oligos that are not similar to any other expressed sequences.
7 Probe specificity by %-Mismatch Hughes et al. 2001
8 Regions without similarity to other transcripts 5 The Sequence we want to design a probe for 3 BLAST hits >75% & longer than 15bp 50 bp Regions suitable for probes
9 Filtering self-detecting BLAST hits out The Sequence we want to design a oligo for 5 3 BLAST hits >75% & longer than 15bp Sequence identical or very similar to the query sequence 50 bp Therefore no BLAST hits with homology > 97% and with a hit length vs. query length ratio > 0.8, are considered.
10 Cross-hybridization as a homology score Only BLAST hits that passed filtering are considered If m is the number of BLAST hits considered in position i. Let h=(h1 i,...,hm i) be the BLAST hits in position i in the oligo BLAST max score = i 1 n = 100 max( h1 i,..., hmi ) n 100 n Where n is the length of the oligo Oligo BLAST hits { Max hit in pos. i 100% 0
11 Similar affinity for all oligos Another way of ensuring an optimal discrimination between target and non-target under hybridization is to design all the oligos on an array with similar affinity for their targets Optimal hybridization conditions for all oligos by choosing the right hybridization temperature and salt concentration Commonly, Melting Temperature (Tm) is used as a measure for DNA:DNA or RNA:DNA hybrid affinity
12 Tm distributions for 30 mers and 50 mers
13 ΔTm Distribution for oligo length intervals
14 Melting temperature difference 1000ΔH Tm( i) = log[ Na+ ] Ct A + ΔS + R ln( ) 4 Where ΔH (Kcal/mol) is the sum of the nearest neighbor enthalpy, A is a constant for helix initiation corrections, ΔS is the sum of the nearest neighbor entropy changes, R is the Gas Constant (1.987 cal deg-1 mol-1) and Ct is the total molar concentration of strands. ΔTm score = Tm(i) - 1 N Tm(i) N Where N is all oligos in all sequences.
15 Self-annealing oligos Probes that form strong hybrids with itself should be avoided But, accurate folding algorithms like the one employed by mfold or RNAfold, is too time consuming, for large scale folding of oligos. Time consumption: mfold ~2 sec / 30 mer Pr. gene (500bp) ~16 min.
16 { { { Folding an oligonucleotide: an approximation The alignment is based on dinucleotides Substitution matrix is based on binding energies { { { AT TG CT...CG GT TT AT TG CT...CG GT TT Dynamic programming: alignment to inverted self Minimal loop size border
17 A fast heuristic implementation AT TG CT...CG GT TT AT TG CT...CG GT TT Full dynamic programming calculation for first probe Dynamic programming calculation for second etc. probe Minimal loop size border Last probe Super-alignment matrix
18 Folding prediction compared to mfold OligoWiz vs mfold (DNA folding energy) mfold: folding energy (kcal/mol) OligoWiz: folding energy (kcal/mol)
19 Common sequences result in unspecific signal If the sub-fractions of an oligo are very common we define it as low-complex Oligo with low-complexity: AAAAAAAGGAGTTTTTTTTCAAAAAACTTTTTAAAAAAGCTTTAGGTTTTTA (Human) Oligo without low-complexity: CGTGACTGACAGCTGACTGCTAGCCATGCAACGTCATAGTACGATGACT (Human)
20 Low-complexity For a given transcriptome a list of information content from all words with length wl (8bp) is calculated: I(w) f(w) f(w) = log 2 tf(w) tf ( w) 4 wl Where f(w) is the number of occurrences of a pattern and tf(w) is the total number of patterns of length wl. A low-complexity score for a given oligo is defined as: Low-complexity = 1-norm d i 1 ( = L-wl+ 1 I(w )) i Where norm is a function that normalizes to between 1 and 0, L is the length of the oligo and Wi is the pattern in position i.
21 Location of Oligo within transcript Labeling include reverse transcription of the mrna and is sensitive to: RNA degradation Premature termination of cdna synthesis Premature termination of crna transcription (IVT) A Position Score reflecting this (eukaryotes): Position score= (1-drp)Δ3 end Where drp is the chance of labeling termination pr. base
22 OligoWiz: A Tool for flexible probe design
23 About OligoWiz How and Who OligoWiz 2.0 is a client-server application for designing oligonucleotides for microarrays The OligoWiz client (the graphical interface) is written in Java 1.4 and runs on virtually all platforms The OligoWiz Server performs the heavy-duty computation and is hosted on a multi-cpu Altix server at CBS. OligoWiz is created by Henrik Bjørn Nielsen and Rasmus Wernersson both at the Center for Biological Sequence Analysis at the Technical University of Denmark.
24 About the OligoWiz scores All scores are normalized to a value between 0.0 (worst) and 1.0 (best). All scores are independent and is assigned a user-adjustable weight. A total score is calculated as the sum of all weighted scores and is normalized to a value between 0.0 and 1.0.
25 Extracting annotation from GenBank files -FeatureExtract server -
26 Sequence Features Intron/Exon structure, UTR regions etc. Special purpose arrays Example: Detecting Differential splicing Exon Intron Exon Exon Exon
27 Species databases The species databases are built from complete genomic sequences or UniGene collections in the case of Vertebrates. The databases are used for: Cross hybridization Low-complexity Several hundred species currently available
28 A microarray from 32 genomes Characterization of probiotic E. coli isolates using a novel pangenome microarray H Willenbrock, PF Hallin, T Wassanar and DW Ussery Background: Based on 32 Escherichia coli and Shigella genome sequences, we have developed an E. coli pan-genome microarray. Publicly available genomes were annotated in a consistent manor to define all currently known genes potentially present in the species. The chip design was evaluated by hybridization of DNA from two sequenced E. coli strains, K-12 MG1655 (a commensal) and O157:H7 EDL933 (an enterotoxigenic E. coli). A dual channel and single channel analysis approach was compared for the comparative genomic hybridization experiments. Moreover, the microarray was used to characterize four unsequenced probiotic E. coli strains, currently marketed for beneficial effects on the human gut flora. Conclusion: This high-density microarray provides an excellent tool for characterizing either DNA content or gene expression from unknown E. coli strains.
29 E. coli core genes
30 Genes added to the E. coli pangenome
31 Designing Probes False positives/false negatives Cover all versions of same gene Distinguish between different genes Conservation is not that well-defined Usually related to the function of the product Genetic mutation is (often) gradual
32 Versions of the same gene or different?
33 Designing Probes False positives/false negatives Cover all versions of same gene Distinguish between different genes Conservation is not that well-defined Usually related to the function of the product Genetic mutation is (often) gradual Distinct versions of the same gene! Mutations giving rise to different phenotypes Bias in the types of strains sequenced Sequencing pathogens seems highly fashionable
34 Benchmarking the chips
35 Out with the bad
36 Realistic design principles Version of the same or different? Let the probes decide! If probes can distinguish, then so do we >90% overall identity, no 15-20bp unique stretch We will never achieve 100% coverage Compromising probe quality gives false signal Filtering probes may eliminate false signals, but does so at the expense of coverage Major remaining issue is speed!
37 To build a web-service An extension of OligoWiz Pangenomics Metagenomics You bring your genome(s) Whole genomes Collection of genes You get a list of probes Ready to put on array
Comparative genomic hybridization Because arrays are more than just a tool for expression analysis
Microarray Data Analysis Workshop MedVetNet Workshop, DTU 2008 Comparative genomic hybridization Because arrays are more than just a tool for expression analysis Carsten Friis ( with several slides from
More informationJust the Facts: A Basic Introduction to the Science Underlying NCBI Resources
1 of 8 11/7/2004 11:00 AM National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools
More informationIntroduction to transcriptome analysis using High Throughput Sequencing technologies (HTS)
Introduction to transcriptome analysis using High Throughput Sequencing technologies (HTS) A typical RNA Seq experiment Library construction Protocol variations Fragmentation methods RNA: nebulization,
More informationRETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison
RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the
More informationAnalysis of gene expression data. Ulf Leser and Philippe Thomas
Analysis of gene expression data Ulf Leser and Philippe Thomas This Lecture Protein synthesis Microarray Idea Technologies Applications Problems Quality control Normalization Analysis next week! Ulf Leser:
More informationMicroarray Technology
Microarrays And Functional Genomics CPSC265 Matt Hudson Microarray Technology Relatively young technology Usually used like a Northern blot can determine the amount of mrna for a particular gene Except
More informationSystematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals
Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals Xiaohui Xie 1, Jun Lu 1, E. J. Kulbokas 1, Todd R. Golub 1, Vamsi Mootha 1, Kerstin Lindblad-Toh
More informationNetPrimer Manual. PREMIER Biosoft International. 3786 Corina Way, Palo Alto, CA 94303-4504 Tel: 650-856-2703 FAX: 650-618-1773
NetPrimer Manual PREMIER Biosoft International 3786 Corina Way, Palo Alto, CA 94303-4504 Tel: 650-856-2703 FAX: 650-618-1773 E-mail: sales@premierbiosoft.com 1 Copyright 2009 by PREMIER Biosoft International.
More informationEssentials of Real Time PCR. About Sequence Detection Chemistries
Essentials of Real Time PCR About Real-Time PCR Assays Real-time Polymerase Chain Reaction (PCR) is the ability to monitor the progress of the PCR as it occurs (i.e., in real time). Data is therefore collected
More informationBasic Analysis of Microarray Data
Basic Analysis of Microarray Data A User Guide and Tutorial Scott A. Ness, Ph.D. Co-Director, Keck-UNM Genomics Resource and Dept. of Molecular Genetics and Microbiology University of New Mexico HSC Tel.
More informationMolecular Genetics: Challenges for Statistical Practice. J.K. Lindsey
Molecular Genetics: Challenges for Statistical Practice J.K. Lindsey 1. What is a Microarray? 2. Design Questions 3. Modelling Questions 4. Longitudinal Data 5. Conclusions 1. What is a microarray? A microarray
More informationShouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center
Computational Challenges in Storage, Analysis and Interpretation of Next-Generation Sequencing Data Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center Next Generation Sequencing
More informationCore Facility Genomics
Core Facility Genomics versatile genome or transcriptome analyses based on quantifiable highthroughput data ascertainment 1 Topics Collaboration with Harald Binder and Clemens Kreutz Project: Microarray
More informationIntroduction To Real Time Quantitative PCR (qpcr)
Introduction To Real Time Quantitative PCR (qpcr) SABiosciences, A QIAGEN Company www.sabiosciences.com The Seminar Topics The advantages of qpcr versus conventional PCR Work flow & applications Factors
More informationHow many of you have checked out the web site on protein-dna interactions?
How many of you have checked out the web site on protein-dna interactions? Example of an approximately 40,000 probe spotted oligo microarray with enlarged inset to show detail. Find and be ready to discuss
More informationPrimePCR Assay Validation Report
Gene Information Gene Name Gene Symbol Organism Gene Summary Gene Aliases RefSeq Accession No. UniGene ID Ensembl Gene ID papillary renal cell carcinoma (translocation-associated) PRCC Human This gene
More informationNew Technologies for Sensitive, Low-Input RNA-Seq. Clontech Laboratories, Inc.
New Technologies for Sensitive, Low-Input RNA-Seq Clontech Laboratories, Inc. Outline Introduction Single-Cell-Capable mrna-seq Using SMART Technology SMARTer Ultra Low RNA Kit for the Fluidigm C 1 System
More informationQPCR Applications using Stratagene s Mx Real-Time PCR Platform
QPCR Applications using Stratagene s Mx Real-Time PCR Platform Dan Schoeffner, Ph.D Field Applications Scientist Dan.Schoeffner@Stratagene.com Tech. Services 800-894-1304 Polymerase Chain Reaction Melt
More informationReal-time quantitative RT -PCR (Taqman)
Real-time quantitative RT -PCR (Taqman) Author: SC, Patti Lab, 3/03 This is performed as a 2-step reaction: 1. cdna synthesis from DNase 1-treated total RNA 2. PCR 1. cdna synthesis (Advantage RT-for-PCR
More informationREAL TIME PCR USING SYBR GREEN
REAL TIME PCR USING SYBR GREEN 1 THE PROBLEM NEED TO QUANTITATE DIFFERENCES IN mrna EXPRESSION SMALL AMOUNTS OF mrna LASER CAPTURE SMALL AMOUNTS OF TISSUE PRIMARY CELLS PRECIOUS REAGENTS 2 THE PROBLEM
More informationAppendix 2 Molecular Biology Core Curriculum. Websites and Other Resources
Appendix 2 Molecular Biology Core Curriculum Websites and Other Resources Chapter 1 - The Molecular Basis of Cancer 1. Inside Cancer http://www.insidecancer.org/ From the Dolan DNA Learning Center Cold
More informationCorrelation of microarray and quantitative real-time PCR results. Elisa Wurmbach Mount Sinai School of Medicine New York
Correlation of microarray and quantitative real-time PCR results Elisa Wurmbach Mount Sinai School of Medicine New York Microarray techniques Oligo-array: Affymetrix, Codelink, spotted oligo-arrays (60-70mers)
More informationFrequently Asked Questions Next Generation Sequencing
Frequently Asked Questions Next Generation Sequencing Import These Frequently Asked Questions for Next Generation Sequencing are some of the more common questions our customers ask. Questions are divided
More informationPrimePCR Assay Validation Report
Gene Information Gene Name sorbin and SH3 domain containing 2 Gene Symbol Organism Gene Summary Gene Aliases RefSeq Accession No. UniGene ID Ensembl Gene ID SORBS2 Human Arg and c-abl represent the mammalian
More informationGenetic Analysis. Phenotype analysis: biological-biochemical analysis. Genotype analysis: molecular and physical analysis
Genetic Analysis Phenotype analysis: biological-biochemical analysis Behaviour under specific environmental conditions Behaviour of specific genetic configurations Behaviour of progeny in crosses - Genotype
More informationMaterials and Methods. Blocking of Globin Reverse Transcription to Enhance Human Whole Blood Gene Expression Profiling
Application Note Blocking of Globin Reverse Transcription to Enhance Human Whole Blood Gene Expression Profi ling Yasmin Beazer-Barclay, Doug Sinon, Christopher Morehouse, Mark Porter, and Mike Kuziora
More informationFinal Project Report
CPSC545 by Introduction to Data Mining Prof. Martin Schultz & Prof. Mark Gerstein Student Name: Yu Kor Hugo Lam Student ID : 904907866 Due Date : May 7, 2007 Introduction Final Project Report Pseudogenes
More informationTyping in the NGS era: The way forward!
Typing in the NGS era: The way forward! Valeria Michelacci NGS course, June 2015 Typing from sequence data NGS-derived conventional Multi Locus Sequence Typing (University of Warwick, 7 housekeeping genes)
More informationIntroduction to Genome Annotation
Introduction to Genome Annotation AGCGTGGTAGCGCGAGTTTGCGAGCTAGCTAGGCTCCGGATGCGA CCAGCTTTGATAGATGAATATAGTGTGCGCGACTAGCTGTGTGTT GAATATATAGTGTGTCTCTCGATATGTAGTCTGGATCTAGTGTTG GTGTAGATGGAGATCGCGTAGCGTGGTAGCGCGAGTTTGCGAGCT
More informationData Acquisition. DNA microarrays. The functional genomics pipeline. Experimental design affects outcome data analysis
Data Acquisition DNA microarrays The functional genomics pipeline Experimental design affects outcome data analysis Data acquisition microarray processing Data preprocessing scaling/normalization/filtering
More informationIntroduction to Quantitative PCR
Introduction to Quantitative PCR Methods and Applications Guide Introduction to Quantitative PCR Methods and Applications Guide IN 70200 D US and Canada Orders: 800-227-9770 x3 Technical Service: 800-227-9770
More informationPreciseTM Whitepaper
Precise TM Whitepaper Introduction LIMITATIONS OF EXISTING RNA-SEQ METHODS Correctly designed gene expression studies require large numbers of samples, accurate results and low analysis costs. Analysis
More informationData Analysis for Ion Torrent Sequencing
IFU022 v140202 Research Use Only Instructions For Use Part III Data Analysis for Ion Torrent Sequencing MANUFACTURER: Multiplicom N.V. Galileilaan 18 2845 Niel Belgium Revision date: August 21, 2014 Page
More informationThe world of non-coding RNA. Espen Enerly
The world of non-coding RNA Espen Enerly ncrna in general Different groups Small RNAs Outline mirnas and sirnas Speculations Common for all ncrna Per def.: never translated Not spurious transcripts Always/often
More informationTo be able to describe polypeptide synthesis including transcription and splicing
Thursday 8th March COPY LO: To be able to describe polypeptide synthesis including transcription and splicing Starter Explain the difference between transcription and translation BATS Describe and explain
More informationSoftware and Methods for the Analysis of Affymetrix GeneChip Data. Rafael A Irizarry Department of Biostatistics Johns Hopkins University
Software and Methods for the Analysis of Affymetrix GeneChip Data Rafael A Irizarry Department of Biostatistics Johns Hopkins University Outline Overview Bioconductor Project Examples 1: Gene Annotation
More informationThermo Scientific DyNAmo cdna Synthesis Kit for qrt-pcr Technical Manual
Thermo Scientific DyNAmo cdna Synthesis Kit for qrt-pcr Technical Manual F- 470S 20 cdna synthesis reactions (20 µl each) F- 470L 100 cdna synthesis reactions (20 µl each) Table of contents 1. Description...
More informationStandards, Guidelines and Best Practices for RNA-Seq V1.0 (June 2011) The ENCODE Consortium
Standards, Guidelines and Best Practices for RNA-Seq V1.0 (June 2011) The ENCODE Consortium I. Introduction: Sequence based assays of transcriptomes (RNA-seq) are in wide use because of their favorable
More informationCurrent Motif Discovery Tools and their Limitations
Current Motif Discovery Tools and their Limitations Philipp Bucher SIB / CIG Workshop 3 October 2006 Trendy Concepts and Hypotheses Transcription regulatory elements act in a context-dependent manner.
More informationFocusing on results not data comprehensive data analysis for targeted next generation sequencing
Focusing on results not data comprehensive data analysis for targeted next generation sequencing Daniel Swan, Jolyon Holdstock, Angela Matchan, Richard Stark, John Shovelton, Duarte Mohla and Simon Hughes
More informationMATCH Commun. Math. Comput. Chem. 61 (2009) 781-788
MATCH Communications in Mathematical and in Computer Chemistry MATCH Commun. Math. Comput. Chem. 61 (2009) 781-788 ISSN 0340-6253 Three distances for rapid similarity analysis of DNA sequences Wei Chen,
More informationMeasuring gene expression (Microarrays) Ulf Leser
Measuring gene expression (Microarrays) Ulf Leser This Lecture Gene expression Microarrays Idea Technologies Problems Quality control Normalization Analysis next week! 2 http://learn.genetics.utah.edu/content/molecules/transcribe/
More informationHow Sequencing Experiments Fail
How Sequencing Experiments Fail v1.0 Simon Andrews simon.andrews@babraham.ac.uk Classes of Failure Technical Tracking Library Contamination Biological Interpretation Something went wrong with a machine
More informationBioinformatics Resources at a Glance
Bioinformatics Resources at a Glance A Note about FASTA Format There are MANY free bioinformatics tools available online. Bioinformaticists have developed a standard format for nucleotide and protein sequences
More informationTranscription and Translation of DNA
Transcription and Translation of DNA Genotype our genetic constitution ( makeup) is determined (controlled) by the sequence of bases in its genes Phenotype determined by the proteins synthesised when genes
More informationBiotechnology and Recombinant DNA (Chapter 9) Lecture Materials for Amy Warenda Czura, Ph.D. Suffolk County Community College
Biotechnology and Recombinant DNA (Chapter 9) Lecture Materials for Amy Warenda Czura, Ph.D. Suffolk County Community College Primary Source for figures and content: Eastern Campus Tortora, G.J. Microbiology
More informationReal-Time PCR Vs. Traditional PCR
Real-Time PCR Vs. Traditional PCR Description This tutorial will discuss the evolution of traditional PCR methods towards the use of Real-Time chemistry and instrumentation for accurate quantitation. Objectives
More informationTechnical Note. Roche Applied Science. No. LC 18/2004. Assay Formats for Use in Real-Time PCR
Roche Applied Science Technical Note No. LC 18/2004 Purpose of this Note Assay Formats for Use in Real-Time PCR The LightCycler Instrument uses several detection channels to monitor the amplification of
More informationPairwise Sequence Alignment
Pairwise Sequence Alignment carolin.kosiol@vetmeduni.ac.at SS 2013 Outline Pairwise sequence alignment global - Needleman Wunsch Gotoh algorithm local - Smith Waterman algorithm BLAST - heuristics What
More informationNGS data analysis. Bernardo J. Clavijo
NGS data analysis Bernardo J. Clavijo 1 A brief history of DNA sequencing 1953 double helix structure, Watson & Crick! 1977 rapid DNA sequencing, Sanger! 1977 first full (5k) genome bacteriophage Phi X!
More informationMeDIP-chip service report
MeDIP-chip service report Wednesday, 20 August, 2008 Sample source: Cells from University of *** Customer: ****** Organization: University of *** Contents of this service report General information and
More informationInteraktionen von RNAs und Proteinen
Sonja Prohaska Computational EvoDevo Universitaet Leipzig June 9, 2015 Studying RNA-protein interactions Given: target protein known to bind to RNA problem: find binding partners and binding sites experimental
More informationGene Expression Assays
APPLICATION NOTE TaqMan Gene Expression Assays A mpl i fic ationef ficienc yof TaqMan Gene Expression Assays Assays tested extensively for qpcr efficiency Key factors that affect efficiency Efficiency
More informationIntroduction to NGS data analysis
Introduction to NGS data analysis Jeroen F. J. Laros Leiden Genome Technology Center Department of Human Genetics Center for Human and Clinical Genetics Sequencing Illumina platforms Characteristics: High
More informationStructure and Function of DNA
Structure and Function of DNA DNA and RNA Structure DNA and RNA are nucleic acids. They consist of chemical units called nucleotides. The nucleotides are joined by a sugar-phosphate backbone. The four
More informationBLAST. Anders Gorm Pedersen & Rasmus Wernersson
BLAST Anders Gorm Pedersen & Rasmus Wernersson Database searching Using pairwise alignments to search databases for similar sequences Query sequence Database Database searching Most common use of pairwise
More informationQuality Assessment of Exon and Gene Arrays
Quality Assessment of Exon and Gene Arrays I. Introduction In this white paper we describe some quality assessment procedures that are computed from CEL files from Whole Transcript (WT) based arrays such
More informationNext Generation Sequencing
Next Generation Sequencing Technology and applications 10/1/2015 Jeroen Van Houdt - Genomics Core - KU Leuven - UZ Leuven 1 Landmarks in DNA sequencing 1953 Discovery of DNA double helix structure 1977
More informationMolecular typing of VTEC: from PFGE to NGS-based phylogeny
Molecular typing of VTEC: from PFGE to NGS-based phylogeny Valeria Michelacci 10th Annual Workshop of the National Reference Laboratories for E. coli in the EU Rome, November 5 th 2015 Molecular typing
More informationGenBank, Entrez, & FASTA
GenBank, Entrez, & FASTA Nucleotide Sequence Databases First generation GenBank is a representative example started as sort of a museum to preserve knowledge of a sequence from first discovery great repositories,
More informationGenetics Lecture Notes 7.03 2005. Lectures 1 2
Genetics Lecture Notes 7.03 2005 Lectures 1 2 Lecture 1 We will begin this course with the question: What is a gene? This question will take us four lectures to answer because there are actually several
More informationSearching Nucleotide Databases
Searching Nucleotide Databases 1 When we search a nucleic acid databases, Mascot always performs a 6 frame translation on the fly. That is, 3 reading frames from the forward strand and 3 reading frames
More informationCompleteⅡ 1st strand cdna Synthesis Kit
Instruction Manual CompleteⅡ 1st strand cdna Synthesis Kit Catalog # GM30401, GM30402 Green Mountain Biosystems. LLC Web: www.greenmountainbio.com Tel: 800-942-1160 Sales: Sales@ greenmountainbio.com Support:
More informationA Primer of Genome Science THIRD
A Primer of Genome Science THIRD EDITION GREG GIBSON-SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc. Publishers Sunderland, Massachusetts USA Contents Preface xi 1 Genome Projects:
More informationUCHIME in practice Single-region sequencing Reference database mode
UCHIME in practice Single-region sequencing UCHIME is designed for experiments that perform community sequencing of a single region such as the 16S rrna gene or fungal ITS region. While UCHIME may prove
More informationPATHOGEN DETECTION SYSTEMS BY REAL TIME PCR. Results Interpretation Guide
PATHOGEN DETECTION SYSTEMS BY REAL TIME PCR Results Interpretation Guide Pathogen Detection Systems by Real Time PCR Microbial offers real time PCR based systems for the detection of pathogenic bacteria
More informationTeaching Bioinformatics to Undergraduates
Teaching Bioinformatics to Undergraduates http://www.med.nyu.edu/rcr/asm Stuart M. Brown Research Computing, NYU School of Medicine I. What is Bioinformatics? II. Challenges of teaching bioinformatics
More informationBiotechnology: DNA Technology & Genomics
Chapter 20. Biotechnology: DNA Technology & Genomics 2003-2004 The BIG Questions How can we use our knowledge of DNA to: diagnose disease or defect? cure disease or defect? change/improve organisms? What
More informationGenetic information (DNA) determines structure of proteins DNA RNA proteins cell structure 3.11 3.15 enzymes control cell chemistry ( metabolism )
Biology 1406 Exam 3 Notes Structure of DNA Ch. 10 Genetic information (DNA) determines structure of proteins DNA RNA proteins cell structure 3.11 3.15 enzymes control cell chemistry ( metabolism ) Proteins
More informationGene Models & Bed format: What they represent.
GeneModels&Bedformat:Whattheyrepresent. Gene models are hypotheses about the structure of transcripts produced by a gene. Like all models, they may be correct, partly correct, or entirely wrong. Typically,
More informationComparing Methods for Identifying Transcription Factor Target Genes
Comparing Methods for Identifying Transcription Factor Target Genes Alena van Bömmel (R 3.3.73) Matthew Huska (R 3.3.18) Max Planck Institute for Molecular Genetics Folie 1 Transcriptional Regulation TF
More informationBioinformatics Grid - Enabled Tools For Biologists.
Bioinformatics Grid - Enabled Tools For Biologists. What is Grid-Enabled Tools (GET)? As number of data from the genomics and proteomics experiment increases. Problems arise for the current sequence analysis
More informationHow To Understand How Gene Expression Is Regulated
What makes cells different from each other? How do cells respond to information from environment? Regulation of: - Transcription - prokaryotes - eukaryotes - mrna splicing - mrna localisation and translation
More informationHierarchical Bayesian Modeling of the HIV Response to Therapy
Hierarchical Bayesian Modeling of the HIV Response to Therapy Shane T. Jensen Department of Statistics, The Wharton School, University of Pennsylvania March 23, 2010 Joint Work with Alex Braunstein and
More information2. True or False? The sequence of nucleotides in the human genome is 90.9% identical from one person to the next. False (it s 99.
1. True or False? A typical chromosome can contain several hundred to several thousand genes, arranged in linear order along the DNA molecule present in the chromosome. True 2. True or False? The sequence
More informationReal-time qpcr Assay Design Software www.qpcrdesign.com
Real-time qpcr Assay Design Software www.qpcrdesign.com Your Blueprint For Success Informational Guide 2199 South McDowell Blvd Petaluma, CA 94954-6904 USA 1.800.GENOME.1(436.6631) 1.415.883.8400 1.415.883.8488
More informationProtein Synthesis How Genes Become Constituent Molecules
Protein Synthesis Protein Synthesis How Genes Become Constituent Molecules Mendel and The Idea of Gene What is a Chromosome? A chromosome is a molecule of DNA 50% 50% 1. True 2. False True False Protein
More informationCRAC: An integrated approach to analyse RNA-seq reads Additional File 3 Results on simulated RNA-seq data.
: An integrated approach to analyse RNA-seq reads Additional File 3 Results on simulated RNA-seq data. Nicolas Philippe and Mikael Salson and Thérèse Commes and Eric Rivals February 13, 2013 1 Results
More informationGenBank: A Database of Genetic Sequence Data
GenBank: A Database of Genetic Sequence Data Computer Science 105 Boston University David G. Sullivan, Ph.D. An Explosion of Scientific Data Scientists are generating ever increasing amounts of data. Relevant
More informationPROC. CAIRO INTERNATIONAL BIOMEDICAL ENGINEERING CONFERENCE 2006 1. E-mail: msm_eng@k-space.org
BIOINFTool: Bioinformatics and sequence data analysis in molecular biology using Matlab Mai S. Mabrouk 1, Marwa Hamdy 2, Marwa Mamdouh 2, Marwa Aboelfotoh 2,Yasser M. Kadah 2 1 Biomedical Engineering Department,
More information13.4 Gene Regulation and Expression
13.4 Gene Regulation and Expression Lesson Objectives Describe gene regulation in prokaryotes. Explain how most eukaryotic genes are regulated. Relate gene regulation to development in multicellular organisms.
More informationDiscovery and Quantification of RNA with RNASeq Roderic Guigó Serra Centre de Regulació Genòmica (CRG) roderic.guigo@crg.cat
Bioinformatique et Séquençage Haut Débit, Discovery and Quantification of RNA with RNASeq Roderic Guigó Serra Centre de Regulació Genòmica (CRG) roderic.guigo@crg.cat 1 RNA Transcription to RNA and subsequent
More informationWelcome to the Plant Breeding and Genomics Webinar Series
Welcome to the Plant Breeding and Genomics Webinar Series Today s Presenter: Dr. Candice Hansey Presentation: http://www.extension.org/pages/ 60428 Host: Heather Merk Technical Production: John McQueen
More informationMORPHEUS. http://biodev.cea.fr/morpheus/ Prediction of Transcription Factors Binding Sites based on Position Weight Matrix.
MORPHEUS http://biodev.cea.fr/morpheus/ Prediction of Transcription Factors Binding Sites based on Position Weight Matrix. Reference: MORPHEUS, a Webtool for Transcripton Factor Binding Analysis Using
More informationGenetics Module B, Anchor 3
Genetics Module B, Anchor 3 Key Concepts: - An individual s characteristics are determines by factors that are passed from one parental generation to the next. - During gamete formation, the alleles for
More informationA greedy algorithm for the DNA sequencing by hybridization with positive and negative errors and information about repetitions
BULLETIN OF THE POLISH ACADEMY OF SCIENCES TECHNICAL SCIENCES, Vol. 59, No. 1, 2011 DOI: 10.2478/v10175-011-0015-0 Varia A greedy algorithm for the DNA sequencing by hybridization with positive and negative
More informationGene Expression Analysis
Gene Expression Analysis Jie Peng Department of Statistics University of California, Davis May 2012 RNA expression technologies High-throughput technologies to measure the expression levels of thousands
More informationInfluence of GSM and UMTS on the Blood Brain Barrier in vitro additional results
Influence of GSM and UMTS on the Blood Brain Barrier in vitro additional results Intl. Workshop on long term effects, München, 11.-12. Okt. 2007 Dr. rer. nat. Helmut Franke Klinik und Poliklinik für Neurologie
More informationBecker Muscular Dystrophy
Muscular Dystrophy A Case Study of Positional Cloning Described by Benjamin Duchenne (1868) X-linked recessive disease causing severe muscular degeneration. 100 % penetrance X d Y affected male Frequency
More informationDNA Sequencing Overview
DNA Sequencing Overview DNA sequencing involves the determination of the sequence of nucleotides in a sample of DNA. It is presently conducted using a modified PCR reaction where both normal and labeled
More informationLecture Series 7. From DNA to Protein. Genotype to Phenotype. Reading Assignments. A. Genes and the Synthesis of Polypeptides
Lecture Series 7 From DNA to Protein: Genotype to Phenotype Reading Assignments Read Chapter 7 From DNA to Protein A. Genes and the Synthesis of Polypeptides Genes are made up of DNA and are expressed
More informationDescription: Molecular Biology Services and DNA Sequencing
Description: Molecular Biology s and DNA Sequencing DNA Sequencing s Single Pass Sequencing Sequence data only, for plasmids or PCR products Plasmid DNA or PCR products Plasmid DNA: 20 100 ng/μl PCR Product:
More information2. The number of different kinds of nucleotides present in any DNA molecule is A) four B) six C) two D) three
Chem 121 Chapter 22. Nucleic Acids 1. Any given nucleotide in a nucleic acid contains A) two bases and a sugar. B) one sugar, two bases and one phosphate. C) two sugars and one phosphate. D) one sugar,
More informationSingle Nucleotide Polymorphisms (SNPs)
Single Nucleotide Polymorphisms (SNPs) Additional Markers 13 core STR loci Obtain further information from additional markers: Y STRs Separating male samples Mitochondrial DNA Working with extremely degraded
More informationRecombinant DNA and Biotechnology
Recombinant DNA and Biotechnology Chapter 18 Lecture Objectives What Is Recombinant DNA? How Are New Genes Inserted into Cells? What Sources of DNA Are Used in Cloning? What Other Tools Are Used to Study
More informationName Class Date. Figure 13 1. 2. Which nucleotide in Figure 13 1 indicates the nucleic acid above is RNA? a. uracil c. cytosine b. guanine d.
13 Multiple Choice RNA and Protein Synthesis Chapter Test A Write the letter that best answers the question or completes the statement on the line provided. 1. Which of the following are found in both
More informationRow Quantile Normalisation of Microarrays
Row Quantile Normalisation of Microarrays W. B. Langdon Departments of Mathematical Sciences and Biological Sciences University of Essex, CO4 3SQ Technical Report CES-484 ISSN: 1744-8050 23 June 2008 Abstract
More informationSICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE
AP Biology Date SICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE LEARNING OBJECTIVES Students will gain an appreciation of the physical effects of sickle cell anemia, its prevalence in the population,
More informationGene expression analysis. Ulf Leser and Karin Zimmermann
Gene expression analysis Ulf Leser and Karin Zimmermann Ulf Leser: Bioinformatics, Wintersemester 2010/2011 1 Last lecture What are microarrays? - Biomolecular devices measuring the transcriptome of a
More informationCONCEPTS ON MICROARRAY DESIGN FOR GENOME AND TRANSCRIPTOME ANALYSES
CHAPTER 11 CONCEPTS ON MICROARRAY DESIGN FOR GENOME AND TRANSCRIPTOME ANALYSES HELDER I. NAKAYA, EDUARDO M. REIS, SERGIO VERJOVSKI-ALMEIDA Departamento de Bioquimica, Instituto de Quimica, Universidade
More information