Transcription factor omics
|
|
- Prosper Campbell
- 8 years ago
- Views:
Transcription
1 Transcription factor omics Comparative transcriptomics and DNA-protein interactomics disclose novel insights into transcription factor omics. Factor I C A G A G T A G T A G Factor II C A G A G T A G C TFx TFx A A A T T TFx TBP T A A Pol II TFx cis-element cis-element TATA-Box Dierk Wanke dierk.wanke@zmbp.uni-tuebingen.de 1
2 Principle of functional genomics Central Dogma of Molecular Biology DNA RNA Protein Metabolism Central Dogma of functional Genomics Physiology Genome Transcriptome Metabolomics Proteome DNA RNA Protein 2 (modified from Pevsner 2003)
3 Principle of functional genomics Central Dogma of Molecular Biology DNA RNA Protein Metabolism Central Dogma of functional Genomics Physiology Genome Transcriptome Metabolomics Proteome Besides the four classical omes, a number of derived omes exist: with Interactome being probably the most important one! There is high interdependency between the different omes 3
4 Individual history Developmental stage Previous stress Diurnal changes and circadian rhythm Tissue / organ specificity Output interpretation Omics analysis Condition laboratory / environmental setting quantitative OMICS Organism responses Documentation Network analysis Species level responses Genetic engineering Kilian et al. (2011)
5 Quantitative vs qualitative analyses Qualitative Analysis empirical data Yes / No / Maybe Quantitative Analysis mathematical data significance 5
6 Quantitative vs qualitative analyses Acquiring of quantitative data of subcellular processes with high spatiotemporal resolution in living (plant) cells in their native tissue environment Quantitative data Dynamics same direction, but different speed! 6
7 Principle of functional genomics Central Dogma of Molecular Biology DNA RNA Protein Metabolism Central Dogma of functional Genomics Physiology Genome Transcriptome Proteome Metabolomics qualitative data only! invariable given population (basic population) e.g. number of genes, number of bp easy to define a background model relatively easy to analyze f(x) = observed P expected P significance can be analyzed by Hypergeometric p 7
8 Principle of functional genomics Central Dogma of Molecular Biology DNA RNA Protein Metabolism Central Dogma of functional Genomics Physiology Genome Transcriptome Proteome Metabolomics first real omics mostly qualitative information (e.g. different from control? Yes/No) data is highly variable depending on type experiment / researcher / method of detection a background model can not be defined significance can be inferred but not measured by values 8
9 Principle of functional genomics Central Dogma of Molecular Biology DNA RNA Protein Metabolism Central Dogma of functional Genomics Physiology Genome Transcriptome Proteome Metabolomics qualitative and quantitative readout data is highly variable depending on type experiment / researcher / method of extraction analysis is driven by researcher s objectives due to high variability and data close to detection limit, metabolomes are very hard to analyze by statistics 9
10 Principle of functional genomics Central Dogma of Molecular Biology DNA RNA Protein Metabolism Central Dogma of functional Genomics Physiology Genome Transcriptome Metabolomics Proteome qualitative and quantitative readout data is highly variable depending on type experiment / researcher / method of detection various background models can be defined, which all end up with different results! - Analysis is driven by researcher s objectives various values can be computed to support significance of results 10
11 Principle of functional genomics Central Dogma of Molecular Biology DNA RNA Protein Metabolism Central Dogma of functional Genomics Physiology Genome Transkriptome Proteome Metabolomics Interestingly, due to the quantitative nature of the data types, both methods cope with similar problems e.g.: accession with low abundance, are sometimes below detection limit and have high variability large standard deviation / error bars 11 accession with high abundance, are so frequent that they compete with others or they saturate detection small error bars, due to signal saturation
12 Transcriptome Transcriptome Protein mrna Transcript transcription-start DNA ATG Introns * cis cis TATA cis cis distal proximal Exons Promoter gene sequence Transcriptome: all transcripts at a given time 12 2 current methods in the field: - microarrays - Next Generation Sequencing (NGS)
13 Microarray Transcriptome 13 more than 50 Mio. probe sets on 2 cm x 2 cm
14 Microarray Transcriptome Oligonucleotide-Array (Genechip-Arrays): PerfectMatch (PM) AGATGATAGACAGAGCAGATGCTTG MisMatch (MM) AGATGATAGACAGACCAGATGCTTG PM MM Signal Ratio
15 Microarray Transcriptome Oligonucleotide-Array (Genechip-Arrays):
16 Microarray At2g19200 FRK1 (At2g19190) At2g19180 GeneChip Array Exon Array Tiling Array qualitative and quantitative readout BUT only the GeneChip Arrays have internal controls (mismatch probes) Exon- and Tiling Array are not assigned to ONE specific target (similar sequences in the genomes will cross-hybridize and add background) Data between different types of arrays is of low comparability
17 Transcriptome Transcriptome Protein mrna Transcript transcription-start DNA ATG Introns * cis cis TATA cis cis distal proximal Exons Promoter gene sequence Transcriptome: all transcripts at a given time and tissue sample 2 current methods dominate the field: - microarrays - Next Generation Sequencing (NGS) 17
18 Next Generation Sequencing (NGS) Transcriptome
19 Next Generation Sequencing (NGS) At2g19200 FRK1 (At2g19190) At2g19180 Genome-Seq no sequence low coverage no informative data talk by Prof. Cole very high sequence no sequence coverage low coverage over-sampling, no informative data satuartion
20 Next Generation Sequencing (NGS) At2g19200 FRK1 (At2g19190) At2g19180 Genome-Seq RNA-Seq expression signal, but there is no annotated gene badly annotated, microrna-gene, differential splicing
21 Next Generation Sequencing (NGS) At2g19200 FRK1 (At2g19190) At2g19180 Genome-Seq low expression signal, (low coverage?) badly annotated, RNA-Seq cryptic intron in gene model, differential splicing
22 Transcriptome: What does ANALYSIS mean? Transcriptome Challenge: Find differentially expressed genes! Mine for meaningful gene sets! - fold difference from control (2x; 3x, 4x...) [simple data with control-treatments only] - LIMMA (Linear Models for Microarray Data) [all data, but linear modeling only] - probabilistic dynamical model - machine-learning approaches
23 Transcriptome: What does ANALYSIS mean? Transcriptome Challenge: Find differentially expressed genes! Mine for meaningful gene sets! Cluster analysis - Pearson correlation - k-means clustering - hierarchical clustering Problem: house-keeping genes!!!
24 Why searching for co-expression Transcriptome Co-expressed genes encode for proteins that act in the same signaling cascades, are enrolled in the same enzymatic pathways or represent interacting partners that form higher order complexes. Additionally, genes with similar expression profiles in microarray experiments might be regulated by the same cis-regulatory elements (CREs).
25 Related to photosynthesis Grundlagen der funktionellen Genomanalyse Related to aging
26 Transcriptome Co-expression reveals logic connections Co expression network reveals protein interaction network Transcriptome vs. Interactome Tanz et al. 2012
27 Transcriptome vs. metabolome Transcriptome Metabolome Combined analysis of transcriptomics and metabolomics in barley research simple Pearson correlation Mangelsen et al. 2010, Plant Phys.
28 sugars and genes exhibit diurnal fluctuations Transcriptome Correlation coefficients Metabolome 10 Fructose Kestose Median normalized (log scale) Glucose Sucrose several genes might act as potential sugar signaling targets
29 Transcriptome: What does ANALYSIS mean? meaningful Transcriptome data microarray next generation sequencing (RNA seq) quantitative data + relatively high comparability + identification of DE genes + co-expression analysis + gene regulatory network analysis - low comparability of data - difficult to identify DE genes - co-expression analysis impossible - no network analysis mainly qualitative data - requires known genome - weak on differential splicing events - weak on ChIP data + applicable to any organism + good on differential splicing events + very good on ChIP data
30 Interactome Chromatin-ImmunoPrecipitation-NGS (ChIP-seq) Identification of in vivo binding sites for DNA binding proteins Immunoprecipitation of specific DNA-protein-complexes with antibodies De-cross-link DNA-protein complexes Ligate adapters to DNA, use as template for sequencing (ChIP-seq)
31 Histone modification regulates expression Interactome transcriptional active [Euchromatin] ON Ac Ac Ac Ac Ac mono-acetylation TrxG PcGs H3K27me3 Me Me Me Ac H3K4me3 tri-methylation Me Me Me Me Me Me Me Me Me Me Me Me Me Me Me transcriptional inactive [Heterochromatin] OFF Histone-modifications are important for gene activation or repression Position of Histones on the DNA can be identified by Chromatin-Immunoprecipitation (ChIP) and subsequent NGS
32 Transcriptome (mrna-seq) + Epigenome (ChIP-Seq) At2g19200 FRK1 (At2g19190) At2g19180 low OFFsignal, high ONsignal high expression, good detection of mrna RNA-Seq (transcriptome) ChIP-Seq (H3K27me 3 ) OFF ChIP-Seq (H3K4me 3 ) ON
33 Transcriptome (mrna-seq) + Epigenome (ChIP-Seq) At2g19200 FRK1 (At2g19190) At2g19180 RNA-Seq (transcriptome) OFFsignal, but still mrna ChIP-Seq (H3K27me 3 ) OFF ChIP-Seq (H3K4me 3 ) ON
34 Research interest: Gene expression control
35 Research interest: Gene expression control mrna TF cis cis TATA distal proximal transcription-start ATG cis Introns * Exons cis Promoter Rapid gene expression response in plant - How are rapid (stress) responses controlled? - How is specificity of this response mediated? laboratory experiments bioinformatics synthetic promoters TF Motif - Interaction - How do TFs find their cis-element? - What is the specificity of this interaction? biotechnology spectro-microscopy 35
36 Transcription factors (TFs) act as molecular switches bzip BAM ARR WRKY BPC GATA 36
37 Research interest: Gene expression control Central Dogma of Molecular Biology DNA RNA Protein Metabolism Central Dogma of functional Genomics Physiology Genome Transkriptome Metabolome Proteome promoter analysis microarray analysis Interactome Protein DNAinteractome 37
38 Genome Is there a cryptic code in non-coding regulatory sequence? transcription-start cis cis TATA ATG cis Introns * cis distal proximal Exons Promoter 400 No. of hits frequency position ATG 38 Berendzen et al., 2006
39 Identification of putative cis-elements by genomic enrichment Genome transcription-start cis cis TATA ATG cis Introns * cis 1600 distal proximal Exons Promoter No. of hits frequency distribution curves for the TATATA motif No. of hits Berendzen et al., 2006
40 Genome Only ~10% of known DNA-binding proteins have a known DNA-binding motif We don t have knowledge on ~90 % of TF-function? What if these behave differently? 2340 ~ % 415 <200 <10 % 40 AGRIS TAIR TRANSFAC Arabidopsis average eukaryote Schröder et al., Plos One 2010
41 Binding specificity of TFs to their DNA motif Interactome DNA-protein interaction ELISA 2 pmol bio dsdna per well epitope tagged protein of interest in crude E. coli extract colorimetric detection DNA-Protein Interaction-ELISA provides quantitative data! 41 Brand et al., 2010
42 Development of a DPI-ELISA screen Interactome Binding site analysis by screening of all 6-mer binding motifs. 42 Brand et al., 2013a Brand et al., 2013b
43 A DPI-ELISA screen with stress responsive WRKY-TFs Interactome WRKY11 DBD Compare different WRKY binding affinities Characterization / identification of binding consensi 43 Brand et al., 2013a Brand et al., 2013b
44 Homology modeling of the WRKY protein at the binding motif Interactome WRKY WRKY WRKY33: One WRKY-protein with 2 WRKY domains! 44 Brand et al., 2013a Brand et al., 2013b
45 Homology modeling of the WRKY protein at the binding motif Interactome linker region variable library region WRKY-protein at the binding motif Homology modeling allows us to built hypotheses about binding of proteins to DNA, as soon as the sequences of both, protein and DNA-motifs are known linker region 45 Brand et al., 2013a Brand et al., 2013b
46 Molecular Dynamics (MD) simulations with WRKY33-domains Interactome weak binder strong binder MD gives an impression about binding strength requires usually complex physical experiments 46 Brand et al., 2013a Brand et al., 2013b
47 A DPI-ELISA screen with stress responsive WRKY-TFs Interactome Very similar WRKY domains have very different binding motifs 47 Brand et al., 2013a Brand et al., 2013b
48 Homology modeling and interactome provide functional clues Interactome affinity high low WRKY50 DBD hngach WRKY33 cdbd nngacw WRKY11 DBD ydgacy WRKY proteins recognize (slightly) different binding consensi Different binding consensi are bound with different affinities Quantitative binding information can be translated into a probability matrix to search genomic information for possible binding sites 48 Brand et al., 2013a Brand et al., 2013b
49 Microarray Transcriptome Oligonucleotide-Array (Genechip-Arrays): ATH1-Genechip: probe cells (79% of all A. thaliana genes) >24000 genes (85%) can be detected due to the high redundanzcy in the genome unique probe sets about (~60%) genes hybridize with probes to generate analyzable data We made use of the 1295 AtGenExpress Expression Profiles + ~1500 additional datasets for co-expression analyses
50 Transcriptome Homology modeling and interactome provide functional clues Which genes are co-expressed with WRKY genes AND have the respective binding site in their promoter?
51 Co-expression analysis in complex Microarray datasets Transcriptome Search for WRKYs as hubs ~ genes; analyzed in ~3000 microarray hybridizations Wanke et al., QBIC 2010
52 Genome analysis combined with transcriptome analysis Genome Transcriptome mrna transcription-start WRKY cis cis TATA distal proximal ATG cis Introns * Exons cis Promoter Identify binding sites in promoters Identify WRKY-co-expressed genes
53 Genome analysis combined with transcriptome analysis Genome - Transcriptome affinity high WRKY50 DBD WRKY33 cdbd WRKY11 DBD hngach nngacw ydgacy low No. co-expressed genes: ~300 ~400 ~400 No. binding sites: ~8000 ~ 5000 ~ 3000 No. put. targets ~250 ~60 ~40 We used publicly available microarray data on WRKY33 to verify our data.
54 Genome analysis combined with transcriptome analysis Transcriptome 100 mock stress stress normalized signal intensity (log scale) activator repressor control AtWRKY33 Integration of metadata from the Somssich Lab (Birkenbihl et al., 2012)
55 NGS coupled genomics: In vivo target gene identification by chromatin immuno-precipitation Genome Sequencing by NGS 100 Isolation of bound DNA normalized signal intensity (log scale) activator All ~60 putative target genes were independently identified as true in vivo targets! repressor AtWRKY33 Integration of metadata from the Somssich Lab (Birkenbihl et al., 2012)
56 What is the function of the putative target genes! Genome - Transcriptome affinity high WRKY50 DBD WRKY33 cdbd WRKY11 DBD hngach nngacw ydgacy low cell cycle chromatin remodeling pathogen response basal resistance No. put. targets ~250 ~60 ~40
57 What is the function of plant specific BBR/BPC factors? 0.05 changes Os10g Os10g HvBBR GmGAGA-BP AtBPC2 AtBPC1 AtBPC3 AtBPC7 AtBPC4 AtBPC5 AtBPC6 Os06g Ceratopteris richardii Group I Group II Physcomitrella patens BBR/BPCs arose evolutionary late in the green plant lineage Bind to GAGA-motifs and recruit LHP1 to DNA Wanke et al., 2011; Santi et al., 2003
58 Transcriptome Microarray vs. NGS Mutants in transcription factor genes exhibit differences in development Control lhp1-4 bpc4 bpc6 lhp1-4 bpc4 bpc6
59 Microarray vs. NGS Transcriptome Strong defects in flowers and seed production! Control lhp1-4 bpc4 bpc6 lhp1-4 bpc4 bpc6 Transcriptome (microarray) analysis to understand what is wrong!
60 Microarray vs. NGS microarrays provide the potential to compare the data with already existing microarrays (intercomparability)
61 Microarray vs. NGS more induced genes in the mutants function as repressor proteins repression coincides with already known repressive H3K27me3 marks
62 Microarray vs. NGS identical RNA used in both experiments two replicates libraries paired end reads ~ 80 Mio analyzable reads from each library microarray NGS Verify the data by Transcriptome (NGS) analysis!
63 Microarray vs. NGS microarray NGS Both methods provide Transcriptome data, but which is correct?
64 Microarray vs. NGS Genome - Interactome two replicate libraries from BPC6-expressing plants single end reads ~ 50 Mio analyzable reads each Isolation of target DNA Sequencing by NGS Interactome (ChIP-seq) analysis to identify direct target genes! We identified 350 putative in vivo target genes!
65 Microarray vs. NGS microarray NGS 33 putative direct targets overlapped with microarray expression 47 putative direct targets overlapped with NGS expression only a total of 8 were picked up with both techniques
66 Next Generation Sequencing (NGS) Transcriptome GLOBAL vs. LOCAL NGS is a quantitative approach, but reveals mostly (high value) qualitative data Quantitative analyses on a Global scale are informative, while specific local questions provide qualitative data only or remain hidden. Unfortunately, NGS for transcriptomics is often useless (to date) local expression analysis is difficult co-expression analysis is almost impossible Chromatin-ImmunoPrecipitation-NGS (ChIP-seq) data is highly informative, but mainly of qualitative nature!
67 Transcription factor omics Quantitative omics data can be normalized and, thus, is comparable. Quantitative data from different sources can be used directly in a simultaneous combined analysis (e.g. transcriptomics and metabolomics) NGS data harbors great potential, but (at present) can not substitute quantitative gene expression data e.g. in co-expression analyses Novel methods, such as the DPI-ELISA, enable quantitative proteomics The combined analysis of quantitative transcriptome and interactome data provide unique insights into protein function
68 What does ANALYSIS mean - personally? Challenge: Mine for meaningful data Problem: Conflicting interests bioinformatics biologists significant output is not interested in a special gene or outcome (unbiased) independent of funding biologically relevant output is ONLY interested in his special gene(s) of interest (biased) important for funding
69 Thank you! Tübingen Andreas Hecker Sachie Kimura Rebecca Dautel Joachim Kilian Luise Brand Saarland Nathalie Sebening Christine Zehren Angelika Anna Dagmar Pogomara Jan Hirsch MPI Tübingen Markus Schmid Silvio Collani Uni Rostock Birgit Piechulla Katrin Wenke INRA Versailles Valerie Gaudin ETH Zürich Sam Zeeman Sebastian Soyk Uni Würzburg Wolfgang Dröge Laser Christoph Weiste
How many of you have checked out the web site on protein-dna interactions?
How many of you have checked out the web site on protein-dna interactions? Example of an approximately 40,000 probe spotted oligo microarray with enlarged inset to show detail. Find and be ready to discuss
More informationCurrent Motif Discovery Tools and their Limitations
Current Motif Discovery Tools and their Limitations Philipp Bucher SIB / CIG Workshop 3 October 2006 Trendy Concepts and Hypotheses Transcription regulatory elements act in a context-dependent manner.
More informationNew Technologies for Sensitive, Low-Input RNA-Seq. Clontech Laboratories, Inc.
New Technologies for Sensitive, Low-Input RNA-Seq Clontech Laboratories, Inc. Outline Introduction Single-Cell-Capable mrna-seq Using SMART Technology SMARTer Ultra Low RNA Kit for the Fluidigm C 1 System
More informationAnalysis of gene expression data. Ulf Leser and Philippe Thomas
Analysis of gene expression data Ulf Leser and Philippe Thomas This Lecture Protein synthesis Microarray Idea Technologies Applications Problems Quality control Normalization Analysis next week! Ulf Leser:
More informationBBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS
BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS 1. The Technology Strategy sets out six areas where technological developments are required to push the frontiers of knowledge
More informationBIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16
Course Director: Dr. Barry Grant (DCM&B, bjgrant@med.umich.edu) Description: This is a three module course covering (1) Foundations of Bioinformatics, (2) Statistics in Bioinformatics, and (3) Systems
More informationAnalysis and Integration of Big Data from Next-Generation Genomics, Epigenomics, and Transcriptomics
Analysis and Integration of Big Data from Next-Generation Genomics, Epigenomics, and Transcriptomics Christopher Benner, PhD Director, Integrative Genomics and Bioinformatics Core (IGC) idash Webinar,
More informationHow To Understand How Gene Expression Is Regulated
What makes cells different from each other? How do cells respond to information from environment? Regulation of: - Transcription - prokaryotes - eukaryotes - mrna splicing - mrna localisation and translation
More informationInteraktionen von Nukleinsäuren und Proteinen
Sonja Prohaska Computational EvoDevo Universitaet Leipzig June 9, 2015 DNA is never naked in a cell DNA is usually in association with proteins. In all domains of life there are small, basic chromosomal
More informationA Primer of Genome Science THIRD
A Primer of Genome Science THIRD EDITION GREG GIBSON-SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc. Publishers Sunderland, Massachusetts USA Contents Preface xi 1 Genome Projects:
More informationFaculty of Medicine. Settore disciplinare: BIO/10. functional domains. Monica Soldi. IFOM-IEO Campus, Milan. Matricola n. R08407
PhD degree in Molecular Medicine European School of Molecular Medicine (SEMM), University of Milan and University of Naples Federico II Faculty of Medicine Settore disciplinare: BIO/10 Establishment and
More informationGENE REGULATION. Teacher Packet
AP * BIOLOGY GENE REGULATION Teacher Packet AP* is a trademark of the College Entrance Examination Board. The College Entrance Examination Board was not involved in the production of this material. Pictures
More informationDiscovery and Quantification of RNA with RNASeq Roderic Guigó Serra Centre de Regulació Genòmica (CRG) roderic.guigo@crg.cat
Bioinformatique et Séquençage Haut Débit, Discovery and Quantification of RNA with RNASeq Roderic Guigó Serra Centre de Regulació Genòmica (CRG) roderic.guigo@crg.cat 1 RNA Transcription to RNA and subsequent
More informationDiscovery & Modeling of Genomic Regulatory Networks with Big Data
Discovery & Modeling of Genomic Regulatory Networks with Big Data Hamid Bolouri Division of Human Biology Fred Hutchinson Cancer Research Center labs.fhcrc.org/bolouri I have no financial relationships
More informationCore Facility Genomics
Core Facility Genomics versatile genome or transcriptome analyses based on quantifiable highthroughput data ascertainment 1 Topics Collaboration with Harald Binder and Clemens Kreutz Project: Microarray
More informationMicroarray Data Analysis Workshop. Custom arrays and Probe design Probe design in a pangenomic world. Carsten Friis. MedVetNet Workshop, DTU 2008
Microarray Data Analysis Workshop MedVetNet Workshop, DTU 2008 Custom arrays and Probe design Probe design in a pangenomic world Carsten Friis Media glna tnra GlnA TnrA C2 glnr C3 C5 C6 K GlnR C1 C4 C7
More informationLecture 1 MODULE 3 GENE EXPRESSION AND REGULATION OF GENE EXPRESSION. Professor Bharat Patel Office: Science 2, 2.36 Email: b.patel@griffith.edu.
Lecture 1 MODULE 3 GENE EXPRESSION AND REGULATION OF GENE EXPRESSION Professor Bharat Patel Office: Science 2, 2.36 Email: b.patel@griffith.edu.au What is Gene Expression & Gene Regulation? 1. Gene Expression
More informationJust the Facts: A Basic Introduction to the Science Underlying NCBI Resources
1 of 8 11/7/2004 11:00 AM National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools
More informationBIO 3352: BIOINFORMATICS II HYBRID COURSE SYLLABUS
BIO 3352: BIOINFORMATICS II HYBRID COURSE SYLLABUS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title: Bioinformatics
More informationNext Generation Sequencing
Next Generation Sequencing Technology and applications 10/1/2015 Jeroen Van Houdt - Genomics Core - KU Leuven - UZ Leuven 1 Landmarks in DNA sequencing 1953 Discovery of DNA double helix structure 1977
More informationControl of Gene Expression
Control of Gene Expression What is Gene Expression? Gene expression is the process by which informa9on from a gene is used in the synthesis of a func9onal gene product. What is Gene Expression? Figure
More informationGMQL Functional Comparison with BEDTools and BEDOPS
GMQL Functional Comparison with BEDTools and BEDOPS Genomic Computing Group Dipartimento di Elettronica, Informazione e Bioingegneria Politecnico di Milano This document presents a functional comparison
More informationControl of Gene Expression
Home Gene Regulation Is Necessary? Control of Gene Expression By switching genes off when they are not needed, cells can prevent resources from being wasted. There should be natural selection favoring
More informationAGILENT S BIOINFORMATICS ANALYSIS SOFTWARE
ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS
More informationTitle: Surveying Genome to Identify Origins of DNA Replication In Silico
Title: Surveying Genome to Identify Origins of DNA Replication In Silico Abstract: DNA replication origins are the bases to realize the process of chromosome replication and analyze the progress of cell
More informationCorrelation of microarray and quantitative real-time PCR results. Elisa Wurmbach Mount Sinai School of Medicine New York
Correlation of microarray and quantitative real-time PCR results Elisa Wurmbach Mount Sinai School of Medicine New York Microarray techniques Oligo-array: Affymetrix, Codelink, spotted oligo-arrays (60-70mers)
More informationGene Expression Analysis
Gene Expression Analysis Jie Peng Department of Statistics University of California, Davis May 2012 RNA expression technologies High-throughput technologies to measure the expression levels of thousands
More informationSystematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals
Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals Xiaohui Xie 1, Jun Lu 1, E. J. Kulbokas 1, Todd R. Golub 1, Vamsi Mootha 1, Kerstin Lindblad-Toh
More informationThe world of non-coding RNA. Espen Enerly
The world of non-coding RNA Espen Enerly ncrna in general Different groups Small RNAs Outline mirnas and sirnas Speculations Common for all ncrna Per def.: never translated Not spurious transcripts Always/often
More informationShouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center
Computational Challenges in Storage, Analysis and Interpretation of Next-Generation Sequencing Data Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center Next Generation Sequencing
More informationQualitative Simulation and Model Checking in Genetic Regulatory Networks
An Application of Model Checking to a realistic biological problem: Qualitative Simulation and Model Checking in Genetic Regulatory Networks A presentation of Formal Methods in Biology Justin Hogg justinshogg@gmail.com
More informationRETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison
RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the
More informationSystems Biology through Data Analysis and Simulation
Biomolecular Networks Initiative Systems Biology through Data Analysis and Simulation William Cannon Computational Biosciences 5/30/03 Cellular Dynamics Microbial Cell Dynamics Data Mining Nitrate NARX
More informationGene Models & Bed format: What they represent.
GeneModels&Bedformat:Whattheyrepresent. Gene models are hypotheses about the structure of transcripts produced by a gene. Like all models, they may be correct, partly correct, or entirely wrong. Typically,
More informationComplex multicellular organisms are produced by cells that switch genes on and off during development.
Home Control of Gene Expression Gene Regulation Is Necessary? By switching genes off when they are not needed, cells can prevent resources from being wasted. There should be natural selection favoring
More informationInteraktionen von RNAs und Proteinen
Sonja Prohaska Computational EvoDevo Universitaet Leipzig June 9, 2015 Studying RNA-protein interactions Given: target protein known to bind to RNA problem: find binding partners and binding sites experimental
More informationPlant Growth & Development. Growth Stages. Differences in the Developmental Mechanisms of Plants and Animals. Development
Plant Growth & Development Plant body is unable to move. To survive and grow, plants must be able to alter its growth, development and physiology. Plants are able to produce complex, yet variable forms
More informationAppendix 2 Molecular Biology Core Curriculum. Websites and Other Resources
Appendix 2 Molecular Biology Core Curriculum Websites and Other Resources Chapter 1 - The Molecular Basis of Cancer 1. Inside Cancer http://www.insidecancer.org/ From the Dolan DNA Learning Center Cold
More informationMolecular Genetics: Challenges for Statistical Practice. J.K. Lindsey
Molecular Genetics: Challenges for Statistical Practice J.K. Lindsey 1. What is a Microarray? 2. Design Questions 3. Modelling Questions 4. Longitudinal Data 5. Conclusions 1. What is a microarray? A microarray
More informationFrequently Asked Questions Next Generation Sequencing
Frequently Asked Questions Next Generation Sequencing Import These Frequently Asked Questions for Next Generation Sequencing are some of the more common questions our customers ask. Questions are divided
More informationData Integration. Lectures 16 & 17. ECS289A, WQ03, Filkov
Data Integration Lectures 16 & 17 Lectures Outline Goals for Data Integration Homogeneous data integration time series data (Filkov et al. 2002) Heterogeneous data integration microarray + sequence microarray
More informationCore Bioinformatics. Degree Type Year Semester. 4313473 Bioinformàtica/Bioinformatics OB 0 1
Core Bioinformatics 2014/2015 Code: 42397 ECTS Credits: 12 Degree Type Year Semester 4313473 Bioinformàtica/Bioinformatics OB 0 1 Contact Name: Sònia Casillas Viladerrams Email: Sonia.Casillas@uab.cat
More information13.4 Gene Regulation and Expression
13.4 Gene Regulation and Expression Lesson Objectives Describe gene regulation in prokaryotes. Explain how most eukaryotic genes are regulated. Relate gene regulation to development in multicellular organisms.
More informationIntroduction to Genome Annotation
Introduction to Genome Annotation AGCGTGGTAGCGCGAGTTTGCGAGCTAGCTAGGCTCCGGATGCGA CCAGCTTTGATAGATGAATATAGTGTGCGCGACTAGCTGTGTGTT GAATATATAGTGTGTCTCTCGATATGTAGTCTGGATCTAGTGTTG GTGTAGATGGAGATCGCGTAGCGTGGTAGCGCGAGTTTGCGAGCT
More informationStandards, Guidelines and Best Practices for RNA-Seq V1.0 (June 2011) The ENCODE Consortium
Standards, Guidelines and Best Practices for RNA-Seq V1.0 (June 2011) The ENCODE Consortium I. Introduction: Sequence based assays of transcriptomes (RNA-seq) are in wide use because of their favorable
More informationQuality Assessment of Exon and Gene Arrays
Quality Assessment of Exon and Gene Arrays I. Introduction In this white paper we describe some quality assessment procedures that are computed from CEL files from Whole Transcript (WT) based arrays such
More informationGenetomic Promototypes
Genetomic Promototypes Mirkó Palla and Dana Pe er Department of Mechanical Engineering Clarkson University Potsdam, New York and Department of Genetics Harvard Medical School 77 Avenue Louis Pasteur Boston,
More informationQuantitative proteomics background
Proteomics data analysis seminar Quantitative proteomics and transcriptomics of anaerobic and aerobic yeast cultures reveals post transcriptional regulation of key cellular processes de Groot, M., Daran
More informationFlipFlop: Fast Lasso-based Isoform Prediction as a Flow Problem
FlipFlop: Fast Lasso-based Isoform Prediction as a Flow Problem Elsa Bernard Laurent Jacob Julien Mairal Jean-Philippe Vert September 24, 2013 Abstract FlipFlop implements a fast method for de novo transcript
More informationA Brief Introduction on DNase-Seq Data Aanalysis
A Brief Introduction on DNase-Seq Data Aanalysis Hashem Koohy, Thomas Down, Mikhail Spivakov and Tim Hubbard Spivakov s and Fraser s Lab September 13, 2014 1 Introduction DNaseI is an enzyme which cuts
More informationDr Alexander Henzing
Horizon 2020 Health, Demographic Change & Wellbeing EU funding, research and collaboration opportunities for 2016/17 Innovate UK funding opportunities in omics, bridging health and life sciences Dr Alexander
More informationModule 3 Questions. 7. Chemotaxis is an example of signal transduction. Explain, with the use of diagrams.
Module 3 Questions Section 1. Essay and Short Answers. Use diagrams wherever possible 1. With the use of a diagram, provide an overview of the general regulation strategies available to a bacterial cell.
More informationFeed Forward Loops in Biological Systems
Feed Forward Loops in Biological Systems Dr. M. Vijayalakshmi School of Chemical and Biotechnology SASTRA University Joint Initiative of IITs and IISc Funded by MHRD Page 1 of 7 Table of Contents 1 INTRODUCTION...
More informationSchool of Nursing. Presented by Yvette Conley, PhD
Presented by Yvette Conley, PhD What we will cover during this webcast: Briefly discuss the approaches introduced in the paper: Genome Sequencing Genome Wide Association Studies Epigenomics Gene Expression
More informationExpression Quantification (I)
Expression Quantification (I) Mario Fasold, LIFE, IZBI Sequencing Technology One Illumina HiSeq 2000 run produces 2 times (paired-end) ca. 1,2 Billion reads ca. 120 GB FASTQ file RNA-seq protocol Task
More informationComparing Methods for Identifying Transcription Factor Target Genes
Comparing Methods for Identifying Transcription Factor Target Genes Alena van Bömmel (R 3.3.73) Matthew Huska (R 3.3.18) Max Planck Institute for Molecular Genetics Folie 1 Transcriptional Regulation TF
More informationCOMPUTATIONAL FRAMEWORKS FOR UNDERSTANDING THE FUNCTION AND EVOLUTION OF DEVELOPMENTAL ENHANCERS IN DROSOPHILA
COMPUTATIONAL FRAMEWORKS FOR UNDERSTANDING THE FUNCTION AND EVOLUTION OF DEVELOPMENTAL ENHANCERS IN DROSOPHILA Saurabh Sinha, Dept of Computer Science, University of Illinois Cis-regulatory modules (enhancers)
More informationComputational localization of promoters and transcription start sites in mammalian genomes
Computational localization of promoters and transcription start sites in mammalian genomes Thomas Down This dissertation is submitted for the degree of Doctor of Philosophy Wellcome Trust Sanger Institute
More informationControl of Gene Expression
Control of Gene Expression (Learning Objectives) Explain the role of gene expression is differentiation of function of cells which leads to the emergence of different tissues, organs, and organ systems
More informationGenetic information (DNA) determines structure of proteins DNA RNA proteins cell structure 3.11 3.15 enzymes control cell chemistry ( metabolism )
Biology 1406 Exam 3 Notes Structure of DNA Ch. 10 Genetic information (DNA) determines structure of proteins DNA RNA proteins cell structure 3.11 3.15 enzymes control cell chemistry ( metabolism ) Proteins
More informationSoftware and Methods for the Analysis of Affymetrix GeneChip Data. Rafael A Irizarry Department of Biostatistics Johns Hopkins University
Software and Methods for the Analysis of Affymetrix GeneChip Data Rafael A Irizarry Department of Biostatistics Johns Hopkins University Outline Overview Bioconductor Project Examples 1: Gene Annotation
More informationChapter 18 Regulation of Gene Expression
Chapter 18 Regulation of Gene Expression 18.1. Gene Regulation Is Necessary By switching genes off when they are not needed, cells can prevent resources from being wasted. There should be natural selection
More informationNext Generation Sequencing: Technology, Mapping, and Analysis
Next Generation Sequencing: Technology, Mapping, and Analysis Gary Benson Computer Science, Biology, Bioinformatics Boston University gbenson@bu.edu http://tandem.bu.edu/ The Human Genome Project took
More informationProtein Protein Interaction Networks
Functional Pattern Mining from Genome Scale Protein Protein Interaction Networks Young-Rae Cho, Ph.D. Assistant Professor Department of Computer Science Baylor University it My Definition of Bioinformatics
More informationRow Quantile Normalisation of Microarrays
Row Quantile Normalisation of Microarrays W. B. Langdon Departments of Mathematical Sciences and Biological Sciences University of Essex, CO4 3SQ Technical Report CES-484 ISSN: 1744-8050 23 June 2008 Abstract
More informationIntroduction to transcriptome analysis using High Throughput Sequencing technologies (HTS)
Introduction to transcriptome analysis using High Throughput Sequencing technologies (HTS) A typical RNA Seq experiment Library construction Protocol variations Fragmentation methods RNA: nebulization,
More informationTwincore - Zentrum für Experimentelle und Klinische Infektionsforschung Institut für Molekulare Bakteriologie
Twincore - Zentrum für Experimentelle und Klinische Infektionsforschung Institut für Molekulare Bakteriologie 0 HELMHOLTZ I ZENTRUM FÜR INFEKTIONSFORSCHUNG Technische Universität Braunschweig Institut
More informationGene Regulation -- The Lac Operon
Gene Regulation -- The Lac Operon Specific proteins are present in different tissues and some appear only at certain times during development. All cells of a higher organism have the full set of genes:
More informationWeb-Based Genomic Information Integration with Gene Ontology
Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, kai.xu@nicta.com.au Abstract. Despite the dramatic growth of online genomic
More informationBasic Analysis of Microarray Data
Basic Analysis of Microarray Data A User Guide and Tutorial Scott A. Ness, Ph.D. Co-Director, Keck-UNM Genomics Resource and Dept. of Molecular Genetics and Microbiology University of New Mexico HSC Tel.
More informationFACULTY OF MEDICAL SCIENCE
Doctor of Philosophy in Biochemistry FACULTY OF MEDICAL SCIENCE Naresuan University 73 Doctor of Philosophy in Biochemistry The Biochemistry Department at Naresuan University is a leader in lower northern
More informationGeneProf and the new GeneProf Web Services
GeneProf and the new GeneProf Web Services Florian Halbritter florian.halbritter@ed.ac.uk Stem Cell Bioinformatics Group (Simon R. Tomlinson) simon.tomlinson@ed.ac.uk December 10, 2012 Florian Halbritter
More informationUnderstanding West Nile Virus Infection
Understanding West Nile Virus Infection The QIAGEN Bioinformatics Solution: Biomedical Genomics Workbench (BXWB) + Ingenuity Pathway Analysis (IPA) Functional Genomics & Predictive Medicine, May 21-22,
More informationUniversity of Glasgow - Programme Structure Summary C1G5-5100 MSc Bioinformatics, Polyomics and Systems Biology
University of Glasgow - Programme Structure Summary C1G5-5100 MSc Bioinformatics, Polyomics and Systems Biology Programme Structure - the MSc outcome will require 180 credits total (full-time only) - 60
More informationNext Generation Sequencing: Adjusting to Big Data. Daniel Nicorici, Dr.Tech. Statistikot Suomen Lääketeollisuudessa 29.10.2013
Next Generation Sequencing: Adjusting to Big Data Daniel Nicorici, Dr.Tech. Statistikot Suomen Lääketeollisuudessa 29.10.2013 Outline Human Genome Project Next-Generation Sequencing Personalized Medicine
More informationT cell Epitope Prediction
Institute for Immunology and Informatics T cell Epitope Prediction EpiMatrix Eric Gustafson January 6, 2011 Overview Gathering raw data Popular sources Data Management Conservation Analysis Multiple Alignments
More informationIntroduction To Epigenetic Regulation: How Can The Epigenomics Core Services Help Your Research? Maria (Ken) Figueroa, M.D. Core Scientific Director
Introduction To Epigenetic Regulation: How Can The Epigenomics Core Services Help Your Research? Maria (Ken) Figueroa, M.D. Core Scientific Director Gene expression depends upon multiple factors Gene Transcription
More informationChallenges associated with analysis and storage of NGS data
Challenges associated with analysis and storage of NGS data Gabriella Rustici Research and training coordinator Functional Genomics Group gabry@ebi.ac.uk Next-generation sequencing Next-generation sequencing
More informationFinal Project Report
CPSC545 by Introduction to Data Mining Prof. Martin Schultz & Prof. Mark Gerstein Student Name: Yu Kor Hugo Lam Student ID : 904907866 Due Date : May 7, 2007 Introduction Final Project Report Pseudogenes
More informationPlant Metabolomics. For BOT 6516
Plant Metabolomics For BOT 6516 Introduction Modern metabolomics began about ten years ago and yet many continue to question the relative performance of this area of technology in advancing plant biology.
More informationSupplementary Figure 1: Quality Assessment of Mouse Arrays. Supplementary Figure 2: Quality Assessment of Rat Arrays
Supplementary Figure 1: Quality Assessment of Mouse Arrays The mouse microarray data were subjected to an extensive quality-control procedure prior to conducting downstream analyses. We assessed the spread
More informationData Acquisition. DNA microarrays. The functional genomics pipeline. Experimental design affects outcome data analysis
Data Acquisition DNA microarrays The functional genomics pipeline Experimental design affects outcome data analysis Data acquisition microarray processing Data preprocessing scaling/normalization/filtering
More informationJoint transcriptomic and metabolomic analyses reveal changes in the primary metabolism and
Joint transcriptomic and metabolomic analyses reveal changes in the primary metabolism and imbalances in the subgenome orchestration in the bread wheat molecular response to Fusarium graminearum Thomas
More informationPreciseTM Whitepaper
Precise TM Whitepaper Introduction LIMITATIONS OF EXISTING RNA-SEQ METHODS Correctly designed gene expression studies require large numbers of samples, accurate results and low analysis costs. Analysis
More informationAP BIOLOGY 2009 SCORING GUIDELINES
AP BIOLOGY 2009 SCORING GUIDELINES Question 4 The flow of genetic information from DNA to protein in eukaryotic cells is called the central dogma of biology. (a) Explain the role of each of the following
More informationSupplementary Information
Supplementary Information S1: Degree Distribution of TFs in the E.coli TRN and CRN based on Operons 1000 TRN Number of TFs 100 10 y = 619.55x -1.4163 R 2 = 0.8346 1 1 10 100 1000 Degree of TFs CRN 100
More informationAlgorithms in Computational Biology (236522) spring 2007 Lecture #1
Algorithms in Computational Biology (236522) spring 2007 Lecture #1 Lecturer: Shlomo Moran, Taub 639, tel 4363 Office hours: Tuesday 11:00-12:00/by appointment TA: Ilan Gronau, Taub 700, tel 4894 Office
More informationIntegrating DNA Motif Discovery and Genome-Wide Expression Analysis. Erin M. Conlon
Integrating DNA Motif Discovery and Genome-Wide Expression Analysis Department of Mathematics and Statistics University of Massachusetts Amherst Statistics in Functional Genomics Workshop Ascona, Switzerland
More informationPrimePCR Assay Validation Report
Gene Information Gene Name sorbin and SH3 domain containing 2 Gene Symbol Organism Gene Summary Gene Aliases RefSeq Accession No. UniGene ID Ensembl Gene ID SORBS2 Human Arg and c-abl represent the mammalian
More informationMORPHEUS. http://biodev.cea.fr/morpheus/ Prediction of Transcription Factors Binding Sites based on Position Weight Matrix.
MORPHEUS http://biodev.cea.fr/morpheus/ Prediction of Transcription Factors Binding Sites based on Position Weight Matrix. Reference: MORPHEUS, a Webtool for Transcripton Factor Binding Analysis Using
More informationGene expression analysis. Ulf Leser and Karin Zimmermann
Gene expression analysis Ulf Leser and Karin Zimmermann Ulf Leser: Bioinformatics, Wintersemester 2010/2011 1 Last lecture What are microarrays? - Biomolecular devices measuring the transcriptome of a
More informationLecture 11 Data storage and LIMS solutions. Stéphane LE CROM lecrom@biologie.ens.fr
Lecture 11 Data storage and LIMS solutions Stéphane LE CROM lecrom@biologie.ens.fr Various steps of a DNA microarray experiment Experimental steps Data analysis Experimental design set up Chips on catalog
More informationGo where the biology takes you. Genome Analyzer IIx Genome Analyzer IIe
Go where the biology takes you. Genome Analyzer IIx Genome Analyzer IIe Go where the biology takes you. To published results faster With proven scalability To the forefront of discovery To limitless applications
More informationComputational Genomics. Next generation sequencing (NGS)
Computational Genomics Next generation sequencing (NGS) Sequencing technology defies Moore s law Nature Methods 2011 Log 10 (price) Sequencing the Human Genome 2001: Human Genome Project 2.7G$, 11 years
More informationAnalyzing microrna Data and Integrating mirna with Gene Expression Data in Partek Genomics Suite 6.6
Analyzing microrna Data and Integrating mirna with Gene Expression Data in Partek Genomics Suite 6.6 Overview This tutorial outlines how microrna data can be analyzed within Partek Genomics Suite. Additionally,
More informationOverview of Eukaryotic Gene Prediction
Overview of Eukaryotic Gene Prediction CBB 231 / COMPSCI 261 W.H. Majoros What is DNA? Nucleus Chromosome Telomere Centromere Cell Telomere base pairs histones DNA (double helix) DNA is a Double Helix
More informationPrimePCR Assay Validation Report
Gene Information Gene Name Gene Symbol Organism Gene Summary Gene Aliases RefSeq Accession No. UniGene ID Ensembl Gene ID papillary renal cell carcinoma (translocation-associated) PRCC Human This gene
More informationName Class Date. Figure 13 1. 2. Which nucleotide in Figure 13 1 indicates the nucleic acid above is RNA? a. uracil c. cytosine b. guanine d.
13 Multiple Choice RNA and Protein Synthesis Chapter Test A Write the letter that best answers the question or completes the statement on the line provided. 1. Which of the following are found in both
More informationUnderstanding the dynamics and function of cellular networks
Understanding the dynamics and function of cellular networks Cells are complex systems functionally diverse elements diverse interactions that form networks signal transduction-, gene regulatory-, metabolic-
More informationThe Advantages and Disadvantages of Using Gene Ontology
Extracting Biological Information from Gene Lists Simon Andrews, Laura Biggins, Boo Virk simon.andrews@babraham.ac.uk laura.biggins@babraham.ac.uk boo.virk@babraham.ac.uk v1.0 Biological material Sample
More information