Interaktionen von Nukleinsäuren und Proteinen



Similar documents
Interaktionen von RNAs und Proteinen

MORPHEUS. Prediction of Transcription Factors Binding Sites based on Position Weight Matrix.

MicroRNA formation. 4th International Symposium on Non-Surgical Contraceptive Methods of Pet Population Control

What is the difference between basal and activated transcription?

How many of you have checked out the web site on protein-dna interactions?

Current Motif Discovery Tools and their Limitations

Illumina TruSeq DNA Adapters De-Mystified James Schiemer

Genetic information (DNA) determines structure of proteins DNA RNA proteins cell structure enzymes control cell chemistry ( metabolism )

Expression and Purification of Recombinant Protein in bacteria and Yeast. Presented By: Puspa pandey, Mohit sachdeva & Ming yu

From DNA to Protein. Proteins. Chapter 13. Prokaryotes and Eukaryotes. The Path From Genes to Proteins. All proteins consist of polypeptide chains

T cell Epitope Prediction

Module 3 Questions. 7. Chemotaxis is an example of signal transduction. Explain, with the use of diagrams.

Genome-wide measurements of protein-dna interaction by chromatin immunoprecipitation

GENE REGULATION. Teacher Packet

2. The number of different kinds of nucleotides present in any DNA molecule is A) four B) six C) two D) three

1. Molecular computation uses molecules to represent information and molecular processes to implement information processing.

ChIP TROUBLESHOOTING TIPS

Recombinant DNA & Genetic Engineering. Tools for Genetic Manipulation

Lectures 1 and February 7, Genomics 2012: Repetitorium. Peter N Robinson. VL1: Next- Generation Sequencing. VL8 9: Variant Calling

Translation Study Guide

VLLM0421c Medical Microbiology I, practical sessions. Protocol to topic J10

Replication Study Guide

Forensic DNA Testing Terminology

Name Class Date. Figure Which nucleotide in Figure 13 1 indicates the nucleic acid above is RNA? a. uracil c. cytosine b. guanine d.

DNA (genetic information in genes) RNA (copies of genes) proteins (functional molecules) directionality along the backbone 5 (phosphate) to 3 (OH)

Lecture 7 Mitosis & Meiosis

Next Generation Sequencing: Technology, Mapping, and Analysis

DNA. Discovery of the DNA double helix

K'NEX DNA Models. Developed by Dr. Gary Benson Department of Biomathematical Sciences Mount Sinai School of Medicine

Chapter 3. Protein Structure and Function

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE Q5B

RNA Structure and folding

New Technologies for Sensitive, Low-Input RNA-Seq. Clontech Laboratories, Inc.

Algorithms in Computational Biology (236522) spring 2007 Lecture #1

Nucleotides and Nucleic Acids

Control of Gene Expression

Viral Infection: Receptors

Lecture Series 7. From DNA to Protein. Genotype to Phenotype. Reading Assignments. A. Genes and the Synthesis of Polypeptides

European Medicines Agency

1. Location matters: Primers should flank the DNA you want to amplify

Chapter 11: Molecular Structure of DNA and RNA

MATCH Commun. Math. Comput. Chem. 61 (2009)

An Overview of DNA Sequencing

Gene Expression Assays

Becker Muscular Dystrophy

DNA and Forensic Science

Lezioni Dipartimento di Oncologia Farmacologia Molecolare. RNA interference. Giovanna Damia 29 maggio 2006

Computer Programs for PCR Primer Design and Analysis

Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals

Sample Questions for Exam 3

A Brief Introduction on DNase-Seq Data Aanalysis

HuCAL Custom Monoclonal Antibodies

June 09, 2009 Random Mutagenesis

Introduction to transcriptome analysis using High Throughput Sequencing technologies (HTS)

restriction enzymes 350 Home R. Ward: Spring 2001

Essentials of Real Time PCR. About Sequence Detection Chemistries

DNA Replication & Protein Synthesis. This isn t a baaaaaaaddd chapter!!!

Biological Sciences Initiative. Human Genome

Analysis of gene expression data. Ulf Leser and Philippe Thomas

Name Date Period. 2. When a molecule of double-stranded DNA undergoes replication, it results in

Chapter 6 DNA Replication

Structure and Function of DNA

Transcription and Translation of DNA

Faculty of Medicine. Settore disciplinare: BIO/10. functional domains. Monica Soldi. IFOM-IEO Campus, Milan. Matricola n. R08407

Every time a cell divides the genome must be duplicated and passed on to the offspring. That is:

Next Generation Sequencing

Appendix C DNA Replication & Mitosis

Guided Notes: Chapter 9 Cellular Reproduction

The Nucleus: DNA, Chromatin And Chromosomes

Helices From Readily in Biological Structures

PRESTWICK ACADEMY NATIONAL 5 BIOLOGY CELL BIOLOGY SUMMARY

1.5 page 3 DNA Replication S. Preston 1

CCR Biology - Chapter 8 Practice Test - Summer 2012

The Structure, Replication, and Chromosomal Organization of DNA

Central Dogma. Lecture 10. Discussing DNA replication. DNA Replication. DNA mutation and repair. Transcription

Complex multicellular organisms are produced by cells that switch genes on and off during development.

Pairwise Sequence Alignment

2. True or False? The sequence of nucleotides in the human genome is 90.9% identical from one person to the next. False (it s 99.

Lecture 1 MODULE 3 GENE EXPRESSION AND REGULATION OF GENE EXPRESSION. Professor Bharat Patel Office: Science 2, b.patel@griffith.edu.

HiPer RT-PCR Teaching Kit

DNA Technology Mapping a plasmid digesting How do restriction enzymes work?

Protein-nucleic acid interactions

Control of Gene Expression

Genetomic Promototypes

Data Analysis for Ion Torrent Sequencing

Overview of Eukaryotic Gene Prediction

Bioinformatics Grid - Enabled Tools For Biologists.

The world of non-coding RNA. Espen Enerly

CCR Biology - Chapter 9 Practice Test - Summer 2012

LabGenius. Technical design notes. The world s most advanced synthetic DNA libraries. hi@labgeni.us V1.5 NOV 15

KMS-Specialist & Customized Biosimilar Service

HCS Exercise 1 Dr. Jones Spring Recombinant DNA (Molecular Cloning) exercise:

Control of Gene Expression

Genetics Module B, Anchor 3

RNA & Protein Synthesis

HuCAL Custom Monoclonal Antibodies

Introduction To Epigenetic Regulation: How Can The Epigenomics Core Services Help Your Research? Maria (Ken) Figueroa, M.D. Core Scientific Director

Transcription:

Sonja Prohaska Computational EvoDevo Universitaet Leipzig June 9, 2015

DNA is never naked in a cell DNA is usually in association with proteins. In all domains of life there are small, basic chromosomal architectural proteins attached to the DNA forming chromatin (bacteria HU, archaea HU, Alba & histones, eukaryots histones). Their function is to prevent DNA from agglutination, ensuring stability and flexibility, aid structure formation ( packaging ) and engage in gene regulation. Sequence-specific transcription factors associate with specific binding sites. There functions is commonly gene regulation.

general vs. specific binder general binder small basic proteins contact negatively charged DNA backbone e.g. HU, Alba, histones (double stranded) e.g. SSB (single stranded) bind everywhere, often bend the DNA specific binder large protein with DNA-binding domain(s) (DBD) contact major groove of DNA DBDs: e.g. HTH (helix-turn-helix), zinc finger, leucine zipper, helix-loop-helix bind to specific sites sequence-dependent

DNA-binding Proteins a Molecular View Example Proteins that bind to DNA make contact with the bases via the major groove of the double helix. The protein is said to bind to the DNA in a sequence-specific manner. A single binding domain usually contacts 4-8 basepairs. RAR... retinoic acid receptor RXR... retinoid X receptor A) view along the DNA helix. RAR and RXR contact the DNA from opposite sides (top left and top right). B) view from the side. The contact sites from RAR and RXR are seperated by half a turn.

How to study DNA-protein binding?

How to study DNA-protein binding? Given a protein, which DNA sequence does it bind preferentially? in vitro selection SELEX SELEX (systematic evolution of ligands by exponential enrichment) also known as in-vitro evolution a very large oligonucleotide library randomly generated sequences of fixed length is exposed to the target protein unbound oligos are removed (by affinity chromatography) bound sequences are amplified by PCR subsequent rounds of selection with increase stringency sequencing returns consensus motif

How to study DNA-protein binding?

How to represent a motif? Given an alignment of n motifs of length l here the result of the SELEX experiment: n = 43, l = 7 consensus string most frequent nucleotide per position 5 -TGAGTCA-3 consensus pattern all possible nucleotides per position 5 -T(G T)(A T)(G C)TCA-3 K = {G,T}, W = {A,T}, S = {G,C} 5 -TKWSTCA-3 position frequency matrix PFM Alphabeth = {A,C,G,T}, size of Alphabet a, matrix a l motif logo

What to Do With the Motif? How often would you expect to see a motif m in a sequence M? assume the motif is a string, e.g. 5 -GGCCT-3 of motif length l = 5 the sequence M, e.g. the human HoxA cluster, has a length of L = 163001 Alphabet {A,C,G,T} 1. assume uniform nucleotide distribution: f(x) = 0.25 2. use mono-nucleotide distribution: fa = 0.2428,fC = 0.2555,f(G) = 0.2552,f(T) = 0.2466 3. use di-nucleotide distribution: fgg = 0.0812,f(GC) = 0.0677,f(CC) = 0.0815,f(CT) = 0.0751 Results: E UNI (m) = 159;E MONO (m) = 171;E DI (m) = 329;

Which expectation comes closer to the observation? Observed sites: count the motif m in M! O = 312 Expected sites: E UNI (m) = 159;E MONO (m) = 171;E DI (m) = 329; Use the chi-squared test (χ 2 ) to determine whether there is a significant difference between the expected frequency and the observed frequency. χ 2 = (O E)2 E look-up values of the χ 2 distribution for one degree of freedom significance at 5% level when χ 2 >= 3.841 significance at 1% level when χ 2 >= 6.635 Only E DI (m) = 329 is a good prediction for the number of sites.

How to study DNA-protein binding? Given a protein, which DNA sequence does it bind preferentially? in vivo ChIP-seq ChIPseq (Chromatin Immuno-Precipitation followed by sequencing) proteins are cross-liked to DNA in vivo chromatin is isolated sonicate to obtain chromatin fragments immunoprecipitate with protein-specific antibodies purify immunocomplexes (remove unbound chromatin fragments) reverse cross-linking purify DNA from chromatin fragments sequence DNA fragments

Study DNA-protein binding with ChIP-seq

Do Predictions Fit Observations?

Sensitivity and Specificity of the Prediction

ROC (curve) to Compare Predictions