Small RNA gene identification and mrna target predictions in bacteria

Size: px
Start display at page:

Download "Small RNA gene identification and mrna target predictions in bacteria"

Transcription

1 BIOINFORMATICS REVIEW Vol. 24 no , pages doi: /bioinformatics/btn560 Genome analysis Small RNA gene identification and mrna target predictions in bacteria Christophe Pichon 1 and Brice Felden 2, 1 Unité Pathogénie Bactérienne des Muqueuses, Institut Pasteur, Rue du Docteur Roux, 75724, Paris and 2 Inserm U835, UPRES JE2311, Laboratoire de Biochimie Pharmaceutique, Université de Rennes 1, 2 Av. du Pr. Léon Bernard Rennes, France Received on July 09, 2008; revised and accepted on October 25, 2008 Advance Access publication October 29, 2008 Associate Editor: Tracy Knight ABSTRACT Motivation: Bacterial small ribonucleic acids (srnas) that are not ribosomal and transfer or messenger RNAs were initially identified in the sixties, whereas their molecular functions are still under active investigation today. It is now widely accepted that most play central roles in gene expression regulation in response to environmental changes. Interestingly, some are also implicated in bacterial virulence. Functional studies revealed that a large subset of these srnas act by an antisense mechanism thanks to pairing interactions with dedicated mrna targets, usually around their translation start sites, to modulate gene expression at the posttranscriptional level. Some srnas modulate protein activity or mimic the structure of other macromolecules. In the last few years, in silico methods have been developed to detect more bacterial srnas. Among these, computational analyses of the bacterial genomes by comparative genomics have predicted the existence of a plethora of srnas, some that were confirmed to be expressed in vivo. The prediction accuracy of these computational tools is highly variable and can be perfectible. Here we review the computational studies that have contributed to detecting the srna gene and mrna targets in bacteria and the methods for their experimental testing. In addition, the remaining challenges are discussed. Contact: bfelden@univ-rennes1.fr 1 INTRODUCTION Small ribonucleic acids (srnas) function directly at RNA level in cells in all three domains of life. srnas are also called nonconventional, noncoding, functional or regulatory RNAs but are not messenger (mrna), transfer (trna) or ribosomal (rrna) RNAs. In recent years, many computational strategies have identified a wealth of srna candidates in various organisms, from bacteria (Vogel and Sharma, 2005) to humans (Washietl et al., 2005b). Some were subsequently validated experimentally and considered as novel srnas [see Gottesman et al. (2006) for the srnas in bacteria]. They possess tremendous heterogeneity in sizes (from 50- to 600-nt long), structures and functions, and are usually not translated into proteins (there are some exceptions). In the bacterial metabolism, srnas are involved in a growing number of regulatory pathways in response to environmental changes. To whom correspondence should be addressed. Until a decade ago, only a handful of bacterial srnas were known, but post-genomic investigations have revealed, unexpectedly, their abundance in many bacterial species (Argaman et al., 2001; Axmann et al., 2005; Ostberg et al., 2004; Pichon and Felden, 2005; Rivas and Eddy, 2001; Wassarman et al., 2001). The field of bacterial srna regulations has recently received a boost of attention, thanks to the availability of many sequenced bacterial genomes and is a fast, exciting and stimulating area of research. srnas have been most intensively tracked in Escherichia coli. To date, the majority of the prokaryotic srnas annotated in the available databases [e.g. the Rfam database, Griffiths-Jones et al. (2005)], were identified in this bacterium. Nowadays, the quest for new srnas among diverse bacteria is very intense. For bacterial genes encoding proteins, sophisticated algorithms were developed that have a strong accuracy in predicting coding sequences (CS) (Kang et al., 2007). The use of computational methods to discover novel bacterial srnas, however, is a difficult task because (i) they usually do not contain recurrent nucleotide motifs (biased word occurrences) as the ribosome binding sites or the codons specifying the 20 amino acids for the protein CS, (ii) are generally small, (iii) most are only conserved among closely related bacterial species and sometimes and (iv) only expressed in pathotype-specific strains of the same bacterial species. Powerful general methods based on the statistical analysis of genomic sequences, RNA structure similarity searches and comparative genomics allow searching for novel srnas in most sequenced bacterial genomes and they will be reviewed here. Combining the prediction of promoters and terminators, base composition statistics, genome annotations, sequence and structure conservation suggest the existence of novel srna genes that have to be subjected to experimental testing (Fig. 1). The effectiveness of all the reported computational methods, however, is still perfectible suggesting that new computational approaches have to be developed. Nevertheless, in the last few years, the number of validated srnas has increased tremendously and thousands of srna encoding gene candidates have yet to be verified experimentally. This recent success in detecting novel srnas has come from bioinformatic searches for srnas outside of E.coli. The majority of the recently discovered bacterial srnas is only conserved among closely related species. A significant portion of the srnas characterized to date interact with dedicated mrnas targets at and around their translation start sites, affecting their stability and/or translation. Trans-encoded srnas are encoded The Author Published by Oxford University Press. All rights reserved. For Permissions, please journals.permissions@oxfordjournals.org 2807

2 C.Pichon and B.Felden Fig. 1. Computational prediction of srna candidates and experimental validation of novel srna genes. The selection criteria for identifying srna candidates are presented, together with their accompanying experimental testing methods. at genomic locations distant from mrna-encoding genes they regulate. The mrna targets of each of these srna regulators are, for the most part, currently unknown. To add more complexity to the problem, usually each srna is thought to regulate the expression of more than one mrna. As a specific example, the interactions between RNAIII from Staphylococcus aureus and its mrna targets involve several structural domains from its 3 -domain via one or two loop loop interactions with its mrna targets (Boisset et al., 2007). Based on structural probing, the RNAIII mrna pairings are imperfect and contain mismatches with a few nucleotides being unpaired. Therefore, predicting such srna mrna interactions by computational studies is a difficult task. Based on experimentally validated srna mrna interactions, algorithms have been developed, predicting putative interaction sites and proposing candidate mrna targets (Mandin et al., 2007; Rehmsmeier et al., 2004; Tjaden et al., 2006), but their success rates are perfectible. Another growing subset of functionally characterized srnas corresponds to the protein sequestrators, as for example the 6S RNA that regulates complex formation between the σ 70 promoter and the σ 70 RNA polymerase complex. There are currently no general in silico methods to predict the existence of new sequestrator-like srnas. Excellent reviews summarize the identification of srnas by biochemical approaches (Altuvia, 2007; Hüttenhofer and Vogel, 2006) and mrna targets (Vogel and Wagner, 2007) in E.coli and other species (Livny and Waldor, 2007). Here, the primary focus is on predictive bioinformatic searches for srnas and mrna targets in bacterial species, with their accompanying experimental testing methods. 2 DETECTING SMALL RNA ENCODING GENES THROUGH BIOINFORMATICS The methods are either based on the RNA primary sequences, RNA secondary structures or depend upon the analysis, at a larger scale, on the comparison of phylogenetically related bacterial genomic Fig. 2. Different combinations of the three main computational methodologies for srna detection in the bacterial genomes. Comparative genomics (Wassarman et al., 2001) including transcriptional unit and Rhoindependent terminator predictions (Chen et al., 2002), RNA secondary structure prediction (Uzilov et al., 2006) and base composition statistics (Carter et al., 2001; Schattner, 2002) can be used alone or in combinations. Groups (1) (4) represent the combinations of each method in pairs or altogether. Specific examples of each of these four groups are available in the literature: For group (1), see di Bernardo et al. (2003) and Gruber et al. (2007). For group (2), see Coventry et al. (2004) and Pichon and Felden (2005). For group (3), see Rivas and Eddy (2000) and Uzilov et al. (2006). Representative of group (4) is only available with the ISI algorithm that combines all sources of data (Pichon and Felden, 2003). sequences. Recently reported methods are a combination of the latter (Fig. 2, intersections). 2.1 Base-composition statistics The hypothesis that genomic regions rich in srnas can be identified using local variations in single-base and dinucleotide statistics has been investigated for some prokaryotic species (Carter et al., 2001; Pichon and Felden, 2003; Rivas and Eddy, 2000; Schattner, 2002). In bacteria, the GC content of the whole genome is not correlated with the optimal growth temperature of the living organism but, however, the GC content of structural RNAs is a selective response to high temperature, being higher for the micro-organisms living in hot environments (Hurst and Merchant, 2001). In consequence, genome variations in base composition display an elevated percentage of GC in structural RNA sequences (e.g. trnas) than in mrnas. Computing a local GC content curve, however, is not sufficient by itself to automatically detect features such as srnas. In a first attempt to find srnas by RNA secondary structures analyses in an entire genome (Rivas and Eddy, 2000), a base composition model, formally a stochastic regular grammar with one state, was used to filter non-srna sequences by calculating the local GC content of the genomic sequence. Unfortunately, no experimental validations were performed. This approach can be used to detect srna genes in the AT-rich genomes, as demonstrated for Pyrococcus furiosus and Methanococcus jannaschii, two AT-rich archaeas 2808

3 Small RNA gene identification and mrna target predictions (Klein et al., 2002). Analyzing the local GC content of genome sequences has already shown its effectiveness for a low GC firmicute, S.aureus (Pichon and Felden, 2005). The expression of all srna candidates had been tested by northern blot experiments for all previous cited bacteria, showing that kind of methodology-enhanced discovery of new srna genes. A combination of comparative genomics, gene structure prediction and local GC calculations is described for srna gene identifications, using the ISI software (Pichon and Felden, 2003). ISI creates a local GC content curve by calculating the GC percentage of the sequences within a small window that moves along the genomic sequence. Other base composition statistics are performed by a recent version of ISI (C. Pichon et al., unpublished data). The analysis of the GC content is useful for srna gene detection in bacteria especially for srnas that are subjected to structural constraints that influence their dinucleotide compositions. In 2002, Schattner combined local GC, GC skew and dinucleotide frequencies to eliminate false positive results, detecting srna candidates in the M.jannaschii genome (Schattner, 2002). A combining of base composition statistics with RNA secondary structure predictions (Yachie et al., 2006) or with neural networks (Carter et al., 2001) has been proposed but their usefulness is difficult to assess due to the lack of experimental testing of the detected candidates. 2.2 RNA structure similarity searches Algorithms to detect RNA structural patterns trnas are highly structured RNAs acting as amino acid donors during protein synthesis. With the exception of the CCA 3 -end, the primary sequences of trnas vary, whereas their secondary and tertiary structures are well conserved and fold, respectively, as cloverleaves and L-shapes. These conserved structural characteristics have allowed the design of specific bioinformatic software for trna identification in genomic sequences. For example, trnascan detects genes containing a cloverleaf structure with variable lengths for the stems and loops. To reduce false positives, the training is performed on a set of known trna genes (Fichant and Burks, 1991). Combining several trna genefinders leads to the design of trnascan-se that detects trna genes with a 100% success rate (Lowe and Eddy, 1997). These approaches were also utilized to detect srnas including the 4.5S RNA (Regalia et al., 2002) and trna mrna (Laslett and Canback, 2004), with an increased difficulty since such RNAs have longer, more variable and intricate structures. Such computational approaches, however, are only applicable in the case of a known srna secondary structure. In a more general case, RNAMotif (Lambert et al., 2004; Macke et al., 2001); interprets a user-defined file in which the RNA structure template is described. This software can, in theory, describe and search for any RNA structural element. In addition, it provides a valuable scoring system due to its flexibility for the user who can implement his/her own values Algorithms based on RNA secondary structure predictions Bacterial srna secondary structures can be predicted by algorithms based on computed global minimum free energy (Zuker, 1989), that should be validated experimentally using structural probes. Unfortunately, secondary structure prediction alone is not sufficient for the detection of bacterial srnas (Rivas and Eddy, 2000). These algorithms assume that RNAs for which the native state (minimum free energy secondary structure) is functionally important, all have lower folding energy than random RNAs of the same length and dinucleotide frequency (Clote et al., 2005). These tools are useful during an initial screening but should be combined with additional tools searching for RNA structure conservations within organisms which are phylogenetically related. Thermodynamic methods based on free energy ( G ) minimization, the identification of conserved structural motifs and the use of sequence covariance between species which are phylogenetically related, are accurate tools for srna detection (Babak et al., 2007; Coventry et al., 2004; di Bernardo et al., 2003; Gruber et al., 2007; Rivas and Eddy, 2001; Washietl et al., 2005a). These studies describe various algorithms and scoring systems for the prediction candidates but, unfortunately, some are restricted to the intergenic regions (IGRs) of the genomes, with no experimental testing of the predictions. Comparative genomics between related species, in combination with RNA structure prediction, are considered as the more effective methods to predict the existence of novel bacterial srnas (Pichon and Felden, 2005). More recent developments (Gruber et al., 2007; Uzilov et al., 2006) are promising because they mix RNA structure identification methods and comparative genomics. 2.3 Comparative genomics The use of sequence homology between closely related species to identify novel srna genes in IGRs has initiated a new era in srna identification, initially in E.coli (Argaman et al., 2001; Rivas et al., 2001; Wassarman et al., 2001) and more recently in many other bacterial species [for recent works, see Silvaggi et al. (2006) for searches in Bacillus subtilis and del Val et al. (2007) for searches in S.meliloti genomes]. Parts of the nucleotide sequences and secondary structures of the srna genes are conserved between closely phylogenetically related species (a few srnas, however, are present in all the bacterial species), sometimes even presumably required for function, thanks to compensatory mutations. QRNA and RNAz are computational tools for srna gene predictions based on comparative sequence analyses and structure prediction (Rivas and Eddy, 2001; Washietl et al., 2005a). The QRNA software is one of the first attempts to combined RNA structure prediction and comparative genomics. QRNA is based on three probabilistic models, one for detecting coding regions, another for locating putative srna loci and a null hypothesis model. The protein genes are detected based on the variation of the third base of the codon triplets when sequences are compared, whereas the first two nucleotides of the codon are conserved. The srna probabilistic model was designed to detect covariances in stem-loop structures by implementing a stochastic context free grammar (SCFG). The third model detects mutations that can occur at single positions within a nucleotide sequence. QRNA, however, is only able to compare two nucleotide sequences. Instead of QRNA in which the RNA structure prediction is based on a probabilistic model, RNAz computes a global RNA consensus secondary structure with the Vienna software package and identifies covariations from the multiple alignments (for details, see Washietl et al., 2005a). Together with primary sequence and RNA secondary structure homologies, the search for putative transcription signals, including the promoters and the intrinsic transcription terminators, and the GC content within bacterial IGRs, especially 2809

4 C.Pichon and B.Felden for the AT-rich bacterial genomes, were of particular interest for selecting a subset of candidate IGRs. Algorithms that have combined these selection criteria have become available (ISI, Pichon and Felden, 2003; srnapredict2, Livny et al., 2005). These powerful methods have allowed to identify novel srnas in many bacterial species, including Gram-positive bacteria. The current limitations of these approaches are often the small number of phylogenetically related sequences of the bacterium under study and the difficulty in extending the search for srna-encoding genes into proteinencoding regions. Also, comparative genomics do not often allow to detect srna genes that are specific to a given bacterial strain, but only the core RNome of a species. Many srna genefinder algorithms have evaluated their detection performances by counting the known srnas they could detect in a given micro-organism. However, they systematically test a different subset of srnas. Therefore, the rated values are not comparable and the assessment of the various in silico methods is difficult to perform. 3 COMBINING PREDICTIVE BIOINFORMATIC SEARCHES WITH EXPERIMENTAL RNOMICS Initially, a few bacterial srna genes were discovered fortuitously, in the absence of any computational methods. Highly expressed bacterial srnas such as the 4.5S RNA (part of the secretion machinery), the 6S RNA which modulates RNA polymerase activity, the RNase P RNA that is the catalytic part of the ribozyme for 5 -end pre-trna maturation, tmrna that releases eubacterial ribosomes stalled on defective mrnas and the antisense regulator spot42 were all discovered in E.coli by metabolic labeling of total RNA and direct analysis by fractionations, together with a substantial amount of serendipity. These approaches favor the detection of the few abundant and stable srnas. Starting in 2000, computation and global tracking for srnas, based on sequence conservations, have suggested the presence of a plethora of novel srna genes (Argaman et al., 2001; Rivas and Eddy, 2000; Rivas et al., 2001; Wassarman et al., 2001). Most of the currently experimentally validated srnas were detected from these genomewide analyses. From the huge amount of predicted bacterial srna genes, the in vivo expression was later confirmed for only a fraction of those. It suggests that computational tools dig out false positive candidates and also that, for some predictions, the experimental conditions that trigger their expressions has not been found yet. Several methods are available to experimentally assay computer predictions (Fig. 3A). For the few srna genes that are highly expressed under specific conditions, their cellular expression can be directly verified after purification, size separation by denaturing PAGE, extraction, labeling and RNA sequencing by enzymes or chemicals (Pichon and Felden, 2005). Those expressed to lower levels can be detected by northern blots, using oligonucleotides complementary to the predicted transcribed DNA strand. An alternative strategy consists in generating cdna libraries, isolating approximately from 50- to 500-nt long RNAs by size separation on denaturing PAGE, reverse transcribing into cdna, shotgun cloning and sequencing the srnas (Vogel et al., 2003). This method implies powerful sequencing facilities and to be aware of the occurrence Fig. 3. Overview of the validation methods used in previous srna genes and mrna targets identification studies. (A) Validation pathways of srna candidates. (B) Validation pathways for srna mrna target prediction methods. of many false positive sequences due to trnas and rrnas degradations. Wider testing includes microarray analyses using high-density oligonucleotide probe arrays within each bacterial gene and IGRs, efficiently assaying various experimental conditions for high-throughput transcript detection (Tjaden et al., 2002), some that can be expressed to very low levels. Microarray analyses of transcripts from a given growth condition should reveal the existence of freestanding transcripts in places where no gene has been annotated. Positive signals revealed by fluorescent dyes or by antibodies detecting DNA RNA hybrids, however, should be analyzed further by RACE (rapid amplification of cdna ends) mapping to discriminate novel srna genes from mrna leader sequences from neighboring genes. Moreover, the validation of the microarray data by independent methods, such as northern blots or RT quantitative PCR, is required. Reliable computational tools are essential for transcriptomic data analysis, including intensity normalization (Pichon and Felden, 2005; Tjaden et al., 2002). Elegant methods combine an initial round of immunoprecipitation of the extracted RNAs in complex with a known purified srnabinding protein, followed by the direct detection of the bound RNAs on genomic microarrays. The use of the association of a subset 2810

5 Small RNA gene identification and mrna target predictions of E.coli srnas with the RNA chaperone Hfq is described earlier (Zhang et al., 2003). 4 DETECTING THE mrna TARGETS OF BACTERIAL srnas Many bacterial srnas act as post-transcriptional regulators by base-pairing with target mrnas (Storz and Gottesman, 2006). Unlike cis-encoded antisense srnas, trans-encoded srnas have imperfect and sometimes only short complementarities with their mrna targets including non canonical pairings implying that their identifications, based on computational methods, is tricky. The starting point has to be all the experimentally supported srna target mrna interactions in bacteria. In the easiest cases, simple BlastN or Fasta3 searches could detect mrna targets that were confirmed experimentally by genetic (Chen et al., 2004) or biochemical (Pichon and Felden, 2005) approaches. Dedicated algorithms for predicting the secondary structure of two interacting RNA molecules by means of free energy minimization have been proposed (Alkan et al., 2006). Another approach consists in calculating optimal hybridization scores between a srna and all the mrnas from a given genome, focusing on the translational start sites, providing a list of candidate mrnas (TargetRNA, Tjaden et al., 2006). An approach that was successfully applied on the mica-ompa regulation searches for srna complementarities in sequence windows containing translation initiation sites, allowing noncontiguous pairing, the candidate target being compared to reiterated searches in related bacteria (Udekwu et al., 2005). Another strategy uses a training set of validated srna target mrna pairs, and pairing energies between an srna and putative mrna targets are calculated, maximized and the gene sequences are selected on their abilities to pair with the srna around the translation start and stop sites (Mandin et al., 2007). The RNAup software computes the probabilities that a sequence interval is unpaired, allowing to determine binding free energies of short oligomers to mrna targets (Mückstein et al., 2006). Another software, RNA-hybrid, predicts multiple potential binding sites of srnas in target mrnas, finding the most energetically favorable hybridization sites (Rehmsmeier et al., 2004). The likelihood of detecting all the mrna targets of a given srna by the above computational methods, however, is still low and perfectible since several known interactions failed detection using these tools. The intrinsic structure of the mrna target should be systematically considered during srna mrna pairing predictions. Therefore, experimental strategies, either alone or in combination with bioinformatic approaches, are essential for mrna target detections (Fig. 3B). Global experimental searches for mrna targets of an srna can be performed by microarrays, either by affinity capture of target mrnas by a biotinylated srna (Douchin et al., 2006) or by comparing the mrna profiles of an srnaoverproducing strain versus a control strain, using differential fluorescence labeling of the two cdna pools (Massé et al., 2005). Also, cellular RNAs that co-immunoprecipitate with the Hfq protein on an Affymetrix K12 E.coli high-density oligonucleotide array have detected mrna fragments that are putative mrna targets of Hfq-associated srnas in E.coli (Zhang et al., 2003). srnas or srna protein complexes can be used as bait for capturing target mrnas by affinity purification (Antal et al., 2005). mrna targets can also be detected by proteomic approaches, in comparing total protein extracts of a bacterial strain lacking or overexpressing a given srna against a wild-type strain (Udekwu et al., 2005). This is achieved by using 1D or 2D gels followed by identification of the proteins by mass spectrometry. When a given protein expression level is either reduced or increased when the srna is lacking or is overexpressed, the regulation can be either direct (interaction between the srna and the mrna encoding the protein) or indirect, via additional regulators. 5 FUTURE PROSPECTS In this review, the available computational methods for the detection of srna genes and for the prediction of the mrnas, whose expression levels are regulated by srnas acting through antisense pairing, are described. The most recent approaches for srna gene detection are a combination of several pre-existing independent methods, to increase their sensitivity and predictive potentials. A plethora of RNA secondary structure prediction methods are available, with some tested in combination with comparative genomic approaches (Pichon and Felden, 2003) or with statistical methods (Uzilov et al., 2006). While these strategies are interesting, their limitations come from extensive secondary structure variations for some bacterial srnas that escape identification using these methods (the structural homology is probably only detectable at the 3D level). The actual tendency to combine various approaches can be further extended, mixing comparative genomics, RNA structure, transcription unit and rho-independent terminator detections, and any other signatures specific of the srna-encoding genes. In bacterial genomics, most algorithms were initially designed and applied to the Gram-negative E.coli bacterium, with serious limitations and the need for adjustments for their use on other genomes. Indeed, transcription promoters are highly variable among bacterial species and their DNA sequence consensus is unknown in most bacterial species. Also, E.coli is a mesophile and srna gene detection has to be adjusted in the case of GC- or AT-rich genomes. Another important limitation of the current studies is the restriction of the computational searches for novel srna genes located in the IGRs. Recent studies using a hidden Markov model (Yachie et al., 2006) enabled identification of srnas in protein-coding regions but their efficiencies should be improved. The bacterial srnas partially or entirely overlapping protein-coding genes on the opposite DNA strand escape from being detected by the current tools: an interesting but difficult challenge would be to discriminate sequence conservations depending upon the presence of a protein (or polypeptide) coding sequence from conservations due to the presence of an srna gene. Quite a few recent scientific reports describe novel srna genefinders and their validations consist in counting the number of previously known srnas that their tools are able to detect, with no detections of new srnas, limiting the interest of these new tools for biologists. Each of the existing srna genefinders, however, is unable to identify all the experimentally validated E.coli srna genes, indicating that all these in silico methods are perfectible and also that their predictions have to be systematically tested experimentally. A substantial number of proteins are closely 2811

6 C.Pichon and B.Felden associated with bacterial srna function and structures [see Pichon and Felden (2007) for a review]. Among them, a growing class of srnas acting as protein sequestrators (Babitzke and Romeo, 2007), trapping and therefore controlling the activity of regulatory proteins is currently only detected by experimental methods. The computational predictions of srna Protein interactions are one of the next challenges. ACKNOWLEDGEMENTS We would like to thank E. Westhof (IMBC, Strasbourg) and C. Le Bouguénec (Institut Pasteur, Paris) for critical reading of the article. Funding: ANR grant (ERA-NET Pathogenomics Project to C.P.) Deciphering the intersection of commensal and extra intestinal pathogenic E.Coli ; ACI (BCMS 136 to B.F.); ANR grant (program MIME to B.F.). Conflict of Interest: none declared. REFERENCES Alkan,C. et al. (2006) RNA-RNA interaction prediction and antisense RNA target search. J. Comput. Biol., 13, Altuvia,S. (2007) Identification of bacterial small non-coding RNAs: experimental approaches. Curr. Opin. Microbiol., 10, Antal,M. et al. (2005) A small bacterial RNA regulates a putative ABC transporter. J. Biol. Chem., 280, Argaman,L. et al. (2001) Novel small RNA-encoding genes in the intergenic regions of Escherichia coli. Curr. Biol., 11, Axmann,I. et al. (2005) Identification of cyanobacterial non-coding RNAs by comparative genome analysis. Genome Biol., 6, R73. Babak,T. et al. (2007) Considerations in the identification of functional RNA structural elements in genomic alignments. BMC Bioinformatics, 8, 33. Babitzke,P. and Romeo,T. (2007) CsrB srna family: sequestration of RNA-binding regulatory proteins. Curr. Opin. Microbiol., 10, Boisset,S. et al. (2007) Staphylococcus aureus RNAIII coordinately represses the synthesis of virulence factors and the transcription regulator Rot by an antisense mechanism. Genes Dev., 21, Carter,R. et al. (2001) A computational approach to identify genes for functional RNAs in genomic sequences. Nucleic Acids Res., 29, Chen,S. et al. (2002) A bioinformatics based approach to discover small RNA genes in the Escherichia coli genome. BioSystems, 65, Chen,S. et al. (2004) MicC, second small-rna regulator of Omp protein expression in Escherichia coli. J. Bacteriol., 186, Clote,P. et al. (2005) Structural RNA has lower folding energy than random RNA of the same dinucleotide frequency. RNA, 11, Coventry,A. et al. (2004) MSARI: multiple sequence alignments forstatistical detection of RNA secondary structure. Proc. Natl Acad. Sci. USA, 101, del Val,C. et al. (2007) Identification of differentially expressed small non-coding RNAs in the legume endosymbiont Sinorhizobium meliloti by comparative genomics. Mol. Microbiol., 66, di Bernardo,D. et al. (2003) ddbrna: detection of conserved secondary structures in multiple alignments. Bioinformatics, 19, Douchin,V. et al. (2006) Down-regulation of porins by a small RNA bypasses the essentiality of the regulated intramembrane proteolysis protease RsePin Escherichia coli. J. Biol. Chem., 281, Fichant,G.A. and Burks,C. (1991) Identifying potential trna genes in genomic DNA sequences. J. Mol. Biol., 220, Griffiths-Jones,S. et al. (2005) Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Res., 33, Gruber,A.R. et al. (2007) The RNAz web server: prediction of thermodynamically stable and evolutionarily conserved RNAstructures. Nucleic Acids Res., 35, W335 W338. Gottesman,S. et al. (2006) Small RNA regulators and the bacterial response to stress. Cold Spring Harb. Symp. Quant. Biol., 71, Hurst,L.D. and Merchant,A.R. (2001) High guanine-cytosine content is not an adaptation to high temperature: a comparative analysis amongst prokaryotes. Proc. Biol. Sci., 268, Hüttenhofer,A. and Vogel,J. (2006) Experimental approaches to identify non-coding RNAs. Nucleic Acids Res., 34, Kang,S. et al. (2007) CONSORF: a consensus prediction system for prokaryotic coding sequences. Bioinformatics, 23, Klein,R.J. et al. (2002) Noncoding RNA genes identified in AT-rich hyperthermophiles. Proc. Natl Acad. Sci. USA, 99, Lambert,A. et al. (2004) The ERPIN server: an interface to profile-based RNA motif identification. Nucleic Acids Res., 32, W160 W165. Laslett,D. and Canback,B. (2004) ARAGORN, a program to detect trna genes and tmrna genes in nucleotide sequences. Nucleic Acids Res., 32, Livny,J. et al. (2005) srnapredict: an integrative computational approach to identify srnas in bacterial genomes. Nucleic Acids Res., 13, Livny,J. and Waldor,M.K. (2007) Identification of small RNAs in diverse bacterial species. Curr. Opin. Microbiol., 10, Lowe,T.M. and Eddy,S.R. (1997) trnascan-se: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res., 25, Macke,T.J. et al. (2001) RNAMotif, an RNA secondary structure definition and search algorithm. Nucleic Acids Res., 29, Mandin,P. et al. (2007) Identification of new noncoding RNAs in Listeria monocytogenes and prediction of mrna targets. Nucleic Acids Res., 35, Massé,E. et al. (2005) Effect of RyhB small RNA on global iron use in Escherichia coli. J. Bacteriol., 187, Mückstein,U. et al. (2006) Thermodynamics of RNA-RNA binding. Bioinformatics, 22, Ostberg,Y. et al. (2004) The etiological agent of Lyme disease, Borrelia burgdorferi, appears to contain only a few small RNA molecules. J. Bacteriol., 186, Pichon,C. and Felden,B. (2003) Intergenic sequence inspector: searching and identifying bacterial RNAs. Bioinformatics, 19, Pichon,C. and Felden,B. (2005) Small RNA genes expressed from Staphylococcus aureus genomic and pathogenicity islands with specific expression among pathogenic strains. Proc. Natl Acad. Sci. USA, 102, Pichon,C. and Felden,B. (2007) Proteins that interact with bacterial small RNA regulators. FEMS Microbiol. Rev., 31, Regalia,M. et al. (2002) Prediction of signal recognition particle RNA genes. Nucleic Acid Res., 30, Rehmsmeier,M. et al. (2004) Fast and effective prediction of microrna/target duplexes. RNA, 10, Rivas,E. and Eddy,S.R. (2000) Secondary structure alone is generally not statistically significant for the detection of noncoding RNAs. Bioinformatics, 16, Rivas,E. and Eddy,S. (2001) Noncoding RNA gene detection using comparative sequence analysis. BMC Bioinformatics, 2, 8. Rivas,E. et al. (2001) Computational identification of noncoding RNAs in E. coli by comparative genomics. Curr. Biol., 11, Schattner,P. (2002) Searching for RNA genes using base composition statistics. Nucleic Acid Res., 30, Storz,G. and Gottesman,S. (2006) Versatile roles of small RNA regulators in bacteria. In Gesteland,R.F., Cech,T.R. and Atkins,J.F. (eds), The RNA World, 3rd edn. Cold Spring Harbor Laboratory Press, USA, pp Tjaden,B. et al. (2002) Transcriptome analysis of Escherichia coli using high-density oligonucleotide probe arrays. Nucleic Acid Res., 30, Tjaden,B. et al. (2006) Target prediction for small, noncoding RNAs in bacteria. Nucleic Acids Res., 34, Udekwu,K.I. et al. (2005) Hfq-dependent regulation of OmpA synthesis is mediated by an antisense RNA. Genes Dev., 19, Uzilov,A. et al. (2006) Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change. BMC Bioinformatics, 7, Vogel,J. et al. (2003) RNomics in Escherichia coli detects new srna species and indicates parallel transcriptional output in bacteria. Nucleic Acids Res., 31, Vogel,J. and Sharma,C.M. (2005) How to find small non-coding RNAs in bacteria. Biol. Chem., 386, Vogel,J. and Wagner,E.G.H. (2007) Target identification of small noncoding RNAs in bacteria. Curr. Opin. Microbiol., 10, Washietl,S. et al. (2005a) Mapping of conserved RNA secondary structures predicts thousands of functional noncoding RNAs in the human genome. Nat. Biotechnol., 23, Washietl,S. et al. (2005b) Fast and reliable prediction of noncoding RNAs. Proc. Natl Acad. Sci. USA, 102, Wassarman,K.M. et al. (1999) Small RNAs in Escherichia coli. Trends Microbiol., 7,

7 Small RNA gene identification and mrna target predictions Wassarman,K. et al. (2001) Identification of novel small RNAs using comparative genomics and microarrays. Genes Dev., 15, Yachie,N. et al. (2006) Prediction of non-coding and antisense RNA genes in Escherichia coli with Gapped Markov Model. Gene, 372, Zhang,A. et al. (2003) Global analysis of small RNA and mrna targets of Hfq. Mol. Microbiol., 50, Zuker,M. (1989) On finding all suboptimal foldings of an RNA molecule. Science, 244,

The world of non-coding RNA. Espen Enerly

The world of non-coding RNA. Espen Enerly The world of non-coding RNA Espen Enerly ncrna in general Different groups Small RNAs Outline mirnas and sirnas Speculations Common for all ncrna Per def.: never translated Not spurious transcripts Always/often

More information

Supplemental Tables and Figure

Supplemental Tables and Figure A bacterial regulatory RNA attenuates virulence, spread and human host cell phagocytosis. Hélène Le Pabic, Noëlla Germain-Amiot, Valérie Bordeau and Brice Felden* Inserm U835-Upres EA2311, Biochimie Pharmaceutique,

More information

Translation Study Guide

Translation Study Guide Translation Study Guide This study guide is a written version of the material you have seen presented in the replication unit. In translation, the cell uses the genetic information contained in mrna to

More information

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources 1 of 8 11/7/2004 11:00 AM National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools

More information

Basic Concepts of DNA, Proteins, Genes and Genomes

Basic Concepts of DNA, Proteins, Genes and Genomes Basic Concepts of DNA, Proteins, Genes and Genomes Kun-Mao Chao 1,2,3 1 Graduate Institute of Biomedical Electronics and Bioinformatics 2 Department of Computer Science and Information Engineering 3 Graduate

More information

From DNA to Protein. Proteins. Chapter 13. Prokaryotes and Eukaryotes. The Path From Genes to Proteins. All proteins consist of polypeptide chains

From DNA to Protein. Proteins. Chapter 13. Prokaryotes and Eukaryotes. The Path From Genes to Proteins. All proteins consist of polypeptide chains Proteins From DNA to Protein Chapter 13 All proteins consist of polypeptide chains A linear sequence of amino acids Each chain corresponds to the nucleotide base sequence of a gene The Path From Genes

More information

Profiling of non-coding RNA classes Gunter Meister

Profiling of non-coding RNA classes Gunter Meister Profiling of non-coding RNA classes Gunter Meister RNA Biology Regensburg University Universitätsstrasse 31 93053 Regensburg Overview Classes of non-coding RNAs Profiling strategies Validation Protein-RNA

More information

RNA & Protein Synthesis

RNA & Protein Synthesis RNA & Protein Synthesis Genes send messages to cellular machinery RNA Plays a major role in process Process has three phases (Genetic) Transcription (Genetic) Translation Protein Synthesis RNA Synthesis

More information

Modeling and Simulation of Gene Regulatory Networks

Modeling and Simulation of Gene Regulatory Networks Modeling and Simulation of Gene Regulatory Networks Hidde de Jong INRIA Grenoble - Rhône-Alpes Hidde.de-Jong@inria.fr http://ibis.inrialpes.fr INRIA Grenoble - Rhône-Alpes and IBIS IBIS: systems biology

More information

Protein Synthesis How Genes Become Constituent Molecules

Protein Synthesis How Genes Become Constituent Molecules Protein Synthesis Protein Synthesis How Genes Become Constituent Molecules Mendel and The Idea of Gene What is a Chromosome? A chromosome is a molecule of DNA 50% 50% 1. True 2. False True False Protein

More information

Genetic information (DNA) determines structure of proteins DNA RNA proteins cell structure 3.11 3.15 enzymes control cell chemistry ( metabolism )

Genetic information (DNA) determines structure of proteins DNA RNA proteins cell structure 3.11 3.15 enzymes control cell chemistry ( metabolism ) Biology 1406 Exam 3 Notes Structure of DNA Ch. 10 Genetic information (DNA) determines structure of proteins DNA RNA proteins cell structure 3.11 3.15 enzymes control cell chemistry ( metabolism ) Proteins

More information

Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals

Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals Xiaohui Xie 1, Jun Lu 1, E. J. Kulbokas 1, Todd R. Golub 1, Vamsi Mootha 1, Kerstin Lindblad-Toh

More information

Essentials of Real Time PCR. About Sequence Detection Chemistries

Essentials of Real Time PCR. About Sequence Detection Chemistries Essentials of Real Time PCR About Real-Time PCR Assays Real-time Polymerase Chain Reaction (PCR) is the ability to monitor the progress of the PCR as it occurs (i.e., in real time). Data is therefore collected

More information

Lecture Series 7. From DNA to Protein. Genotype to Phenotype. Reading Assignments. A. Genes and the Synthesis of Polypeptides

Lecture Series 7. From DNA to Protein. Genotype to Phenotype. Reading Assignments. A. Genes and the Synthesis of Polypeptides Lecture Series 7 From DNA to Protein: Genotype to Phenotype Reading Assignments Read Chapter 7 From DNA to Protein A. Genes and the Synthesis of Polypeptides Genes are made up of DNA and are expressed

More information

Transcription and Translation of DNA

Transcription and Translation of DNA Transcription and Translation of DNA Genotype our genetic constitution ( makeup) is determined (controlled) by the sequence of bases in its genes Phenotype determined by the proteins synthesised when genes

More information

Lecture 1 MODULE 3 GENE EXPRESSION AND REGULATION OF GENE EXPRESSION. Professor Bharat Patel Office: Science 2, 2.36 Email: b.patel@griffith.edu.

Lecture 1 MODULE 3 GENE EXPRESSION AND REGULATION OF GENE EXPRESSION. Professor Bharat Patel Office: Science 2, 2.36 Email: b.patel@griffith.edu. Lecture 1 MODULE 3 GENE EXPRESSION AND REGULATION OF GENE EXPRESSION Professor Bharat Patel Office: Science 2, 2.36 Email: b.patel@griffith.edu.au What is Gene Expression & Gene Regulation? 1. Gene Expression

More information

DNA Replication & Protein Synthesis. This isn t a baaaaaaaddd chapter!!!

DNA Replication & Protein Synthesis. This isn t a baaaaaaaddd chapter!!! DNA Replication & Protein Synthesis This isn t a baaaaaaaddd chapter!!! The Discovery of DNA s Structure Watson and Crick s discovery of DNA s structure was based on almost fifty years of research by other

More information

A Primer of Genome Science THIRD

A Primer of Genome Science THIRD A Primer of Genome Science THIRD EDITION GREG GIBSON-SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc. Publishers Sunderland, Massachusetts USA Contents Preface xi 1 Genome Projects:

More information

Structure and Function of DNA

Structure and Function of DNA Structure and Function of DNA DNA and RNA Structure DNA and RNA are nucleic acids. They consist of chemical units called nucleotides. The nucleotides are joined by a sugar-phosphate backbone. The four

More information

Current Motif Discovery Tools and their Limitations

Current Motif Discovery Tools and their Limitations Current Motif Discovery Tools and their Limitations Philipp Bucher SIB / CIG Workshop 3 October 2006 Trendy Concepts and Hypotheses Transcription regulatory elements act in a context-dependent manner.

More information

Name Class Date. Figure 13 1. 2. Which nucleotide in Figure 13 1 indicates the nucleic acid above is RNA? a. uracil c. cytosine b. guanine d.

Name Class Date. Figure 13 1. 2. Which nucleotide in Figure 13 1 indicates the nucleic acid above is RNA? a. uracil c. cytosine b. guanine d. 13 Multiple Choice RNA and Protein Synthesis Chapter Test A Write the letter that best answers the question or completes the statement on the line provided. 1. Which of the following are found in both

More information

Molecular Genetics. RNA, Transcription, & Protein Synthesis

Molecular Genetics. RNA, Transcription, & Protein Synthesis Molecular Genetics RNA, Transcription, & Protein Synthesis Section 1 RNA AND TRANSCRIPTION Objectives Describe the primary functions of RNA Identify how RNA differs from DNA Describe the structure and

More information

Module 3 Questions. 7. Chemotaxis is an example of signal transduction. Explain, with the use of diagrams.

Module 3 Questions. 7. Chemotaxis is an example of signal transduction. Explain, with the use of diagrams. Module 3 Questions Section 1. Essay and Short Answers. Use diagrams wherever possible 1. With the use of a diagram, provide an overview of the general regulation strategies available to a bacterial cell.

More information

How many of you have checked out the web site on protein-dna interactions?

How many of you have checked out the web site on protein-dna interactions? How many of you have checked out the web site on protein-dna interactions? Example of an approximately 40,000 probe spotted oligo microarray with enlarged inset to show detail. Find and be ready to discuss

More information

Chapter 18 Regulation of Gene Expression

Chapter 18 Regulation of Gene Expression Chapter 18 Regulation of Gene Expression 18.1. Gene Regulation Is Necessary By switching genes off when they are not needed, cells can prevent resources from being wasted. There should be natural selection

More information

Introduction To Real Time Quantitative PCR (qpcr)

Introduction To Real Time Quantitative PCR (qpcr) Introduction To Real Time Quantitative PCR (qpcr) SABiosciences, A QIAGEN Company www.sabiosciences.com The Seminar Topics The advantages of qpcr versus conventional PCR Work flow & applications Factors

More information

DNA (genetic information in genes) RNA (copies of genes) proteins (functional molecules) directionality along the backbone 5 (phosphate) to 3 (OH)

DNA (genetic information in genes) RNA (copies of genes) proteins (functional molecules) directionality along the backbone 5 (phosphate) to 3 (OH) DNA, RNA, replication, translation, and transcription Overview Recall the central dogma of biology: DNA (genetic information in genes) RNA (copies of genes) proteins (functional molecules) DNA structure

More information

Standards, Guidelines and Best Practices for RNA-Seq V1.0 (June 2011) The ENCODE Consortium

Standards, Guidelines and Best Practices for RNA-Seq V1.0 (June 2011) The ENCODE Consortium Standards, Guidelines and Best Practices for RNA-Seq V1.0 (June 2011) The ENCODE Consortium I. Introduction: Sequence based assays of transcriptomes (RNA-seq) are in wide use because of their favorable

More information

Micro RNAs: potentielle Biomarker für das. Blutspenderscreening

Micro RNAs: potentielle Biomarker für das. Blutspenderscreening Micro RNAs: potentielle Biomarker für das Blutspenderscreening micrornas - Background Types of RNA -Coding: messenger RNA (mrna) -Non-coding (examples): Ribosomal RNA (rrna) Transfer RNA (trna) Small nuclear

More information

13.4 Gene Regulation and Expression

13.4 Gene Regulation and Expression 13.4 Gene Regulation and Expression Lesson Objectives Describe gene regulation in prokaryotes. Explain how most eukaryotic genes are regulated. Relate gene regulation to development in multicellular organisms.

More information

CCR Biology - Chapter 9 Practice Test - Summer 2012

CCR Biology - Chapter 9 Practice Test - Summer 2012 Name: Class: Date: CCR Biology - Chapter 9 Practice Test - Summer 2012 Multiple Choice Identify the choice that best completes the statement or answers the question. 1. Genetic engineering is possible

More information

Interaktionen von RNAs und Proteinen

Interaktionen von RNAs und Proteinen Sonja Prohaska Computational EvoDevo Universitaet Leipzig June 9, 2015 Studying RNA-protein interactions Given: target protein known to bind to RNA problem: find binding partners and binding sites experimental

More information

2. The number of different kinds of nucleotides present in any DNA molecule is A) four B) six C) two D) three

2. The number of different kinds of nucleotides present in any DNA molecule is A) four B) six C) two D) three Chem 121 Chapter 22. Nucleic Acids 1. Any given nucleotide in a nucleic acid contains A) two bases and a sugar. B) one sugar, two bases and one phosphate. C) two sugars and one phosphate. D) one sugar,

More information

Transcription in prokaryotes. Elongation and termination

Transcription in prokaryotes. Elongation and termination Transcription in prokaryotes Elongation and termination After initiation the σ factor leaves the scene. Core polymerase is conducting the elongation of the chain. The core polymerase contains main nucleotide

More information

Name Date Period. 2. When a molecule of double-stranded DNA undergoes replication, it results in

Name Date Period. 2. When a molecule of double-stranded DNA undergoes replication, it results in DNA, RNA, Protein Synthesis Keystone 1. During the process shown above, the two strands of one DNA molecule are unwound. Then, DNA polymerases add complementary nucleotides to each strand which results

More information

PreciseTM Whitepaper

PreciseTM Whitepaper Precise TM Whitepaper Introduction LIMITATIONS OF EXISTING RNA-SEQ METHODS Correctly designed gene expression studies require large numbers of samples, accurate results and low analysis costs. Analysis

More information

AERES report on the research unit

AERES report on the research unit Section des Unités de recherche AERES report on the research unit Fonction structure et inactivation d ARN bactériens From the University Rennes 1 INSERM December 2010 Section des Unités de recherche AERES

More information

Biotechnology: DNA Technology & Genomics

Biotechnology: DNA Technology & Genomics Chapter 20. Biotechnology: DNA Technology & Genomics 2003-2004 The BIG Questions How can we use our knowledge of DNA to: diagnose disease or defect? cure disease or defect? change/improve organisms? What

More information

RNA Structure and folding

RNA Structure and folding RNA Structure and folding Overview: The main functional biomolecules in cells are polymers DNA, RNA and proteins For RNA and Proteins, the specific sequence of the polymer dictates its final structure

More information

Specific problems. The genetic code. The genetic code. Adaptor molecules match amino acids to mrna codons

Specific problems. The genetic code. The genetic code. Adaptor molecules match amino acids to mrna codons Tutorial II Gene expression: mrna translation and protein synthesis Piergiorgio Percipalle, PhD Program Control of gene transcription and RNA processing mrna translation and protein synthesis KAROLINSKA

More information

Microarray Data Analysis Workshop. Custom arrays and Probe design Probe design in a pangenomic world. Carsten Friis. MedVetNet Workshop, DTU 2008

Microarray Data Analysis Workshop. Custom arrays and Probe design Probe design in a pangenomic world. Carsten Friis. MedVetNet Workshop, DTU 2008 Microarray Data Analysis Workshop MedVetNet Workshop, DTU 2008 Custom arrays and Probe design Probe design in a pangenomic world Carsten Friis Media glna tnra GlnA TnrA C2 glnr C3 C5 C6 K GlnR C1 C4 C7

More information

Algorithms in Computational Biology (236522) spring 2007 Lecture #1

Algorithms in Computational Biology (236522) spring 2007 Lecture #1 Algorithms in Computational Biology (236522) spring 2007 Lecture #1 Lecturer: Shlomo Moran, Taub 639, tel 4363 Office hours: Tuesday 11:00-12:00/by appointment TA: Ilan Gronau, Taub 700, tel 4894 Office

More information

Translation. Translation: Assembly of polypeptides on a ribosome

Translation. Translation: Assembly of polypeptides on a ribosome Translation Translation: Assembly of polypeptides on a ribosome Living cells devote more energy to the synthesis of proteins than to any other aspect of metabolism. About a third of the dry mass of a cell

More information

Control of Gene Expression

Control of Gene Expression Home Gene Regulation Is Necessary? Control of Gene Expression By switching genes off when they are not needed, cells can prevent resources from being wasted. There should be natural selection favoring

More information

How To Understand How Gene Expression Is Regulated

How To Understand How Gene Expression Is Regulated What makes cells different from each other? How do cells respond to information from environment? Regulation of: - Transcription - prokaryotes - eukaryotes - mrna splicing - mrna localisation and translation

More information

FlipFlop: Fast Lasso-based Isoform Prediction as a Flow Problem

FlipFlop: Fast Lasso-based Isoform Prediction as a Flow Problem FlipFlop: Fast Lasso-based Isoform Prediction as a Flow Problem Elsa Bernard Laurent Jacob Julien Mairal Jean-Philippe Vert September 24, 2013 Abstract FlipFlop implements a fast method for de novo transcript

More information

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the

More information

Appendix 2 Molecular Biology Core Curriculum. Websites and Other Resources

Appendix 2 Molecular Biology Core Curriculum. Websites and Other Resources Appendix 2 Molecular Biology Core Curriculum Websites and Other Resources Chapter 1 - The Molecular Basis of Cancer 1. Inside Cancer http://www.insidecancer.org/ From the Dolan DNA Learning Center Cold

More information

Next Generation Sequencing

Next Generation Sequencing Next Generation Sequencing Technology and applications 10/1/2015 Jeroen Van Houdt - Genomics Core - KU Leuven - UZ Leuven 1 Landmarks in DNA sequencing 1953 Discovery of DNA double helix structure 1977

More information

BCH401G Lecture 39 Andres

BCH401G Lecture 39 Andres BCH401G Lecture 39 Andres Lecture Summary: Ribosome: Understand its role in translation and differences between translation in prokaryotes and eukaryotes. Translation: Understand the chemistry of this

More information

Control of Gene Expression

Control of Gene Expression Control of Gene Expression What is Gene Expression? Gene expression is the process by which informa9on from a gene is used in the synthesis of a func9onal gene product. What is Gene Expression? Figure

More information

Synthetic RNA-based devices --- Programming cellular networks using synthetic riboregulators ---

Synthetic RNA-based devices --- Programming cellular networks using synthetic riboregulators --- Synthetic RNA-based devices --- Programming cellular networks using synthetic riboregulators --- Synthetic Microbiology Heinrich-Heine-Universität Düsseldorf 24.06.2014 Synthetic RNA-based devices ---

More information

Outline. interfering RNA - What is dat? Brief history of RNA interference. What does it do? How does it work?

Outline. interfering RNA - What is dat? Brief history of RNA interference. What does it do? How does it work? Outline Outline interfering RNA - What is dat? Brief history of RNA interference. What does it do? How does it work? What is RNA interference? Recently discovered regulatory level. Genome immune system.

More information

Discovery and Quantification of RNA with RNASeq Roderic Guigó Serra Centre de Regulació Genòmica (CRG) roderic.guigo@crg.cat

Discovery and Quantification of RNA with RNASeq Roderic Guigó Serra Centre de Regulació Genòmica (CRG) roderic.guigo@crg.cat Bioinformatique et Séquençage Haut Débit, Discovery and Quantification of RNA with RNASeq Roderic Guigó Serra Centre de Regulació Genòmica (CRG) roderic.guigo@crg.cat 1 RNA Transcription to RNA and subsequent

More information

Activity 7.21 Transcription factors

Activity 7.21 Transcription factors Purpose To consolidate understanding of protein synthesis. To explain the role of transcription factors and hormones in switching genes on and off. Play the transcription initiation complex game Regulation

More information

DNA, RNA, Protein synthesis, and Mutations. Chapters 12-13.3

DNA, RNA, Protein synthesis, and Mutations. Chapters 12-13.3 DNA, RNA, Protein synthesis, and Mutations Chapters 12-13.3 1A)Identify the components of DNA and explain its role in heredity. DNA s Role in heredity: Contains the genetic information of a cell that can

More information

Bioinformatics Resources at a Glance

Bioinformatics Resources at a Glance Bioinformatics Resources at a Glance A Note about FASTA Format There are MANY free bioinformatics tools available online. Bioinformaticists have developed a standard format for nucleotide and protein sequences

More information

Data Analysis for Ion Torrent Sequencing

Data Analysis for Ion Torrent Sequencing IFU022 v140202 Research Use Only Instructions For Use Part III Data Analysis for Ion Torrent Sequencing MANUFACTURER: Multiplicom N.V. Galileilaan 18 2845 Niel Belgium Revision date: August 21, 2014 Page

More information

Chapter 11: Molecular Structure of DNA and RNA

Chapter 11: Molecular Structure of DNA and RNA Chapter 11: Molecular Structure of DNA and RNA Student Learning Objectives Upon completion of this chapter you should be able to: 1. Understand the major experiments that led to the discovery of DNA as

More information

Nucleic Acid Techniques in Bacterial Systematics

Nucleic Acid Techniques in Bacterial Systematics Nucleic Acid Techniques in Bacterial Systematics Edited by Erko Stackebrandt Department of Microbiology University of Queensland St Lucia, Australia and Michael Goodfellow Department of Microbiology University

More information

Complex multicellular organisms are produced by cells that switch genes on and off during development.

Complex multicellular organisms are produced by cells that switch genes on and off during development. Home Control of Gene Expression Gene Regulation Is Necessary? By switching genes off when they are not needed, cells can prevent resources from being wasted. There should be natural selection favoring

More information

Core Facility Genomics

Core Facility Genomics Core Facility Genomics versatile genome or transcriptome analyses based on quantifiable highthroughput data ascertainment 1 Topics Collaboration with Harald Binder and Clemens Kreutz Project: Microarray

More information

Introduction to Genome Annotation

Introduction to Genome Annotation Introduction to Genome Annotation AGCGTGGTAGCGCGAGTTTGCGAGCTAGCTAGGCTCCGGATGCGA CCAGCTTTGATAGATGAATATAGTGTGCGCGACTAGCTGTGTGTT GAATATATAGTGTGTCTCTCGATATGTAGTCTGGATCTAGTGTTG GTGTAGATGGAGATCGCGTAGCGTGGTAGCGCGAGTTTGCGAGCT

More information

PATHOGEN DETECTION SYSTEMS BY REAL TIME PCR. Results Interpretation Guide

PATHOGEN DETECTION SYSTEMS BY REAL TIME PCR. Results Interpretation Guide PATHOGEN DETECTION SYSTEMS BY REAL TIME PCR Results Interpretation Guide Pathogen Detection Systems by Real Time PCR Microbial offers real time PCR based systems for the detection of pathogenic bacteria

More information

Thermo Scientific DyNAmo cdna Synthesis Kit for qrt-pcr Technical Manual

Thermo Scientific DyNAmo cdna Synthesis Kit for qrt-pcr Technical Manual Thermo Scientific DyNAmo cdna Synthesis Kit for qrt-pcr Technical Manual F- 470S 20 cdna synthesis reactions (20 µl each) F- 470L 100 cdna synthesis reactions (20 µl each) Table of contents 1. Description...

More information

Analysis of gene expression data. Ulf Leser and Philippe Thomas

Analysis of gene expression data. Ulf Leser and Philippe Thomas Analysis of gene expression data Ulf Leser and Philippe Thomas This Lecture Protein synthesis Microarray Idea Technologies Applications Problems Quality control Normalization Analysis next week! Ulf Leser:

More information

4. DNA replication Pages: 979-984 Difficulty: 2 Ans: C Which one of the following statements about enzymes that interact with DNA is true?

4. DNA replication Pages: 979-984 Difficulty: 2 Ans: C Which one of the following statements about enzymes that interact with DNA is true? Chapter 25 DNA Metabolism Multiple Choice Questions 1. DNA replication Page: 977 Difficulty: 2 Ans: C The Meselson-Stahl experiment established that: A) DNA polymerase has a crucial role in DNA synthesis.

More information

trna Processing and Modification

trna Processing and Modification trna Processing and Modification RNA POL III - TRANSCRIPTS 5S RNA, trna, repetitive Sequenzen (Alu-typ), versch. kleine stabile RNAs (7SL - RNA vom signal recognition particle (SRP)), U6 RNA 5S RNA nicht

More information

1 Mutation and Genetic Change

1 Mutation and Genetic Change CHAPTER 14 1 Mutation and Genetic Change SECTION Genes in Action KEY IDEAS As you read this section, keep these questions in mind: What is the origin of genetic differences among organisms? What kinds

More information

Dicer Substrate RNAi Design

Dicer Substrate RNAi Design INTEGRATED DNA TECHNOLOGIES, INC. Dicer Substrate RNAi Design How to design and order 27-mer Dicer-substrate Duplex RNAs for use as RNA interference reagents The following document provides a summary of

More information

Molecular Genetics: Challenges for Statistical Practice. J.K. Lindsey

Molecular Genetics: Challenges for Statistical Practice. J.K. Lindsey Molecular Genetics: Challenges for Statistical Practice J.K. Lindsey 1. What is a Microarray? 2. Design Questions 3. Modelling Questions 4. Longitudinal Data 5. Conclusions 1. What is a microarray? A microarray

More information

Molecular Biology Techniques: A Classroom Laboratory Manual THIRD EDITION

Molecular Biology Techniques: A Classroom Laboratory Manual THIRD EDITION Molecular Biology Techniques: A Classroom Laboratory Manual THIRD EDITION Susan Carson Heather B. Miller D.Scott Witherow ELSEVIER AMSTERDAM BOSTON HEIDELBERG LONDON NEW YORK OXFORD PARIS SAN DIEGO SAN

More information

Functional RNAs; RNA catalysts, mirna,

Functional RNAs; RNA catalysts, mirna, Functional RNAs; RNA catalysts, mirna, srna, RNAi... RNAs have many functions rrna (ribosomal RNA) trna (transfer RNA) mrna (Messenger RNA) snrna (including snorna) ) (Small nuclear RNA- splicing) Other

More information

Chapter 6 DNA Replication

Chapter 6 DNA Replication Chapter 6 DNA Replication Each strand of the DNA double helix contains a sequence of nucleotides that is exactly complementary to the nucleotide sequence of its partner strand. Each strand can therefore

More information

The Steps. 1. Transcription. 2. Transferal. 3. Translation

The Steps. 1. Transcription. 2. Transferal. 3. Translation Protein Synthesis Protein synthesis is simply the "making of proteins." Although the term itself is easy to understand, the multiple steps that a cell in a plant or animal must go through are not. In order

More information

The sequence of bases on the mrna is a code that determines the sequence of amino acids in the polypeptide being synthesized:

The sequence of bases on the mrna is a code that determines the sequence of amino acids in the polypeptide being synthesized: Module 3F Protein Synthesis So far in this unit, we have examined: How genes are transmitted from one generation to the next Where genes are located What genes are made of How genes are replicated How

More information

New Technologies for Sensitive, Low-Input RNA-Seq. Clontech Laboratories, Inc.

New Technologies for Sensitive, Low-Input RNA-Seq. Clontech Laboratories, Inc. New Technologies for Sensitive, Low-Input RNA-Seq Clontech Laboratories, Inc. Outline Introduction Single-Cell-Capable mrna-seq Using SMART Technology SMARTer Ultra Low RNA Kit for the Fluidigm C 1 System

More information

Introduction to transcriptome analysis using High Throughput Sequencing technologies (HTS)

Introduction to transcriptome analysis using High Throughput Sequencing technologies (HTS) Introduction to transcriptome analysis using High Throughput Sequencing technologies (HTS) A typical RNA Seq experiment Library construction Protocol variations Fragmentation methods RNA: nebulization,

More information

GenBank, Entrez, & FASTA

GenBank, Entrez, & FASTA GenBank, Entrez, & FASTA Nucleotide Sequence Databases First generation GenBank is a representative example started as sort of a museum to preserve knowledge of a sequence from first discovery great repositories,

More information

Bioinformatics Grid - Enabled Tools For Biologists.

Bioinformatics Grid - Enabled Tools For Biologists. Bioinformatics Grid - Enabled Tools For Biologists. What is Grid-Enabled Tools (GET)? As number of data from the genomics and proteomics experiment increases. Problems arise for the current sequence analysis

More information

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE Q5B

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE Q5B INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE ICH HARMONISED TRIPARTITE GUIDELINE QUALITY OF BIOTECHNOLOGICAL PRODUCTS: ANALYSIS

More information

Ms. Campbell Protein Synthesis Practice Questions Regents L.E.

Ms. Campbell Protein Synthesis Practice Questions Regents L.E. Name Student # Ms. Campbell Protein Synthesis Practice Questions Regents L.E. 1. A sequence of three nitrogenous bases in a messenger-rna molecule is known as a 1) codon 2) gene 3) polypeptide 4) nucleotide

More information

PRACTICE TEST QUESTIONS

PRACTICE TEST QUESTIONS PART A: MULTIPLE CHOICE QUESTIONS PRACTICE TEST QUESTIONS DNA & PROTEIN SYNTHESIS B 1. One of the functions of DNA is to A. secrete vacuoles. B. make copies of itself. C. join amino acids to each other.

More information

Quantitative proteomics background

Quantitative proteomics background Proteomics data analysis seminar Quantitative proteomics and transcriptomics of anaerobic and aerobic yeast cultures reveals post transcriptional regulation of key cellular processes de Groot, M., Daran

More information

Sickle cell anemia: Altered beta chain Single AA change (#6 Glu to Val) Consequence: Protein polymerizes Change in RBC shape ---> phenotypes

Sickle cell anemia: Altered beta chain Single AA change (#6 Glu to Val) Consequence: Protein polymerizes Change in RBC shape ---> phenotypes Protein Structure Polypeptide: Protein: Therefore: Example: Single chain of amino acids 1 or more polypeptide chains All polypeptides are proteins Some proteins contain >1 polypeptide Hemoglobin (O 2 binding

More information

Genetomic Promototypes

Genetomic Promototypes Genetomic Promototypes Mirkó Palla and Dana Pe er Department of Mechanical Engineering Clarkson University Potsdam, New York and Department of Genetics Harvard Medical School 77 Avenue Louis Pasteur Boston,

More information

Title: Surveying Genome to Identify Origins of DNA Replication In Silico

Title: Surveying Genome to Identify Origins of DNA Replication In Silico Title: Surveying Genome to Identify Origins of DNA Replication In Silico Abstract: DNA replication origins are the bases to realize the process of chromosome replication and analyze the progress of cell

More information

Twincore - Zentrum für Experimentelle und Klinische Infektionsforschung Institut für Molekulare Bakteriologie

Twincore - Zentrum für Experimentelle und Klinische Infektionsforschung Institut für Molekulare Bakteriologie Twincore - Zentrum für Experimentelle und Klinische Infektionsforschung Institut für Molekulare Bakteriologie 0 HELMHOLTZ I ZENTRUM FÜR INFEKTIONSFORSCHUNG Technische Universität Braunschweig Institut

More information

Integrating DNA Motif Discovery and Genome-Wide Expression Analysis. Erin M. Conlon

Integrating DNA Motif Discovery and Genome-Wide Expression Analysis. Erin M. Conlon Integrating DNA Motif Discovery and Genome-Wide Expression Analysis Department of Mathematics and Statistics University of Massachusetts Amherst Statistics in Functional Genomics Workshop Ascona, Switzerland

More information

a. Ribosomal RNA rrna a type ofrna that combines with proteins to form Ribosomes on which polypeptide chains of proteins are assembled

a. Ribosomal RNA rrna a type ofrna that combines with proteins to form Ribosomes on which polypeptide chains of proteins are assembled Biology 101 Chapter 14 Name: Fill-in-the-Blanks Which base follows the next in a strand of DNA is referred to. as the base (1) Sequence. The region of DNA that calls for the assembly of specific amino

More information

Functional and Biomedical Aspects of Genome Research

Functional and Biomedical Aspects of Genome Research Functional and Biomedical Aspects of Genome Research 20 11 35 Vorlesung SS 04 Bartsch, Jockusch & Schmitt-John Mi. 9:15-10:00, in W7-135 13 Functional RNAs Thomas Schmitt-John micro RNAs small interfering

More information

Lecture 6: Single nucleotide polymorphisms (SNPs) and Restriction Fragment Length Polymorphisms (RFLPs)

Lecture 6: Single nucleotide polymorphisms (SNPs) and Restriction Fragment Length Polymorphisms (RFLPs) Lecture 6: Single nucleotide polymorphisms (SNPs) and Restriction Fragment Length Polymorphisms (RFLPs) Single nucleotide polymorphisms or SNPs (pronounced "snips") are DNA sequence variations that occur

More information

Analytical Study of Hexapod mirnas using Phylogenetic Methods

Analytical Study of Hexapod mirnas using Phylogenetic Methods Analytical Study of Hexapod mirnas using Phylogenetic Methods A.K. Mishra and H.Chandrasekharan Unit of Simulation & Informatics, Indian Agricultural Research Institute, New Delhi, India akmishra@iari.res.in,

More information

Focusing on results not data comprehensive data analysis for targeted next generation sequencing

Focusing on results not data comprehensive data analysis for targeted next generation sequencing Focusing on results not data comprehensive data analysis for targeted next generation sequencing Daniel Swan, Jolyon Holdstock, Angela Matchan, Richard Stark, John Shovelton, Duarte Mohla and Simon Hughes

More information

PROC. CAIRO INTERNATIONAL BIOMEDICAL ENGINEERING CONFERENCE 2006 1. E-mail: msm_eng@k-space.org

PROC. CAIRO INTERNATIONAL BIOMEDICAL ENGINEERING CONFERENCE 2006 1. E-mail: msm_eng@k-space.org BIOINFTool: Bioinformatics and sequence data analysis in molecular biology using Matlab Mai S. Mabrouk 1, Marwa Hamdy 2, Marwa Mamdouh 2, Marwa Aboelfotoh 2,Yasser M. Kadah 2 1 Biomedical Engineering Department,

More information

Protein Protein Interaction Networks

Protein Protein Interaction Networks Functional Pattern Mining from Genome Scale Protein Protein Interaction Networks Young-Rae Cho, Ph.D. Assistant Professor Department of Computer Science Baylor University it My Definition of Bioinformatics

More information

Interaktionen von Nukleinsäuren und Proteinen

Interaktionen von Nukleinsäuren und Proteinen Sonja Prohaska Computational EvoDevo Universitaet Leipzig June 9, 2015 DNA is never naked in a cell DNA is usually in association with proteins. In all domains of life there are small, basic chromosomal

More information

RNA Viruses. A Practical Approac h. Alan J. Cann

RNA Viruses. A Practical Approac h. Alan J. Cann RNA Viruses A Practical Approac h Alan J. Cann List of protocols page xiii Abbreviations xvii Investigation of RNA virus genome structure 1 A j. Easton, A.C. Marriott and C.R. Pringl e 1 Introduction-the

More information

Technical Note. Roche Applied Science. No. LC 18/2004. Assay Formats for Use in Real-Time PCR

Technical Note. Roche Applied Science. No. LC 18/2004. Assay Formats for Use in Real-Time PCR Roche Applied Science Technical Note No. LC 18/2004 Purpose of this Note Assay Formats for Use in Real-Time PCR The LightCycler Instrument uses several detection channels to monitor the amplification of

More information