The world of non-coding RNA Espen Enerly
ncrna in general Different groups Small RNAs Outline mirnas and sirnas Speculations
Common for all ncrna Per def.: never translated Not spurious transcripts Always/often works together with proteins RNA-RNA, RNA, RNA-DNA and RNA-Protein Different mutational pattern than CDS
Non coding RNA genes Gene groups (today ): rrna trna snrna snorna Long mrna like ncrna Antisense (transcribed from same locus; cis-acting) mirna Other groups tncrna (tiny non coding RNA; mirna like) strna (small temporal RNA; early grouping of mirna) Riboregulators/sRNA/saRNA (bacteria; cis- and trans-acting) acting) sirna (processed RNA and not a gene group) Introns
RNA processing Mattick et al. 2003
Well known ncrna rrna ribosomal RNA: component of the ribosome trna - transfer RNA: Guides the adding of amino acids during translation snrna Component of the splicing complex
Small nucleolar RNA. rrna editing via complementary binding 2 groups different secondary structure Modify differently Function: Ribosome assembly and/or function! 100 200 genes identified Eukaryotes and archaebacteria 1. H/ACA snorna 2. C/D box snorna
Long mrna like ncrna No ORF (no coding potential) Spliced Many well known, essential genes e.g XIST, X-chromosome inactivation ~20 kb cdna Non sequence specific function Regulated by an antisense (TSIX). Number of genes? Several thousands mouse cdna cluster lack coding potential Many remains to be discovered
Antisense Transcribed from the same locus on the opposite strand dsrna in nucleus Effect: A. Editing of sense transcript which prevents export) B. Local silencing and heterochromatin formation Retrotransposons Repetitive elements Common among imprinted genes >8% of the human genes contain antisense transcripts e.g. Air Antisense to a cluster of imprinted genes
MicroRNAs Definition of mirnas ~22 nt long Processed from Dicer Hairpin precursor Phylogenetic conservation RNase III enzymes Recognize dsrna e.g. Dicer, Drosha ++ proteins Drosha Dicer Lee et al., EMBO 2002 and Nature 2003
MicroRNAs (2) ~200-300 human mirna genes (most of them identified) Most famous: Lin-4 and Let-7 Developmental regulators in C. elegans Target unknown in most situations Ambros, Cell 2001
RNA interference (RNAi) and small interfering RNA (sirna) RNA interference Mechanism/process/ method Degrades long double stranded RNA Dicer Used to generate KO Introduce long dsrna or sirna (~22 nt) Dicer Long dsrna sirna mirnp-risc McManus and Sharp, 2002 Target cleavage
RNA interference (RNAi) and small interfering RNA (sirna) RNA interference Mechanism/process method Degrades long double stranded RNA Dicer Used to generate KO Introduce long dsrna or sirna (~22 nt) Target recognition Plant/animal disparity PremiRNA Dicer Long dsrna sirna and mirna mirnp-risc Translational repression McManus and Sharp, 2002 Target cleavage
RNA interference (RNAi) and small interfering RNA (sirna) Secondary sirnas and Transitive RNAi RdRP (RNA dependent RNA Polymerase) Amplification Found in Neurospora, C. elegans, and Arabidopsis Not in Drosophila and human Random degradative PCR- model Therapeutic possibilities Delivery, delivery, delivery Specificity (!) Systemic response (spreading) Plants, C. elegans Design of sirna In silico prediction sirna cocktail Secondary sirnas, generated from the dsrna extended (orange) upstream of the primary target region (blue) by RdRP, promote transitive RNAi against their sequence homologous gene family members (A) or alternatively spliced mrnas (B). Nishikura, 2001
mirna examples Altered mirna metabolism = pleiotropic developmental defects Leaf morphogenesis in plants JAW locus encode a mirna Cleave TCP4 mrna and regulate the level of this transcript Bantam i Drosophila Mir-23 in human Scarecrow i Arabidopsis Hot topics Target identification Regulation of mirna Pre-miRNA Mature mirna mrna Wild type Mutant
Localization and organization? Localization Intergenic regions (Between protein coding genes) Intronic e.g. mirna, long ncrna, e.g. snorna,, mirna(!) Organization Individual e.g. long ncrna, mirna and snorna Polycistronic e.g. rrna, mirna, and snorna
Summary of the groups Gene groups (today ): snorna Long mrna like ncrna Antisense (transcribed from same locus; cis-acting) mirna Other groups tncrna (tiny non coding RNA; mirna like) strna (small temporal RNA; early grouping of mirna) Riboregulators/sRNA/saRNA (bacteria; cis- and trans-acting) acting) sirna (processed RNA from mirna, exogenous dsrna,, viral dsrna or antisense) Introns
Processes which involves ncrnas Protein synthesis (rrna/trna/snorna snorna) Splicing and alternative splicing (snrna/snorna( snrna/snorna/+) /+) Translational regulation (mirna) mrna stability (mirna/sirna) Virus protection Suggested to be the evolutionary origin of RNAi mrna export (antisense) Chromatin modification (antisense/sirna) Transposon control Genomic imprinting X-chromosome inactivation DNA methylation ++
Non-coding RNAs: The architects of eukaryotic complexity (John Mattick) The traditional view most genes encode proteins and these are sufficient in themselves to specify and organize the autopoietic programming of complex biological entities The new view (Mattick s( view) 1) A second tier of genetic output and a network of parallel RNA- mediated interactions has evolved in multi-cellular eukaryotes 2) Expansion of ncrna in higher organisms suggests that evolution of complexity may not only have been dependent on more proteins and protein isoforms 3) Alteration in the RNA control architecture is responsible for much of the phenotypic alteration between individuals and species 4) Much of this is hidden in the introns
Introns as ncrna 95 % of pre-mrnas are introns Traditional view: Rapidly degraded, only minor regulatory functions Some introns contain ncrnas (snorna, mirna) Some are stabile and exported to cytoplasm Conserved elements (5'-introns)
Why is it discovered first now? RNA isolation disfavor ncrnas Poly-A A extraction Small RNA discarded Gene identification programs are specialized for protein coding genes RNA interference (1998) Tool to generate knockouts Gene therapy
Q & A
Bioinformatics In silico ncrna gene identification In silico target identification Implication for network construction Many genes have dual output RNAs as a hidden layer of regulation Increasing interest for RNA structure prediction ++
Antisense Air