Bioinformatics, Sequences and Genomes
|
|
- Nelson Phelps
- 8 years ago
- Views:
Transcription
1 Bioinformatics, Sequences and Genomes BL4273 Bioinformatics for Biologists Week 1 Daniel Barker, School of Biology, University of St Andrews db60@st-andrews.ac.uk
2 BL4273 and 4273π 4273π is a custom Linux operating system for the Raspberry Pi, including an Open Access course in bioinformatics. The Open Access course is based on BL4273 and is similar, but not identical. See: Barker, D., Ferrier, D.E.K., Holland, P.W.H., Mitchell, J.B.O., Plaisier, H., Ritchie, M.D. and Smart, S.D. (2013) 4273π: bioinformatics education on low cost ARM hardware. BMC Bioinformatics 14: 243. Further discussion in BMC s Biome magazine:
3 What is bioinformatics? Use of computers in biology DNA, RNA, proteins sequences, structure, function, expression Amount of data is large Release 123 of the European Nucleotide Archive database (March 2015) contains 1,154,095,724,206 nucleotides Degree of automation should be as high as possible
4 Some research examples Alderson, R.G., Barker, D. and Mitchell, J.B.O. (2014) One origin for metallo-β-lactamase activity, or two? An investigation assessing a diverse set of reconstructed ancestral sequences based on a sample of phylogenetic trees. Journal of Molecular Evolution 79: It is unclear whether metallo-β-lactamse activity evolved once or twice. Latysheva, N., Junker, V.L., Palmer, W.J., Codd, G.A. and Barker, D. (2012) The evolution of nitrogen fixation in cyanobacteria. Bioinformatics 28: The last common ancestor of cyanobacteria may have fixed nitrogen. Lim, J.-H., Iggo, R.D. and Barker, D. (2013) Nucleic Acids Research 41: Chromatin modification data help us identify functional p53 transcription factor binding sites. Mendivil Ramos, O., Barker, D. and Ferrier, D.E.K. (2012) Ghost loci imply Hox and ParaHox existence in the last common ancestor of animals. Current Biology 22:
5 Nucleotide database growth Source:
6 The major nucleotide sequence databases GenBank (USA), EMBL (EU), DDBJ (Japan) These three databases share their data. To access the data, we can use GQuery at the National Center for Biotechnology Information (NCBI), USA
7 A typical DNA record at the NCBI accession.version Source: NCBI GQuery,
8
9
10
11 Same sequence in Fasta format Source: NCBI GQuery,
12
13 What is a genome? Sum total of heritable nucleic acids in an individual (or organelle, or virus particle) DNA for some viruses and all (other) life RNA for most viruses
14 Raw genome data Genome sequence Transcriptome sequence Gene expression data Variation data (polymorphisms, repeats, allele frequencies) Subsequent bioinformatic analysis Genome annotation Functional annotation
15 Growth of genome sequence databases In June 2015: In 2012: Source: GOLD database,
16 Genome sequencing Decent job sequencing a eukaryotic genome: 20,000 to 1,000,000. Decent job sequencing a eukaryotic transcriptome: 1,000 to 2,000. Technology is moving fast. Sequencing a eukaryotic genome to draft standard will soon cost < 1,000. (Doing a proper job of sequencing a genome to finished standard remains extremely expensive.)
17 Genome analysis Stage: Methods: Genome sequencing (laboratory) Genome assembly computers Genome annotation computers Computational genome analysis single-genome analyses computers comparative analyses computers (Validate computational predictions laboratory)
18 DNA sequencing Reads Source: NCBI Traces Archive,
19 Genome assembly Genomes Reads Contigs fragment assembly software whole-genome assembly software Contigs Assembly
20 Genome annotation Genes Genomic DNA (contigs / assembly) Known mrnas and proteins + ab initio methods Genome annotation procedure Putative genes location in the genome, DNA, mrna and protein sequences
21 Genes Biology How do genes vary? Correlate with environmental, phenotypic and genomic features How are gene products affected? Secondary and tertiary structure Active sites Protein-protein and protein-dna interaction surfaces What are the consequences? Disease Virulence Drug susceptibility and resistance Fitness and adaptation Speciation
22 Protein sequences Most protein sequences are obtained by conceptual translation of coding DNA. It is usually DNA which is sequenced in the laboratory, rather than protein. The protein database available via Gquery includes translations of all annotated coding sequences in the DNA database. Additionally it includes SwissProt, a highquality (but smaller) protein sequence database.
23 Protein sequence databases GenPept (via GQuery) Translation of all coding sequences in Genbank Automatically generated without review High volume, unpredictable quality Swissprot (via GQuery and various sites) Whole-genome data: genomic DNA Manually curated Low volume, high quality
24 Protein sequence in Fasta format Single-letter amino acid codes G P A V L I M C F Y glycine proline alanine valine leucine isoleucine methionine cysteine phenylalanine tyrosine W H K R Q N E D S T tryptophan histidine lysine arginine glutamine asparagine glutamic Acid aspartic Acid serine threonine
25 Genome databases Contain sequence and annotation Often better than general databases, for the genome in question There are many for example Ensembl NCBI Genome
26 Gene expression database GEO Further protein databases RefSeq, genome-oriented PDB: structures
27 General aims of BL4273 To see what research-grade bioinformatics is like. To reach the point where, in a biology or biochemistry research project, you can begin to do the necessary bioinformatics for that project. Underlying principles, data, software, computers, scripting. Linux operating system, command-line software. Have fun.
28 Module contents Five lectures Seven practical classes, 2 hours each One tutorial, up to 2 hours One seminar, 2 hours One practical-project to complete and write up in your own time (50% of final grade) One exam (50% of final grade)
29 Recommended reading: main journals Bioinformatics PLoS Computational Biology BMC Bioinformatics Nucleic Acids Research Genome Research All papers more than a year old in Bioinformatics, and more than six months old in Genome Research, are available free online. PLoS Computational Biology, BMC Bioinformatics and Nucleic Acids Research are Open Access journals, with all published content available free online.
30 Recommended reading: main books Bradnam, K. and Korf, I. (2012) UNIX and Perl to the Rescue! A field guide for the life sciences (and other data-rich pursuits). (Cambridge: Cambridge University Press) Baxevanis, A.D. and Ouellette, B.F.F., editors (2005) Bioinformatics: A practical Guide to the Analysis of Genes and Proteins, 3rd edition (Hoboken, New Jersey: John Wiley). Dear, P.H., editor (2007) Bioinformatics (Bloxham, Oxfordshire: Scion). Schwartz, R.L., Phoenix, T. and foy, b d (2008) Learning Perl, 5th edition (Sebastopol, California: O'Reilly). Zvelebil, M. and Baum, J.O. (2007) Understanding Bioinformatics (New York: Garland Science).
31 Recommended reading: other books Robbins, A. (2006). UNIX in a Nutshell, 4th edition (Sebastopol, California: O Reilly). Christiansen, T., foy, b d and Orwant, J. (2012) Programming Perl, 4th edition (Sebastopol, California: O Reilly). [3rd ed. is also OK.] Brown, T.A. (2007) Genomes, 3rd edition (New York: Garland Science). Cristianini, N. and Hahn, M.W. (2007) Introduction to Computational Genomics: A Case Studies Approach (Cambridge: Cambridge University Press). Kernighan, B.W. and Pike, R. (1984) The UNIX Programming Environment (Englewood Cliffs: Prentice-Hall). Orengo, C.A., Jones, D.T. and Thornton, J.M. (2003) Bioinformatics: Genes, Proteins and Computers (Oxford: BIOS Scientific). Sunnerhagen, P. and Piškur, J., editors (2006) Comparative Genomics: Using Fungi as Models (Heidelberg: Springer).
32 Further reading for the keen: genre-defining classics Durbin, R., Eddy, S., Krogh, A. and Mitchison, G. (1998) Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids (Cambridge: Cambridge University Press). Baldi, P. and Brunak, S. (2001) Bioinformatics: The Machine Learning Approach, 2nd edition (Cambridge, Massachusetts: MIT Press). Deonier, R.C., Tavaré, S. and Waterman, M. (2005) Computational Genome Analysis: An Introduction (New York: Springer). These three books are beyond the scope of this course. But you may wish to take a look.
Concluding lesson. Student manual. What kind of protein are you? (Basic)
Concluding lesson Student manual What kind of protein are you? (Basic) Part 1 The hereditary material of an organism is stored in a coded way on the DNA. This code consists of four different nucleotides:
More informationIntroduction to Bioinformatics 2. DNA Sequence Retrieval and comparison
Introduction to Bioinformatics 2. DNA Sequence Retrieval and comparison Benjamin F. Matthews United States Department of Agriculture Soybean Genomics and Improvement Laboratory Beltsville, MD 20708 matthewb@ba.ars.usda.gov
More informationA Multiple DNA Sequence Translation Tool Incorporating Web Robot and Intelligent Recommendation Techniques
Proceedings of the 2007 WSEAS International Conference on Computer Engineering and Applications, Gold Coast, Australia, January 17-19, 2007 402 A Multiple DNA Sequence Translation Tool Incorporating Web
More informationBOC334 (Proteomics) Practical 1. Calculating the charge of proteins
BC334 (Proteomics) Practical 1 Calculating the charge of proteins Aliphatic amino acids (VAGLIP) N H 2 H Glycine, Gly, G no charge Hydrophobicity = 0.67 MW 57Da pk a CH = 2.35 pk a NH 2 = 9.6 pi=5.97 CH
More informationRETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison
RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the
More informationBIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS
BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title:
More informationModule 1. Sequence Formats and Retrieval. Charles Steward
The Open Door Workshop Module 1 Sequence Formats and Retrieval Charles Steward 1 Aims Acquaint you with different file formats and associated annotations. Introduce different nucleotide and protein databases.
More informationPipe Cleaner Proteins. Essential question: How does the structure of proteins relate to their function in the cell?
Pipe Cleaner Proteins GPS: SB1 Students will analyze the nature of the relationships between structures and functions in living cells. Essential question: How does the structure of proteins relate to their
More informationRNA and Protein Synthesis
Name lass Date RN and Protein Synthesis Information and Heredity Q: How does information fl ow from DN to RN to direct the synthesis of proteins? 13.1 What is RN? WHT I KNOW SMPLE NSWER: RN is a nucleic
More informationHuman Tubal Fluid (HTF) Media & Modifi ed Human Tubal Fluid (mhtf) Medium with Gentamicin
Human Tubal Fluid (HTF) Media & Modifi ed Human Tubal Fluid (mhtf) Medium with Gentamicin HTF Media are intended for use in assisted reproductive procedures which include gamete and embryo manipulation
More informationIntroduction to Bioinformatics 3. DNA editing and contig assembly
Introduction to Bioinformatics 3. DNA editing and contig assembly Benjamin F. Matthews United States Department of Agriculture Soybean Genomics and Improvement Laboratory Beltsville, MD 20708 matthewb@ba.ars.usda.gov
More informationBiological Sequence Data Formats
Biological Sequence Data Formats Here we present three standard formats in which biological sequence data (DNA, RNA and protein) can be stored and presented. Raw Sequence: Data without description. FASTA
More informationMultiple Choice Write the letter that best answers the question or completes the statement on the line provided.
Name lass Date hapter 12 DN and RN hapter Test Multiple hoice Write the letter that best answers the question or completes the statement on the line provided. Pearson Education, Inc. ll rights reserved.
More informationGuidelines for Writing a Scientific Paper
Guidelines for Writing a Scientific Paper Writing an effective scientific paper is not easy. A good rule of thumb is to write as if your paper will be read by a person who knows about the field in general
More informationFINDING RELATION BETWEEN AGING AND
FINDING RELATION BETWEEN AGING AND TELOMERE BY APRIORI AND DECISION TREE Jieun Sung 1, Youngshin Joo, and Taeseon Yoon 1 Department of National Science, Hankuk Academy of Foreign Studies, Yong-In, Republic
More informationIV. -Amino Acids: carboxyl and amino groups bonded to -Carbon. V. Polypeptides and Proteins
IV. -Amino Acids: carboxyl and amino groups bonded to -Carbon A. Acid/Base properties 1. carboxyl group is proton donor! weak acid 2. amino group is proton acceptor! weak base 3. At physiological ph: H
More informationAdvanced Medicinal & Pharmaceutical Chemistry CHEM 5412 Dept. of Chemistry, TAMUK
Advanced Medicinal & Pharmaceutical Chemistry CHEM 5412 Dept. of Chemistry, TAMUK Dai Lu, Ph.D. dlu@tamhsc.edu Tel: 361-221-0745 Office: RCOP, Room 307 Drug Discovery and Development Drug Molecules Medicinal
More informationPRACTICE TEST QUESTIONS
PART A: MULTIPLE CHOICE QUESTIONS PRACTICE TEST QUESTIONS DNA & PROTEIN SYNTHESIS B 1. One of the functions of DNA is to A. secrete vacuoles. B. make copies of itself. C. join amino acids to each other.
More information200630 - FBIO - Fundations of Bioinformatics
Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2015 200 - FME - School of Mathematics and Statistics 1004 - UB - (ENG)Universitat de Barcelona MASTER'S DEGREE IN STATISTICS AND
More informationGenBank, Entrez, & FASTA
GenBank, Entrez, & FASTA Nucleotide Sequence Databases First generation GenBank is a representative example started as sort of a museum to preserve knowledge of a sequence from first discovery great repositories,
More informationAmino Acids, Peptides, Proteins
Amino Acids, Peptides, Proteins Functions of proteins: Enzymes Transport and Storage Motion, muscle contraction Hormones Mechanical support Immune protection (Antibodies) Generate and transmit nerve impulses
More informationA Primer of Genome Science THIRD
A Primer of Genome Science THIRD EDITION GREG GIBSON-SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc. Publishers Sunderland, Massachusetts USA Contents Preface xi 1 Genome Projects:
More informationAmino Acids and Proteins
Amino Acids and Proteins Proteins are composed of amino acids. There are 20 amino acids commonly found in proteins. All have: N2 C α R COO Amino acids at neutral p are dipolar ions (zwitterions) because
More informationH H N - C - C 2 R. Three possible forms (not counting R group) depending on ph
Amino acids - 0 common amino acids there are others found naturally but much less frequently - Common structure for amino acid - C, -N, and functional groups all attached to the alpha carbon N - C - C
More informationShu-Ping Lin, Ph.D. E-mail: splin@dragon.nchu.edu.tw
Amino Acids & Proteins Shu-Ping Lin, Ph.D. Institute te of Biomedical Engineering ing E-mail: splin@dragon.nchu.edu.tw Website: http://web.nchu.edu.tw/pweb/users/splin/ edu tw/pweb/users/splin/ Date: 10.13.2010
More informationCore Bioinformatics. Degree Type Year Semester. 4313473 Bioinformàtica/Bioinformatics OB 0 1
Core Bioinformatics 2014/2015 Code: 42397 ECTS Credits: 12 Degree Type Year Semester 4313473 Bioinformàtica/Bioinformatics OB 0 1 Contact Name: Sònia Casillas Viladerrams Email: Sonia.Casillas@uab.cat
More informationLecture Outline. Introduction to Databases. Introduction. Data Formats Sample databases How to text search databases. Shifra Ben-Dor Irit Orr
Introduction to Databases Shifra Ben-Dor Irit Orr Lecture Outline Introduction Data and Database types Database components Data Formats Sample databases How to text search databases What units of information
More informationBiochemistry - I. Prof. S. Dasgupta Department of Chemistry Indian Institute of Technology, Kharagpur Lecture-11 Enzyme Mechanisms II
Biochemistry - I Prof. S. Dasgupta Department of Chemistry Indian Institute of Technology, Kharagpur Lecture-11 Enzyme Mechanisms II In the last class we studied the enzyme mechanisms of ribonuclease A
More informationFocusing on results not data comprehensive data analysis for targeted next generation sequencing
Focusing on results not data comprehensive data analysis for targeted next generation sequencing Daniel Swan, Jolyon Holdstock, Angela Matchan, Richard Stark, John Shovelton, Duarte Mohla and Simon Hughes
More information4273?: a novel platform for bioinformatics education
4273?: a novel platform for bioinformatics education Gaining an understanding of bioinformatics is now par for the course when it comes to training undergraduate students in biology. However, facilitating
More informationBioinformatics: course introduction
Bioinformatics: course introduction Filip Železný Czech Technical University in Prague Faculty of Electrical Engineering Department of Cybernetics Intelligent Data Analysis lab http://ida.felk.cvut.cz
More informationTHE UNIVERSITY OF MANCHESTER Unit Specification
1. GENERAL INFORMATION Title Unit code Credit rating 15 Level 7 Contact hours 30 Other Scheduled teaching and learning activities* Pre-requisite units Co-requisite units School responsible Member of staff
More informationSharing Data from Large-scale Biological Research Projects: A System of Tripartite Responsibility
Sharing Data from Large-scale Biological Research Projects: A System of Tripartite Responsibility Report of a meeting organized by the Wellcome Trust and held on 14 15 January 2003 at Fort Lauderdale,
More informationCh18_PT MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Ch18_PT MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) All of the following can be classified as biomolecules except A) lipids. B) proteins. C)
More information2007 7.013 Problem Set 1 KEY
2007 7.013 Problem Set 1 KEY Due before 5 PM on FRIDAY, February 16, 2007. Turn answers in to the box outside of 68-120. PLEASE WRITE YOUR ANSWERS ON THIS PRINTOUT. 1. Where in a eukaryotic cell do you
More informationGenetic information (DNA) determines structure of proteins DNA RNA proteins cell structure 3.11 3.15 enzymes control cell chemistry ( metabolism )
Biology 1406 Exam 3 Notes Structure of DNA Ch. 10 Genetic information (DNA) determines structure of proteins DNA RNA proteins cell structure 3.11 3.15 enzymes control cell chemistry ( metabolism ) Proteins
More informationGenome and DNA Sequence Databases. BME 110/BIOL 181 CompBio Tools Todd Lowe March 31, 2009
Genome and DNA Sequence Databases BME 110/BIOL 181 CompBio Tools Todd Lowe March 31, 2009 Admin Reading: Chapters 1 & 2 Notes available in PDF format on-line (see class calendar page): http://www.soe.ucsc.edu/classes/bme110/spring09/bme110-calendar.html
More informationJust the Facts: A Basic Introduction to the Science Underlying NCBI Resources
1 of 8 11/7/2004 11:00 AM National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools
More informationActivity 7.21 Transcription factors
Purpose To consolidate understanding of protein synthesis. To explain the role of transcription factors and hormones in switching genes on and off. Play the transcription initiation complex game Regulation
More informationBioinformatics Grid - Enabled Tools For Biologists.
Bioinformatics Grid - Enabled Tools For Biologists. What is Grid-Enabled Tools (GET)? As number of data from the genomics and proteomics experiment increases. Problems arise for the current sequence analysis
More informationBIO 3352: BIOINFORMATICS II HYBRID COURSE SYLLABUS
BIO 3352: BIOINFORMATICS II HYBRID COURSE SYLLABUS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title: Bioinformatics
More informationSGI. High Throughput Computing (HTC) Wrapper Program for Bioinformatics on SGI ICE and SGI UV Systems. January, 2012. Abstract. Haruna Cofer*, PhD
White Paper SGI High Throughput Computing (HTC) Wrapper Program for Bioinformatics on SGI ICE and SGI UV Systems Haruna Cofer*, PhD January, 2012 Abstract The SGI High Throughput Computing (HTC) Wrapper
More informationBioinformatics Resources at a Glance
Bioinformatics Resources at a Glance A Note about FASTA Format There are MANY free bioinformatics tools available online. Bioinformaticists have developed a standard format for nucleotide and protein sequences
More informationINTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE Q5B
INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE ICH HARMONISED TRIPARTITE GUIDELINE QUALITY OF BIOTECHNOLOGICAL PRODUCTS: ANALYSIS
More informationIntro to Bioinformatics
Intro to Bioinformatics Marylyn D Ritchie, PhD Professor, Biochemistry and Molecular Biology Director, Center for Systems Genomics The Pennsylvania State University Sarah A Pendergrass, PhD Research Associate
More informationDatabase searching with DNA and protein sequences: An introduction Clare Sansom Date received (in revised form): 12th November 1999
Dr Clare Sansom works part time at Birkbeck College, London, and part time as a freelance computer consultant and science writer At Birkbeck she coordinates an innovative graduate-level Advanced Certificate
More informationSeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications
Product Bulletin Sequencing Software SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications Comprehensive reference sequence handling Helps interpret the role of each
More informationThe Organic Chemistry of Amino Acids, Peptides, and Proteins
Essential rganic Chemistry Chapter 16 The rganic Chemistry of Amino Acids, Peptides, and Proteins Amino Acids a-amino carboxylic acids. The building blocks from which proteins are made. H 2 N C 2 H Note:
More informationAlgorithms in Computational Biology (236522) spring 2007 Lecture #1
Algorithms in Computational Biology (236522) spring 2007 Lecture #1 Lecturer: Shlomo Moran, Taub 639, tel 4363 Office hours: Tuesday 11:00-12:00/by appointment TA: Ilan Gronau, Taub 700, tel 4894 Office
More informationTeaching Bioinformatics to Undergraduates
Teaching Bioinformatics to Undergraduates http://www.med.nyu.edu/rcr/asm Stuart M. Brown Research Computing, NYU School of Medicine I. What is Bioinformatics? II. Challenges of teaching bioinformatics
More informationMutations and Genetic Variability. 1. What is occurring in the diagram below?
Mutations and Genetic Variability 1. What is occurring in the diagram below? A. Sister chromatids are separating. B. Alleles are independently assorting. C. Genes are replicating. D. Segments of DNA are
More information582606 Introduction to bioinformatics
582606 Introduction to bioinformatics Autumn 2007 Esa Pitkänen Master's Degree Programme in Bioinformatics (MBI) Department of Computer Science, University of Helsinki http://www.cs.helsinki.fi/mbi/courses/07-08/itb/
More informationPaper: 6 Chemistry 2.130 University I Chemistry: Models Page: 2 of 7. 4. Which of the following weak acids would make the best buffer at ph = 5.0?
Paper: 6 Chemistry 2.130 University I Chemistry: Models Page: 2 of 7 4. Which of the following weak acids would make the best buffer at ph = 5.0? A) Acetic acid (Ka = 1.74 x 10-5 ) B) H 2 PO - 4 (Ka =
More informationPart A: Amino Acids and Peptides (Is the peptide IAG the same as the peptide GAI?)
ChemActivity 46 Amino Acids, Polypeptides and Proteins 1 ChemActivity 46 Part A: Amino Acids and Peptides (Is the peptide IAG the same as the peptide GAI?) Model 1: The 20 Amino Acids at Biological p See
More information13.2 Ribosomes & Protein Synthesis
13.2 Ribosomes & Protein Synthesis Introduction: *A specific sequence of bases in DNA carries the directions for forming a polypeptide, a chain of amino acids (there are 20 different types of amino acid).
More informationMs. Campbell Protein Synthesis Practice Questions Regents L.E.
Name Student # Ms. Campbell Protein Synthesis Practice Questions Regents L.E. 1. A sequence of three nitrogenous bases in a messenger-rna molecule is known as a 1) codon 2) gene 3) polypeptide 4) nucleotide
More informationUGENE Quick Start Guide
Quick Start Guide This document contains a quick introduction to UGENE. For more detailed information, you can find the UGENE User Manual and other special manuals in project website: http://ugene.unipro.ru.
More informationMolecular Facts and Figures
Nucleic Acids Molecular Facts and Figures DNA/RNA bases: DNA and RNA are composed of four bases each. In DNA the four are Adenine (A), Thymidine (T), Cytosine (C), and Guanine (G). In RNA the four are
More informationVad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives
Vad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives Dirk.Repsilber@oru.se 2015-05-21 Functional Bioinformatics, Örebro University Vad är bioinformatik och varför
More informationClusterControl: A Web Interface for Distributing and Monitoring Bioinformatics Applications on a Linux Cluster
Bioinformatics Advance Access published January 29, 2004 ClusterControl: A Web Interface for Distributing and Monitoring Bioinformatics Applications on a Linux Cluster Gernot Stocker, Dietmar Rieder, and
More informationFrom DNA to Protein. Proteins. Chapter 13. Prokaryotes and Eukaryotes. The Path From Genes to Proteins. All proteins consist of polypeptide chains
Proteins From DNA to Protein Chapter 13 All proteins consist of polypeptide chains A linear sequence of amino acids Each chain corresponds to the nucleotide base sequence of a gene The Path From Genes
More informationCore Bioinformatics. Titulació Tipus Curs Semestre. 4313473 Bioinformàtica/Bioinformatics OB 0 1
Core Bioinformatics 2014/2015 Codi: 42397 Crèdits: 12 Titulació Tipus Curs Semestre 4313473 Bioinformàtica/Bioinformatics OB 0 1 Professor de contacte Nom: Sònia Casillas Viladerrams Correu electrònic:
More informationDrug-resistant Tuberculosis: A Genetic Analysis Using Online Bioinformatics Tools
Inquiry & Investigation Drug-resistant Tuberculosis: A Genetic Analysis Using Online Bioinformatics Tools Jessica M. Taylor, Rebecca M. Davidson, Michael Strong Abstrac t Tuberculosis (TB) continues to
More informationREGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf])
820 REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf]) (See also General Regulations) BMS1 Admission to the Degree To be eligible for admission to the degree of Bachelor
More informationStructure and properties of proteins. Vladimíra Kvasnicová
Structure and properties of proteins Vladimíra Kvasnicová Chemical nature of proteins biopolymers of amino acids macromolecules (M r > 10 000) Classification of proteins 1) by localization in an organism
More informationGene mutation and molecular medicine Chapter 15
Gene mutation and molecular medicine Chapter 15 Lecture Objectives What Are Mutations? How Are DNA Molecules and Mutations Analyzed? How Do Defective Proteins Lead to Diseases? What DNA Changes Lead to
More informationWhen you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want
1 When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want to search other databases as well. There are very
More informationBIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16
Course Director: Dr. Barry Grant (DCM&B, bjgrant@med.umich.edu) Description: This is a three module course covering (1) Foundations of Bioinformatics, (2) Statistics in Bioinformatics, and (3) Systems
More informationName Class Date. Figure 13 1. 2. Which nucleotide in Figure 13 1 indicates the nucleic acid above is RNA? a. uracil c. cytosine b. guanine d.
13 Multiple Choice RNA and Protein Synthesis Chapter Test A Write the letter that best answers the question or completes the statement on the line provided. 1. Which of the following are found in both
More informationCommittee on WIPO Standards (CWS)
E CWS/1/5 ORIGINAL: ENGLISH DATE: OCTOBER 13, 2010 Committee on WIPO Standards (CWS) First Session Geneva, October 25 to 29, 2010 PROPOSAL FOR THE PREPARATION OF A NEW WIPO STANDARD ON THE PRESENTATION
More informationThe EcoCyc Curation Process
The EcoCyc Curation Process Ingrid M. Keseler SRI International 1 HOW OFTEN IS THE GOLDEN GATE BRIDGE PAINTED? Many misconceptions exist about how often the Bridge is painted. Some say once every seven
More informationThis article reprinted from: Kosinski, R. J. 2007. An introduction to bioinformatics. Pages 145-182, in
This article reprinted from: Kosinski, R. J. 2007. An introduction to bioinformatics. Pages 145-182, in Tested Studies for Laboratory Teaching, Volume 28 (M.A. O'Donnell, Editor). Proceedings of the 28th
More informationDelivering the power of the world s most successful genomics platform
Delivering the power of the world s most successful genomics platform NextCODE Health is bringing the full power of the world s largest and most successful genomics platform to everyday clinical care NextCODE
More informationGenomes and SNPs in Malaria and Sickle Cell Anemia
Genomes and SNPs in Malaria and Sickle Cell Anemia Introduction to Genome Browsing with Ensembl Ensembl The vast amount of information in biological databases today demands a way of organising and accessing
More informationThe human gene encoding Glucose-6-phosphate dehydrogenase (G6PD) is located on chromosome X in cytogenetic band q28.
Tutorial Module 5 BioMart You will learn about BioMart, a joint project developed and maintained at EBI and OiCR www.biomart.org How to use BioMart to quickly obtain lists of gene information from Ensembl
More informationLinear Sequence Analysis. 3-D Structure Analysis
Linear Sequence Analysis What can you learn from a (single) protein sequence? Calculate it s physical properties Molecular weight (MW), isoelectric point (pi), amino acid content, hydropathy (hydrophilic
More informationStructure and Function of DNA
Structure and Function of DNA DNA and RNA Structure DNA and RNA are nucleic acids. They consist of chemical units called nucleotides. The nucleotides are joined by a sugar-phosphate backbone. The four
More informationDistributed Bioinformatics Computing System for DNA Sequence Analysis
Global Journal of Computer Science and Technology: A Hardware & Computation Volume 14 Issue 1 Version 1.0 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global Journals Inc.
More informationGrow Taller 4 Idiots 2
1 Grow Taller 4 Idiots 2 Warning and Disclaimer Every effort has been made to make this digital book as complete and as accurate as possible, but no warranty or fitness is implied. The information provided
More informationProvincial Exam Questions. 9. Give one role of each of the following nucleic acids in the production of an enzyme.
Provincial Exam Questions Unit: Cell Biology: Protein Synthesis (B7 & B8) 2010 Jan 3. Describe the process of translation. (4 marks) 2009 Sample 8. What is the role of ribosomes in protein synthesis? A.
More informationGenome Viewing. Module 2. Using Genome Browsers to View Annotation of the Human Genome
Module 2 Genome Viewing Using Genome Browsers to View Annotation of the Human Genome Bert Overduin, Ph.D. PANDA Coordination & Outreach EMBL - European Bioinformatics Institute Wellcome Trust Genome Campus
More informationLecture 11 Data storage and LIMS solutions. Stéphane LE CROM lecrom@biologie.ens.fr
Lecture 11 Data storage and LIMS solutions Stéphane LE CROM lecrom@biologie.ens.fr Various steps of a DNA microarray experiment Experimental steps Data analysis Experimental design set up Chips on catalog
More informationCore Bioinformatics. Degree Type Year Semester
Core Bioinformatics 2015/2016 Code: 42397 ECTS Credits: 12 Degree Type Year Semester 4313473 Bioinformatics OB 0 1 Contact Name: Sònia Casillas Viladerrams Email: Sonia.Casillas@uab.cat Teachers Use of
More informationProtein. Protein. Why is protein important?
Protein Protein Though protein is often perceived as an area of concern for vegans, it is actually very easily accessible on a vegan diet, particularly if a variety of plant foods are consumed regularly.
More informationNon-standard amino acids. mgr Adrian Jasiński Theoretical Molecular Biophysics/Bioinformatics Group
Non-standard amino acids mgr Adrian Jasiński Theoretical Molecular Biophysics/Bioinformatics Group Outline Introduction Non-standard amino acids Variations to the standard genetic code Modifications to
More informationNCBI resources III: GEO and ftp site. Yanbin Yin Spring 2013
NCBI resources III: GEO and ftp site Yanbin Yin Spring 2013 1 Homework assignment 2 Search colon cancer at GEO and find a data Series and perform a GEO2R analysis Write a report (in word or ppt) to include
More informationSUBMITTING DNA SEQUENCES TO THE DATABASES
Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins, Second Edition Andreas D. Baxevanis, B.F. Francis Ouellette Copyright 2001 John Wiley & Sons, Inc. ISBNs: 0-471-38390-2 (Hardback);
More informationTHE CHEMICAL SYNTHESIS OF PEPTIDES
TE EMIAL SYTESIS F PEPTIDES Peptides are the long molecular chains that make up proteins. Synthetic peptides are used either as drugs (as they are biologically active) or in the diagnosis of disease. Peptides
More informationBioinformatics Tools Tutorial Project Gene ID: KRas
Bioinformatics Tools Tutorial Project Gene ID: KRas Bednarski 2011 Original project funded by HHMI Bioinformatics Projects Introduction and Tutorial Purpose of this tutorial Illustrate the link between
More informationEuropean Medicines Agency
European Medicines Agency July 1996 CPMP/ICH/139/95 ICH Topic Q 5 B Quality of Biotechnological Products: Analysis of the Expression Construct in Cell Lines Used for Production of r-dna Derived Protein
More informationNIH s Genomic Data Sharing Policy
NIH s Genomic Data Sharing Policy 2 Benefits of Data Sharing Enables data generated from one study to be used to explore a wide range of additional research questions Increases statistical power and scientific
More informationUNIVERSITETET I OSLO Det matematisk-naturvitenskapelige fakultet
1 UNIVERSITETET I OSLO Det matematisk-naturvitenskapelige fakultet Exam in: MBV4010 Arbeidsmetoder i molekylærbiologi og biokjemi I MBV4010 Methods in molecular biology and biochemistry I Day of exam:.
More informationBiological Databases and Protein Sequence Analysis
Biological Databases and Protein Sequence Analysis Introduction M. Madan Babu, Center for Biotechnology, Anna University, Chennai 25, India Bioinformatics is the application of Information technology to
More informationPatients with COPD were recruited according to the inclusion criteria; aged 40-75, current or
Online Data Supplement Materials and Methods The ECLIPSE Cohort Patients with COPD were recruited according to the inclusion criteria; aged 40-75, current or ex-smokers with >10 pack-year history; a post
More informationBuilt from 20 kinds of amino acids
Built from 20 kinds of amino acids Each Protein has a three dimensional structure. Majority of proteins are compact. Highly convoluted molecules. Proteins are folded polypeptides. There are four levels
More informationCoding sequence the sequence of nucleotide bases on the DNA that are transcribed into RNA which are in turn translated into protein
Assignment 3 Michele Owens Vocabulary Gene: A sequence of DNA that instructs a cell to produce a particular protein Promoter a control sequence near the start of a gene Coding sequence the sequence of
More informationSICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE
AP Biology Date SICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE LEARNING OBJECTIVES Students will gain an appreciation of the physical effects of sickle cell anemia, its prevalence in the population,
More informationLecture 1 MODULE 3 GENE EXPRESSION AND REGULATION OF GENE EXPRESSION. Professor Bharat Patel Office: Science 2, 2.36 Email: b.patel@griffith.edu.
Lecture 1 MODULE 3 GENE EXPRESSION AND REGULATION OF GENE EXPRESSION Professor Bharat Patel Office: Science 2, 2.36 Email: b.patel@griffith.edu.au What is Gene Expression & Gene Regulation? 1. Gene Expression
More information