Module 10: Bioinformatics
|
|
- Katherine Lawson
- 8 years ago
- Views:
Transcription
1 Module 10: Bioinformatics 1.) Goal: To understand the general approaches for basic in silico (computer) analysis of DNA- and protein sequences. We are going to discuss sequence formatting required prior to analysis, DNA restriction mapping, DNA translation into protein coding regions (= finding open reading frames ORFs), protein sequence analysis, sequence comparisons and database searching. 2.) Introduction DNA sequencing has of late become very easy, fast and cheap. The elucidation of the complete human genome sequence (a mere 3 x 10 9 basepairs) has only been possible because of these technical advances. Protein sequencing on the other hand is also possible, but technically much harder, and slower. Because a DNA sequence predicts the encoded protein sequence thru the rules of the genetic code, we can sequence a piece of DNA and deduce or predict its encoded protein sequence, instead of painstakingly purifying and then sequencing the corresponding protein. Given the large size of some of the genomes that have been sequenced to date, it becomes clear that powerful in silico approaches must go hand in hand with the wet lab procedures. 3.) Background information Sequence input files can be generated in WORD, or can be copied from other source documents, websites etc formatting: for some applications, files first have to be converted into a specific format ( FASTA being a popular choice); removal of non- standard characters from the files is necessary (example: DNA sequence files can only contain the characters G, A, T and C) Many free sequence analysis programs are available on line; all you have to do is simply copy and paste your sequence(s) into a browser window and run the analysis, but remember in some cases the correct file format must be used. Tip: when working with sequence files in WORD, use the Courier font, as it is the only font in which each letter/character uses the same amount of space, resulting in well- aligned sequences 4.) Steps in an in silico exercise. Note: different exercises may use a different set of steps Open sequence file supplied in WORD Check for absence of non- standard characters, convert file into appropriate format Copy sequence Open browser window with particular application Paste sequence into window Choose specific analysis parameters Run analysis Results and files can be copied out of browser window and pasted/saved back into the original Word file 5.) Materials supplied: general plasmid map (Appendix A), related DHFR protein sequence for sequence alignment (Appendix B), Complete Plasmid Sequence with the Bacillus thermophilis DHFR (Appendix C), 6.) Boyer book chapter: #2 1
2 7.) Basic examples of things that can be done with in silico analyses With DNA sequence sequence generation: type or copy- paste in Word format sequence formatting, sequence length : restriction digestion and mapping (linear vs circular maps): translation of open reading frames (ORFs): With Protein sequence determine AA sequence length, AA composition, molecular weight, pi, molar extinction coefficient : DNA of Protein Sequence comparison these programs can be used to compare complete protein sequences to establish evolutionary relationships or find single point mutations Pairwise DNA alignment: Pairwise Protein alignment: Multiple sequence alignment: Sequence Databases DNA and Pubmed: Also: (well curated protein DB, can do Blasts and other alignments) 8.) Protocol: Do the following: 1. Convert the complete plasmid sequence in Appendix C to GCG and EMBL format, indicate length of plasmid DNA in bp. Include properly labeled copies of these in your report 2. Perform restriction mapping for the complete plasmid sequence supplied in Appendix C, using the restriction enzymes NdeI and BamHI. Show result table in your report 3. In your report show translation of all 6 reading frames and indicate the frame with the DHFR ORF (open reading frame). The Bacillus thermophilus ORF starts with MISHI. Show the amino acid sequence of the complete B. thermophilus ORF in your report. 4. Protein analysis: Use the B. thermophilus DHFR protein sequence. In your report only include molecular weight, number of amino acids, pi, and molar extinction coefficient 5. Sequence comparison: use the DHFR - protein sequence from above and align one at a time to the three DHFR protein sequences supplied. Show the sequence alignment and % identity for all three (B. thermophilus with human; B. thermophilus with Bacillus amyloliquefaciens; B. thermophilus with Geobacillus thermodenitrificans) alignments in your report. 6. Sequence comparison: Align all four DHFR protein sequences. Show the sequence alignment in your report. Hand in via e- mail, as a word document, one per group 2
3 9.) Materials Appendix A: Plasmid Map Appendix B: DHFR sequences to be used for sequence alignment: This is the sequence for human DHFR: 1 mvgslnciva vsqnmgigkn gdlpwpplrn efryfqrmtt tssvegkqnl vimgkktwfs 61 ipeknrplkg rinlvlsrel keppqgahfl srslddalkl teqpelankv dmvwivggss 121 vykeamnhpg hlklfvtrim qdfesdtffp eidlekykll peypgvlsdv qeekgikykf 181 evyeknd This is the DHFR sequence from Bacillus amyloliquefaciens: 1 misfifamde nrligkdndl pwhlpddlay fkkvttghti vmgrktfesi grplpnrrni 61 vvtsrdeslf pgcitadsae evlklippde ecfviggaql ysalfpyadr lymtkihhvf 121 egdrffpefn eaeweltsrk qgvkdeknpy dyeylvyekk n This is the DHFR sequence from Geobacillus thermodenitrificans: 1 mnmtilkssv mtlirrlkrq wrckgektmi shivamdenr vigkdnqlpw hlpadlayfk 61 rvtmghaivm grktfeaigr plpgrdnvvv trnpqfrpeg clvlhsleev kqwiaargee 121 vfiiggaelf katmpiadrl yvtnifasfp gdtfyppise kewkvvsytp gvkdeknpye 181 hafliyerk 3
4 Appendix C: Complete Plasmid Sequence with the Bacillus thermophilis DHFR DHFR ORF is situated between pos 5205 (NdeI)and pos 5699 (BamHI) tggcgaatgggacgcgccctgtagcggcgcattaagcgcggcgggtgtgg tggttacgcgcagcgtgaccgctacacttgccagcgccctagcgcccgct cctttcgctttcttcccttcctttctcgccacgttcgccggctttccccg tcaagctctaaatcgggggctccctttagggttccgatttagtgctttac ggcacctcgaccccaaaaaacttgattagggtgatggttcacgtagtggg ccatcgccctgatagacggtttttcgccctttgacgttggagtccacgtt ctttaatagtggactcttgttccaaactggaacaacactcaaccctatct cggtctattcttttgatttataagggattttgccgatttcggcctattgg ttaaaaaatgagctgatttaacaaaaatttaacgcgaattttaacaaaat attaacgtttacaatttcaggtggcacttttcggggaaatgtgcgcggaa cccctatttgtttatttttctaaatacattcaaatatgtatccgctcatg agacaataaccctgataaatgcttcaataatattgaaaaaggaagagtat gagtattcaacatttccgtgtcgcccttattcccttttttgcggcatttt gccttcctgtttttgctcacccagaaacgctggtgaaagtaaaagatgct gaagatcagttgggtgcacgagtgggttacatcgaactggatctcaacag cggtaagatccttgagagttttcgccccgaagaacgttttccaatgatga gcacttttaaagttctgctatgtggcgcggtattatcccgtattgacgcc gggcaagagcaactcggtcgccgcatacactattctcagaatgacttggt tgagtactcaccagtcacagaaaagcatcttacggatggcatgacagtaa gagaattatgcagtgctgccataaccatgagtgataacactgcggccaac ttacttctgacaacgatcggaggaccgaaggagctaaccgcttttttgca caacatgggggatcatgtaactcgccttgatcgttgggaaccggagctga atgaagccataccaaacgacgagcgtgacaccacgatgcctgcagcaatg gcaacaacgttgcgcaaactattaactggcgaactacttactctagcttc ccggcaacaattaatagactggatggaggcggataaagttgcaggaccac ttctgcgctcggcccttccggctggctggtttattgctgataaatctgga gccggtgagcgtgggtctcgcggtatcattgcagcactggggccagatgg taagccctcccgtatcgtagttatctacacgacggggagtcaggcaacta tggatgaacgaaatagacagatcgctgagataggtgcctcactgattaag cattggtaactgtcagaccaagtttactcatatatactttagattgattt aaaacttcatttttaatttaaaaggatctaggtgaagatcctttttgata atctcatgaccaaaatcccttaacgtgagttttcgttccactgagcgtca gaccccgtagaaaagatcaaaggatcttcttgagatcctttttttctgcg cgtaatctgctgcttgcaaacaaaaaaaccaccgctaccagcggtggttt gtttgccggatcaagagctaccaactctttttccgaaggtaactggcttc agcagagcgcagataccaaatactgtccttctagtgtagccgtagttagg ccaccacttcaagaactctgtagcaccgcctacatacctcgctctgctaa tcctgttaccagtggctgctgccagtggcgataagtcgtgtcttaccggg ttggactcaagacgatagttaccggataaggcgcagcggtcgggctgaac ggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaac tgagatacctacagcgtgagctatgagaaagcgccacgcttcccgaaggg agaaaggcggacaggtatccggtaagcggcagggtcggaacaggagagcg cacgagggagcttccagggggaaacgcctggtatctttatagtcctgtcg ggtttcgccacctctgacttgagcgtcgatttttgtgatgctcgtcaggg gggcggagcctatggaaaaacgccagcaacgcggcctttttacggttcct ggccttttgctggccttttgctcacatgttctttcctgcgttatcccctg attctgtggataaccgtattaccgcctttgagtgagctgataccgctcgc cgcagccgaacgaccgagcgcagcgagtcagtgagcgaggaagcggaaga gcgcctgatgcggtattttctccttacgcatctgtgcggtatttcacacc gcatatatggtgcactctcagtacaatctgctctgatgccgcatagttaa gccagtatacactccgctatcgctacgtgactgggtcatggctgcgcccc gacacccgccaacacccgctgacgcgccctgacgggcttgtctgctcccg gcatccgcttacagacaagctgtgaccgtctccgggagctgcatgtgtca gaggttttcaccgtcatcaccgaaacgcgcgaggcagctgcggtaaagct catcagcgtggtcgtgaagcgattcacagatgtctgcctgttcatccgcg tccagctcgttgagtttctccagaagcgttaatgtctggcttctgataaa gcgggccatgttaagggcggttttttcctgtttggtcactgatgcctccg tgtaagggggatttctgttcatgggggtaatgataccgatgaaacgagag aggatgctcacgatacgggttactgatgatgaacatgcccggttactgga acgttgtgagggtaaacaactggcggtatggatgcggcgggaccagagaa aaatcactcagggtcaatgccagcgcttcgttaatacagatgtaggtgtt ccacagggtagccagcagcatcctgcgatgcagatccggaacataatggt gcagggcgctgacttccgcgtttccagactttacgaaacacggaaaccga agaccattcatgttgttgctcaggtcgcagacgttttgcagcagcagtcg cttcacgttcgctcgcgtatcggtgattcattctgctaaccagtaaggca accccgccagcctagccgggtcctcaacgacaggagcacgatcatgcgca cccgtggggccgccatgccggcgataatggcctgcttctcgccgaaacgt ttggtggcgggaccagtgacgaaggcttgagcgagggcgtgcaagattcc gaataccgcaagcgacaggccgatcatcgtcgcgctccagcgaaagcggt cctcgccgaaaatgacccagagcgctgccggcacctgtcctacgagttgc 4
5 5 atgataaagaagacagtcataagtgcggcgacgatagtcatgccccgcgc ccaccggaaggagctgactgggttgaaggctctcaagggcatcggtcgag atcccggtgcctaatgagtgagctaacttacattaattgcgttgcgctca ctgcccgctttccagtcgggaaacctgtcgtgccagctgcattaatgaat cggccaacgcgcggggagaggcggtttgcgtattgggcgccagggtggtt tttcttttcaccagtgagacgggcaacagctgattgcccttcaccgcctg gccctgagagagttgcagcaagcggtccacgctggtttgccccagcaggc gaaaatcctgtttgatggtggttaacggcgggatataacatgagctgtct tcggtatcgtcgtatcccactaccgagatatccgcaccaacgcgcagccc ggactcggtaatggcgcgcattgcgcccagcgccatctgatcgttggcaa ccagcatcgcagtgggaacgatgccctcattcagcatttgcatggtttgt tgaaaaccggacatggcactccagtcgccttcccgttccgctatcggctg aatttgattgcgagtgagatatttatgccagccagccagacgcagacgcg ccgagacagaacttaatgggcccgctaacagcgcgatttgctggtgaccc aatgcgaccagatgctccacgcccagtcgcgtaccgtcttcatgggagaa aataatactgttgatgggtgtctggtcagagacatcaagaaataacgccg gaacattagtgcaggcagcttccacagcaatggcatcctggtcatccagc ggatagttaatgatcagcccactgacgcgttgcgcgagaagattgtgcac cgccgctttacaggcttcgacgccgcttcgttctaccatcgacaccacca cgctggcacccagttgatcggcgcgagatttaatcgccgcgacaatttgc gacggcgcgtgcagggccagactggaggtggcaacgccaatcagcaacga ctgtttgcccgccagttgttgtgccacgcggttgggaatgtaattcagct ccgccatcgccgcttccactttttcccgcgttttcgcagaaacgtggctg gcctggttcaccacgcgggaaacggtctgataagagacaccggcatactc tgcgacatcgtataacgttactggtttcacattcaccaccctgaattgac tctcttccgggcgctatcatgccataccgcgaaaggttttgcgccattcg atggtgtccgggatctcgacgctctcccttatgcgactcctgcattagga agcagcccagtagtaggttgaggccgttgagcaccgccgccgcaaggaat ggtgcatgcaaggagatggcgcccaacagtcccccggccacggggcctgc caccatacccacgccgaaacaagcgctcatgagcccgaagtggcgagccc gatcttccccatcggtgatgtcggcgatataggcgccagcaaccgcacct gtggcgccggtgatgccggccacgatgcgtccggcgtagaggatcgagat ctcgatcccgcgaaattaatacgactcactataggggaattgtgagcgga taacaattcccctctagaaataattttgtttaactttaagaaggagatat acatatgatttcgcacattgtggcaatggatgaaaaccgggtgatcggca aagacaaccgcttgccttggcatttgccggccgatttggcgtattttaaa cgggtgacaatgggccatgccatcgtgatggggcgcaagacgtttgaagc gatcggccggccgcttcccggccgcgataacgtcgttgtcacgcgcaacc gctcgtttcgtccggaaggctgccttgtgcttcattcgctcgaggaagtc aagcaatggatcgcatcgcgcgctgatgaagtgtttatcatcggcggggc cgaactgtttcgggcgacgatgccgattgtcgaccggctgtatgtgacaa aaatttttgcttccttccccggcgatacgttttatccgcccatttctgac gatgaatgggaaatcgtttcctatacgccaggagggaaagatgaaaagaa tccgtatgaacacgcctttatcatttatgagcggaaaaaggcgaaataat GGATCCgaattcgagctccgtcgacaagcttgcggccgcactcgagcacc accaccaccaccactgagatccggctgctaacaaagcccgaaaggaagct gagttggctgctgccaccgctgagcaataactagcataaccccttggggc ctctaaacgggtcttgaggggttttttgctgaaaggaggaactatatccg gat
Bioinformatics Resources at a Glance
Bioinformatics Resources at a Glance A Note about FASTA Format There are MANY free bioinformatics tools available online. Bioinformaticists have developed a standard format for nucleotide and protein sequences
More informationRETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison
RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the
More informationGenBank, Entrez, & FASTA
GenBank, Entrez, & FASTA Nucleotide Sequence Databases First generation GenBank is a representative example started as sort of a museum to preserve knowledge of a sequence from first discovery great repositories,
More informationA Multiple DNA Sequence Translation Tool Incorporating Web Robot and Intelligent Recommendation Techniques
Proceedings of the 2007 WSEAS International Conference on Computer Engineering and Applications, Gold Coast, Australia, January 17-19, 2007 402 A Multiple DNA Sequence Translation Tool Incorporating Web
More informationMAKING AN EVOLUTIONARY TREE
Student manual MAKING AN EVOLUTIONARY TREE THEORY The relationship between different species can be derived from different information sources. The connection between species may turn out by similarities
More informationBioinformatics Grid - Enabled Tools For Biologists.
Bioinformatics Grid - Enabled Tools For Biologists. What is Grid-Enabled Tools (GET)? As number of data from the genomics and proteomics experiment increases. Problems arise for the current sequence analysis
More informationBIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS
BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title:
More informationBio-Informatics Lectures. A Short Introduction
Bio-Informatics Lectures A Short Introduction The History of Bioinformatics Sanger Sequencing PCR in presence of fluorescent, chain-terminating dideoxynucleotides Massively Parallel Sequencing Massively
More informationBiological Sequence Data Formats
Biological Sequence Data Formats Here we present three standard formats in which biological sequence data (DNA, RNA and protein) can be stored and presented. Raw Sequence: Data without description. FASTA
More informationIntroduction to Bioinformatics 3. DNA editing and contig assembly
Introduction to Bioinformatics 3. DNA editing and contig assembly Benjamin F. Matthews United States Department of Agriculture Soybean Genomics and Improvement Laboratory Beltsville, MD 20708 matthewb@ba.ars.usda.gov
More informationID of alternative translational initiation events. Description of gene function Reference of NCBI database access and relative literatures
Data resource: In this database, 650 alternatively translated variants assigned to a total of 300 genes are contained. These database records of alternative translational initiation have been collected
More informationUsability in bioinformatics mobile applications
Usability in bioinformatics mobile applications what we are working on Noura Chelbah, Sergio Díaz, Óscar Torreño, and myself Juan Falgueras App name Performs Advantajes Dissatvantajes Link The problem
More informationClone Manager. Getting Started
Clone Manager for Windows Professional Edition Volume 2 Alignment, Primer Operations Version 9.5 Getting Started Copyright 1994-2015 Scientific & Educational Software. All rights reserved. The software
More informationCore Bioinformatics. Degree Type Year Semester. 4313473 Bioinformàtica/Bioinformatics OB 0 1
Core Bioinformatics 2014/2015 Code: 42397 ECTS Credits: 12 Degree Type Year Semester 4313473 Bioinformàtica/Bioinformatics OB 0 1 Contact Name: Sònia Casillas Viladerrams Email: Sonia.Casillas@uab.cat
More informationTutorial for Windows and Macintosh. Preparing Your Data for NGS Alignment
Tutorial for Windows and Macintosh Preparing Your Data for NGS Alignment 2015 Gene Codes Corporation Gene Codes Corporation 775 Technology Drive, Ann Arbor, MI 48108 USA 1.800.497.4939 (USA) 1.734.769.7249
More informationIntroduction to Bioinformatics AS 250.265 Laboratory Assignment 6
Introduction to Bioinformatics AS 250.265 Laboratory Assignment 6 In the last lab, you learned how to perform basic multiple sequence alignments. While useful in themselves for determining conserved residues
More informationPairwise Sequence Alignment
Pairwise Sequence Alignment carolin.kosiol@vetmeduni.ac.at SS 2013 Outline Pairwise sequence alignment global - Needleman Wunsch Gotoh algorithm local - Smith Waterman algorithm BLAST - heuristics What
More informationGene Models & Bed format: What they represent.
GeneModels&Bedformat:Whattheyrepresent. Gene models are hypotheses about the structure of transcripts produced by a gene. Like all models, they may be correct, partly correct, or entirely wrong. Typically,
More informationActivity 7.21 Transcription factors
Purpose To consolidate understanding of protein synthesis. To explain the role of transcription factors and hormones in switching genes on and off. Play the transcription initiation complex game Regulation
More informationGenome and DNA Sequence Databases. BME 110/BIOL 181 CompBio Tools Todd Lowe March 31, 2009
Genome and DNA Sequence Databases BME 110/BIOL 181 CompBio Tools Todd Lowe March 31, 2009 Admin Reading: Chapters 1 & 2 Notes available in PDF format on-line (see class calendar page): http://www.soe.ucsc.edu/classes/bme110/spring09/bme110-calendar.html
More informationSearching Nucleotide Databases
Searching Nucleotide Databases 1 When we search a nucleic acid databases, Mascot always performs a 6 frame translation on the fly. That is, 3 reading frames from the forward strand and 3 reading frames
More informationArabidopsis. A Practical Approach. Edited by ZOE A. WILSON Plant Science Division, School of Biological Sciences, University of Nottingham
Arabidopsis A Practical Approach Edited by ZOE A. WILSON Plant Science Division, School of Biological Sciences, University of Nottingham OXPORD UNIVERSITY PRESS List of Contributors Abbreviations xv xvu
More informationSequence Formats and Sequence Database Searches. Gloria Rendon SC11 Education June, 2011
Sequence Formats and Sequence Database Searches Gloria Rendon SC11 Education June, 2011 Sequence A is the primary structure of a biological molecule. It is a chain of residues that form a precise linear
More informationBUDAPEST: Bioinformatics Utility for Data Analysis of Proteomics using ESTs
BUDAPEST: Bioinformatics Utility for Data Analysis of Proteomics using ESTs Richard J. Edwards 2008. Contents 1. Introduction... 2 1.1. Version...2 1.2. Using this Manual...2 1.3. Why use BUDAPEST?...2
More informationJust the Facts: A Basic Introduction to the Science Underlying NCBI Resources
1 of 8 11/7/2004 11:00 AM National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools
More informationAmazing DNA facts. Hands-on DNA: A Question of Taste Amazing facts and quiz questions
Amazing DNA facts These facts can form the basis of a quiz (for example, how many base pairs are there in the human genome?). Students should be familiar with most of this material, so the quiz could be
More informationBLAST. Anders Gorm Pedersen & Rasmus Wernersson
BLAST Anders Gorm Pedersen & Rasmus Wernersson Database searching Using pairwise alignments to search databases for similar sequences Query sequence Database Database searching Most common use of pairwise
More informationSimilarity Searches on Sequence Databases: BLAST, FASTA. Lorenza Bordoli Swiss Institute of Bioinformatics EMBnet Course, Basel, October 2003
Similarity Searches on Sequence Databases: BLAST, FASTA Lorenza Bordoli Swiss Institute of Bioinformatics EMBnet Course, Basel, October 2003 Outline Importance of Similarity Heuristic Sequence Alignment:
More informationCalculating Nucleic Acid or Protein Concentration Using the GloMax Multi+ Microplate Instrument
Calculating Nucleic Acid or Protein Concentration Using the GloMax Multi+ Microplate Instrument Technical Note INTRODUCTION Direct measurements of nucleic acid samples at OD 260 or protein samples at OD
More information1. INTRODUCTION TABLE OF CONTENTS INTRODUCTION 1-3. How This Guide Is Organized 1-3 Additional Documentation 1-4 Conventions Used in This Guide 1-4
1. INTRODUCTION TABLE OF CONTENTS The Introduction to the HUSAR/GCG User s Guide describes basic information you need to get started with the HUSAR/GCG Sequence Analysis Software Package. INTRODUCTION
More informationModule 1. Sequence Formats and Retrieval. Charles Steward
The Open Door Workshop Module 1 Sequence Formats and Retrieval Charles Steward 1 Aims Acquaint you with different file formats and associated annotations. Introduce different nucleotide and protein databases.
More informationDNA Technology Mapping a plasmid digesting How do restriction enzymes work?
DNA Technology Mapping a plasmid A first step in working with DNA is mapping the DNA molecule. One way to do this is to use restriction enzymes (restriction endonucleases) that are naturally found in bacteria
More informationBIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16
Course Director: Dr. Barry Grant (DCM&B, bjgrant@med.umich.edu) Description: This is a three module course covering (1) Foundations of Bioinformatics, (2) Statistics in Bioinformatics, and (3) Systems
More informationMitochondrial DNA Analysis
Mitochondrial DNA Analysis Lineage Markers Lineage markers are passed down from generation to generation without changing Except for rare mutation events They can help determine the lineage (family tree)
More informationA Tutorial in Genetic Sequence Classification Tools and Techniques
A Tutorial in Genetic Sequence Classification Tools and Techniques Jake Drew Data Mining CSE 8331 Southern Methodist University jakemdrew@gmail.com www.jakemdrew.com Sequence Characters IUPAC nucleotide
More information4.2.1. What is a contig? 4.2.2. What are the contig assembly programs?
Table of Contents 4.1. DNA Sequencing 4.1.1. Trace Viewer in GCG SeqLab Table. Box. Select the editor mode in the SeqLab main window. Import sequencer trace files from the File menu. Select the trace files
More informationEuropean Medicines Agency
European Medicines Agency July 1996 CPMP/ICH/139/95 ICH Topic Q 5 B Quality of Biotechnological Products: Analysis of the Expression Construct in Cell Lines Used for Production of r-dna Derived Protein
More informationLinear Sequence Analysis. 3-D Structure Analysis
Linear Sequence Analysis What can you learn from a (single) protein sequence? Calculate it s physical properties Molecular weight (MW), isoelectric point (pi), amino acid content, hydropathy (hydrophilic
More informationBiotechnology: DNA Technology & Genomics
Chapter 20. Biotechnology: DNA Technology & Genomics 2003-2004 The BIG Questions How can we use our knowledge of DNA to: diagnose disease or defect? cure disease or defect? change/improve organisms? What
More informationLecture Outline. Introduction to Databases. Introduction. Data Formats Sample databases How to text search databases. Shifra Ben-Dor Irit Orr
Introduction to Databases Shifra Ben-Dor Irit Orr Lecture Outline Introduction Data and Database types Database components Data Formats Sample databases How to text search databases What units of information
More informationWhen you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want
1 When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want to search other databases as well. There are very
More informationDNA Scissors: Introduction to Restriction Enzymes
DNA Scissors: Introduction to Restriction Enzymes Objectives At the end of this activity, students should be able to 1. Describe a typical restriction site as a 4- or 6-base- pair palindrome; 2. Describe
More informationPROC. CAIRO INTERNATIONAL BIOMEDICAL ENGINEERING CONFERENCE 2006 1. E-mail: msm_eng@k-space.org
BIOINFTool: Bioinformatics and sequence data analysis in molecular biology using Matlab Mai S. Mabrouk 1, Marwa Hamdy 2, Marwa Mamdouh 2, Marwa Aboelfotoh 2,Yasser M. Kadah 2 1 Biomedical Engineering Department,
More informationThe sequence of bases on the mrna is a code that determines the sequence of amino acids in the polypeptide being synthesized:
Module 3F Protein Synthesis So far in this unit, we have examined: How genes are transmitted from one generation to the next Where genes are located What genes are made of How genes are replicated How
More informationLESSON 4. Using Bioinformatics to Analyze Protein Sequences. Introduction. Learning Objectives. Key Concepts
4 Using Bioinformatics to Analyze Protein Sequences Introduction In this lesson, students perform a paper exercise designed to reinforce the student understanding of the complementary nature of DNA and
More informationCore Bioinformatics. Titulació Tipus Curs Semestre. 4313473 Bioinformàtica/Bioinformatics OB 0 1
Core Bioinformatics 2014/2015 Codi: 42397 Crèdits: 12 Titulació Tipus Curs Semestre 4313473 Bioinformàtica/Bioinformatics OB 0 1 Professor de contacte Nom: Sònia Casillas Viladerrams Correu electrònic:
More informationINTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE Q5B
INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE ICH HARMONISED TRIPARTITE GUIDELINE QUALITY OF BIOTECHNOLOGICAL PRODUCTS: ANALYSIS
More informationCommittee on WIPO Standards (CWS)
E CWS/1/5 ORIGINAL: ENGLISH DATE: OCTOBER 13, 2010 Committee on WIPO Standards (CWS) First Session Geneva, October 25 to 29, 2010 PROPOSAL FOR THE PREPARATION OF A NEW WIPO STANDARD ON THE PRESENTATION
More informationReplication Study Guide
Replication Study Guide This study guide is a written version of the material you have seen presented in the replication unit. Self-reproduction is a function of life that human-engineered systems have
More informationBecker Muscular Dystrophy
Muscular Dystrophy A Case Study of Positional Cloning Described by Benjamin Duchenne (1868) X-linked recessive disease causing severe muscular degeneration. 100 % penetrance X d Y affected male Frequency
More informationOrganelle Speed Dating Game Instructions and answers for teachers
Organelle Speed Dating Game Instructions and answers for teachers These instructions should accompany the OCR resources GCSE (9 1) Combined Science 21 st Century Science B Organelle Speed Dating Game learner
More informationGreen Fluorescent Protein (GFP): Genetic Transformation, Synthesis and Purification of the Recombinant Protein
Green Fluorescent Protein (GFP): Genetic Transformation, Synthesis and Purification of the Recombinant Protein INTRODUCTION Green Fluorescent Protein (GFP) is a novel protein produced by the bioluminescent
More informationCD-HIT User s Guide. Last updated: April 5, 2010. http://cd-hit.org http://bioinformatics.org/cd-hit/
CD-HIT User s Guide Last updated: April 5, 2010 http://cd-hit.org http://bioinformatics.org/cd-hit/ Program developed by Weizhong Li s lab at UCSD http://weizhong-lab.ucsd.edu liwz@sdsc.edu 1. Introduction
More informationThe Galaxy workflow. George Magklaras PhD RHCE
The Galaxy workflow George Magklaras PhD RHCE Biotechnology Center of Oslo & The Norwegian Center of Molecular Medicine University of Oslo, Norway http://www.biotek.uio.no http://www.ncmm.uio.no http://www.no.embnet.org
More informationChoices, choices, choices... Which sequence database? Which modifications? What mass tolerance?
Optimization 1 Choices, choices, choices... Which sequence database? Which modifications? What mass tolerance? Where to begin? 2 Sequence Databases Swiss-prot MSDB, NCBI nr dbest Species specific ORFS
More informationExercises for the UCSC Genome Browser Introduction
Exercises for the UCSC Genome Browser Introduction 1) Find out if the mouse Brca1 gene has non-synonymous SNPs, color them blue, and get external data about a codon-changing SNP. Skills: basic text search;
More informationGenetic Technology. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.
Name: Class: Date: Genetic Technology Multiple Choice Identify the choice that best completes the statement or answers the question. 1. An application of using DNA technology to help environmental scientists
More informationVector NTI Advance 11 Quick Start Guide
Vector NTI Advance 11 Quick Start Guide Catalog no. 12605050, 12605099, 12605103 Version 11.0 December 15, 2008 12605022 Published by: Invitrogen Corporation 5791 Van Allen Way Carlsbad, CA 92008 U.S.A.
More informationBIO 3352: BIOINFORMATICS II HYBRID COURSE SYLLABUS
BIO 3352: BIOINFORMATICS II HYBRID COURSE SYLLABUS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title: Bioinformatics
More informationMolecular and Cell Biology Laboratory (BIOL-UA 223) Instructor: Ignatius Tan Phone: 212-998-8295 Office: 764 Brown Email: ignatius.tan@nyu.
Molecular and Cell Biology Laboratory (BIOL-UA 223) Instructor: Ignatius Tan Phone: 212-998-8295 Office: 764 Brown Email: ignatius.tan@nyu.edu Course Hours: Section 1: Mon: 12:30-3:15 Section 2: Wed: 12:30-3:15
More informationLecture 13: DNA Technology. DNA Sequencing. DNA Sequencing Genetic Markers - RFLPs polymerase chain reaction (PCR) products of biotechnology
Lecture 13: DNA Technology DNA Sequencing Genetic Markers - RFLPs polymerase chain reaction (PCR) products of biotechnology DNA Sequencing determine order of nucleotides in a strand of DNA > bases = A,
More informationGuide for Bioinformatics Project Module 3
Structure- Based Evidence and Multiple Sequence Alignment In this module we will revisit some topics we started to look at while performing our BLAST search and looking at the CDD database in the first
More informationDNA Sequence formats
DNA Sequence formats [Plain] [EMBL] [FASTA] [GCG] [GenBank] [IG] [IUPAC] [How Genomatix represents sequence annotation] Plain sequence format A sequence in plain format may contain only IUPAC characters
More informationStructure and Function of DNA
Structure and Function of DNA DNA and RNA Structure DNA and RNA are nucleic acids. They consist of chemical units called nucleotides. The nucleotides are joined by a sugar-phosphate backbone. The four
More informationLab 2/Phylogenetics/September 16, 2002 1 PHYLOGENETICS
Lab 2/Phylogenetics/September 16, 2002 1 Read: Tudge Chapter 2 PHYLOGENETICS Objective of the Lab: To understand how DNA and protein sequence information can be used to make comparisons and assess evolutionary
More informationUsing MATLAB: Bioinformatics Toolbox for Life Sciences
Using MATLAB: Bioinformatics Toolbox for Life Sciences MR. SARAWUT WONGPHAYAK BIOINFORMATICS PROGRAM, SCHOOL OF BIORESOURCES AND TECHNOLOGY, AND SCHOOL OF INFORMATION TECHNOLOGY, KING MONGKUT S UNIVERSITY
More informationIntroduction to Genome Annotation
Introduction to Genome Annotation AGCGTGGTAGCGCGAGTTTGCGAGCTAGCTAGGCTCCGGATGCGA CCAGCTTTGATAGATGAATATAGTGTGCGCGACTAGCTGTGTGTT GAATATATAGTGTGTCTCTCGATATGTAGTCTGGATCTAGTGTTG GTGTAGATGGAGATCGCGTAGCGTGGTAGCGCGAGTTTGCGAGCT
More informationSTUDIES ON SEED STORAGE PROTEINS OF SOME ECONOMICALLY MINOR PLANTS
STUDIES ON SEED STORAGE PROTEINS OF SOME ECONOMICALLY MINOR PLANTS THESIS SUBMITTED FOR THE DEGREB OF DOCTOR OF PHILOSOPHY (SCIENCE) OF THE UNIVERSITY OF CALCUTTA 1996 NRISINHA DE, M.Sc DEPARTMENT OF BIOCHEMISTRY
More informationBiology Behind the Crime Scene Week 4: Lab #4 Genetics Exercise (Meiosis) and RFLP Analysis of DNA
Page 1 of 5 Biology Behind the Crime Scene Week 4: Lab #4 Genetics Exercise (Meiosis) and RFLP Analysis of DNA Genetics Exercise: Understanding how meiosis affects genetic inheritance and DNA patterns
More informationREGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf])
820 REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf]) (See also General Regulations) BMS1 Admission to the Degree To be eligible for admission to the degree of Bachelor
More informationFrom DNA to Protein. Proteins. Chapter 13. Prokaryotes and Eukaryotes. The Path From Genes to Proteins. All proteins consist of polypeptide chains
Proteins From DNA to Protein Chapter 13 All proteins consist of polypeptide chains A linear sequence of amino acids Each chain corresponds to the nucleotide base sequence of a gene The Path From Genes
More information1 Mutation and Genetic Change
CHAPTER 14 1 Mutation and Genetic Change SECTION Genes in Action KEY IDEAS As you read this section, keep these questions in mind: What is the origin of genetic differences among organisms? What kinds
More information2006 7.012 Problem Set 3 KEY
2006 7.012 Problem Set 3 KEY Due before 5 PM on FRIDAY, October 13, 2006. Turn answers in to the box outside of 68-120. PLEASE WRITE YOUR ANSWERS ON THIS PRINTOUT. 1. Which reaction is catalyzed by each
More informationDNA Replication & Protein Synthesis. This isn t a baaaaaaaddd chapter!!!
DNA Replication & Protein Synthesis This isn t a baaaaaaaddd chapter!!! The Discovery of DNA s Structure Watson and Crick s discovery of DNA s structure was based on almost fifty years of research by other
More informationThe world of non-coding RNA. Espen Enerly
The world of non-coding RNA Espen Enerly ncrna in general Different groups Small RNAs Outline mirnas and sirnas Speculations Common for all ncrna Per def.: never translated Not spurious transcripts Always/often
More informationAppendix 2 Molecular Biology Core Curriculum. Websites and Other Resources
Appendix 2 Molecular Biology Core Curriculum Websites and Other Resources Chapter 1 - The Molecular Basis of Cancer 1. Inside Cancer http://www.insidecancer.org/ From the Dolan DNA Learning Center Cold
More informationRecombinant DNA Unit Exam
Recombinant DNA Unit Exam Question 1 Restriction enzymes are extensively used in molecular biology. Below are the recognition sites of two of these enzymes, BamHI and BclI. a) BamHI, cleaves after the
More informationRNA Structure and folding
RNA Structure and folding Overview: The main functional biomolecules in cells are polymers DNA, RNA and proteins For RNA and Proteins, the specific sequence of the polymer dictates its final structure
More informationHCS604.03 Exercise 1 Dr. Jones Spring 2005. Recombinant DNA (Molecular Cloning) exercise:
HCS604.03 Exercise 1 Dr. Jones Spring 2005 Recombinant DNA (Molecular Cloning) exercise: The purpose of this exercise is to learn techniques used to create recombinant DNA or clone genes. You will clone
More informationBio 102 Practice Problems Recombinant DNA and Biotechnology
Bio 102 Practice Problems Recombinant DNA and Biotechnology Multiple choice: Unless otherwise directed, circle the one best answer: 1. Which of the following DNA sequences could be the recognition site
More informationAnnex 6: Nucleotide Sequence Information System BEETLE. Biological and Ecological Evaluation towards Long-Term Effects
Annex 6: Nucleotide Sequence Information System BEETLE Biological and Ecological Evaluation towards Long-Term Effects Long-term effects of genetically modified (GM) crops on health, biodiversity and the
More informationError Tolerant Searching of Uninterpreted MS/MS Data
Error Tolerant Searching of Uninterpreted MS/MS Data 1 In any search of a large LC-MS/MS dataset 2 There are always a number of spectra which get poor scores, or even no match at all. 3 Sometimes, this
More informationDescription: Molecular Biology Services and DNA Sequencing
Description: Molecular Biology s and DNA Sequencing DNA Sequencing s Single Pass Sequencing Sequence data only, for plasmids or PCR products Plasmid DNA or PCR products Plasmid DNA: 20 100 ng/μl PCR Product:
More informationGenetics Lecture Notes 7.03 2005. Lectures 1 2
Genetics Lecture Notes 7.03 2005 Lectures 1 2 Lecture 1 We will begin this course with the question: What is a gene? This question will take us four lectures to answer because there are actually several
More informationGENEWIZ, Inc. DNA Sequencing Service Details for USC Norris Comprehensive Cancer Center DNA Core
DNA Sequencing Services Pre-Mixed o Provide template and primer, mixed into the same tube* Pre-Defined o Provide template and primer in separate tubes* Custom o Full-service for samples with unknown concentration
More informationLab # 12: DNA and RNA
115 116 Concepts to be explored: Structure of DNA Nucleotides Amino Acids Proteins Genetic Code Mutation RNA Transcription to RNA Translation to a Protein Figure 12. 1: DNA double helix Introduction Long
More informationIntegrated Protein Services
Integrated Protein Services Custom protein expression & purification Version DC04-0012 Expression strategy The first step in the recombinant protein generation process is to design an appropriate expression
More informationAP BIOLOGY 2007 SCORING GUIDELINES
AP BIOLOGY 2007 SCORING GUIDELINES Question 4 A bacterial plasmid is 100 kb in length. The plasmid DNA was digested to completion with two restriction enzymes in three separate treatments: EcoRI, HaeIII,
More informationComparing Methods for Identifying Transcription Factor Target Genes
Comparing Methods for Identifying Transcription Factor Target Genes Alena van Bömmel (R 3.3.73) Matthew Huska (R 3.3.18) Max Planck Institute for Molecular Genetics Folie 1 Transcriptional Regulation TF
More informationGENE CONSTRUCTION KIT 4
GENE CONSTRUCTION KIT 4 Tutorials & User Manual from Textco BioSoftware, Inc. September 2012, First Edition Gene Construction Kit 4 Manual is Copyright Textco Bio- Software, Inc. 2003-2012. All rights
More informationInnovations in Molecular Epidemiology
Innovations in Molecular Epidemiology Molecular Epidemiology Measure current rates of active transmission Determine whether recurrent tuberculosis is attributable to exogenous reinfection Determine whether
More informationMolecular Genetics. RNA, Transcription, & Protein Synthesis
Molecular Genetics RNA, Transcription, & Protein Synthesis Section 1 RNA AND TRANSCRIPTION Objectives Describe the primary functions of RNA Identify how RNA differs from DNA Describe the structure and
More informationThe Techniques of Molecular Biology: Forensic DNA Fingerprinting
Revised Fall 2011 The Techniques of Molecular Biology: Forensic DNA Fingerprinting The techniques of molecular biology are used to manipulate the structure and function of molecules such as DNA and proteins
More information2. The number of different kinds of nucleotides present in any DNA molecule is A) four B) six C) two D) three
Chem 121 Chapter 22. Nucleic Acids 1. Any given nucleotide in a nucleic acid contains A) two bases and a sugar. B) one sugar, two bases and one phosphate. C) two sugars and one phosphate. D) one sugar,
More informationBioBoot Camp Genetics
BioBoot Camp Genetics BIO.B.1.2.1 Describe how the process of DNA replication results in the transmission and/or conservation of genetic information DNA Replication is the process of DNA being copied before
More informationRNA & Protein Synthesis
RNA & Protein Synthesis Genes send messages to cellular machinery RNA Plays a major role in process Process has three phases (Genetic) Transcription (Genetic) Translation Protein Synthesis RNA Synthesis
More informationA Web Based Software for Synonymous Codon Usage Indices
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 3 (2013), pp. 147-152 International Research Publications House http://www. irphouse.com /ijict.htm A Web
More informationUnipro UGENE User Manual Version 1.12.3
Unipro UGENE User Manual Version 1.12.3 April 01, 2014 Contents 1 About Unipro................................... 10 1.1 Contacts.......................................... 10 2 About UGENE..................................
More informationMs. Campbell Protein Synthesis Practice Questions Regents L.E.
Name Student # Ms. Campbell Protein Synthesis Practice Questions Regents L.E. 1. A sequence of three nitrogenous bases in a messenger-rna molecule is known as a 1) codon 2) gene 3) polypeptide 4) nucleotide
More informationBMC Bioinformatics. Open Access. Abstract
BMC Bioinformatics BioMed Central Software Recent Hits Acquired by BLAST (ReHAB): A tool to identify new hits in sequence similarity searches Joe Whitney, David J Esteban and Chris Upton* Open Access Address:
More information