Biology & Big Data. Debasis Mitra Professor, Computer Science, FIT
|
|
|
- Myles Daniel
- 10 years ago
- Views:
Transcription
1 Biology & Big Data Debasis Mitra Professor, Computer Science, FIT
2 Cloud? Debasis Mitra, Florida Tech
3 Data as Service Transparent to user Multiple locations Robustness Software as Service Software location transparent Pay-as-you-use Software running platform transparent Cost-based speed Underlying middle-layer makes everything transparent
4 Platform as Service Middle-layer + Software environment + User interface Infrastructure as Service Hardware + Internet + Virtual Machine Source: Dai et al. Biology Direct 2012, 7:43
5 Bio-informatics Cloud / Grid
6 Life begins with Cell A cell is a smallest structural unit of an organism that is capable of independent functioning All cells have some common features
7 Molecular Biology An Introduction to Bioinformatics Algorithms (Computational Molecular Biology) Neil C. Jones, Pavel A. Pevzner
8 Organizations of life Nucleus = library Chromosomes = bookshelves Genes = books Same libraries and the same sets of books for every cell in an organism Books represent all the information to carry out its various functions.
9 DNA: The Code of Life The structure and the four genomic letters code for all living organisms Adenine, Guanine, Thymine, and Cytosine which pair A-T and C-G on complimentary strands.
10 DNA, continued DNA has a double helix structure which is composed of sugar molecule phosphate group and a base (A,C,G,T) 5 ATTTAGGCC 3 3 TAAATCCGG 5
11 Genetic Information: Chromosomes
12 Genes Make Proteins genome-> genes ->protein(forms cellular structural & life functional)->pathways & physiology
13
14 Some Terminology Genome: an organism s genetic material Human genome is about 3,000,000,000 base-pair long Gene: a discrete units of hereditary information located on the chromosomes and consisting of DNA. Nucleic acid: Biological molecules(rna and DNA) that allow organisms to reproduce: A, T (U), C, G
15 Major events in the history of Molecular Biology James D. Watson and Francis H. C. Crick deduced the double helical structure of DNA with four molecules A, T, C, G James Watson and Francis Crick 1 Biologist + 1 Physics student words = Nobel Prize
16 Major events in the history of Molecular Biology Howard Temin and David Baltimore independently isolate the first restriction enzyme DNA can be cut into reproducible pieces with restriction enzymes; (gene cloning or recombinant DNA technology)
17 Major events in the history of Molecular Biology Leroy Hood: Developed automated sequencing mechanism 1986 Human Genome Initiative announced Leroy Hood 1990 The 15 year Human Genome project is launched by congress 1995 Moderate-resolution maps of chromosomes 3, 11, 12, and 22 maps published (These maps provide the locations of markers on each chromosome to make locating genes easier)
18 Major events in the history of Molecular Biology John Craig Venter: First bacterial genomes sequenced Challenged the genome sequencing project by developing shotgun approach approach depends on assembling the sequences by computer John Craig Venter
19 Major events in the history of Molecular Biology E. Coli sequenced 1998 PerkinsElmer, Inc.. Developed 96-capillary sequencer 1998 Complete sequence of the Caenorhabditis elegans genome 1999 First human chromosome (number 22) sequenced
20 Major events in the history of Molecular Biology Complete sequence of the euchromatic portion of the Drosophila melanogaster genome 2001 International Human Genome Sequencing:first draft of the sequence of the human genome published
21 Major events in the history of Molecular Biology Present April 2003 Human Genome Project Completed. Mouse genome is sequenced. Jan 15, 2014, Illumina Your genome may be sequenced < $1,000 IBM Challenge: <$100
22 Why sequence? MUtAsHONS What happens to genes when the DNA is mutated?
23 The Good, the Bad, and the Silent Mutations can serve the organism in three ways: The Good : The Bad : A mutation can cause a trait that enhances the organism s function: Mutation in the sickle cell gene provides resistance to malaria. A mutation can cause a trait that is harmful, sometimes fatal to the organism: Huntington s disease, a symptom of a gene mutation, is a degenerative disease of the nervous system. The Silent: A mutation can simply cause no difference in the function of the organism. Campbell, Biology, 5 th edition, p. 255
24 Real data: human & fruitfly eyeless This is a global alignment of human & fruitfly Eyeless-gene Next few slides are from Dr Avril Coghlan, Sanger Institute, UK
25 Real data: human & fruitfly eyeless There are 2 short regions of high similarity Outside those regions, there are many mismatches and gaps It might be more sensible to make local alignments of one or both of the regions of high similarity
26 Real data: human & fruitfly This is a local alignment of human & fruitfly What parts of the sequences were used in the local alignment?
27 The Smith-Waterman algorithm local alignment of 2 sequences The alignment of all possible subsequences (parts) of sequences S 1 and S 2 The 0 th row and 0 th column of T are first filled with zeroes The recurrence relation used to fill table T is: T(i, j) = max T(i-1, j-1) + σ(s 1 (i), S 2 (j)) T(i-1, j) + gap penalty T(i, j-1) + gap penalty 0 The traceback starts at the highest scoring cell in the matrix T, and travels up/left while the score is still positive
28 eg., to find the best local alignment of sequences ACCTAAGG and GGCTCAATCA, using +2 for a match, -1 for a mismatch, and -2 for a gap: We first make matrix T (as in N-W): The 0 th row and 0 th column of T are filled with zeroes The recurrence relation is then used to fill the matrix T A 0 C 0 C 0 T 0 A 0 A 0 G 0 G 0 G G C T C A A T C A
29 G G C T C A A T C A A 0? 0? C 0 C 0 T 0 A 0 A 0 G 0 G 0 We first calculate T(1,1) using the recurrence relation: T(i-1, j-1) + σ(s 1 (i), S 2 (j)) = 0 1 = -1 T(i, j) = max T(i-1, j) + gap penalty = 0-2 = -2 T(i, j-1) + gap penalty = 0-2 = -2 0 The maximum value is 0, so we set T(1,1) to 0
30 You fill in the whole of T, recording the previous cell (if any) to calculate the value of each T(i, j): G G C T C A A T C A A used C C T A A G G The traceback starts at the highest scoring cell in the matrix T, and travels up/left while the score is still positive
31 G G C T C A A T C A A C C T A A G G Work out the best local alignment from the traceback: Score of the alignment is in the bottom right cell of the traceback (6 = 4 (score of 2 per match) + 1 (-2 per gap)) C C T T C - A A A A
32 Molecular Clock Linus and Pauling found that α-chains of human and gorilla differ by 2 residues, and β-chains by 1 residues. They then calculated the time of divergence between human and gorilla using evolutionary molecular clock. Gorilla and human β chain were found to diverge about 7.3 years ago. Ancestor Human β Chain β Chain Gorilla β Chain
33 Beta globins: Beta globin chains of closely related species are highly similar: Observe simple alignments below: Human β chain: MVHLTPEEKSAVTALWGKV NVDEVGGEALGRLL Mouse β chain: MVHLTDAEKAAVNGLWGKVNPDDVGGEALGRLL Human β chain: VVYPWTQRFFESFGDLSTPDAVMGNPKVKAHGKKVLG Mouse β chain: VVYPWTQRYFDSFGDLSSASAIMGNPKVKAHGKK VIN Human β chain: AFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGN Mouse β chain: AFNDGLKHLDNLKGTFAHLSELHCDKLHVDPENFRLLGN Human β chain: VLVCVLAHHFGKEFTPPVQAAYQKVVAGVANALAHKYH Mouse β chain: MI VI VLGHHLGKEFTPCAQAAFQKVVAGVASALAHKYH There are a total of 27 mismatches, or (147 27) / 147 = 81.7 % identical
34 Beta globins: Cont. Human β chain: Chicken β chain: MVH L TPEEKSAVTALWGKVNVDEVGGEALGRLL MVHWTAEEKQL I TGLWGKVNVAECGAEALARLL Human β chain: Chicken β chain: VVYPWTQRFFESFGDLSTPDAVMGNPKVKAHGKKVLG IVYPWTQRFF ASFGNLSSPTA I LGNPMVRAHGKKVLT Human β chain: Chicken β chain: AFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGN SFGDAVKNLDNIK NTFSQLSELHCDKLHVDPENFRLLGD Human β chain: Mouse β chain: VLVCVLAHHFGKEFTPPVQAAY QKVVAGVANALAHKYH I L I I VLAAHFSKDFTPECQAAWQKLVRVVAHALARKYH -There are a total of 44 mismatches, or (147 44) / 147 = 70.1 % identical - As expected, mouse β chain is closer to that of human than chicken s.
35 Molecular evolution can be visualized with phylogenetic tree.
36 Mouse and Human overview Mouse has 2.1 x10 9 base pairs versus 2.9 x 10 9 in human. About 95% of genetic material is shared. 99% of genes shared of about 30,000 total. The 300 genes that have no homologue in either species deal largely with immunity, detoxification, smell and sex* *Scientific American Dec. 5, 2002
37 Human and Mouse Significant chromosomal rearranging occurred between the diverging point of humans and mice. Here is a mapping of human chromosome 3. It contains homologous sequences to at least 5 mouse chromosomes.
38 Important discovery
39 DNA Reversal 5 A T G C C T G T A C T A 3 Break and Invert 3 T A C G G A C A T G A T 5 5 A T G T A C A G G C T A 3 3 T A C A T G T C C G A T 5
40 Waardenburg s syndrome Genetic disorder Characterized by loss of hearing and pigmentary dysphasia Found on human chromosome 2
41
42 It is Sequenced, What s Next? Tracing Phylogeny Finding family relationships between species by tracking similarities between species. Gene Annotation (cooperative genomics) Comparison of similar species. Proteomics From DNA sequence to a folded protein. Determining Regulatory Networks How the genes react to certain stimuli? How cancer progresses?
43 Sequence Driven Problems in Bioinformatics Genomics Fragment assembly of the DNA sequence. Not possible to read entire sequence. Cut up into small fragments using restriction enzymes. Then need to do fragment assembly. Overlapping similarities to matching fragments. N-P complete problem. Finding Genes Identify open reading frames Exons are spliced out. Junk in between genes
44 More Computing problems Proteomics Protein Folding 1D Sequence 3D Structure What drives this process? Identification of functional domains in protein s sequence Determining functional pieces in proteins. Most problems need mathematical solutions & For Large data they need software
45 Biological Databases Online databases to archive, search, and run programs on massive amount of data NCBI GeneBank Huge collection of databases, the most prominent being the nucleotide sequence database Protein Data Bank Database of protein 3D-structures SWISSPROT Database of annotated protein sequences PROSITE Database of protein active site motifs
46 How gene expressions translate to Prognosis or Diagnosis: A Cancer Question
47
48
49 Microarray Experiments lead to understanding Gene Regulatory Networks
50 Big Data?
51 Current Thoughts on Big-data & Biology a Conference Bioinformatics for Big Data: How Applications of Big Data will Drive Research Forward February 10-12, 2014 Moscone North Convention Center San Francisco, CA
52 Session: TECHNOLOGIES GENERATING BIG BIOMEDICAL DATA 11:00 Cancer Genomics David Haussler, Ph.D., Distinguished Professor and Director, Center for Biomolecular Science & Engineering, Univ. of California Santa Cruz UCSC has built the Cancer Genomics Hub (CGHub) for the US National Cancer Institute, designed to hold up to 5 petabytes of research genomics data (up to 50,000 whole genomes), including data for all major NCI projects. To date it has served more than 8.3 petabytes of data to more than 300 research labs. Cancer is exceedingly complex, with thousands of subtypes involving an immense number of different combinations of mutations. The only way we will understand it is to gather together DNA data from many thousands of cancer genomes so that we have the statistical power to distinguish between recurring combinations of mutations that drive cancer progression and "passenger" mutations that occur by random chance. Currently, with the exception of a few projects such as ICGC and TCGA, most cancer genomics research is taking place in research silos, with little opportunity for data sharing. If this trend continues, we lose an incredible opportunity. Soon cancer genome sequencing will be widespread in clinical practice, making it possible in principle to study as many as a million cancer genomes. For these data to also have impact on understanding cancer, we must begin soon to move data into a global cloud storage and computing system, and design mechanisms t hat allow clinical data to be used in research with appropriate patient consent. A global alliance for sharing genomic and clinical data is emerging to address this problem. This is an opportunity we cannot turn away from, but involves both social and technical challenges. Translation: Huge database of mutated gene s
53 Session: DATA STORAGE AND MAINTENANCE 2:20 Implementing Big Data Analysis and Archival Solutions for NGS Data Zhiyan Fu, Ph.D., Chief Scientific Computing Officer, Genome Institute of Singapore (A*STAR) This presentation shows the latest development in big data analysis, compression and storage management. It provides a practical case to implement the big data technologies to a mid-size genome center. Attendees will understand the challenges of big data life-cycle management in a genome center and see how the latest big data technologies are implemented, and the pros and cons of some of the techniques, including Hadoop, HDF5, and different NGS compression algorithms evaluated by GIS. Translation: Experience in managing Next Gen. Sequencing Data storage
54 Session: HOW BIG DATA WILL DRIVE RESEARCH FORWARD 1:45 Harnessing Big Data to Accelerate Drug Development Vinod Kumar, Ph.D., Senior Investigator, Computational Biology, GlaxoSmithKline Pharmaceuticals With the rapid development of high-throughput technologies and ever-increasing accumulation of whole genome-level datasets, an increasing number of diseases and drugs can be comprehensively characterized by the changes they induce in gene expression, protein, metabolites and phenotypes. Integrating and querying such large volumes of data, often spanning domains and residing in diverse sources, constitutes a significant obstacle. This talk presents two distinct approaches that utilize these data types to systematically evaluate and suggest new disease indications for new and existing drugs. Translation: Predict new disease indications from Data for Drug Modeling, first by software
55 Session: BIG DATA DRIVING PERSONALIZED MEDICINE 5:05 It s Not Just About Big Data... Big Analytics for Identifying What Works and for Whom in Healthcare Iya Khalil, Ph.D., Executive Vice President and Co-Founder, GNS Healthcare We are living in the era of big data in healthcare, with unprecedented ability to collect data at multiple levels (genomic/'omic', phenotypic, health records, mobile health, etc.) and at scale. The key will be leveraging advanced analytics and appropriate feedback loops to identify what works on an individual patient level. Translation: Smart Algorithms needed for connecting multi-omics Big Data
56 Thank you! Debasis Mitra Professor, Computer Science, FIT
RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison
RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the
CCR Biology - Chapter 9 Practice Test - Summer 2012
Name: Class: Date: CCR Biology - Chapter 9 Practice Test - Summer 2012 Multiple Choice Identify the choice that best completes the statement or answers the question. 1. Genetic engineering is possible
Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources
1 of 8 11/7/2004 11:00 AM National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools
12.1 The Role of DNA in Heredity
12.1 The Role of DNA in Heredity Only in the last 50 years have scientists understood the role of DNA in heredity. That understanding began with the discovery of DNA s structure. In 1952, Rosalind Franklin
Basic Concepts of DNA, Proteins, Genes and Genomes
Basic Concepts of DNA, Proteins, Genes and Genomes Kun-Mao Chao 1,2,3 1 Graduate Institute of Biomedical Electronics and Bioinformatics 2 Department of Computer Science and Information Engineering 3 Graduate
Structure and Function of DNA
Structure and Function of DNA DNA and RNA Structure DNA and RNA are nucleic acids. They consist of chemical units called nucleotides. The nucleotides are joined by a sugar-phosphate backbone. The four
Algorithms in Computational Biology (236522) spring 2007 Lecture #1
Algorithms in Computational Biology (236522) spring 2007 Lecture #1 Lecturer: Shlomo Moran, Taub 639, tel 4363 Office hours: Tuesday 11:00-12:00/by appointment TA: Ilan Gronau, Taub 700, tel 4894 Office
Name Date Period. 2. When a molecule of double-stranded DNA undergoes replication, it results in
DNA, RNA, Protein Synthesis Keystone 1. During the process shown above, the two strands of one DNA molecule are unwound. Then, DNA polymerases add complementary nucleotides to each strand which results
Genetic information (DNA) determines structure of proteins DNA RNA proteins cell structure 3.11 3.15 enzymes control cell chemistry ( metabolism )
Biology 1406 Exam 3 Notes Structure of DNA Ch. 10 Genetic information (DNA) determines structure of proteins DNA RNA proteins cell structure 3.11 3.15 enzymes control cell chemistry ( metabolism ) Proteins
Biological Sciences Initiative. Human Genome
Biological Sciences Initiative HHMI Human Genome Introduction In 2000, researchers from around the world published a draft sequence of the entire genome. 20 labs from 6 countries worked on the sequence.
14.3 Studying the Human Genome
14.3 Studying the Human Genome Lesson Objectives Summarize the methods of DNA analysis. State the goals of the Human Genome Project and explain what we have learned so far. Lesson Summary Manipulating
DNA Replication & Protein Synthesis. This isn t a baaaaaaaddd chapter!!!
DNA Replication & Protein Synthesis This isn t a baaaaaaaddd chapter!!! The Discovery of DNA s Structure Watson and Crick s discovery of DNA s structure was based on almost fifty years of research by other
Vad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives
Vad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives [email protected] 2015-05-21 Functional Bioinformatics, Örebro University Vad är bioinformatik och varför
Human Genome and Human Genome Project. Louxin Zhang
Human Genome and Human Genome Project Louxin Zhang A Primer to Genomics Cells are the fundamental working units of every living systems. DNA is made of 4 nucleotide bases. The DNA sequence is the particular
Big data in cancer research : DNA sequencing and personalised medicine
Big in cancer research : DNA sequencing and personalised medicine Philippe Hupé Conférence BIGDATA 04/04/2013 1 - Titre de la présentation - nom du département émetteur et/ ou rédacteur - 00/00/2005 Deciphering
Worksheet - COMPARATIVE MAPPING 1
Worksheet - COMPARATIVE MAPPING 1 The arrangement of genes and other DNA markers is compared between species in Comparative genome mapping. As early as 1915, the geneticist J.B.S Haldane reported that
CHAPTER 6: RECOMBINANT DNA TECHNOLOGY YEAR III PHARM.D DR. V. CHITRA
CHAPTER 6: RECOMBINANT DNA TECHNOLOGY YEAR III PHARM.D DR. V. CHITRA INTRODUCTION DNA : DNA is deoxyribose nucleic acid. It is made up of a base consisting of sugar, phosphate and one nitrogen base.the
DNA, RNA, Protein synthesis, and Mutations. Chapters 12-13.3
DNA, RNA, Protein synthesis, and Mutations Chapters 12-13.3 1A)Identify the components of DNA and explain its role in heredity. DNA s Role in heredity: Contains the genetic information of a cell that can
BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS
BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title:
Genetics Test Biology I
Genetics Test Biology I Multiple Choice Identify the choice that best completes the statement or answers the question. 1. Avery s experiments showed that bacteria are transformed by a. RNA. c. proteins.
Chapter 11: Molecular Structure of DNA and RNA
Chapter 11: Molecular Structure of DNA and RNA Student Learning Objectives Upon completion of this chapter you should be able to: 1. Understand the major experiments that led to the discovery of DNA as
Appendix 2 Molecular Biology Core Curriculum. Websites and Other Resources
Appendix 2 Molecular Biology Core Curriculum Websites and Other Resources Chapter 1 - The Molecular Basis of Cancer 1. Inside Cancer http://www.insidecancer.org/ From the Dolan DNA Learning Center Cold
Genetics Module B, Anchor 3
Genetics Module B, Anchor 3 Key Concepts: - An individual s characteristics are determines by factors that are passed from one parental generation to the next. - During gamete formation, the alleles for
Answer: 2. Uracil. Answer: 2. hydrogen bonds. Adenine, Cytosine and Guanine are found in both RNA and DNA.
Answer: 2. Uracil Adenine, Cytosine and Guanine are found in both RNA and DNA. Thymine is found only in DNA; Uracil takes its (Thymine) place in RNA molecules. Answer: 2. hydrogen bonds The complementary
MAKING AN EVOLUTIONARY TREE
Student manual MAKING AN EVOLUTIONARY TREE THEORY The relationship between different species can be derived from different information sources. The connection between species may turn out by similarities
Transcription and Translation of DNA
Transcription and Translation of DNA Genotype our genetic constitution ( makeup) is determined (controlled) by the sequence of bases in its genes Phenotype determined by the proteins synthesised when genes
How To Understand The Science Of Genomics
Curs Bioinformática. Grau Genética GENÓMICA INTRODUCTION TO GENOME SCIENCE Antonio Barbadilla Group Genomics, Bioinformatics & Evolution Institut Biotecnologia I Biomedicina Departament de Genètica i Microbiologia
2. The number of different kinds of nucleotides present in any DNA molecule is A) four B) six C) two D) three
Chem 121 Chapter 22. Nucleic Acids 1. Any given nucleotide in a nucleic acid contains A) two bases and a sugar. B) one sugar, two bases and one phosphate. C) two sugars and one phosphate. D) one sugar,
Chapter 6 DNA Replication
Chapter 6 DNA Replication Each strand of the DNA double helix contains a sequence of nucleotides that is exactly complementary to the nucleotide sequence of its partner strand. Each strand can therefore
Genetics Lecture Notes 7.03 2005. Lectures 1 2
Genetics Lecture Notes 7.03 2005 Lectures 1 2 Lecture 1 We will begin this course with the question: What is a gene? This question will take us four lectures to answer because there are actually several
Human Genome Organization: An Update. Genome Organization: An Update
Human Genome Organization: An Update Genome Organization: An Update Highlights of Human Genome Project Timetable Proposed in 1990 as 3 billion dollar joint venture between DOE and NIH with 15 year completion
Activity IT S ALL RELATIVES The Role of DNA Evidence in Forensic Investigations
Activity IT S ALL RELATIVES The Role of DNA Evidence in Forensic Investigations SCENARIO You have responded, as a result of a call from the police to the Coroner s Office, to the scene of the death of
How Cancer Begins???????? Chithra Manikandan Nov 2009
Cancer Cancer is one of the most common diseases in the developed world: 1 in 4 deaths are due to cancer 1 in 17 deaths are due to lung cancer Lung cancer is the most common cancer in men Breast cancer
Next Generation Sequencing: Technology, Mapping, and Analysis
Next Generation Sequencing: Technology, Mapping, and Analysis Gary Benson Computer Science, Biology, Bioinformatics Boston University [email protected] http://tandem.bu.edu/ The Human Genome Project took
Automated DNA sequencing 20/12/2009. Next Generation Sequencing
DNA sequencing the beginnings Ghent University (Fiers et al) pioneers sequencing first complete gene (1972) first complete genome (1976) Next Generation Sequencing Fred Sanger develops dideoxy sequencing
When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want
1 When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want to search other databases as well. There are very
Searching Nucleotide Databases
Searching Nucleotide Databases 1 When we search a nucleic acid databases, Mascot always performs a 6 frame translation on the fly. That is, 3 reading frames from the forward strand and 3 reading frames
The Human Genome Project
The Human Genome Project Brief History of the Human Genome Project Physical Chromosome Maps Genetic (or Linkage) Maps DNA Markers Sequencing and Annotating Genomic DNA What Have We learned from the HGP?
Bioinformatics Grid - Enabled Tools For Biologists.
Bioinformatics Grid - Enabled Tools For Biologists. What is Grid-Enabled Tools (GET)? As number of data from the genomics and proteomics experiment increases. Problems arise for the current sequence analysis
Lab # 12: DNA and RNA
115 116 Concepts to be explored: Structure of DNA Nucleotides Amino Acids Proteins Genetic Code Mutation RNA Transcription to RNA Translation to a Protein Figure 12. 1: DNA double helix Introduction Long
Academic Nucleic Acids and Protein Synthesis Test
Academic Nucleic Acids and Protein Synthesis Test Multiple Choice Identify the letter of the choice that best completes the statement or answers the question. 1. Each organism has a unique combination
BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16
Course Director: Dr. Barry Grant (DCM&B, [email protected]) Description: This is a three module course covering (1) Foundations of Bioinformatics, (2) Statistics in Bioinformatics, and (3) Systems
Teacher Guide: Have Your DNA and Eat It Too ACTIVITY OVERVIEW. http://gslc.genetics.utah.edu
ACTIVITY OVERVIEW Abstract: Students build an edible model of DNA while learning basic DNA structure and the rules of base pairing. Module: The Basics and Beyond Prior Knowledge Needed: DNA contains heritable
A Primer of Genome Science THIRD
A Primer of Genome Science THIRD EDITION GREG GIBSON-SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc. Publishers Sunderland, Massachusetts USA Contents Preface xi 1 Genome Projects:
Lecture 26: Overview of deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) structure
Lecture 26: Overview of deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) structure Nucleic acids play an important role in the storage and expression of genetic information. They are divided into
To be able to describe polypeptide synthesis including transcription and splicing
Thursday 8th March COPY LO: To be able to describe polypeptide synthesis including transcription and splicing Starter Explain the difference between transcription and translation BATS Describe and explain
BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS
BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-110 012 [email protected] Genomics A genome is an organism s
AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE
ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS
Bioinformatics Resources at a Glance
Bioinformatics Resources at a Glance A Note about FASTA Format There are MANY free bioinformatics tools available online. Bioinformaticists have developed a standard format for nucleotide and protein sequences
SNP Essentials The same SNP story
HOW SNPS HELP RESEARCHERS FIND THE GENETIC CAUSES OF DISEASE SNP Essentials One of the findings of the Human Genome Project is that the DNA of any two people, all 3.1 billion molecules of it, is more than
Pairwise Sequence Alignment
Pairwise Sequence Alignment [email protected] SS 2013 Outline Pairwise sequence alignment global - Needleman Wunsch Gotoh algorithm local - Smith Waterman algorithm BLAST - heuristics What
Analysis and Integration of Big Data from Next-Generation Genomics, Epigenomics, and Transcriptomics
Analysis and Integration of Big Data from Next-Generation Genomics, Epigenomics, and Transcriptomics Christopher Benner, PhD Director, Integrative Genomics and Bioinformatics Core (IGC) idash Webinar,
DNA and the Cell. Version 2.3. English version. ELLS European Learning Laboratory for the Life Sciences
DNA and the Cell Anastasios Koutsos Alexandra Manaia Julia Willingale-Theune Version 2.3 English version ELLS European Learning Laboratory for the Life Sciences Anastasios Koutsos, Alexandra Manaia and
Genomes and SNPs in Malaria and Sickle Cell Anemia
Genomes and SNPs in Malaria and Sickle Cell Anemia Introduction to Genome Browsing with Ensembl Ensembl The vast amount of information in biological databases today demands a way of organising and accessing
A Genomic Timeline Tim Shank 2003
A Genomic Timeline Tim Shank 2003 1800s 1865 Gregor Mendel reports the results of his pea plant expts, from which he discerned several fundamental laws of heredity. His results appeared in an obscure journal
Bio-Informatics Lectures. A Short Introduction
Bio-Informatics Lectures A Short Introduction The History of Bioinformatics Sanger Sequencing PCR in presence of fluorescent, chain-terminating dideoxynucleotides Massively Parallel Sequencing Massively
Bioinformatics: course introduction
Bioinformatics: course introduction Filip Železný Czech Technical University in Prague Faculty of Electrical Engineering Department of Cybernetics Intelligent Data Analysis lab http://ida.felk.cvut.cz
How To Change Medicine
P4 Medicine: Personalized, Predictive, Preventive, Participatory A Change of View that Changes Everything Leroy E. Hood Institute for Systems Biology David J. Galas Battelle Memorial Institute Version
Acceleration for Personalized Medicine Big Data Applications
Acceleration for Personalized Medicine Big Data Applications Zaid Al-Ars Computer Engineering (CE) Lab Delft Data Science Delft University of Technology 1" Introduction Definition & relevance Personalized
Forensic DNA Testing Terminology
Forensic DNA Testing Terminology ABI 310 Genetic Analyzer a capillary electrophoresis instrument used by forensic DNA laboratories to separate short tandem repeat (STR) loci on the basis of their size.
1865 Discovery: Heredity Transmitted in Units
1859 Discovery: Natural Selection Genetic Timeline Charles Darwin wrote On the Origin of Species by Means of Natural Selection, or the Preservation of Favored Races in the Struggle for Life. 1865 Discovery:
Chapter 8: Recombinant DNA 2002 by W. H. Freeman and Company Chapter 8: Recombinant DNA 2002 by W. H. Freeman and Company
Genetic engineering: humans Gene replacement therapy or gene therapy Many technical and ethical issues implications for gene pool for germ-line gene therapy what traits constitute disease rather than just
Name Class Date. Figure 13 1. 2. Which nucleotide in Figure 13 1 indicates the nucleic acid above is RNA? a. uracil c. cytosine b. guanine d.
13 Multiple Choice RNA and Protein Synthesis Chapter Test A Write the letter that best answers the question or completes the statement on the line provided. 1. Which of the following are found in both
GENE REGULATION. Teacher Packet
AP * BIOLOGY GENE REGULATION Teacher Packet AP* is a trademark of the College Entrance Examination Board. The College Entrance Examination Board was not involved in the production of this material. Pictures
Protein Synthesis. Page 41 Page 44 Page 47 Page 42 Page 45 Page 48 Page 43 Page 46 Page 49. Page 41. DNA RNA Protein. Vocabulary
Protein Synthesis Vocabulary Transcription Translation Translocation Chromosomal mutation Deoxyribonucleic acid Frame shift mutation Gene expression Mutation Point mutation Page 41 Page 41 Page 44 Page
Gene mutation and molecular medicine Chapter 15
Gene mutation and molecular medicine Chapter 15 Lecture Objectives What Are Mutations? How Are DNA Molecules and Mutations Analyzed? How Do Defective Proteins Lead to Diseases? What DNA Changes Lead to
The Structure, Replication, and Chromosomal Organization of DNA
Michael Cummings Chapter 8 The Structure, Replication, and Chromosomal Organization of DNA David Reisman University of South Carolina History of DNA Discoveries Friedrich Miescher Isolated nuclein from
A Multiple DNA Sequence Translation Tool Incorporating Web Robot and Intelligent Recommendation Techniques
Proceedings of the 2007 WSEAS International Conference on Computer Engineering and Applications, Gold Coast, Australia, January 17-19, 2007 402 A Multiple DNA Sequence Translation Tool Incorporating Web
Intro to Bioinformatics
Intro to Bioinformatics Marylyn D Ritchie, PhD Professor, Biochemistry and Molecular Biology Director, Center for Systems Genomics The Pennsylvania State University Sarah A Pendergrass, PhD Research Associate
RNA and Protein Synthesis
Name lass Date RN and Protein Synthesis Information and Heredity Q: How does information fl ow from DN to RN to direct the synthesis of proteins? 13.1 What is RN? WHT I KNOW SMPLE NSWER: RN is a nucleic
ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE. October 2013
ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE October 2013 Introduction As sequencing technologies continue to evolve and genomic data makes its way into clinical use and
AP Biology Essential Knowledge Student Diagnostic
AP Biology Essential Knowledge Student Diagnostic Background The Essential Knowledge statements provided in the AP Biology Curriculum Framework are scientific claims describing phenomenon occurring in
Replication Study Guide
Replication Study Guide This study guide is a written version of the material you have seen presented in the replication unit. Self-reproduction is a function of life that human-engineered systems have
DNA is found in all organisms from the smallest bacteria to humans. DNA has the same composition and structure in all organisms!
Biological Sciences Initiative HHMI DNA omponents and Structure Introduction Nucleic acids are molecules that are essential to, and characteristic of, life on Earth. There are two basic types of nucleic
Next Generation Sequencing: Adjusting to Big Data. Daniel Nicorici, Dr.Tech. Statistikot Suomen Lääketeollisuudessa 29.10.2013
Next Generation Sequencing: Adjusting to Big Data Daniel Nicorici, Dr.Tech. Statistikot Suomen Lääketeollisuudessa 29.10.2013 Outline Human Genome Project Next-Generation Sequencing Personalized Medicine
INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE Q5B
INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE ICH HARMONISED TRIPARTITE GUIDELINE QUALITY OF BIOTECHNOLOGICAL PRODUCTS: ANALYSIS
Module 1. Sequence Formats and Retrieval. Charles Steward
The Open Door Workshop Module 1 Sequence Formats and Retrieval Charles Steward 1 Aims Acquaint you with different file formats and associated annotations. Introduce different nucleotide and protein databases.
PRACTICE TEST QUESTIONS
PART A: MULTIPLE CHOICE QUESTIONS PRACTICE TEST QUESTIONS DNA & PROTEIN SYNTHESIS B 1. One of the functions of DNA is to A. secrete vacuoles. B. make copies of itself. C. join amino acids to each other.
An Introduction to Genomics and SAS Scientific Discovery Solutions
An Introduction to Genomics and SAS Scientific Discovery Solutions Dr Karen M Miller Product Manager Bioinformatics SAS EMEA 16.06.03 Copyright 2003, SAS Institute Inc. All rights reserved. 1 Overview!
Core Bioinformatics. Degree Type Year Semester. 4313473 Bioinformàtica/Bioinformatics OB 0 1
Core Bioinformatics 2014/2015 Code: 42397 ECTS Credits: 12 Degree Type Year Semester 4313473 Bioinformàtica/Bioinformatics OB 0 1 Contact Name: Sònia Casillas Viladerrams Email: [email protected]
Genetic diagnostics the gateway to personalized medicine
Micronova 20.11.2012 Genetic diagnostics the gateway to personalized medicine Kristiina Assoc. professor, Director of Genetic Department HUSLAB, Helsinki University Central Hospital The Human Genome Packed
Introduction to Bioinformatics 3. DNA editing and contig assembly
Introduction to Bioinformatics 3. DNA editing and contig assembly Benjamin F. Matthews United States Department of Agriculture Soybean Genomics and Improvement Laboratory Beltsville, MD 20708 [email protected]
3120-1 - Page 1. Name:
Name: 1) Which series is arranged in correct order according to decreasing size of structures? A) DNA, nucleus, chromosome, nucleotide, nitrogenous base B) chromosome, nucleus, nitrogenous base, nucleotide,
Lab 2/Phylogenetics/September 16, 2002 1 PHYLOGENETICS
Lab 2/Phylogenetics/September 16, 2002 1 Read: Tudge Chapter 2 PHYLOGENETICS Objective of the Lab: To understand how DNA and protein sequence information can be used to make comparisons and assess evolutionary
The National Institute of Genomic Medicine (INMEGEN) was
Genome is...... the complete set of genetic information contained within all of the chromosomes of an organism. It defines the particular phenotype of an individual. What is Genomics? The study of the
Next Generation Sequencing
Next Generation Sequencing Technology and applications 10/1/2015 Jeroen Van Houdt - Genomics Core - KU Leuven - UZ Leuven 1 Landmarks in DNA sequencing 1953 Discovery of DNA double helix structure 1977
Molecular Genetics: Challenges for Statistical Practice. J.K. Lindsey
Molecular Genetics: Challenges for Statistical Practice J.K. Lindsey 1. What is a Microarray? 2. Design Questions 3. Modelling Questions 4. Longitudinal Data 5. Conclusions 1. What is a microarray? A microarray
Lecture 6: Single nucleotide polymorphisms (SNPs) and Restriction Fragment Length Polymorphisms (RFLPs)
Lecture 6: Single nucleotide polymorphisms (SNPs) and Restriction Fragment Length Polymorphisms (RFLPs) Single nucleotide polymorphisms or SNPs (pronounced "snips") are DNA sequence variations that occur
Introduction to The DNA Discovery Kit
...where molecules become real TM Introduction to The DNA Discovery Kit Photos by Sean Ryan All rights reserved on DNA Discovery Kit. US Patent 6,471,520 B1 1 ...where molecules become real TM Dear Friends
Protein & DNA Sequence Analysis. Bobbie-Jo Webb-Robertson May 3, 2004
Protein & DNA Sequence Analysis Bobbie-Jo Webb-Robertson May 3, 2004 Sequence Analysis Anything connected to identifying higher biological meaning out of raw sequence data. 2 Genomic & Proteomic Data Sequence
Mitochondrial DNA Analysis
Mitochondrial DNA Analysis Lineage Markers Lineage markers are passed down from generation to generation without changing Except for rare mutation events They can help determine the lineage (family tree)
Polar Covalent Bonds and Hydrogen Bonds
Lesson 6.1: Polar Covalent Bonds and Hydrogen Bonds The last section of code will add hydrogen bonding functionality between molecules. To do so, we have to understand the chemistry of polar covalent bonds
