CS691K Bioinformatics Kulp Lecture Notes #0 Molecular & Cell Biology. Fall 2005

Similar documents
Name Class Date. Figure Which nucleotide in Figure 13 1 indicates the nucleic acid above is RNA? a. uracil c. cytosine b. guanine d.

DNA Replication & Protein Synthesis. This isn t a baaaaaaaddd chapter!!!

Genetic information (DNA) determines structure of proteins DNA RNA proteins cell structure enzymes control cell chemistry ( metabolism )

Name Date Period. 2. When a molecule of double-stranded DNA undergoes replication, it results in

Algorithms in Computational Biology (236522) spring 2007 Lecture #1

Structure and Function of DNA

Given these characteristics of life, which of the following objects is considered a living organism? W. X. Y. Z.

Chapter 6 DNA Replication

1 Mutation and Genetic Change

Basic Concepts of DNA, Proteins, Genes and Genomes

Genetics Module B, Anchor 3

12.1 The Role of DNA in Heredity

Translation Study Guide

Forensic DNA Testing Terminology

Protein Synthesis How Genes Become Constituent Molecules

Page 1. Name:

From DNA to Protein. Proteins. Chapter 13. Prokaryotes and Eukaryotes. The Path From Genes to Proteins. All proteins consist of polypeptide chains

RNA & Protein Synthesis

Transcription and Translation of DNA

2. The number of different kinds of nucleotides present in any DNA molecule is A) four B) six C) two D) three

13.2 Ribosomes & Protein Synthesis

Appendix C DNA Replication & Mitosis

Molecular Genetics. RNA, Transcription, & Protein Synthesis

Thymine = orange Adenine = dark green Guanine = purple Cytosine = yellow Uracil = brown

To be able to describe polypeptide synthesis including transcription and splicing

PRACTICE TEST QUESTIONS

1.5 page 3 DNA Replication S. Preston 1

BME Engineering Molecular Cell Biology. Lecture 02: Structural and Functional Organization of

Respiration occurs in the mitochondria in cells.

Academic Nucleic Acids and Protein Synthesis Test

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

Nucleotides and Nucleic Acids

Answer: 2. Uracil. Answer: 2. hydrogen bonds. Adenine, Cytosine and Guanine are found in both RNA and DNA.

The sequence of bases on the mrna is a code that determines the sequence of amino acids in the polypeptide being synthesized:

Chapter 11: Molecular Structure of DNA and RNA

Replication Study Guide

From DNA to Protein

Genetics Lecture Notes Lectures 1 2

Basic Biological Principles Module A Anchor 1

Bob Jesberg. Boston, MA April 3, 2014

Cell Growth and Reproduction Module B, Anchor 1

Sample Questions for Exam 3

Module 3 Questions. 7. Chemotaxis is an example of signal transduction. Explain, with the use of diagrams.

AS Biology Unit 2 Key Terms and Definitions. Make sure you use these terms when answering exam questions!

Protein Synthesis. Page 41 Page 44 Page 47 Page 42 Page 45 Page 48 Page 43 Page 46 Page 49. Page 41. DNA RNA Protein. Vocabulary

The Molecules of Cells

Modeling DNA Replication and Protein Synthesis

DNA, RNA, Protein synthesis, and Mutations. Chapters

Ms. Campbell Protein Synthesis Practice Questions Regents L.E.

Genetics Test Biology I

DNA Paper Model Activity Level: Grade 6-8

Introduction. What is Ecological Genetics?

Essentials of Human Anatomy & Physiology 11 th Edition, 2015 Marieb

4. DNA replication Pages: Difficulty: 2 Ans: C Which one of the following statements about enzymes that interact with DNA is true?

1. When new cells are formed through the process of mitosis, the number of chromosomes in the new cells

DNA. Discovery of the DNA double helix

BioBoot Camp Genetics

An Overview of Cells and Cell Research

Name: Date: Period: DNA Unit: DNA Webquest

Central Dogma. Lecture 10. Discussing DNA replication. DNA Replication. DNA mutation and repair. Transcription

Unit I: Introduction To Scientific Processes

2. True or False? The sequence of nucleotides in the human genome is 90.9% identical from one person to the next. False (it s 99.

13.4 Gene Regulation and Expression

Lecture Series 7. From DNA to Protein. Genotype to Phenotype. Reading Assignments. A. Genes and the Synthesis of Polypeptides

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison

How Cancer Begins???????? Chithra Manikandan Nov 2009

MCAS Biology. Review Packet

AP Biology Essential Knowledge Student Diagnostic

Chapter 13: Meiosis and Sexual Life Cycles

Today you will extract DNA from some of your cells and learn more about DNA. Extracting DNA from Your Cells

CCR Biology - Chapter 8 Practice Test - Summer 2012

MUTATION, DNA REPAIR AND CANCER

STRUCTURES OF NUCLEIC ACIDS

CHAPTER 6: RECOMBINANT DNA TECHNOLOGY YEAR III PHARM.D DR. V. CHITRA

RNA and Protein Synthesis

Coding sequence the sequence of nucleotide bases on the DNA that are transcribed into RNA which are in turn translated into protein

Organelle Speed Dating Game Instructions and answers for teachers

Provincial Exam Questions. 9. Give one role of each of the following nucleic acids in the production of an enzyme.

Lab # 12: DNA and RNA

Molecular Cell Biology WS2011

a. Ribosomal RNA rrna a type ofrna that combines with proteins to form Ribosomes on which polypeptide chains of proteins are assembled

Biology Final Exam Study Guide: Semester 2

Lecture Overview. Hydrogen Bonds. Special Properties of Water Molecules. Universal Solvent. ph Scale Illustrated. special properties of water

Regents Biology REGENTS REVIEW: PROTEIN SYNTHESIS

Multiple Choice Write the letter that best answers the question or completes the statement on the line provided.

4. Why are common names not good to use when classifying organisms? Give an example.

Proteins and Nucleic Acids

The Structure, Replication, and Chromosomal Organization of DNA

Basic Concepts Recombinant DNA Use with Chapter 13, Section 13.2

Viruses. Viral components: Capsid. Chapter 10: Viruses. Viral components: Nucleic Acid. Viral components: Envelope

A disaccharide is formed when a dehydration reaction joins two monosaccharides. This covalent bond is called a glycosidic linkage.

Cellular Respiration Worksheet What are the 3 phases of the cellular respiration process? Glycolysis, Krebs Cycle, Electron Transport Chain.

AP Biology 2015 Free-Response Questions

Name: LAB SECTION: Circle your answer on the test sheet: completely erase or block out unwanted answers.

A and B are not absolutely linked. They could be far enough apart on the chromosome that they assort independently.

AP Biology Syllabus

DNA: Structure and Replication

The Steps. 1. Transcription. 2. Transferal. 3. Translation

Teacher Guide: Have Your DNA and Eat It Too ACTIVITY OVERVIEW.

Transcription:

CS691K Bioinformatics Kulp Lecture Notes #0 Molecular & Cell Biology Fall 2005 dkulp@cs.umass.edu

Syllabus distributed Logistics Class taught in 3 stages by faculty in CS, math/stats, and microbio Grades will be based on up to six homework assignments Office hours on syllabus. All faculty are readily available by email. We are happy to discuss the class with you personally. Not all notes will be available online - you should attend all lectures and take good notes Diverse group of students Emphasis will be on understanding methods and practical use of existing bioinformatics tools Why are you here? What is your background? What are you hoping to get out of this class? Please sign the email sheet! Homework will involve the use of the unix ED-LAB computers. There will be a special meeting on WEDNESDAY, SEPTEMBER 14 for novice unix users.

What is Bioinformatics Computational Biology: The use of algorithmic, mathematical, and statistical methods to analyze genome sequences (i.e. DNA, RNA, protein) and derived data (e.g. expression, NMR, etc.) Informatics: The software and data management methodologies for storing, retrieving, and intrigrating such data Data Mining / In-silico Biology: Hypothesis generation and testing from genome data sets

Topics Detecting similar sequences (homology) Pairwise and multiple sequence alignment Protein function/structure prediction Sequence pattern modeling and recognition Motif discovery Gene finding Analyzing high-dimension data Function prediction, target discovery, etc. from gene expression Constructing trees Phylogenetics Informatics and integration Genome biology

The Cell Prokaryotes are unicellular with minimal compartments - bacteria, archaea Eukaryotes are multicellular with differentiation and many organelles including the nucleus that typically can reproduce sexually - all higher organisms including mammals, birds, fish, invertebrates, mushrooms, plants, and yeast. ~300,000,000,000,000 cells in a human.

The Cell The cell is composed of and makes thousands of proteins, e.g. the cell wall is made of a layer of proteins and lipids. There are special proteins embedded in the wall as channels and pumps And the cell makes (synthesizes) proteins DNA makes RNA, RNA makes proteins, and proteins make us! F. Crick The cell is a chemical catalytic machine Networks: one type of network are metabolic networks describing catalytic reactions for the consumption or synthesis of products necessary for life. Many of these are fairly well understood. (e.g. photosynthesis) Another type of network are signaling networks where information is conveyed about the environment. These are partially understood. (e.g. protein kinases are involved in cell differentiation and cell death)

From KEGG (http://www.genome.ad.jp/kegg/pathway.html)

The Cell - Genetic Information There is a third major type of network: genetic information processing. We will focus on these networks. To understand this: we describe the nature of DNA Tangentially mention homology and conservation Then discuss the process of translation

DNA Structure - Eukaryotic Chromosome DNA - a string of nucleic acids (Adenine, Guanine, Cytosine, and Thymine) Regular, long, stable, oriented, double-stranded, helical structure Humans: 23 pairs of chromosomes. Total ~3B bases (x2) DNA resides in nucleus in eukaryotes

DNA DNA Structure Always: chemical pairing of A-T and C-G. Thus, strands are complementary. Two chains run in opposite directions: 5 to 3 5 3 3 5

Prokaryotes (and mitochondria) have one circular chromosome Prokaryotic Chromosomes This shows the E. coli genome with orange and yellow bars indicating the positions of the genes on the two strands.

RNA RNA is a similar molecule composed of 4 nucleic acids (A, C, G, and U) Single-stranded. Can base-pair with DNA (synthesis) Can self-base-pair and fold

DNA Replication We won t be discussing the details of DNA replication. There are 2 processes: Mitosis for normal cell duplication Meiosis for gametes for sexual reproduction - single, recombined chromosomes In both processes, DNA is copied by breaking doublestrand (dsdna) into single-strands (ssdna) at origins of replication and synthesizing a complementary copy from the template. 50 bp/sec * 15K origins = ~1 hr to replicate human genome Problem: How does DNA polymerase find the origins? Are there sequence patterns?

The Tree of Life Single common ancestral genome!

DNA Conservation and Variation Mutations occur in DNA due to environmental effects (e.g. radiation) and random mistakes during synthesis. Usually just single nucleotides are changes, sometimes large rearrangements. Those changes occurring in somatic (non-sex) cells cause local damage, usually cell death, but can cause cancer. (Search for the common mutations that cause different types of cancers.) Those changes occurring in gametes can be inherited and if favorable can become fixed Variation in non-functional (junk) DNA tends to drift, whereas functional DNA (e.g. containing genes) tends to remain conserved. Problems: Given a set of sequences from different organisms: Identify and align sequences from a common ancestor (homologous) What are the important (conserved) parts? What was the evolutionary history? (Reconstruct the tree ) Given a model organism (e.g. mouse, yeast, fruitfly, etc.), find the orthologous locus in human

Examples of Sequence Conservation A segment from the RNA needed for protein synthesis - a fundamental process in all life forms. It is conserved across all 3 major branches of the tree of life. A multiple alignment of homologous protein sequences. Colors indicate different classes of amino acids. Dots are inserts/deletes.

DNA contains GENES Genes are heriditary units of DNA We now know that, for the most part, genes are regions that code for proteins Proteins are derived from DNA according to the central dogma : DNA => RNA => Protein Like DNA replication, DNA is opened into two single strands. Using a ssdna as a template, a complementary copy of RNA is synthesized for a small region of the genome (1000-100000nt) The RNA is processed and transported (more about that in later lectures) Each triple of RNA (codon) is translated to one of 20 amino acids creating a polypeptide chain, which folds into a protein Problems: How does the cell know where to find a gene? (Sequence patterns?) How does RNA transcription know when to stop? (Patterns?) How is RNA edited?

Central Dogma - DNA - RNA - Protein 1998 by Alberts, Bray, Johnson, Lewis, Raff, Roberts, Walter

Codon Translation Each triplet translates to a unique amino acid. For example, CUU is Leucine. There are 4*4*4=64 possible codons that translate into 20 amino acids This translation table is fixed for almost all life

Cell Differentiation Eukaryotes have many different cell types (skin, muscle, neurons, etc.) that each play a different role. To accomplish the cell s role, different genes must be activated Problems: How are genes activated? What regulatory patterns are in the DNA? What genes control other genes? What network associations among genes can be found? What genes are differentially expressed?

Cell Differentiation

Differential Expression Interleukin 1 alpha expressed in different cell types

Protein Sequence, Structure, Function Lastly, given a protein sequence, what is the 3-D structure and function? The most common approach is to exploit conservation (see earlier) Problem: Find similar proteins to my query protein. Maybe I can assign structure or function to my new query protein, if structure or function is already known for a homologous protein. (Sequence similarity searching, protein family modeling)

Protein Structure

Further Reading Many online intros to genome biology E.g. http://www.ncbi.nlm.nih.gov/about/primer/ Any molecular biology text E.g. Molecular Biology of the Cell by Alberts, et al or Genomes by Brown.