Basic Concepts of DNA, Proteins, Genes and Genomes

Similar documents
Structure and Function of DNA

Molecular Genetics. RNA, Transcription, & Protein Synthesis

From DNA to Protein. Proteins. Chapter 13. Prokaryotes and Eukaryotes. The Path From Genes to Proteins. All proteins consist of polypeptide chains

DNA, RNA, Protein synthesis, and Mutations. Chapters

DNA Replication & Protein Synthesis. This isn t a baaaaaaaddd chapter!!!

PRACTICE TEST QUESTIONS

Answer: 2. Uracil. Answer: 2. hydrogen bonds. Adenine, Cytosine and Guanine are found in both RNA and DNA.

Genetic information (DNA) determines structure of proteins DNA RNA proteins cell structure enzymes control cell chemistry ( metabolism )

Translation Study Guide

RNA & Protein Synthesis

13.2 Ribosomes & Protein Synthesis

Name Class Date. Figure Which nucleotide in Figure 13 1 indicates the nucleic acid above is RNA? a. uracil c. cytosine b. guanine d.

Transcription and Translation of DNA

Algorithms in Computational Biology (236522) spring 2007 Lecture #1

Name Date Period. 2. When a molecule of double-stranded DNA undergoes replication, it results in

Lecture 26: Overview of deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) structure

Thymine = orange Adenine = dark green Guanine = purple Cytosine = yellow Uracil = brown

2. The number of different kinds of nucleotides present in any DNA molecule is A) four B) six C) two D) three

Proteins and Nucleic Acids

Lecture Overview. Hydrogen Bonds. Special Properties of Water Molecules. Universal Solvent. ph Scale Illustrated. special properties of water

Provincial Exam Questions. 9. Give one role of each of the following nucleic acids in the production of an enzyme.

Protein Synthesis How Genes Become Constituent Molecules

Nucleotides and Nucleic Acids

12.1 The Role of DNA in Heredity

Academic Nucleic Acids and Protein Synthesis Test

Coding sequence the sequence of nucleotide bases on the DNA that are transcribed into RNA which are in turn translated into protein

Lab # 12: DNA and RNA

To be able to describe polypeptide synthesis including transcription and splicing

A disaccharide is formed when a dehydration reaction joins two monosaccharides. This covalent bond is called a glycosidic linkage.

The Steps. 1. Transcription. 2. Transferal. 3. Translation

a. Ribosomal RNA rrna a type ofrna that combines with proteins to form Ribosomes on which polypeptide chains of proteins are assembled

Chapter 11: Molecular Structure of DNA and RNA

Genetics Module B, Anchor 3

RNA Structure and folding

Lecture Series 7. From DNA to Protein. Genotype to Phenotype. Reading Assignments. A. Genes and the Synthesis of Polypeptides

A Genomic Timeline Tim Shank 2003

DNA and the Cell. Version 2.3. English version. ELLS European Learning Laboratory for the Life Sciences

Name: Date: Period: DNA Unit: DNA Webquest

The Molecules of Cells

Protein Synthesis. Page 41 Page 44 Page 47 Page 42 Page 45 Page 48 Page 43 Page 46 Page 49. Page 41. DNA RNA Protein. Vocabulary

Cellular Respiration Worksheet What are the 3 phases of the cellular respiration process? Glycolysis, Krebs Cycle, Electron Transport Chain.

Replication Study Guide

The sequence of bases on the mrna is a code that determines the sequence of amino acids in the polypeptide being synthesized:

ISTEP+: Biology I End-of-Course Assessment Released Items and Scoring Notes

DNA. Discovery of the DNA double helix

Page 1. Name:

2. True or False? The sequence of nucleotides in the human genome is 90.9% identical from one person to the next. False (it s 99.

Ms. Campbell Protein Synthesis Practice Questions Regents L.E.

CCR Biology - Chapter 8 Practice Test - Summer 2012

Chapter 3 Molecules of Cells

Proteins. Proteins. Amino Acids. Most diverse and most important molecule in. Functions: Functions (cont d)

DNA is found in all organisms from the smallest bacteria to humans. DNA has the same composition and structure in all organisms!

Modeling DNA Replication and Protein Synthesis

13.4 Gene Regulation and Expression

Specific problems. The genetic code. The genetic code. Adaptor molecules match amino acids to mrna codons

RNA and Protein Synthesis

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

Polar Covalent Bonds and Hydrogen Bonds

How To Understand The Chemistry Of Organic Molecules

From DNA to Protein

AS and A LEVEL. Delivery Guide H020/H420 BIOLOGY A. Theme: Nucleotides and Nucleic Acids March 2015

Bob Jesberg. Boston, MA April 3, 2014

Genetics Lecture Notes Lectures 1 2

Teacher Guide: Have Your DNA and Eat It Too ACTIVITY OVERVIEW.

The Puzzle of Life A Lesson Plan for Life S cien ce Teach ers From: The G reat Lakes S cien ce C ent er, C lev elan d, OH

Chapter 5: The Structure and Function of Large Biological Molecules

Chapter 3: Biological Molecules. 1. Carbohydrates 2. Lipids 3. Proteins 4. Nucleic Acids

TRANSCRIPTION TRANSLATION - GENETIC CODE AND OUTLINE OF PROTEIN SYNTHESIS

Human Genome and Human Genome Project. Louxin Zhang

Chapter 5. The Structure and Function of Macromolecule s

1865 Discovery: Heredity Transmitted in Units

BCH401G Lecture 39 Andres

DNA (genetic information in genes) RNA (copies of genes) proteins (functional molecules) directionality along the backbone 5 (phosphate) to 3 (OH)

Sample Questions for Exam 3

A Brief History of DNA

Chapter 17: From Gene to Protein

Translation. Translation: Assembly of polypeptides on a ribosome

Forensic DNA Testing Terminology

STRUCTURES OF NUCLEIC ACIDS

4. Which carbohydrate would you find as part of a molecule of RNA? a. Galactose b. Deoxyribose c. Ribose d. Glucose

Central Dogma. Lecture 10. Discussing DNA replication. DNA Replication. DNA mutation and repair. Transcription

Genetics Test Biology I

DNA and RNA are long linear polymers, called nucleic acids, that carry. DNA, RNA, and the Flow of Genetic Information CHAPTER 4

Elements in Biological Molecules

The Structure, Replication, and Chromosomal Organization of DNA

GENE REGULATION. Teacher Packet

The DNA Discovery Kit The Discovery Approach & Teacher Notes

K'NEX DNA Models. Developed by Dr. Gary Benson Department of Biomathematical Sciences Mount Sinai School of Medicine

Biological molecules:

Concluding lesson. Student manual. What kind of protein are you? (Basic)

Micro RNAs: potentielle Biomarker für das. Blutspenderscreening

MAKING AN EVOLUTIONARY TREE

agucacaaacgcu agugcuaguuua uaugcagucuua

Overview of Eukaryotic Gene Prediction

Announcements. Chapter 15. Proteins: Function. Proteins: Function. Proteins: Structure. Peptide Bonds. Lab Next Week. Help Session: Monday 6pm LSS 277

Transcription: RNA Synthesis, Processing & Modification

CHAPTER 6: RECOMBINANT DNA TECHNOLOGY YEAR III PHARM.D DR. V. CHITRA

Bio 102 Practice Problems Genetic Code and Mutation

Transcription:

Basic Concepts of DNA, Proteins, Genes and Genomes Kun-Mao Chao 1,2,3 1 Graduate Institute of Biomedical Electronics and Bioinformatics 2 Department of Computer Science and Information Engineering 3 Graduate Institute of Networking and Multimedia National Taiwan University, Taipei, Taiwan 106 Email: kmchao@csie.ntu.edu.tw October 2, 2006 1 Introduction Deoxyribonucleic acid (DNA) is the genetic material of cells. It carries information in a coded form from cell to cell and from parent to offspring. A gene is a linear array of nucleotides located in a particular position on a particular chromosome that encodes a specific functional product (a protein or RNA molecule). When a gene is active, its information is copied first into another nucleic acid, ribonucleic acid (RNA), which in turn directs the synthesis of the gene products, the specific proteins. This lecture introduces some basic concepts of DNA, proteins, genes and genomes. 2 The Nucleic Acids: DNA and RNA Each nucleic acid contains four types of base. DNA is made up of four similar chemicals: adenine, guanine, cytosine and thymine that are repeated millions or billions of times throughout a genome. The human genome, for example, 1

has about three billion pairs of bases. RNA is made of four chemicals: adenine, guanine, cytosine and uracil. The bases are usually referred to by their initial letters: A, G, C, T for DNA and A, G, C, U for RNA. The particular order of As, Gs, Cs, and Ts is extremely important. The order underlies all of life s diversity, even dictating whether an organism is human or another species such as yeast, rice, or fruit fly, all of which have their own genomes and are themselves the focus of genome projects. In the late 1940s, Erwin Chargaff noted an important similarity: the amount of adenine in DNA molecules is always equal to the amount of thymine, and the mount of guanine is always equal to the amount of cytosine (A = T and G = C). In 1953, based on the x-ray diffraction data of Rosalind Franklin and Maurice Wilkins, James Watson and Francis Crick proposed a model for DNA structure. The Watson-Crick model states that the DNA molecule is a double helix (two strands twisted together). The only two pairs that are possible are AT and CG. This yields a molecule in which A = T and G = C. The model also suggests that the basis for copying the genetic information is the complementarity of its bases. For example, if the sequence on one strand is AGATC, then the sequence of the other strand would have to be TCTAG - its complementary bases. For their ground-breaking theory, Watson and Crick shared the Nobel Prize in 1962. There are several different kinds of RNA made by the cell. In particular, mrna, messenger RNA, is a copy of a gene. It acts as a photocpoy of a gene by having a sequence complementary to one strand of the DNA and identical to the other strand. Other RNAs include trna (transfer RNA), rrna (ribosomal RNA) and snrna (small nuclear RNA). RNA is too bulky to form a stable double helix. In fact, RNA exists as a single-stranded molecule. However, regions of double helix can form where there is some base pair complementation (U and A, G and C), resulting in hairpin loops. The RNA molecule with its hairpin loops is said to have a secondary structure. 3 Proteins The building blocks of proteins are the amino acids. Only 20 different amino acids make up the diverse array of proteins found in living things. Each protein differs according to the amount, type and arrangement of amino acids that make up its structure. The chains of amino acids are linked by peptide bounds. A long chain of amino acids linked by peptide bonds is a 2

polypeptide. Proteins are long, complex polypeptides. The sequence of amino acids that makes up a particular polypeptide chain is called the primary structure of a protein. The primary structure folds into the secondary structure, which describes the path that the polypeptide backbone of the protein follows in space. The tertiary structure describes the organization in three dimensions of all the atoms in the polypeptide chain. The quaternary structure consists of aggregates of more than one polypeptide chain. The sequence of amino acids, as well as shape, is crucial to the functioning of a protein. 4 Genes Genes are the fundamental physical and functional units of heredity. Genes carry information for making all the proteins required by all organisms. These proteins determine, among other things, how the organism looks, how well its body metabolizes food or fights infection, and sometimes even how it behaves. A gene is an ordered sequence of nucleotides located in a particular position on a particular chromosome that encodes a specific functional product (a protein or RNA molecule). Expressed genes include those that are transcribed into mrna and then translated into protein and those that are transcribed into RNA but not translated into protein (e.g., trna and rrna). How does the sequence of a strand of DNA correspond to the amino acid sequence of a protein? This concept is explained by the central dogma of molecular biology. Information flow (with the exception of reverse transcription) is from DNA to RNA via the process of transcription, and then to protein via translation. Transcription is the making of an RNA molecule off a DNA template. Translation is the construction of an amino acid sequence (polypeptide) from an RNA molecule. How does an mrna specify amino acid sequence? The answer lies in the genetic code. It would be impossible for each amino acid to be specified by one nucleotide, because there are only 4 nucleotides and 20 amino acids. Similarly, two nucleotide combinations could only specify 16 amino acids. The final conclusion is that each amino acid is specified by a particular combination of three nucleotides, called a codon. To code for the 20 essential amino acids a genetic code must consist of at least a 3-base set (triplet) of the 4 bases. If one considers the possibilities 3

of arranging four things 3 at a time (4x4x4), we get 64 possible code words, or codons (a 3-base sequence on the mrna that codes for either a specific amino acid or a control word). The genetic code was broken by Marshall Nirenberg and Heinrich Matthaei, a decade after Watson and Crick s work. 5 The Genomes A genome is all the DNA in an organism, including its genes. In 1990, the Human Genome Project was launched by the U.S. Department of Energy and the National Institutes of Health. The project originally was planned to last 15 years, but rapid technological advances have accelerated the draft completion date to 2001. The goals of the project are to identify all the genes in human DNA, determine the sequences of the 3 billion base pairs that make up human DNA, store this information in databases, develop tools for data analysis, and address the ethical, legal, and social issues that may arise from the project. To help achieve these goals, researchers are also studying the genetic makeup of several nonhuman organisms. Because all organisms are related through similarities in DNA sequences, insights gained from nonhuman genomes often lead to new knowledge about human biology. These include the bacterium Escherichia coli, the fruit fly, and the laboratory mouse. The much larger genomes of two frequently-used higher organisms have also been sequenced. They are baker s yeast (Saccharomyces cerevisiae) and the roundworm (Caenorhabditis elegans). The project will bring fantastic benefits for humankind, some that we can anticipate and others that will surprise us. Researchers will be provided with detailed DNA information that will be key to understanding the structure, organization, and function of DNA in chromosomes. Genome maps of other organisms will provide the basis for comparative studies that are often critical to understanding more complex biological systems. Information generated and technologies developed will revolutionize future biological explorations. Wouldn t you agree that the genomes are the largest programs written in the oldest language, and are quite adaptable, flexible, and fault-tolerant? 4

Acknowledgements Kun-Mao Chao was supported in part by NSC grants 94-2213-E-002-018 and 95-2221-E-002-126-MY3 from the National Science Council, Taiwan. 5