CSC 2427: Algorithms for Molecular Biology Spring 2006. Lecture 16 March 10



Similar documents
Built from 20 kinds of amino acids

Advanced Medicinal & Pharmaceutical Chemistry CHEM 5412 Dept. of Chemistry, TAMUK

4. Which carbohydrate would you find as part of a molecule of RNA? a. Galactose b. Deoxyribose c. Ribose d. Glucose

Part A: Amino Acids and Peptides (Is the peptide IAG the same as the peptide GAI?)

IV. -Amino Acids: carboxyl and amino groups bonded to -Carbon. V. Polypeptides and Proteins

18.2 Protein Structure and Function: An Overview

Helices From Readily in Biological Structures

Carbohydrates, proteins and lipids

Recap. Lecture 2. Protein conformation. Proteins. 8 types of protein function 10/21/10. Proteins.. > 50% dry weight of a cell

A. A peptide with 12 amino acids has the following amino acid composition: 2 Met, 1 Tyr, 1 Trp, 2 Glu, 1 Lys, 1 Arg, 1 Thr, 1 Asn, 1 Ile, 1 Cys

Pipe Cleaner Proteins. Essential question: How does the structure of proteins relate to their function in the cell?

Proteins and Nucleic Acids

Structure of proteins

Problem Set 1 KEY

Disulfide Bonds at the Hair Salon

Chapter 5. The Structure and Function of Macromolecule s

Lecture Overview. Hydrogen Bonds. Special Properties of Water Molecules. Universal Solvent. ph Scale Illustrated. special properties of water


The Molecules of Cells

Paper: 6 Chemistry University I Chemistry: Models Page: 2 of Which of the following weak acids would make the best buffer at ph = 5.0?

Shu-Ping Lin, Ph.D.

Structure and properties of proteins. Vladimíra Kvasnicová

How To Understand The Chemistry Of Organic Molecules

Biological Molecules

A disaccharide is formed when a dehydration reaction joins two monosaccharides. This covalent bond is called a glycosidic linkage.

Chapter 3 Molecules of Cells

Lecture 19: Proteins, Primary Struture

Nafith Abu Tarboush DDS, MSc, PhD

Proteins. Proteins. Amino Acids. Most diverse and most important molecule in. Functions: Functions (cont d)

Chapter 3: Biological Molecules. 1. Carbohydrates 2. Lipids 3. Proteins 4. Nucleic Acids

Translation Study Guide

Disaccharides consist of two monosaccharide monomers covalently linked by a glycosidic bond. They function in sugar transport.

Protein Physics. A. V. Finkelstein & O. B. Ptitsyn LECTURE 1

The peptide bond is rigid and planar

Myoglobin and Hemoglobin

Replication Study Guide

Combinatorial Biochemistry and Phage Display

The peptide bond Peptides and proteins are linear polymers of amino acids. The amino acids are

NO CALCULATORS OR CELL PHONES ALLOWED

Molecular Genetics. RNA, Transcription, & Protein Synthesis

Chapter 3. Protein Structure and Function

Amino Acids and Proteins

Peptide bonds: resonance structure. Properties of proteins: Peptide bonds and side chains. Dihedral angles. Peptide bond. Protein physics, Lecture 5

Molecular Cell Biology

Genetic information (DNA) determines structure of proteins DNA RNA proteins cell structure enzymes control cell chemistry ( metabolism )

RNA & Protein Synthesis

Chapter 12 - Proteins

Introduction to Proteins and Enzymes

This class deals with the fundamental structural features of proteins, which one can understand from the structure of amino acids, and how they are

Concluding lesson. Student manual. What kind of protein are you? (Basic)

Amino Acids, Proteins, and Enzymes. Primary and Secondary Structure Tertiary and Quaternary Structure Protein Hydrolysis and Denaturation

Elements in Biological Molecules

Chapter 5: The Structure and Function of Large Biological Molecules

Structures of Proteins. Primary structure - amino acid sequence

INTRODUCTION TO PROTEIN STRUCTURE

From DNA to Protein. Proteins. Chapter 13. Prokaryotes and Eukaryotes. The Path From Genes to Proteins. All proteins consist of polypeptide chains

Lecture Conformation of proteins Conformation of a protein three-dimensional structure native state. native condition

Carbon-organic Compounds

H H N - C - C 2 R. Three possible forms (not counting R group) depending on ph

Amino Acids. Amino acids are the building blocks of proteins. All AA s have the same basic structure: Side Chain. Alpha Carbon. Carboxyl. Group.

Transcription and Translation of DNA

AP BIOLOGY 2008 SCORING GUIDELINES

Peptide Bonds: Structure

Structure and Function of DNA

PROTEINS THE PEPTIDE BOND. The peptide bond, shown above enclosed in the blue curves, generates the basic structural unit for proteins.

(c) How would your answers to problem (a) change if the molecular weight of the protein was 100,000 Dalton?

Biological molecules:

FTIR Analysis of Protein Structure

The Organic Chemistry of Amino Acids, Peptides, and Proteins

Chemical Basis of Life Module A Anchor 2

Peptide Bond Amino acids are linked together by peptide bonds to form polypepetide chain.

Ionization of amino acids

Preliminary MFM Quiz

Basic Concepts of DNA, Proteins, Genes and Genomes

Name: Hour: Elements & Macromolecules in Organisms

Translation. Translation: Assembly of polypeptides on a ribosome

DNA is found in all organisms from the smallest bacteria to humans. DNA has the same composition and structure in all organisms!

Computational Systems Biology. Lecture 2: Enzymes

Name Date Period. 2. When a molecule of double-stranded DNA undergoes replication, it results in

Papers listed: Cell2. This weeks papers. Chapt 4. Protein structure and function

Previously published in Biophysical Society On-line Textbook PROTEINS CHAPTER 1. PROTEIN STRUCTURE. Section 1. Primary structure, secondary motifs,

Hydrogen Bonds The electrostatic nature of hydrogen bonds

Invariant residue-a residue that is always conserved. It is assumed that these residues are essential to the structure or function of the protein.

MCAT Organic Chemistry - Problem Drill 23: Amino Acids, Peptides and Proteins

Lecture Series 7. From DNA to Protein. Genotype to Phenotype. Reading Assignments. A. Genes and the Synthesis of Polypeptides

Elements & Macromolecules in Organisms

I N V E S T I C E D O R O Z V O J E V Z D Ě L Á V Á N Í

Specific problems. The genetic code. The genetic code. Adaptor molecules match amino acids to mrna codons

Biochemistry of Cells

Recognizing Organic Molecules: Carbohydrates, Lipids and Proteins

PRACTICE TEST QUESTIONS

Chemistry 110. Bettelheim, Brown, Campbell & Farrell. Introduction to General, Organic and Biochemistry Chapter 22 Proteins

Exam 4 Outline CH 105 Spring 2012

DNA Replication & Protein Synthesis. This isn t a baaaaaaaddd chapter!!!

Coding sequence the sequence of nucleotide bases on the DNA that are transcribed into RNA which are in turn translated into protein

The Chemical Basis of Life. Chemical Bonds

The Steps. 1. Transcription. 2. Transferal. 3. Translation

Student name ID # 2. (4 pts) What is the terminal electron acceptor in respiration? In photosynthesis? O2, NADP+

I. Chapter 5 Summary. II. Nucleotides & Nucleic Acids. III. Lipids

8/20/2012 H C OH H R. Proteins

Transcription:

CSC 2427: Algorithms for Molecular Biology Spring 2006 Lecture 16 March 10 Lecturer: Michael Brudno Scribe: Jim Huang 16.1 Overview of proteins Proteins are long chains of amino acids (AA) which are produced through the operation of translating messenger RNA. Proteins play an crucial role in enzymatic activity, storage and transport of material, signal transduction and many other biological functions. There are 20 different AAs that serve as the building blocks for proteins: we can therefore think of proteins as sequences of symbols drawn from this alphabet of AAs. Each AA is produced from a codon, or triplet of nucleotides in the messenger RNA. Since the codon ATG is a start codon and ATG also translates to the aminoacid methionine (M), all proteins begin with a methionine. Each AA has a specific chemical structure which contains a carbon backbone and a side chain, or R group. All 20 different AAs have this same general structure, but their side-chain groups vary in size, shape, charge, hydrophobicity, and reactivity. AAs can be classified into a few distinct categories based primarily on their solubility in water. AAs with polar side groups are soluble in aqueous solutions and are thus called hydrophilic. In contrast, AAs with nonpolar side groups avoid water and are said to be hydrophobic. These aggregate to form the water-insoluble core of proteins. The polarity of AA side chains thus is one of the forces responsible for shaping the final three-dimensional structure of proteins (more on this below). AAs in a protein are connected to one another in a linear, unbranched chain through peptide bonds. A chain of peptide bonds forms the backbone of a protein molecule: the various side-chain groups for each AA in the protein project outwards from this backbone Fig. 16.1. Figure 16.1. An amino acid and a peptide chain (figure reproduced from [1]. Sidechains/R-groups hang off the backbone of the peptide chain, or polymer) Some AAs are more abundant in proteins than other AAs. Cysteine, tryptophan, and methionine are rare AAs; together they constitute approximately 5 percent of the AAs in a protein. Four AAsleucine, serine, lysine, and glutamic acidare the most abundant AAs, totaling 32 percent of all the AA residues in a typical protein. However, the AA composition of proteins can vary widely from one protein to another. Many terms are used to denote the chains formed by polymerization of amino acids. A short chain of amino acids linked by peptide bonds and having a defined sequence is a peptide; longer peptides are referred to as polypeptides. Peptides generally contain fewer than 2030 AAs, whereas polypeptides contain as many as 4000 AAs. We ll use the term protein for a polypeptide (or a complex of polypeptides) that has a 3D structure. 16-1

16.2 Protein structure The AA composition of a protein uniquely determines (for given environmental conditions) the structure of the protein (e.g., two proteins with the same AA sequence will have the same structure for the same conditions). There are four hierarchical levels of organization which are used to describe the structure of proteins: The primary structure of a protein is simply the linear sequence of AA residues that constitute the polypeptide chain (e.g: M ACILV GT); Secondary structure refers to the organization of parts of a polypeptide chain, which can assume several different spatial arrangements. Without any stabilizing interactions, a polypeptide assumes a randomcoil structure. However, when stabilizing hydrogen bonds form between certain residues, the backbone folds periodically into one of two geometric arrangements: either an α helix, which is a spiral, rodlike structure (see Fig. 16.2), or a β sheet (Fig. 16.3), which is a planar structure composed of alignments of two or more short, fully extended segments of the backbone. Finally, U-shaped four-residue segments stabilized by hydrogen bonds between their arms are called turns. They are located at the surfaces of proteins and redirect the polypeptide chain toward the interior. Figure 16.2. α-helix secondary structure of a polypeptide chain: the polypeptide backbone is folded into a spiral that is held in place by hydrogen bonds (black dots) between backbone oxygen atoms and hydrogen atoms. Note that all the hydrogen bonds have the same polarity. The outer surface of the helix is covered by the side-chain R groups (figures reproduced from [1]) Tertiary structure, the next-higher level of structure, refers to the overall conformation of a polypeptide chain, that is, the three-dimensional arrangement of all the amino acids residues. In contrast to secondary structure, which is stabilized by hydrogen bonds, tertiary structure is stabilized by hydrophobic interactions between the nonpolar side chains and, in some proteins, by disulfide bonds. These stabilizing forces hold the helices, strands, turns, and random coils in a compact internal scaffold. Thus, a proteins size and shape is dependent not only on its sequence but also on the number, 16-2

Figure 16.3. β-sheet secondary structure of a polypeptide chain: (a) A simple two-stranded β-sheet with antiparallel β- strands. A sheet is stabilized by hydrogen bonds (black dots) between the β-strands. The planarity of the peptide bond forces a β-sheet to be pleated; hence, this structure is also called a β-pleated sheet, or simply a pleated sheet. (b) Side view of a β-sheet showing how the R groups protrude above and below the plane of the sheet. (c) Model of binding site in class I MHC (major histocompatibility complex) molecules, which are involved in graft rejection. (figures reproduced from [1]) size, and arrangement of its secondary structures. For proteins that consist of a single polypeptide chain, monomeric proteins, tertiary structure is the highest level of organization. Quaternary structure describes the number (stoichiometry) and relative positions of the subunits in a multimeric protein. Hemagglutinin is a trimer of three identical subunits; other multimeric proteins can be composed of any number of identical or different subunits. In a fashion similar to the hierarchy of structures that make up a protein, proteins themselves are part of a hierarchy of cellular structures. Proteins can associate into larger structures termed macromolecular assemblies. Examples of such macromolecular assemblies include the protein coat of a virus, a bundle of actin filaments, the nuclear pore complex, and other large submicroscopic objects. Macromolecular assemblies in turn combine with other cell biopolymers like lipids, carbohydrates, and nucleic acids to form complex cell organelles. 16.3 Structural Classification of Protein (SCOP) Nearly all proteins have structural similarities with other proteins and, in some of these cases, share a common evolutionary origin. The SCOP database [2], created by manual inspection and abetted by a battery of automated methods, aims to provide a detailed and comprehensive description of the structural and evolutionary relationships between all proteins whose structure is known. As such, it provides a broad 16-3

survey of all known protein folds, detailed information about the close relatives of any particular protein, and a framework for future research and classification. Proteins are classified to reflect both structural and evolutionary relatedness. The different major levels in the classification hierarchy are: Family (Clear evolutionarily relationship): Proteins clustered together into families are clearly evolutionarily related. Generally, this means that pairwise residue identities between the proteins are 30% and greater. However, in some cases similar functions and structures provide definitive evidence of common descent in the absense of high sequence identity; for example, many globins form a family though some members have sequence identities of only 15%; Superfamily (Probable common evolutionary origin): Proteins that have low sequence identities, but whose structural and functional features suggest that a common evolutionary origin is probable are placed together in superfamilies. For example, actin, the ATPase domain of the heat shock protein, and hexakinase together form a superfamily; Fold (Major structural similarity): Proteins are defined as having a common fold if they have the same major secondary structures in the same arrangement and with the same topological connections. Different proteins with the same fold often have peripheral elements of secondary structure and turn regions that differ in size and conformation. In some cases, these differing peripheral regions may comprise half the structure. Proteins placed together in the same fold category may not have a common evolutionary origin: the structural similarities could arise just from the physics and chemistry of proteins favoring certain packing arrangements and chain topologies. 16-4

Bibliography [1] Lodish H, Berk A, Zipursky LS, Matsudaira P, Baltimore D and Darnell J (2000). Molecular Cell Biology, Fourth Edition. W. H. Freeman. [2] Murzin AG, Brenner SE, Hubbard T, Chothia C (1995). SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 247, 536-540. 5