Protein primary structure: Amino acids

Similar documents
IV. -Amino Acids: carboxyl and amino groups bonded to -Carbon. V. Polypeptides and Proteins

Advanced Medicinal & Pharmaceutical Chemistry CHEM 5412 Dept. of Chemistry, TAMUK

Amino Acids, Peptides, Proteins

Peptide bonds: resonance structure. Properties of proteins: Peptide bonds and side chains. Dihedral angles. Peptide bond. Protein physics, Lecture 5

Pipe Cleaner Proteins. Essential question: How does the structure of proteins relate to their function in the cell?

Amino Acids. Amino acids are the building blocks of proteins. All AA s have the same basic structure: Side Chain. Alpha Carbon. Carboxyl. Group.

BOC334 (Proteomics) Practical 1. Calculating the charge of proteins

Part A: Amino Acids and Peptides (Is the peptide IAG the same as the peptide GAI?)

H H N - C - C 2 R. Three possible forms (not counting R group) depending on ph

Shu-Ping Lin, Ph.D.

Amino Acids and Proteins

Ionization of amino acids

The Organic Chemistry of Amino Acids, Peptides, and Proteins

Paper: 6 Chemistry University I Chemistry: Models Page: 2 of Which of the following weak acids would make the best buffer at ph = 5.0?

Structure and properties of proteins. Vladimíra Kvasnicová

A. A peptide with 12 amino acids has the following amino acid composition: 2 Met, 1 Tyr, 1 Trp, 2 Glu, 1 Lys, 1 Arg, 1 Thr, 1 Asn, 1 Ile, 1 Cys

Previously published in Biophysical Society On-line Textbook PROTEINS CHAPTER 1. PROTEIN STRUCTURE. Section 1. Primary structure, secondary motifs,

The peptide bond is rigid and planar

Built from 20 kinds of amino acids

Ch18_PT MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Structure of proteins

Biochemistry - I. Prof. S. Dasgupta Department of Chemistry Indian Institute of Technology, Kharagpur Lecture-11 Enzyme Mechanisms II

Introduction to Chemical Biology

Amino Acids, Proteins, and Enzymes. Primary and Secondary Structure Tertiary and Quaternary Structure Protein Hydrolysis and Denaturation

Peptide Bond Amino acids are linked together by peptide bonds to form polypepetide chain.

18.2 Protein Structure and Function: An Overview

Recap. Lecture 2. Protein conformation. Proteins. 8 types of protein function 10/21/10. Proteins.. > 50% dry weight of a cell

The peptide bond Peptides and proteins are linear polymers of amino acids. The amino acids are

Combinatorial Biochemistry and Phage Display

Chapter 26 Biomolecules: Amino Acids, Peptides, and Proteins

In addition to being shorter than a single bond, the double bonds in ethylene don t twist the way single bonds do. In other words, the other atoms

(c) How would your answers to problem (a) change if the molecular weight of the protein was 100,000 Dalton?

CHAPTER 29 AMINO ACIDS, POLYPEPTIDES, AND PROTEINS SOLUTIONS TO REVIEW QUESTIONS

AMINO ACIDS & PEPTIDE BONDS STRUCTURE, CLASSIFICATION & METABOLISM

Protein Physics. A. V. Finkelstein & O. B. Ptitsyn LECTURE 1

Chapter 16 Amino Acids, Proteins, and Enzymes

Preliminary MFM Quiz

Hydrogen Bonds The electrostatic nature of hydrogen bonds

CSC 2427: Algorithms for Molecular Biology Spring Lecture 16 March 10

INTRODUCTION TO PROTEIN STRUCTURE

Peptide Bonds: Structure

Lecture Overview. Hydrogen Bonds. Special Properties of Water Molecules. Universal Solvent. ph Scale Illustrated. special properties of water

Concluding lesson. Student manual. What kind of protein are you? (Basic)

MCAT Organic Chemistry - Problem Drill 23: Amino Acids, Peptides and Proteins

CHAPTER 15: ANSWERS TO SELECTED PROBLEMS

Chemistry 110. Bettelheim, Brown, Campbell & Farrell. Introduction to General, Organic and Biochemistry Chapter 22 Proteins

This class deals with the fundamental structural features of proteins, which one can understand from the structure of amino acids, and how they are

Chapter 12 - Proteins

PROTEINS STRUCTURE AND FUNCTION (DR. TRAISH)

Amino Acids as Acids, Bases and Buffers:

Conformational Properties of Polypeptide Chains

Chapter 3: Biological Molecules. 1. Carbohydrates 2. Lipids 3. Proteins 4. Nucleic Acids

Lecture 15: Enzymes & Kinetics Mechanisms

Helices From Readily in Biological Structures

Acidic amino acids: Those whose side chains can carry a negative charge at certain ph values. Typically aspartic acid, glutamic acid.

From Sequence to Structure

THE CHEMICAL SYNTHESIS OF PEPTIDES

Exam 4 Outline CH 105 Spring 2012

Proteins the primary biological macromolecules of living organisms

Myoglobin and Hemoglobin

Lecture Conformation of proteins Conformation of a protein three-dimensional structure native state. native condition

Lecture 4: Peptides and Protein Primary Structure [PDF] Key Concepts. Objectives See also posted Peptide/pH/Ionization practice problems.

Invariant residue-a residue that is always conserved. It is assumed that these residues are essential to the structure or function of the protein.

Application Note. Determination of 17 AQC derivatized Amino acids in baby food samples. Summary. Introduction. Category Bio science, food Matrix

Carbohydrates, proteins and lipids

Section Activity #1: Fill out the following table for biology s most common elements assuming that each atom is neutrally charged.

Molecular Models in Biology

Role of Hydrogen Bonding on Protein Secondary Structure Introduction

Chapter 2: Biochemistry Problems

Problem Set 1 KEY

Chapter 3 Molecules of Cells

Previous lecture: Today:

Covalent bonds are the strongest chemical bonds contributing to the protein structure A peptide bond is formed between with of the following?

Structures of Proteins. Primary structure - amino acid sequence

4. Which carbohydrate would you find as part of a molecule of RNA? a. Galactose b. Deoxyribose c. Ribose d. Glucose

Molecular Facts and Figures

A disaccharide is formed when a dehydration reaction joins two monosaccharides. This covalent bond is called a glycosidic linkage.

Recognizing Organic Molecules: Carbohydrates, Lipids and Proteins

Disaccharides consist of two monosaccharide monomers covalently linked by a glycosidic bond. They function in sugar transport.

Protein Structure and Function

Guidelines for Writing a Scientific Paper

I N V E S T I C E D O R O Z V O J E V Z D Ě L Á V Á N Í

Disulfide Bonds at the Hair Salon

Survival Organic Chemistry Part I: Molecular Models

Elements in Biological Molecules

Amino Acid Properties and Consequences of Substitutions

The Molecules of Cells

Application Note. Determination of Amino acids by UHPLC with automated OPA- Derivatization by the Autosampler. Summary. Fig. 1.

Chemical Basis of Life Module A Anchor 2

PROTEINS THE PEPTIDE BOND. The peptide bond, shown above enclosed in the blue curves, generates the basic structural unit for proteins.

PRACTICE TEST QUESTIONS

The chemistry of insulin

Refinement of a pdb-structure and Convert

Introduction to the Protein Folding Problem

Chapter 5 Classification of Organic Compounds by Solubility

Biochemistry 462a Hemoglobin Structure and Function Reading - Chapter 7 Practice problems - Chapter 7: 1-6; Proteins extra problems

Chemical Bonds and Groups - Part 1

Nafith Abu Tarboush DDS, MSc, PhD

Chapter 2 Polar Covalent Bonds; Acids and Bases

Transcription:

Protein primary structure: Amino acids I. Diversity of proteins Numerous biological functions are performed by proteins. These include oxygen transport and storage (hemoglobin and myoglobin, respectively), muscle functioning (titin), growth of bones (collagen), cell adhesion (fibronectin) etc. The diversity in functions suggests a wide diversity in protein structures. Indeed, proteins show remarkable variation in size and shape. The largest protein, titin, consists of about 30,000 amino acids and includes approximately 300 domains. Collagen is composed of three individual chains, which are wound in a triple helix elongated structure (Fig. 1). One of the smallest proteins is WW domain (FBP28, shown in Fig. 1), which contains from 35 to 40 amino acids and is stable without disulfide bonds. The WW domains adopt a β-sheet structure in their native state, which includes only three β-strands. Fig. 1 Native conformations of triple helix collagen (PDB access code 1cag, upper panel) and three β-strand FBP28 WW domain (PDB access code 1eol, lower panel).

The diversity of proteins is also reflected in the existence of three classes of proteins, namely globular, membrane and fibrous proteins. Collagen and WW domains belong to fibrous and globular proteins, respectively. II. Water structure Water is an excellent solvent and plays a critical role in determining the structure and stability of proteins. Despite the simplicity of its molecular structure, water shows very unusual properties. For example, water expands upon freezing transition and, in fact, expands even in the liquid form when temperature is reduced from 4 C to 0 C. Water also has an unusually large heat capacity (specific heat). The structure of liquid water is mainly determined by the formation of hydrogen bonds (HB, for definition see below). Each water molecule forms four HBs, two of which are formed by an oxygen atom and another two - by two hydrogen atoms (Fig. 2). Furthermore, because the threedimensional distribution of HBs is far from being planar, water molecules are arranged in molecular tetrahedrons. Fig. 2 Schematic representation of liquid water structure. Each water molecule is engaged in four HBs with four other waters shown by blue dotted lines. Oxygen and hydrogen atoms are displayed in pink and yellow, respectively. Hydrogen bonding D-H A is a weak electrostatic interactions resulting from sharing of a hydrogen atom H between the donor atom D with the acceptor atom A. The hydrogen atom H and the donor atom A are covalently linked. Both donor and acceptor atoms carry partial negative charges. In proteins, a classical intraprotein HB is formed between backbone amide and carbonyl groups. In this case, D and A are the backbone nitrogen and carbonyl oxygen atoms. The formation of HB can be observed by using radial density distribution functions g(r) for the atom pairs H and A or D and A. Specifically, in the case of H..A pair of atoms g(r) gives the density of the atoms A at the distance r from the hydrogen H. Consider now the structure of water around protein backbone. In this example, H is amide hydrogen and O is water oxygen (Fig. 3). A well defined peak of

g(r) indicates a local build-up of water density at the distance of about 1.8Å, which is associated with the formation of hydration shell (out of water molecules) around protein backbone. The maximum in Fig. 3 is due to the formation of HB between water oxygens and protein amide hydrogens. For the D and A pair of atoms the maximum in g(r) function is reached at about 2.9 Å. The protein-water HBs are also formed between water hydrogens and backbone carbonyl oxygens as well as between water molecules and protein side chains. Protein-protein HBs also include those formed between side chains, between side chains and backbone, and also weak and rare HBs involving C α -carbons as a donor atom (of the C α -H O type). There is no universal definition of a HB, although two empiric definitions, geometric and energetic, are commonly used. The geometric definition states that a HB is formed, if the distance r DA 3.5Å and the angle D-H A is greater than 120 (note that other numerical values for r DA and DHA angle are sometimes used). The energetic definition applied for intraprotein backbone HBs relies on the computation of electrostatic energy E hb due to the interactions between hydrogen H and nitrogen N of the amide group and oxygen O and carbon C atoms of the carbonyl group. This definition was first proposed by Kabsch and Sanders (Biopolymers 22, 2577 (1983)) and is used in DSSP database. The HB is formed, if E hb -0.5 kcal/mol, where E hb 1 1 1 1 = 332q + 1 q2 (1) rno rhc rho rnc In Eq. 1 r are the distances between respective atoms, q 1 =0.2 and q 2 =0.42 are the partial charges on (hydrogen, nitrogen) and (carbon, oxygen) pairs, respectively, and 332 is a factor expressed in kcal mol -1 Å. Partial charges are expressed in the units of e, which is an absolute value of electron charge. The typical energy of a HB is several kcal/mol. Fig. 3 Radial density distribution function g(r) for backbone amide hydrogen H and water oxygen O. The distance r represents the separation between H and O.

III. Protein amino acids The sequence of amino acids forms a primary protein structure. Each amino acid contains central C α -carbon, hydrogen atom H, protonated NH + 3, dissociated COO - group, and side chain R (Fig. 4). This form of amino acid is typical at normal ph. These atomic groups form tetrahedral structure, which leads to two isomer (mirror image) forms (left-handed L-isomer and right-handed D-isomer, see Fig. 5 for the example of alanine isomers). Interestingly, only L-isomers are found in wild-type proteins. H NH + 3 C α COO - R i Fig. 4. Structure of amino acid. Fig. 5 Spatial tetrahedral structures of L- (left) and D- (right) alanines. Nitrogen and oxygen atoms are shown in blue and red, respectively. Wild-type proteins, which are synthesized on ribosomes, utilize 20 amino acids. The generic structure of amino acid incorporated in a protein sequence is shown in Fig. 6. H N C α C H R i O n Fig. 6 Structure of generic amino acid i in a polypeptide sequence of n residues.

In general, amino acids can be divided into hydrophobic, polar (or hydrophilic) and charged residues. Below we consider the structures and properties of individual amino acids. Aliphatic amino acids: These amino acids include glycine, alanine, valine, leucine, and isoleucine. The aliphatic residues are hydrophobic (structures of amino acids below are arranged in the ascending order of their hydrophobicity), have open, sometimes branched side chains. Aliphatic amino acids do not usually form HB and are neutral and not polar. The smallest is Gly, which has no side chain and demonstrates the largest backbone flexibility. As the length of side chain increases, so does the number of available side chain conformations (the rotamer library, see below). As the side chains become bulkier, the flexibility of the backbone decreases. Glycine (Gly, G) Alanine (Ala, A) Valine (Val, V) Leucine (Leu, L) Isoleucine (Ile, I) Aliphatic hydroxyl amino acids: Serine and threonine belong to this group and contain hydroxyl group OH in their side chains. As a result these amino acids are polar, capable of forming HBs with water and, consequently, are hydrophilic. The side chains of these amino acids are neutral. Serine (Ser, S) Threonine (Thr, T) Acidic amino acids and amide derivatives: Asparagine, glutamine, aspartic acid, and glutamic acid are included in this group. Asparagine and glutamine contain amide groups in their side chains and are uncharged. Aspartic and glutamic acids contain deprotonated hydroxyl groups and are negatively charged at ph values above approximately 4. All these four residues are considered polar (and hydrophilic) and capable of forming HBs. Note also that parts of Asp and Glu side chains are hydrophobic.

Aspartic acid (Asp, D) Glutamic acid (Glu, E) Asparagine (Asn, N) Glutamine (Gln, Q) Basic amino acids: The amino acids of basic group are lysine, arginine, and histidine. Lys and Arg contain basic amino groups and are positively charged (protonated) at normal ph. Their pk values (i.e., the values of ph, at which their side chains become neutral) are in the range of 10 to 12. His side chain has the pk value of 6.5, therefore, its charge state depends sensitively on the specific cellular environment. Basic amino acids are highly polar and can participate in hydrogen bonding. Histidine (His, H) Lysine (Lys, K) Arginine (Arg, R) Aromatic amino acids: Phenylalanine, tyrosine, tryptophan amino acids belong to this group. Because the side chains contain aromatic rings, these amino acids are generally hydrophobic, although the degree of their hydrophobicity varies. Phe is the most hydrophobic, while Tyr and Trp are mildly hydrophobic because of partially polar properties of their side chains. These amino acids are uncharged at normal ph. Tyr and Trp, but not Phe, may form HBs

Phenylalanine (Phe, F) Tyrosine (Tyr, Y) Tryptophan (Trp, W) Sulfur containing amino acids: Two amino acids, metheonine and cysteine, contain sulfur atoms in the side chains. Met and Cys are hydrophobic and uncharged. As a result of oxidation of SH groups Cys residues can form disulfide bonds, the strength of which are comparable with covalent ones. Their role in folding of BPTI protein was discussed previously. Methionine (Met, M) Cysteine (Cys, C) Cyclic amino acids: Proline is the only cyclic amino acid, in which the side chain makes a covalent bond with the amide backbone group. Generally, Pro has aliphatic properties, but because of cyclic side chain its conformational flexibility is highly limited. Proline (Pro, P) The frequency of occurrence of amino acids in globular proteins may be evaluated by examining a non-homologous (a maximum sequence similarity of 40%) set of PDB proteins (Journal of Molecular Biology 273, 349 (1997)). In all, the PDB40 dataset contains 971 proteins. The results shown in Table 1 suggests that, in general, frequency of amino acid occurrence ranges from 1 to about 9 %. Two hydrophobic amino acids,

Leu and Ala, have the highest frequency of occurrence (>8%), whereas sulfur containing residues, Cys and Met, aromatic amino acid Trp and His appear relatively rare (<3%). It should be noted that the frequencies of amino acids for membrane proteins are skewed towards hydrophobic residues, because these proteins are embedded in lipid bilayers. Similarly, few amino acids, such as Gly or Pro, are greatly overrepresented in fibrous proteins. Table 1. Frequencies (in %) of amino acids in PDB40 protein dataset Alanine 8.44 Arginine 4.72 Asparagine 5.98 Aspartic acid 4.68 Cysteine 1.69 Glutamine 3.70 Glutamic acid 6.16 Glycine 7.95 Histidine 2.19 Isoleucine 5.49 Leucine 8.28 Lysine 5.80 Methionine 2.18 Phenylalanine 4.03 Proline 4.61 Serine 6.08 Threonine 5.87 Tryptophan 1.47 Tyrosine 3.61 Valine 6.94 IV. Conformational flexibility in proteins The flexibility of protein backbone is determined by dihedral angles. Fig. 7 introduces one of the possible definitions of the dihedral angle. Consider four backbone atoms i,i+1,i+2, and i+3 with the radius vectors r i, r i+ 1, r i+ 2, r i+ 3. The bond vectors are r r r r r r r r r i, i + 1 = i + 1 i, i+ 1, i+ 2 = i+ 2 i+ 1, i+ 2, i+ 3 = i+ 3 i 2. Let us define the vector normal to r r r the plane (i,i+1,i+2) as a vector product n1 = i, i+ 1 i+ 1, i+ 2. Similarly, we define the r r r vector normal to (i+1,i+2,i+3) plane n2 = i + 1, i+ 2 i+ 2, i+ 3. The angle between the vectors n r 1 and n r 2 is called the dihedral angle φ, which may be computed 1 r r r r as φ = cos ( n 1n2 /( n1 n2 )). The sign of φ is determined by the sign of the scalar product n r r ). Negative sign of this scalar product corresponds to φ<0. The dihedral angle ( 1 r i+ 2, i + 3

r r r r 1 may be also defined as = cos ( t t /( t t ) φ 1 2 1 2, where t r 1 and t r 2 are the vectors normal to the line (i+1,i+2) and lying in the planes (i,i+1,i+2) and (i+1,i+2,i+3), r r respectively. In r sign φ ) sign ( t t. The trans ( = 1 2 ) r i + 2, i+ 3 this case the sign of φ is determined as ( ) conformation of four atoms corresponds to φ=180, cis state is represented by φ=0, and gauch states are φ=±60. Two dihedral angles φ and ψ defined for the protein backbone (see Lecture 3, Fig. 1) determine completely the conformational flexibility of the backbone. The third dihedral angle ω defined by the four atoms C αi,c i,n i,c α,i+1 of the residues i and i+1 is frozen in the trans state due to the double bond character of the peptide bond. This arrangement sets C α -carbons at the maximum spatial separation. Steric clashes between neighboring side chains severely restrict the values of accessible φ and ψ as it is seen from Ramachandran plots. t 1 i n 1 i+1 n 2 t 2 i+2 i+3 Fig. 7 Backbone vectors used in the definition of dihedral angle φ. In a similar way, the dihedral angles may be defined for the long side chains. The side chain dihedral angles χ i are allocated as C α CH 2 CH 2 CH 2 χ 1 χ 2 χ 3