Chapter 3 Amino Acids, Peptides, Proteins



Similar documents
Amino Acids, Peptides, Proteins

Ionization of amino acids

H H N - C - C 2 R. Three possible forms (not counting R group) depending on ph

The Organic Chemistry of Amino Acids, Peptides, and Proteins

Amino Acids and Proteins

Separation of Amino Acids by Paper Chromatography

PROTEIN SEQUENCING. First Sequence

Biochemistry - I. Prof. S. Dasgupta Department of Chemistry Indian Institute of Technology, Kharagpur Lecture-11 Enzyme Mechanisms II

Amino Acids. Amino acids are the building blocks of proteins. All AA s have the same basic structure: Side Chain. Alpha Carbon. Carboxyl. Group.

--not necessarily a protein! (all proteins are polypeptides, but the converse is not true)

(c) How would your answers to problem (a) change if the molecular weight of the protein was 100,000 Dalton?

Ch18_PT MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

IV. -Amino Acids: carboxyl and amino groups bonded to -Carbon. V. Polypeptides and Proteins

Lecture 15: Enzymes & Kinetics Mechanisms

Previous lecture: Today:

Biological Molecules

Carbohydrates, proteins and lipids

Chapter 3. Protein Structure and Function

Part A: Amino Acids and Peptides (Is the peptide IAG the same as the peptide GAI?)

Amino Acids as Acids, Bases and Buffers:

Lecture Conformation of proteins Conformation of a protein three-dimensional structure native state. native condition

6 Characterization of Casein and Bovine Serum Albumin

Chapter 26 Biomolecules: Amino Acids, Peptides, and Proteins

8/20/2012 H C OH H R. Proteins

18.2 Protein Structure and Function: An Overview

CHEMISTRY STANDARDS BASED RUBRIC ATOMIC STRUCTURE AND BONDING

Chemical Basis of Life Module A Anchor 2

A. A peptide with 12 amino acids has the following amino acid composition: 2 Met, 1 Tyr, 1 Trp, 2 Glu, 1 Lys, 1 Arg, 1 Thr, 1 Asn, 1 Ile, 1 Cys

Preliminary MFM Quiz

Chemistry 20 Chapters 15 Enzymes

Chapter 2 Polar Covalent Bonds; Acids and Bases

Nuclear Structure. particle relative charge relative mass proton +1 1 atomic mass unit neutron 0 1 atomic mass unit electron -1 negligible mass


Guide to Reverse Phase SpinColumns Chromatography for Sample Prep

Transmembrane proteins span the bilayer. α-helix transmembrane domain. Multiple transmembrane helices in one polypeptide

Chem101: General Chemistry Lecture 9 Acids and Bases

A disaccharide is formed when a dehydration reaction joins two monosaccharides. This covalent bond is called a glycosidic linkage.

NO CALCULATORS OR CELL PHONES ALLOWED

Protein Physics. A. V. Finkelstein & O. B. Ptitsyn LECTURE 1

Proteins and Nucleic Acids

Marmara Üniversitesi Fen-Edebiyat Fakültesi Kimya Bölümü / Biyokimya Anabilim Dalı

Structure and properties of proteins. Vladimíra Kvasnicová

Acids and Bases. but we will use the term Lewis acid to denote only those acids to which a bond can be made without breaking another bond

ph: Measurement and Uses

CHAPTER 4: Enzyme Structure ENZYMES

Chemistry 201. Practical aspects of buffers. NC State University. Lecture 15

Combinatorial Biochemistry and Phage Display

Problem Set 1 KEY

Paper: 6 Chemistry University I Chemistry: Models Page: 2 of Which of the following weak acids would make the best buffer at ph = 5.0?

Chapter 2: The Chemical Context of Life

Lecture 11 Enzymes: Kinetics

Shu-Ping Lin, Ph.D.

Peptide bonds: resonance structure. Properties of proteins: Peptide bonds and side chains. Dihedral angles. Peptide bond. Protein physics, Lecture 5

Organic Molecules of Life - Exercise 2

Recap. Lecture 2. Protein conformation. Proteins. 8 types of protein function 10/21/10. Proteins.. > 50% dry weight of a cell

Peptides: Synthesis and Biological Interest

Keystone Review Practice Test Module A Cells and Cell Processes. 1. Which characteristic is shared by all prokaryotes and eukaryotes?

Worksheet Chapter 13: Human biochemistry glossary

Exam 4 Outline CH 105 Spring 2012

Mass Spec - Fragmentation

PROTEINS STRUCTURE AND FUNCTION (DR. TRAISH)

Molecular Spectroscopy

serum protein and A/ G ratio

4. Which carbohydrate would you find as part of a molecule of RNA? a. Galactose b. Deoxyribose c. Ribose d. Glucose

Electrospray Ion Trap Mass Spectrometry. Introduction

Chemistry B11 Chapter 6 Solutions and Colloids

1. The diagram below represents a biological process

Covalent bonds are the strongest chemical bonds contributing to the protein structure A peptide bond is formed between with of the following?

Chapter 2. The Chemistry of Life Worksheets

How To Understand The Chemistry Of Organic Molecules

Chapter 3: Biological Molecules. 1. Carbohydrates 2. Lipids 3. Proteins 4. Nucleic Acids

Acids and Bases: A Brief Review

Prentice Hall. Chemistry (Wilbraham) 2008, National Student Edition - South Carolina Teacher s Edition. High School. High School

I N V E S T I C E D O R O Z V O J E V Z D Ě L Á V Á N Í

Introduction, Noncovalent Bonds, and Properties of Water

Lecture Overview. Hydrogen Bonds. Special Properties of Water Molecules. Universal Solvent. ph Scale Illustrated. special properties of water

Chapter 17. How are acids different from bases? Acid Physical properties. Base. Explaining the difference in properties of acids and bases

Chapter 3 Molecules of Cells

Pipe Cleaner Proteins. Essential question: How does the structure of proteins relate to their function in the cell?

Marmara Üniversitesi Fen-Edebiyat Fakültesi Kimya Bölümü / Biyokimya Anabilim Dalı PURIFICATION AND CHARACTERIZATION OF PROTEINS

Advanced Medicinal & Pharmaceutical Chemistry CHEM 5412 Dept. of Chemistry, TAMUK

Chem 405 Biochemistry Lab I Experiment 2 Quantitation of an unknown protein solution.

Name: Hour: Elements & Macromolecules in Organisms

Chapter 5 Classification of Organic Compounds by Solubility

Guidance for Industry

EXPERIMENT 5: DIPEPTIDE RESEARCH PROJECT

The Molecules of Cells

Disaccharides consist of two monosaccharide monomers covalently linked by a glycosidic bond. They function in sugar transport.

Chapter 16 Amino Acids, Proteins, and Enzymes

Chapter 8: Energy and Metabolism

1) Technical informations. - a) How does it work? - b) Purification - c) Quality Control. 2) Standard synthesis

What does pka tell you?

Amino Acids, Peptides, and Proteins

ACID-BASE TITRATIONS: DETERMINATION OF CARBONATE BY TITRATION WITH HYDROCHLORIC ACID BACKGROUND

ph. Weak acids. A. Introduction

Molecular Cell Biology. Prof. D. Karunagaran. Department of Biotechnology. Indian Institute of Technology Madras

The chemistry of insulin

RNA) = 1 1 = 1 EU

Molecular Models in Biology

Chapter 13 EDTA titrations

Transcription:

Chapter 3 Amino Acids, Peptides, Proteins Also start memorizing AA: names, structures, abbreviations, 1 letter codes, pka s of side chains 3.1 Amino acids Amino acid is monomer condensed into peptide (small) protein (large) Both can be degraded back to AA s by hydrolysis (At neutral ph hydrolysis is slow but faster in acid or base Large bulk of proteins broken down to 20 different amino acids A. Amino acids share Common structural features. i. C atom center, Amino group on one side carboxylic acid on the other, and an R group that can be H or more complicated will explore in a minute ii. 20 common AA called standard AA s other AA s derived from these AA s usually by post-translational modification (ie AA was changed after it was made into a protein) iii. All 20 AA s have trivial names All have a 3 letter code All have a 1 letter code Key convention paragraph in text may help you to remember the 1 letter code I expect you to know codes and structures for all AA s Naming substituents on AA s is not standard organic chem base C is called alpha C then out from there is beta, gamma, delta, epsilon iv. All AA s expect Glycine are chiral All AA s expect Glycine have 4 different constituents around Cá. Why is this important? Chiral center non-superimposable mirror images (Figure 3-3) enantiomers optically active (Will rotate plane polarized light and display differential absorbtion of circularly polarized light) Won t worry about designating individual C (á,â...) Or Ring C s What did you learn in organic about nomenclature in this system?

2 a. L-D (oldest) dates back to 1851 Emil Fisher relate structure to L or D glyceraldehyde Figure 3-4 Under this nomenclature AA s are L. All prepared by living things are L. Ingest D and will die of starvation. b. based on how rotate plane polarized light + or - AA s can be either c. R or S AA s can be either. If good at this give a few a try, not used much in Biochem so won t go over B. Classification by R groups Figure 3-5 Have looked at common part, now lets look at different part, the R group What are useful chemical properties Charge +/- Acid or base (tied to above base usually +. Acid usually -) polarity Table 3-1 Be able to look at an amino acid (or your memorized Figure 3-5 structure) and place into 1 of 5 groups non-polar aliphatic G, A, P, V, L, I, M non-polar aromatic F, Y, W Polar uncharged S, T, C, N, Q + charged (Base) R, K, H - charged (acid) D, E Note: some of these classifications are not clear cut. Tyrosine is somewhat polar Also note: should remember roughly the pka s of sidechains Backbone COOH is usually about 2 and NH is about 9.6 Special notes on special groups i. Aromatics Give protein absorbance in 280 nm range (Figure 3-6) ii. cysteine may occur as free SH, or may be reduced to form

3 cystine (Figure 3-7) iii. His with pka of about 7, is use often as a catalyst because can act as either proton donor or acceptor at neutral ph C. Uncommon Amino Acids also have Important Functions Other AA s observed in nature Figure 3-8 i. 4-hydroyproline (found in plant cell walls and collagen) ii. 5 hydrolysine (found in collagen) iii. 6-N-methyl lysine Found in myosin, a muscle protein iv. ã carboxy glutamate (found in blood clotting proteins All of above are made by incorporating base AA into protein, then modifying after protein is made (post-translational modification) Ornithine and citruline are intermediated in amino acid synthesis, but are not incorporated into proteins Selenocysteine (looks like cysteine with a seleneium instead of a sulfur) Synthesized from serine Incorporated into a very very few proteins in specific organisms A rare bird D. Amino acids act as acids and bases Since have common form of NH and COOH have both acid and base functions called zwitterions when have both + and negative form also called amphoteric Look at titration curve of an AA without a charged R groups Figure 3-10 COOH pka? (2) NH 3 (9.6) Is this normal for an organic COOH and NH 3? Figure 3-11 NO COOH is lower than expected. Usually more like 4 (acetic acid is 4.8) NH 3 is also lower than expected. methylamine 10.6 Can you explain this? A lower pk a means is more or less acidic? More acidic More acidic means is gets rid of H more easily Why can get rid of H more easily? Let s think about the fully protonated form H A + 2

4 + Has charge on NH 3+, wants to get rid of H from COOH to - become negative (COO ) and then no net charge on the molecule. Another way to think is that charge-charge interaction stabilizes the zwitterion so is occurs more easily - Use a different logic on NH 2. Base is close to COO. These - electronegative atoms (O s of COO ) pull electrons toward + them. NH 3 is more easily deprotonated (more acidic) because these electronegative atoms help to delocalize the electron that remains on the NH 2 as it deprotonates, the + NH group is more acidic 3. E. Titration Curves Notes don t follow text Construct a curve like figure 3-10 at what ph has a negative charge? At what ph has a positive charge? At what ph is neutral called isoelectric point pi formulas from Analytical? (pk + pk )/2 1 2 Now how about a more complicated AA, say Glutamate or His Figure 3-12 Amino Acids with >2 ionizable groups What were basic AA s (Lys, Arg, His) What were acidic AA (Asp, Glu) What do titration curves of these AA s look like Examples: Glu and His Figure 3-12 Glu pka s 2.19, 4.25, 9.67 His pka s 1.82, 6.00, 9.17 What are structures at these 2,5,7,9,10 etc Where is pi? What is charge at neutral ph? Note - His is special because is only AA in the bunch with pka close to neutral so depending on situation can be in either ionized or unionized

5 Before we leave lets look at either Tyr (2.2, 9.11, 10.07) or Cys (1.96, 10.28, 8.18) Both have 3 ionizable groups - why not acid or base? Both acids but extremely weak, so at neutral ph uncharged, so usually don t see much. However when place near another group get very interesting results 3.2 Peptides & proteins Now start building monomer into polymer A. Peptides are chains of amino acids Condense 2 AA s via a peptide bond kicking out water (like figure 3-13 on board) The resulting peptide bond has special properties will use in next chapter. Lets see a hint Is acid or base? (Amide bond - neither) Polar or nonpolar? (Polar and slightly resonant so even more polar) Can make hydrogen bonds? (Yes to both sides) Note equilibrium is toward AA s not peptide peptides/proteins breaking down need to input E to make bond Will see is done biologically next semester, chemicals later this chapter Actually kinetically slow process, at ph 7 t 1/2 about 7 years Acid and base catalyzed so at high or low ph much faster Peptide kind of a generic term for several AA s hooked together oligopeptide a few AA s (<30?) polypeptide many AA s (30-100?) Proteins lots of AA s (>=100) In biological systems is always linear. Not branched. Note: order is important. Who is at which end AA with free á amino group in main chain is call the amino-terminal or N-Terminal AA with free carboxylic acid in main chain is called the carboxyl terminal or C-terminal

B. Can distinguish peptides by ionization behavior Peptides will have ionizable groups at N and C terminus, Plus any ionizable groups from side chains. Thus can do for peptides what did for amino acids, predict charge at any ph and predict pi. Can use this to find methods to separate different peptides Won t correspond exactly to pka given in 5-1 because environment changes pka s. For instance will Terminal COOH and NH 3 still have pka s of 2 and 9? No - no longer near charged groups so no inductive effect, and will be more normal with 4 and 10 6 C. Biologically active peptides come in many sizes and compositions from 2 to 1000's of residues Nutrasweet (Asp-Phe-methyl ester often have effect at very low concentrations many hormones are peptides Thyrotropin releasing factor (3) Oxytocin (9 AA) Bradykinin (9 AA) Glucagon(29) Cortiotropin(39) Insulin (51 AA 30+21) Proteins Cytochrome C about 100 residues Titin (from muscle) 27,000 AA - MW 3,000,000 Average MW of a residue roughly 110 Proteins can also be multi-subunit so can have several associated together! Peptide composition Each protein has a different # of AA s, different % composition of AA s, different sequence of AA s. All will have different physical properties Some AA s may be there others may not compare 2 proteins Table 3-3 D. Many protein will have other things besides amino acids may be covalently attached or non-covalently associated added group called prosthetic groups Proteins with groups called conjugated proteins Lipoproteins -lipids glycoproteins -carbohydate

7 phophoproteins -phosporus hemoprotein -heme metalloporteins -metals Apoprotein - Protein without prosthetic group 3.3 Working with Proteins How do we know what we know about proteins? Isolate and characterize in the lab a bit different than regular chemistry instead of synthesizing a new compound and isolating is from a few reactants or products, are trying to isolate a single protein out of an organism that contains thousands of proteins How do you do this? A. Separation and purification Step 1 Isolate tissue break open cell in some way lyse cell- grind cell- sonicate- etc resulting solution called a crude extract If possible, isolate a particular organelle of cell that contains POI (protein of interest) See discussion back in chapter 1 on how this is done Then begin to separate different proteins from each other i. And oldest method - selective precipitation usually adding salts like (NH 4) 2SO4 First try to get other proteins to ppt Filter and get rid of other proteins Now get yours to ppt Save filtrate, and dissolve in water and back in business This method usually very crude, but can get protein from 100' of L of cells pptd down to a few mls of protein paste so is useful Now what? dissolve back in water. Now have protein dissolved in water, but what is wrong? Have salt still in water as well. How to get rid of salt? Dialysis Place in membrane with pores lets salts out keeps proteins in.

8 May use dialysis often in purification procedure whenever want to get rid of small ions ii. Now need chromatography What is chromatography? Method where a mixture is applied to some medium, and as the mixture moves through the medium, different components in the mixture are retarded differently so materials begin to separate from each other. Any examples of chromatography from class?? (Bring in a demo??) Lots of chromatographic methods, biochemistry uses 3 major methods (Figure 3-17) Ion exchange Medium is a solid polymer polymer contains charged species Ions of opposite charge in medium are attracted to solid and stick with different affinity Least attracted come right through, most attracted may be permanently bond Your s comes off in between? If didn t come off Change ionic strength Change ph Size exclusion chromatography Polymer has pores of different sizes Small guys penetrate hole, takes longer to get through Large guys say outside and get through faster Nifty technique. Plot log MW vs elution can get MW! Affinity Covalently attach something to polymer that protein likes to grab All other proteins was through Your protein is stuck Add excess of ligand Flushes off your protein, and is now pure Any one method will not purify completely, so typically will use at least two or more of the above methods

Once pure, then go on and characterize the protein 9

B. Separation and Characterization via electrophoresis One technique that is used often in Biochem is electrophoresis Electrophoresis refers to the movement of charged species in an electric field -nice thing about this technique can be used both for characterization and separation at the same time so don t need pure sample before use for characterization -bad thing is that isn t a great technique for bulk separation, to hard to do gram quantities. How does it work Place material in electric field If NET + will move to - If NET - will move to + If no net charge? Won t move How get so no net charge? Hold that thought What will make move faster? Larger potential (increase V) More charge on molecule What will make move slower Large floppy molecule more drag Putting something in way, gel matrix Who has used in Cell biology or genetic, what was done, how did it work Now lets try on proteins If ph < pi, what is charge on protein? (+) If ph > PI, charge is -, move to + So can see will separate proteins based on how close or far ph is from pi What else? Size/structure So pretty complex for an characterization tool See figure 3-18b for electrophoresis of proteins as fractionated Can simplify with a couple special sub techniques SDS gel electrophoresis Add detergent Sodium dodecyl sulfate - + CH 3(CH 2) 11SO4 Na Hydrophobic binds to protein About 1 molecule SDS/2 AA 10

11 Protein gets denatured so any structure blown away Gets uniform coating of negative charge, so migrates in simple manner. In fact migration relative to log MW Figure 3-19 for SDS Gel So get measure of MW and purity in 1 easy shot! Isoelectric focusing Remember earlier thought If ph < ph + PH > pi - PH = pi doesn t move Make gel with ph gradient using mixture of organic acids and bases Figure 3-20 Apply to one end of gel let move until it stops. Now at ph measure ph of gel and have pi. Now you know why we calculated pi s earlier, it a value can find experimentally Can combine both in one Figure 3-21 This is a cutting edge technique in new field of proteomics Studying changes that occur in proteins can compare host of proteins form one person to next to see how are different, and apply to clinical situation C. Quantification of protein As you purify a protein you must be able to detect and quantify the protein in the presence of the other proteins If this is the first time the protein has been purified, you have no idea of its physical or chemical properties so you can t use those as possible markers For proteins that are enzymes you need to create an assay To create an assay you need to know: The chemical reaction the enzyme catalyzes A way to follow kinetics of product appearing or substrate disappearing If enzyme requires any cofactors Dependence of activity on substrate concentration Optimum ph T at which is stable and most active

12 By international agreement 1 unit of activity -Amount of enzyme required to Transform 1.0 ìmole of substrate / minute @ 25 o C Activity - Total units of enzyme in a solution Specific Activity - Number of units of enzyme per mg of protein Specific activity is a measure of enzyme purity Should increase during purification procedure Should maximize ant be constant when enzyme is pure Once have assay can now look at POI as purification occurs Table 3-5 Note difference between activity and specific activity Specific activity is activity/wt of protein. Should be at max when finished How do you know when purification is complete? Only 1 protein in SDS gel Specific activity is at max Proteins that are not enzymes are a bit problematic! 3.4 Covalent structure of Proteins As will see in next chapter that there are four hierarchies of structure in proteins o 1 structure - The connection between amino acids - Covalent Structure - Sequence and disulfides o 2 structure - local structure of amino acids and nearest neighbors o 3 structure - overall 3D folding o 4 structure - how peptides associate together in larger polypeptide structure This section deals with 1, will talk more about 2, 3, & 4 in next chapter o (1 structure) A. The function of a protein depends on its sequence Each type of a protein has a unique sequence the unique sequence makes it fold into unique 3D structure The 3D structure give the protein its unique function Change the sequence change the function Leads to genetic diseases Proteins with similar functions in different species can have similar sequences

13 Not entirely iron clad 20-30% or proteins in human are polymorphic That is - have small variations in sequence Even within protein Some regions have more variability than others B. Amino Acid Sequence have been determined for Millions of proteins 1953 read letter year for Biochem Watson and Crick deduced DNA double helix Sanger worked out sequence of insulin Dr. Z born Getting sequence of insulin important because first protein ever sequenced Most protein sequencing now done by DNA sequencing because is fast and easy There are also better current instrumental methods (Section E Mass Spec) But will spend a bit of time in this chapter outlining the older classical methods C. Classical Protein Sequencing Since I learned these classical techniques, I am going to include some additional older techniques in Green for completeness. I don t know If I will include this in the lecture or not. Once we have isolated a protein, and before we start to analyze the sequence, there are some things wee need that the book skips over. I. Determination of Molecular weight One of the first thing we want to now is the molecular weight. Because that will tell us roughly how many AA s we have. How do you get MW? Have seen SDS gel and gel permeation chromatography give you MW +/- 5% so not terribly accurate Can be further off if funky protein New technology way is via mass spec How does mass spec work Put material is gas phase (usually in vacuum) Put charge on material (various ways - let s not worry about) Now can use charge to accelerate material in electric field Once moving can detect mass by observing material How long does it take to travel

14 - in a magnetic field how quickly does it turn From these measurement can get Mass/charge ratio And from there get mass Methods establish for organic molecule for 20 years In past 10 years has been extended to large molecules like proteins II. Amino Acid Composition Next step is usually to get overall AA composition. Used to compare proteins with each other and can we used as a check when you have the final sequence to make sure it fits with known data. i. Hydrolyze protein in either strong acid or base (Boil in 6M HCL, 12, 24, 48hrs This knocks proteins into AA Separate AAs by chromatography and quantitate All has been mechanized onto machines call amino a acid analyzers Need a few mg of protein Some acids destroyed in base, others in acids, so usually need both acid and base hydrolysis can double check results against MW to be sure you have everything Note - this only tells you overall % composition, NOT sequence. For that you need to dig harder C. Sequence Determination i. (a start) Figure 3-25 Take protein and modify with FDNB 2,4-dinitrofluorobenzene also called Sanger reagent after inventor of process show structure on board. This chemical spontaneously reacts with all free amines on proteins. Where would you find free amines? N terminus and lys. Do AA analysis again. Will find modified lysines because they will run a bit differently, but not-aproblem, will find 1 AA in the whole bunch that runs differently because now has 2,4-DNB group attached, and that is your N-terminus so have identified AA 1 ii. (continued) Figure 3-27 Let s try to get more than one residue the chemical phenylisothiocyanate is another reagent that reacts with free amines This is called an Edman Degradation after Pehr Edman, inventor With this reagent, if you hit the protein with 6M HCL for just a minute (not the boiling for 24 hrs required for hydrolysis)- the reagent forms a phenylthiohydantoin derivative that pulls of the first AA and leaves the rest

15 of the protein unchanged run extract on HPLC, identify AA hit protein a another time to get next AA Again has been automated into a machine called a protein sequenator Use a few mg of protein and read off sequence Not quite that simple Reaction not 100% Say only 97% Lets say sequence is gly-pro-lys Reaction 1 get 97% of gly off end Reaction 2 get 3% of gly that missed first time, and.97(.97) or.94% fo pro off Reaction 3 get 6% of pro and now only.94x.97 or 91% of lys Can see getting less and less of correct AA and more and more of other AA s Usually can get to between 20 and 50 before gets so bad that can t follow any further. So can t do entire protein this way Note machine works fast, get this in about 24 hours Getting the complete sequence So need to cut protein into smaller pieces i. Remember that protein MAY contain a few disulfide Need to take care of these first Not removed by other procedure and may interfere Can remove disulfides by 2 methods (figure 3-28) Oxidation with performic acid Reduction with dithiothreitol Will air oxidize back together so need Additional acetylation step to prevent re-oxidation ii. Now can cut into peptides Use with chemical methods (CNBr) Or enzymes to cut into smaller pieces See table 3-6 for specificity Get peptides Sequence peptide iii. Note: Which one goes first, second, third?? Need second cleavage method (or more) So get overlaps and put overlaps together Have linear sequence

Summarized Figure 3-25 Still have to go back to disulfides Cleave it again Isolate peptides If get peptides tied together and have 2 N terminals then are tied by disulfides. Put with other information to get complete sequence E. Current Methods - Mass Spec I have outlined classical approach done biochemistry from 1950's thru 1980's Mass spectrometry has largely changed this Classical mass spec Get an ion in a vacuum moving in a magnetic field Magnetic field make ion move in a curved path From analysis of path can determine m/z (mass to charge) ratio If know change can then determine mass very accurately. Big problem for proteins can you see it? How do you get a protein with 100's to 1000's of amino acids into the gas phase? 2 ways MALDI - Matrix Assisted laser desorption Ionization ESI - Electrospray Ionization MALDI not well described in text Put protein solution on a light absorbing matrix Put in vacuum Hit with a laser that is at the same wavelength as matrix Pulse of laser light destroys matrix and protein is released into the vacuum. Matrix also ionizes protein Time how long it takes the protein ion to travel down a tube Analogy - how do you make an elephant fly? By blowing the ground out from underneath him! ESI Figure 3-30 Solution with protein is pumped through a tiny needle The needle is charged with high + voltage Outside of needle is a vacuum Sprays out fine mist of charged micro droplets Micro droplets are also + charged with protons 16

Solvent in droplets evaporates in vacuum Charge gets condensed onto protein And protein is in vacuum, so it is a gas phase ion usually with multiple charges and can do mass spec Get family of peaks each +1 in mass and charge (due to added proton) So can figure out the base peak Takes very little protein - as little as in a peak of a 2D gel We have such a machine at BHSU in Dr. D s Lab Sequencing with Mass spec Figure 3-31 Can sequence short peptides with tandem mass spec (also designated MS/MS) Cleave peptide into fragments as did with older techniques Shoot the fragments onto an HPLC As an individual fragment comes off the HPLC put it on an ESI mass spec As fragment peak comes of the mass spec, bombard it with a collision gas (usually Ar or Ne) This makes peptide break- usually at a peptide bond Now run it through a second mass spec and read off the size of all the fragments. From the size of the fragments and the known masses of the amino acids can figure out the sequence Note Val and and ILE have same mass so not perfect Comparison Standard chemical methods, slow, require comparatively large amounts Mass spec methods - fast - use trace amounts Ambiguity between Val and Ile Some peptide fragments don t work so can be gaps DNA methods Also fast, require no protein But DNA and then RNA may be processed so DNA sequence is not the same as the protein sequence! So may need all three to properly characterize a protein!! 17 F. Small Peptides and Proteins can be Chemically Synthesized As chemist ultimate test is synthesize o have done analysis, think you know 1 sequence. Best proof you are right is to synthesize from scratch. If what you make has identical properties to

18 what is found in nature, then you had it right. How do you synthesize a protein with a MW in the 1000's? Do standard chemical synthesis as in chem lab, get about 4 residues. Need something new and different Bruce Merrifield invented way to link AA to insoluble resin and cycle reactions to make efficient, simple and mechanized. Got Nobel prize. What I will show you is not his original chemistry, but a modern evolution using his principles Figure 3-32 Works well. Synthesizers cost 10-50K One problem, reaction not 100% So slowly errors accumulate See table 3-7 Practical limit about 100 AA if everything is optimal, but usually lower Need to do solution chemistry to link peptides together if you want a large protein Compare with biology On synthesizer can prepare 100AA protein in a few days and it is about 80% pure E coli, does in 5 seconds, 100% correct G. Amino Acid Sequences Provide Important Biochemical Information o Now have protein 1 structure. What good is it? Can provide insight into 3-D structure/function Cellular location Evolution Structures of 1000's of proteins available on web sequences many more sequence available Search sequence for similarities and relationships BIOINFORMATICS Structure/function Have been trying to predict structure from sequence for many years- still can t do it But if part of a family, (usually 25% sequence homology) Can look at family s structure and function

19 And make a good guess Sometime not entire protein, but a domain will work Also certain sequences Signals for location of protein in cell Tell where to attach prosthetic groups H. Protein sequences can elucidate History Bioinformatics - extracting information from sequence related proteins should have similar sequence Should be able to trace changes in sequence to evolution But not linear Some AA and protein critical so can change or change slowly Others are less critical so can change quickly So get different evolutionary rates Also complicated by gene transfer Where genes from one organism are are transferred to another rather than evolving This is how bacteria gain antibiotic resistance So usually focus on family of closely relate proteins Usually essential for metabolism of cell So little chance of transfer! For example EF-1á in eukariots or EF-Tu in bacteria a protein needed for protein synthesis (chapter 27) Homologous proteins or homologs Homologous proteins from different species called orthologs Same Paralogs Figure 3-33 So use computer programs to search data bank to see if your sequence lines up with another sequence or family of sequences Tricky programing - gaps, changes, etc Is change conservative (same amino acid properties- say ASP for GLu) or radical (Say Gly for Phe) Layer on top of this statistics for random or non-random alignment Integrate together to come up with evolutionary tree Figure 3-35 or 3-36