The Leucine Binding Proteins of Escherichia coli as Models for Studying the Relationships Between Protein Structure and Function

Similar documents
UNIVERSITETET I OSLO Det matematisk-naturvitenskapelige fakultet

( TUTORIAL. (July 2006)

Hands on Simulation of Mutation

GENEWIZ, Inc. DNA Sequencing Service Details for USC Norris Comprehensive Cancer Center DNA Core

(A) Microarray analysis was performed on ATM and MDM isolated from 4 obese donors.

Mutations and Genetic Variability. 1. What is occurring in the diagram below?

The p53 MUTATION HANDBOOK

Table S1. Related to Figure 4

10 µg lyophilized plasmid DNA (store lyophilized plasmid at 20 C)

Chapter 9. Applications of probability. 9.1 The genetic code

Mutation. Mutation provides raw material to evolution. Different kinds of mutations have different effects

Part ONE. a. Assuming each of the four bases occurs with equal probability, how many bits of information does a nucleotide contain?

DNA Sample preparation and Submission Guidelines

Introduction to Perl Programming Input/Output, Regular Expressions, String Manipulation. Beginning Perl, Chap 4 6. Example 1

pcas-guide System Validation in Genome Editing

Gene Synthesis 191. Mutagenesis 194. Gene Cloning 196. AccuGeneBlock Service 198. Gene Synthesis FAQs 201. User Protocol 204

DNA Bracelets

Inverse PCR & Cycle Sequencing of P Element Insertions for STS Generation

Molecular Facts and Figures

Supplementary Online Material for Morris et al. sirna-induced transcriptional gene

Coding sequence the sequence of nucleotide bases on the DNA that are transcribed into RNA which are in turn translated into protein

Drosophila NK-homeobox genes

Title : Parallel DNA Synthesis : Two PCR product from one DNA template

Supplementary Information. Binding region and interaction properties of sulfoquinovosylacylglycerol (SQAG) with human

Problem Set 3 KEY

ISTEP+: Biology I End-of-Course Assessment Released Items and Scoring Notes

Provincial Exam Questions. 9. Give one role of each of the following nucleic acids in the production of an enzyme.

Introduction to Bioinformatics (Master ChemoInformatique)

Pipe Cleaner Proteins. Essential question: How does the structure of proteins relate to their function in the cell?

Amino Acids. Amino acids are the building blocks of proteins. All AA s have the same basic structure: Side Chain. Alpha Carbon. Carboxyl. Group.

Next Generation Sequencing

a. Ribosomal RNA rrna a type ofrna that combines with proteins to form Ribosomes on which polypeptide chains of proteins are assembled

Gene Finding CMSC 423

Module 6: Digital DNA

Characterization of cdna clones of the family of trypsin/a-amylase inhibitors (CM-proteins) in barley {Hordeum vulgare L.)

SERVICES CATALOGUE WITH SUBMISSION GUIDELINES

Molecular analyses of EGFR: mutation and amplification detection

Recap. Lecture 2. Protein conformation. Proteins. 8 types of protein function 10/21/10. Proteins.. > 50% dry weight of a cell

Inverse PCR and Sequencing of P-element, piggybac and Minos Insertion Sites in the Drosophila Gene Disruption Project

pcmv6-neo Vector Application Guide Contents

The nucleotide sequence of the gene for human protein C

ANALYSIS OF A CIRCULAR CODE MODEL

and revertant strains. The present paper demonstrates that the yeast gene for subunit II can also be translated to yield a polypeptide

Advanced Medicinal & Pharmaceutical Chemistry CHEM 5412 Dept. of Chemistry, TAMUK

IV. -Amino Acids: carboxyl and amino groups bonded to -Carbon. V. Polypeptides and Proteins

Marine Biology DEC 2004; 146(1) : Copyright 2004 Springer

Cloning, sequencing, and expression of H.a. YNRI and H.a. YNII, encoding nitrate and nitrite reductases in the yeast Hansenula anomala

Part A: Amino Acids and Peptides (Is the peptide IAG the same as the peptide GAI?)

DNA Sequencing of the eta Gene Coding for Staphylococcal Exfoliative Toxin Serotype A

Peptide bonds: resonance structure. Properties of proteins: Peptide bonds and side chains. Dihedral angles. Peptide bond. Protein physics, Lecture 5

Gene and Chromosome Mutation Worksheet (reference pgs in Modern Biology textbook)

Shu-Ping Lin, Ph.D.

Supplemental Data. Short Article. PPARγ Activation Primes Human Monocytes. into Alternative M2 Macrophages. with Anti-inflammatory Properties

How To Clone Into Pcdna 3.1/V5-His

BD BaculoGold Baculovirus Expression System Innovative Solutions for Proteomics

All commonly-used expression vectors used in the Jia Lab contain the following multiple cloning site: BamHI EcoRI SmaI SalI XhoI_ NotI

Hiding Data in DNA. 1 Introduction

Mutation of the SPSl-encoded protein kinase of Saccharomyces cerevisiae leads to defects in transcription and morphology during spore formation

Insulin mrna to Protein Kit

Y-chromosome haplotype distribution in Han Chinese populations and modern human origin in East Asians

The making of The Genoma Music

Protein Physics. A. V. Finkelstein & O. B. Ptitsyn LECTURE 1

NimbleGen SeqCap EZ Library SR User s Guide Version 3.0

ANALYSIS OF GROWTH HORMONE IN TENCH (TINCA TINCA) ANALÝZA RŮSTOVÉHO HORMONU LÍNA OBECNÉHO (TINCA TINCA)

cdna sequence and expression pattern of the putative

were demonstrated to be, respectively, the catalytic and regulatory subunits of protein phosphatase 2A (PP2A) (29).

The DNA-"Wave Biocomputer"

Transmembrane Signaling in Chimeras of the E. coli Chemotaxis Receptors and Bacterial Class III Adenylyl Cyclases

Insulin Receptor Gene Mutations in Iranian Patients with Type II Diabetes Mellitus

Molecular chaperones involved in preprotein. targeting to plant organelles

June 09, 2009 Random Mutagenesis

2. The number of different kinds of nucleotides present in any DNA molecule is A) four B) six C) two D) three

BOC334 (Proteomics) Practical 1. Calculating the charge of proteins

An Introduction to Bioinformatics Algorithms Gene Prediction

Nucleotide sequence and the encoded amino acids of human serum

PRACTICE TEST QUESTIONS

Bio 102 Practice Problems Recombinant DNA and Biotechnology

Genomes and SNPs in Malaria and Sickle Cell Anemia

pentr Directional TOPO Cloning Kits

Chlamydomonas adapted Green Fluorescent Protein (CrGFP)


Heraeus Sepatech, Kendro Laboratory Products GmbH, Berlin. Becton Dickinson,Heidelberg. Biozym, Hessisch Oldendorf. Eppendorf, Hamburg

Five-minute cloning of Taq polymerase-amplified PCR products

RNA & Protein Synthesis

Sickle cell anemia: Altered beta chain Single AA change (#6 Glu to Val) Consequence: Protein polymerizes Change in RBC shape ---> phenotypes

pbad/his A, B, and C pbad/myc-his A, B, and C

Amino Acids, Peptides, Proteins

The sequence of bases on the mrna is a code that determines the sequence of amino acids in the polypeptide being synthesized:

From DNA to Protein

Protein Synthesis Simulation

Production and Characterization of a Murine/Human Chimeric Anti-Idiotype Antibody That Mimics Ganglioside1

Chem 465 Biochemistry II

From DNA to Protein. Proteins. Chapter 13. Prokaryotes and Eukaryotes. The Path From Genes to Proteins. All proteins consist of polypeptide chains

C YTOPLASMIC organelles proliferate by the growth and

Biopython Tutorial and Cookbook

Synthetic Biology: DNA Digital Storage, Computation and the Organic Computer

The Arabinosyltransferase EmbC Is Inhibited by Ethambutol in Mycobacterium tuberculosis

expressed histone genes have intervening sequences and encode polyadenylylated mrnas

On Covert Data Communication Channels Employing DNA Recombinant and Mutagenesis-based Steganographic Techniques

Transcription:

Journal of Cellular Biochemistry 29:209-216 (1985) Protein Structure, Folding, and Design 39-46 The Leucine Binding Proteins of Escherichia coli as Models for Studying the Relationships Between Protein Structure and Function Tammy K. Antonucci, Robert Landick, and Dale L. Oxender Department of Biological Chemistry, University of Michigan, Ann Arbor, Michigan 481 09 The genes encoding the leucine binding proteins in E coli have been cloned and their DNA sequences have been determined. One of the binding proteins (LIV- BP) binds leucine, isoleucine, valine, threonine, and alanine, whereas the other (LS-BP) binds only the D- and L-isomers of leucine. These proteins bind their solutes as they enter the periplasm, then interact with three membrane components, IivH, livc, and livm, to achieve the translocation of the solute across the bacterial cell membrane. Another feature of the binding proteins is that they must be secreted into the periplasmic space where they carry out their function. The amino acid sequence of the two binding proteins is 80% homologous, indicating that they are the products of an ancestral gene duplication. Because of these characteristics of the leucine binding proteins, we are using them as models for studying the relationships between protein structure and function. Key words: amino acid transport, binding proteins, secretion, gene duplication, oligonucleotidedirected rnutagenesis There are three major systems operating in Escherichiu coli for the transport of the branched-chain amino acids. Two of these systems consist of a binding protein and three membrane components while the third system appears to involve only a single membrane component. In this discussion we will describe the biochemical and genetic properties of the two binding protein-dependent transport systems, emphasizing the multiple functions of the binding protein components of the transport systems and, as a consequence, the value of the binding proteins for understanding the relationship(s) between protein structure and function. The LIV-I system transports L-leucine, L-isoleucine, and L-valine with Kd values of approximately lop6 to lop7 pm [ 1-41, In addition, this system is responsible for the transport of threonine and alanine, but with somewhat lower affinities [2]. Robert Landick's present address is the Department of Biological Sciences, Stanford University, Stanford, CA 94305. Received June 6, 1985; revised and accepted August 6, 1985. 0 1985 Alan R. Liss, Inc.

21O:JCB Antonucci, Landick, and Oxender The other transport system to be considered is the leucine-specific transport system (LS). The LS system transports L-leucine with a Kd value of approximately pm; D-leucine is also an acceptable substrate but with a lower affinity [ 1-41. All structural genes for the LS and LIV-I transport systems are located at the minute 76 region on the E coli chromosome [4]. The genes encoding the components of the branched-chain amino acid transport systems LIV-I and LS in E mli have been cloned 151. The gene organization is shown in Figure 1. There are three functional operons with their directions of transcription indicated by the arrows. The livj and livk gene products are the periplasmic binding proteins, called LIV-BP and LS-BP, respectively. The LIV-BP binds isoleucine, leucine, and valine with high affinity and threonine and alanine with a somewhat lower affinity. The LS-BP binds only isomers of leucine. The livh, livm, and livc genes encode proteins that have been localized to the inner membrane of the bacterial cell 161. These proteins presumably interact with the BP-amino acid complex to effect the transport of the amino acid into the cytoplasm of the cell. Thus the LIV-I and LS transport systems consist of individual binding proteins that have different substrate specificities yet interact with the same membrane components during the transport process. The livl operon produces a 17 kd protein but its role in transport, if any, has not been determined [7]. CHARACTERISTICS OF THE LIV-BP AND THE LS-BP The DNA sequence of the livj and livk genes encoding the two binding proteins has been determined 181 and is shown in Figure 2. A comparison of the coding regions of the two sequences shows that the two binding protein genes are 80% homologous, indicating that they are the products of an ancestral gene duplication 18-11]. While the sequences of the 5' flanking regions and coding regions revealed no strong homology with the known ribosome-binding sites, the pattern of codon usage in both genes implies that they are translated with average efficiency. The amino acid sequence of the LIV-BP has been previously reported [ 121 and is consistent with the DNA sequence except for two positions 181. Moreover, the crystal structure of the mature LIV-BP was determined to a resolution of 2.0 A (131. The crystal structure reveals that the LIV-BP has two domains that fold to form a cleft 1131. One domain of LIV-BP is composed of the N-terminal half of the protein, while the other domain is formed by most of the C terminal portion of the molecule. The secondary structure obtained from the crystallographic analysis shows that each domain is formed by an alternating series of alpha helices and beta sheets [8]. Figure 3 shows the predicted secondary structural features superimposed onto a drawing of the crystal structure. The filled-in dots represent differences in amino acid sequences between the two binding proteins. The two arrows in Figure 2 point out regions where amino acid insertions occur in the LS-BP. A disulfide bridge can be positioned between segments beta-2 and beta-3. Substrate Binding One of the major functions carried out by the BPs is binding of substrate. As mentioned above, the LIV-BP and the LS-BP have different substrate specificities. Saper and Quiocho [I31 have observed that when L-leucine is diffused into crystals 40:PSFD

Leucine Binding Proteins of E culi JCB:211. PSFD:41

212:JCB Antonucci, Landick, and Oxender -20-10 -1 1 ATG AAC ACA AAG GGC AAA GCG TTA CTG GCA GGA TTG ATC GCG CTG GCA TTC AGC AAT ATG GCT CTG GCA GAA GAT ATT AAA GTC GCG GTC Met Asn Thr Lys Gly Lys Ala Leu Leu Ala Gly Leu Ile Ala Leu Ala Phe Ser Asn Met Ala Leu Ala Glu Asp Ile Lys Val Ala Val I.. * *. I.. * * * Met Lys Arg Asn Ala Lys Thr Ile Ile Ala Gly Met Ile Ala Leu Ala lie Ser His Thr Ala Met Ala Asp Asp Ile Lys Val Ala Val ATG AAA CGG AAT GCG AAA ACT ATC ATC GCA GGG ATG ATT GCA CTG GCA ATT TCA CAC ACC GCT ATG GCT GAC GAT ATT AAA GTC GCC titt 10 20 30 GTG GGC GCA ATG TCC GGT CCG GTT GCG CAG TAC GGT GAC CAG GAG TTT ACC GGC GCA GAG CAG GCG GTT GCG GAT ATC AAC GCT AAA GGC Val Gly Ala Met Ser Gly Pro Val Ala Gln Tyr Gly Asp Gln Glu Phe Thr Gly Ala Glu Gln Ala Val Ala Asp Ile Asn Ala Lys Gly * * * *. * * f f * * I + * *,.. t *, t t Val Gly Ala Met Ser Gly Pro I l e Ala Gln Trp Gly I l e Met Glu Phe Asn Gly Ala Glu Gln Ala Ile Lys Asp Ile Asn Ala Lys Gly GTC GGC GCG ATG TCC GGC CCG AT1 GCC CAG TGG GGC ATA ATG GAA TTT AAC GGC GCG GAG CAG GCG AT1 AAA GAC ATT AAT GCC AAA GGG 40 50 60 GGC AlT A M GGC AAC AAA CTG CAA ATC GCA AAA TAT GAC GAT GCC TGT GAT CCG AAA CAG GCG GTT GCG GTG GCG AAC AAA GTC GTT AAC Gly Ile Lys Gly Asn Lys Leu Gln Ile Ala Lys Tyr Asp Asp Ala Cys Asp Pro Lys Gln Ala Val Ala Val Ala Asn Lys Val Val Asn. *. I * f * * t t * * * l * * * * t * * t. * * Gly Ile Lys Gly Asp Lys Leu Val Gly Val Glu Tyr Asp Asp Ala Cys Asp Pro Lys Gln Ala Val Ala Val Ala Asn Lys Ile Val Asn GGA ATT AAG GGC GAT AAA CTG GTT GGC GTG GAA TAT GAC GAC GCA TGC GAC CCG AAA CAA GCC GTT GCG GTC GCC AAC AAA ATC GTT AAT 70 80 90 GAC GGC AT1 AAA TAT GTG ATT GGT CAC CTC TGT TCC TCA TCA ACG CAG CCT GCG TCG GAT ATC TAC GAA GAC GAA GGC ATT TTG ATG ATC Asp Gly Ile Lys Tyr Val Ile Gly His Leu Cys Ser Ser Ser Thr Gln Pro Ala Ser Asp Ile Tyr Glu Asp Glu Gly Ile Leu Met Ile * * ~ t.. * t. * t l t t * + * * + * ~.. ~ * * ~ ~.. Asp Gly Ile Lys Tyr Val Ile Gly His Leu Cys Ser Ser Ser Thr Gln Pro Ala Ser Asp lle Tyr Glu Asp Glu Gly lle Leu Met Ile GAC GGC ATT AAA TAC GTT ATT GGT CAT CTG TGT TCT TCT TCT ACC CAG.CCT GCG TCA GAC ATC TAT GAA GAC GAA GGT ATT CTA ATG ATC 100 110 120 ACC CCA GCG GCA ACC GCG CCG GAG CTG ACC GCC CGT GGC TAT CAG CTG ATC CTG CGA ACC ACC GGC CTG GAT TCC GAC CAA GGG CCG ACG Thr Pro Ala Ala Thr Ala Pro Glu Leu Thr Ala Arg Gly Tyr Gln Leu Ile Leu Arg Thr Thr Gly Leu Asp Ser Asp Gin Gly Pro Thr * * * * * * * + * * * t t t t t l. * *.. Ser Pro Gly Ala Thr Ala Pro Glu Leu Thr Gln Arg Gly Tyr Gln His I l e Met Arg Thr Ala Gly Leu Asp Ser Ser Gln Gly Pro Thr TCG CCG GGA GCG ACC GCG CCG GAA CTA ACC CAA CGC GGT TAT CAA CAC ATT ATG CGT ACT GCC GGG CTG GAC TCT TCC CAG GGG CCA ACG 130 140 150 GCT GCC AAA TAT ATT CTT GAG AAA GTG AAA CCG CAG CGT ATT GCT ATC GTT CAC GAC AAA CAG CAA TAC GGC GAA GGT CTG GCG CGA GCG Ala Ala Lys Tyr Ile Leu Glu Lys Val Lys Pro Gln Arg Ile Ala Ile Val His Asp Lys Gln Gln Tyr Gly Glu Gly Leu Ala Arg Ala * * *. * * *. *. * * * * *. * * *, * * * * * *. Ala Ala Lys Tyr I l e Leu Glu Thr Val Lys Pro Gln Arg Ile Ala Ile Ile His Asp Lys Gln Gln Tyr Gly Glu Gly Leu Ala Arg Ser GCG GCA AAA TAC ATT CTT GAG ACG GTG AAG CCC CAG CGC ATC GCC ATC ATC CAC GAC AAA CAA GAG TAT GGC GAA GGG CTG GCG CGT TCG 160 170 I80 GTG CAG GAC GGC CTG AAG AAA GGC AAT GCA AAC GTG GTG TTC TTT GA1 GGC ATC ACC GCC GGG GAA AAA GAT TTC TCA ACG CTG GTG GCG Val Gln Asp Gly Leu Lys Lys Gly Asn Ala Asn Val Val Phe Phe Asp Gly Ile Thr Ala Gly Glu Lys Asp Phe Ser Thr Leu Val Ala * * *. + * * * * * * * *.. * *. * * * * *, Val Gln Asp Gly Leu Lys Ala Ala Asn Ala Asn Val Val Phe Phe Asp Gly I l e Thr Ala Gly Glu Ly5 Asp Phe Ser Ala Leu Ile Ala GTG CAG GAC GGG CTG AAA GCG GCT AAC GCC AAC GTC GTC TTC TTC GAT GGT ATT ACC GCC GGG GAG A M GAT TTC TCC GCG CTG ATC GCC 190 200 210 CGT CTG AAA AAA GAG AAT ATC GAC TTC GTT TAC TAC GGC GGT TAT CAC CCG GAA ATG GGG CAA ATC CTG CGT CAG GCA CGC GCG GCA GGG Arg Leu Lys Lys Glu Asn I l e Asp Phe Val Tyr Tyr Gly Gly Tyr His Pro Glu Met Gly Gln Ile Leu Arg Gln Ala Arg Ala Ala Gly. t * * t + * * * l * * t t * * * * * * * * *. * Arg Leu Lys Lys Glu Asn l i e Asp Phe Val Tyr Tyr Gly Gly Tyr Tyr Pro Glu Met Gly Gln Met Leu Arg Gln Ala Arg Ser Val Gly CGC CTG AAA AAA GAA AAC ATC GAC TTC GTT TAC TAC GGC GGT TAC TAC CCG GAA ATG GGG CAG ATG C.TG CGC CAG GCC CGT TCC GTT GGC 220 230 240 CTG AAA ACT CAG TTI ATG GGG CCG GAA GGT GTG GCT AAC GTT TCG CTG TCT AAC ATT GCG GGC GAA TCA GCG GAA GGG CTA CTG GTG ACC Leu Lys Thr Gln Phe Met Gly Pro Glu Gly Val Ala Asn Val Ser Leu Ser Asn I l e Ala Gly Glu Ser Ala Glu Gly Leu Leu Val Thr *. * * * * * *. * * * t i * * * * * * 1,. * * * Leu Lys Thr Gln Phe Met Gly Pro Glu Gly Val Gly Asr; Ala Ser Leu Ser Aan Ile Ala Gly Asp Ala Ala Glu Gly Met Leu Val Thr CTG AAA ACC CAG TTT ATG GGG CCG GAA GGT GTG GGT AAT GCG TCG TTG TCG AAC ATT GCC GGT GAT GCC GCC GAA GGC ATG TTG GTC ACT. *. 250 260 270 AAA CCG AAG AAC TAC GAT CAG GTT CCG GCG AAC AAA CCC ATT GTT GAC GCG ATC A M GCG AAA AAA CAG GAC CCA AGT GGC GCA TTC GTT Lys Pro Lys Asn Tyr Asp Gln Val Pro Ala Asn Lys Pro I l e Val Asp Ala Ile Lys Ala Lys Lys Gln Asp Pro Ser Gly Ala Phe Val * *. t t t * I * I. * * * * Met Pro Lys Arg Tyr Asp Gln Asp Pro Ala Asn Gln Gly I l e Val Asp Ala Leu Lys Ala Asp Lys Lys Asp Pro Ser Gly Pro Tyr Val ATG CCA AAA CGC TAT GAC CAG GAT CCG GCA AAC CAG GGC ATC GTT GAT GCG CTG AAA GCA GAC AAG AAA GAT CCG TCC GGG CCT TAT GTC 280 290 300 TGG ACC ACC TAC GCC GCG CTG CAA TCT TTG CAG GCG GGC CTC AAT CAG TCT GAC GAT CCG GCT GAA ATC GCC AAA TAC CTG AAA Trp Thr Thr Tyr Ala Ala Leu Gln Ser Leu Gln Ala Gly Leu Asn Gln Ser Asp Asp Pro Ala Glu Ile Ala Lys Tyr Leu Lyr. * *. *. * f f. * Trp Ile Thr Tyr Ala Ala Val Gln Ser Leu Ala Thr Ala Leu Glu Arg Thr Gly Ser Asp Glu Pro Leu Ala Leu Val Lys Asp Leu Lys TGG ATC ACC TAC GCG GCG GTG CAA TCT CTG GCG ACT GCC CTT GAG CGT ACC GGC AGC GAT GAG CCG CTG GCG CTG GTG AAA GAT TTA AAA 310 320 330 GCG AAC TCC GTG GAT ACC GTA ATG GGC CCG CTG ACC TGG GAT GAG A M GGC GAT CTG AA4 GGC TTT GAG TTC GGC GTA TTT GAC TGG CAC AIa Asn Ser Val Asp Thr Val Met Gly Pro Leu Thr Trp Asp Glu Lys Gly Asp Leu Lys Gly Phe Glu Phe Gly Val Phe Asp Trp His * * t. * * * *. * * i l l * * * * *.... Ala Asn Gly Ala Asn Thr Val Ile Gly Pro Leu Asn Trp Asp Glu Lys Gly Asp Leu Lys Gly Phe Asp Phe Gly Val Phe Gln Trp His GCT AAC GGT GCA AAC ACC GTG AT1 GGG CCG CTG AAC TGG GAT GAA AAA GGC GAT CTT AAG GGA TTT GAT TTT GGT GTG TTC CAG TGG CAC 340 344 GCC AAC GGC ACG GCC ACC GAT GCG AAG Ala Asn Gly Thr Ala Thr Asp Ala Lys * * Ala Asp Gly Ser Ser Thr AIa Ala Lys GCC GAC GGT TCA TCC ACG GCA GCC AAG LIV-BP LS-BP Fig. 2. Nucleotide sequences of the genes encoding LIV-BP and LS-BP. Positive numbering indicates the mature protein sequences, while negative numbering designates the signal sequences. Homologous amino acids are noted by asterisks. 42 : PSFD

Leucine Binding Proteins of E coli JCB:213 Fig. 3. Structure of the LIV-BP. Predicted secondary structures are superimposed on the threedimensional structure reported by Saper and Quiocho [13]. Filled circles indicate the positions of amino acid differences between the LIV-BP and LS-EP sequences. The two dark arrows indicate the region where a two-amino acid insertion occurs in the LS-BP sequence. A disulfide bridge is indicated between segments beta-2 and beta-3. of the LIV-BP, it binds to the inside face of the N-terminal domain. It is reasonable to suggest [8] that amino acid differences between the two BP sequences within the cleft between the two domains are responsible for the different substrate specificities of the LIV-BP and LS-BP. Nonhomologous residues are present near the cleft in segments beta- 1-alpha- 1, beta-4-alpha-4, alpha-9, alpha- 10, beta-9, and beta- 10. Thus, these areas of the BPs may be involved either in forming the structure of the substrate binding site or in directly interacting with the substrate. INTERACTION OF BPS WITH MEMBRANE COMPONENTS Both the LIV-BP-substrate complex and the LS-BP-substrate complex must interact with the same three membrane components to effect the transport of the substrate. Therefore, it is interesting to note that some of the regions of high amino acid sequence conservation include the C terminus of beta-2, alpha-2, beta-3, alpha-3, the C terminus of beta-8, and alpha-8. Because of their sequence conservation and position on the BPs, these regions may function in interactions with the membrane components. It is tempting to speculate that this interaction may be the signal that initiates the membrane translocation process. SECRETION OF THE BPS INTO THE PERIPLASM Both the LIV-BP and LS-BP are synthesized as precursors with a 23-amino acid signal sequence that is processed in association with the BP secretion into the periplasm [7,10,14]. Using the rules of Chou and Fasman [ 151, the secondary structure of the N-terminal portion of the precursor would be predicted to form a helical hairpin structure. These two helices would form from the signal sequence and the mature N-terminal regions and would contain hydrophobic amino acids in the sections that might span the membrane. A model for the secretion of the BPs is shown in Figure PSFD:43

5 I4 CL &h c c3 W cc t7 B R c. F a g. k u a INNER MEMBRANE 0 a $ 1 2 3 4 Fig. 4. Model for the export of the periplasmic leucine-binding proteins. The amino acids are represented by single-letter codes in the circles. The small arrows indicate the directions (positive to negative) of helix dipoles. Step I represents the initial association of the BP with the inner membrane. Step 2 indicates the formation of a helical hairpin structure between the leader peptide and the N-terminal portion of the mature protein. This helical hairpin assumes a transmembrane orientation in the presence of a transmembrane potential. Step 3 indicates the cleavage of the leader sequence by the leader peptidase. Finally, steps 4 and 5 correspond to a refolding of the mature protein and its translocation into the periplasmic space.

Leucine Binding Proteins of E coli JCB:215 4. In this model, a major role of the membrane potential is suggested for the proper membrane orientation of the proposed helix hairpin formed by the signal sequence and the first segment of the mature portion of the BP [ 16,171, Recent work on the secretion of the BPs has been directed toward identifying the sequences required to achieve secretion. To this end, mutants of LS-BP carrying various deletions or sequence alterations have been constructed [17]. It was found that these altered binding proteins were processed and secreted normally. Moreover, hybrid molecules were constructed between the tryptophan synthetase alpha-subunit and various lengths of the N-terminal portion of the LS-BP [ 181. The results showed that almost the entire C-terminal half of the LS-BP can be removed with no adverse effect on processing or secretion of the hybrid protein. Thus, the signal sequence together with a portion of the N-terminal portion of the LS-BP is sufficient to direct its export into the periplasmic space. THE BPS AS MODELS IN STUDYING STRUCTURE-FUNCTION RELATIONSHIPS Because the LIV-BP and LS-BP have three known functions in E coli, they are proving useful in examining the relationships between protein structure and function. For example, using oligonucleotide-directed mutagenesis, the nonhomologous residues in the clefts of the BPs are being altered. If an alteration corresponds with a change in either substrate affinities or substrate specificities, then knowledge will be gained about the mechanism of substrate binding. Other changes are being directed toward the regions of high amino acid homology on the BPs that are potential regions for interacting with the membrane components. These altered BPs will be especially useful for study when the individual membrane proteins have been isolated. Finally, work in progress also involves analyzing additional LS-BP-trpA hybrid molecules containing larger C-terminal deletions to identify more precisely the N-terminal sequences required for export of the BPs into the periplasmic space. Since the LIV- BP and LS-BP are the products of gene duplication, a comparison of their structurefunction relationships should help illuminate the potential constraints placed on their evolutionary divergence. In conclusion, the leucine binding proteins of E coli have been thoroughly studied by biochemical, genetic, and recombinant DNA approaches. As a consequence, much is already known about their molecular structure and their relationship to each other. Because these proteins must be secreted and interact with the same membrane components yet exhibit divergent substrate specificities, they provide a model system for studying the relationship of protein structure to protein function. REFERENCES I. Piperno JR, Oxender DL: J Biol Chem 243:5914-5920, 1968. 2. Rahnianian M. Claus DR, Oxender DL: J Bacteriol 116: 1258-1266, 1973. 3. Wood JM: J Biol Chem 250:4477-4485, 1975. 4. Anderson JJ. Oxender DL: J Bacteriol 136: 168-174, 1978. 5. Oxender DL, Anderson JJ, Daniels CJ, Landick R. Gunsalus UP. Zurawski G. Sclker E. Yanofsky C: Proc Natl Acad Sci USA 77:1412-1416, 1980. 6. Nazoa PN: Ph.D. Thesis, University of Michigan, Ann Arbor, 1984. 7. Landick R: Ph.D. Thesis. University of Michigan, Ann Arbor. 1983. PSFD:45

216:JCB Antonucci, Landick, and Oxender 8. Landick R, Oxender DL: J Biol Chem, 260:8257-8261, 1985. 9. Landick R, Anderson JJ, Mayo MM, Gunsalus RP, Mavromara P, Daniels CJ, Oxender DL: J Supramolec Struct 14:527-537, 1981. 10. Oxender DL, Anderson JJ, Daniels CJ, Landick R, Gunsalus RP, Zurawski G, Yanofsky C: Proc Natl Acad Sci USA 77:2005-2009, 1980. 11. Riley M, Aninlionis A: Annu Rev Microbiol 32:519-560, 1978. 12. Ovchinnikov YA, Aldanova NA, Grinkevich VA, Arzamazova NM, Moroz IN, Nazimov IV: FEBS Lett 78:313-316, 1977. 13. Saper M, Quiocho FA: J Biol Chem 258: 11057-11062, 1983. 14. Daniels CJ, Anderson JJ, Landick R, Oxender DL: J Supramolec Struct 14:305-311, 1980. 15. Chou TY, Fasman GD: Annu Rev Biochem 47:251-276, 1978. 16. Daniels CJ, Bole DG, Quay SC, Oxender DL: Proc Natl Acad Sci USA 78:5396-5400, 1981. 17. Landick R, Duncan JR, Copeland BR, Nazos PM, Oxender DL: J Cell Biochem 24:331-344, 1984. 18. Su TZ, Copeland BR, Landick R, Oxender DL: J Biol Chem 1985, (submitted). 46:PSFD