Protein Structure Prediction and Analysis Tools Jianlin Cheng, PhD
|
|
|
- Priscilla McBride
- 10 years ago
- Views:
Transcription
1 Protein Structure Prediction and Analysis Tools Jianlin Cheng, PhD Assistant Professor Department of Computer Science & Informatics Institute University of Missouri, Columbia 2011
2 Sequence, Structure and Function AGCWY Cell
3 Protein Folding Movie
4 Alpha-Helix Jurnak, 2003
5 Beta-Sheet Anti-Parallel Parallel
6 Non-Repetitive Secondary Structure Beta-Turn Loop
7 myoglobin haemoglobin
8 Quaternary Structure: Complex G-Protein Complex
9 Protein Structure Determination X-ray crystallography Nuclear Magnetic Resonance (NMR) Spectroscopy X-ray: any size, accurate (1-3 Angstrom (10-10 m)), sometime hard to grow crystal NMR: small to medium size, moderate accuracy, structure in solution
10 Pacific Northwest National Laboratory's high magnetic field (800 MHz, 18.8 T) NMR spectrometer being loaded with a sample. Wikipedia, the free encyclopedia
11 Storage in Protein Data Bank Search database
12 Search protein 1VJG
13 PDB Format (2C8Q, insulin)
14
15 Structure Visualization Rasmol ( m) MDL Chime (plug-in) ( Protein Explorer ( Jmol: Pymol:
16 J. Pevsner, 2005
17 Rasmol (1VJG)
18 Structure Analysis Assign secondary structure for amino acids from 3D structure Generate solvent accessible area for amino acids from 3D structure Most widely used tool: DSSP (Dictionary of Protein Secondary Structure: Pattern Recognition of Hydrogen-Bonded and Geometrical Features. Kabsch and Sander, 1983)
19 DSSP server: DSSP download: DSSP Code: H = alpha helix G = 3-helix (3/10 helix) I = 5 helix (pi helix) B = residue in isolated beta-bridge E = extended strand, participates in beta ladder T = hydrogen bonded turn S = bend Blank = loop
20 DSSP Web Service
21 Amino Acids Secondary Structure Solvent Accessibility
22 Solvent Accessibility Size of the area of an amino acid that is exposed to solvent (water). Maximum solvent accessible area for each amino acid is its whole surface area. Hydrophobic residues like to be Buried inside (interior). Hydrophilic residues like to be exposed on the surface.
23 Structure Comparison (Alignment) Are the structures of two protein similar? Are the two structure models of the same protein similar? Different measures (RMSD, GDT-TS (Zemla et al., 1999), MaxSub (Siew et al., 2000), TM score (Zhang and Skolnick,2005))
24 Superimposition David Baker, 2005
25 Useful Structure Alignment Tools CE ( DALI ( dali/) TM-Align: atics.ku.edu/tm-align/
26
27 Notation: protein structure 1D, 2D, 3D B. Rost, 2005
28 Goal of structure prediction Epstein & Anfinsen, 1961: sequence uniquely determines structure INPUT: OUTPUT: sequence 3D structure and function B. Rost, 2005
29 CASP Olympics of Protein Critical Assessment of Techniques of Protein Structure Prediction 1994,1996,1998,2000,20 02,2004,2006, 2008, 2010 Blind Test, Independent Evaluation CASP9: 116 targets Structure Prediction
30 1D Structure Prediction Secondary structure Solvent accessibility Disordered regions Domain boundary
31 1D: Secondary Structure Prediction Helix MWLKKFGINLLIGQSV Coil Neural Networks + Alignments CCCCHHHHHCCCSSSSS Strand Cheng, Randall, Sweredoski, Baldi. Nucleic Acid Research, 2005
32 Widely Used Tools (~78-80%) SSpro 4.1: Distill: PSI-PRED: software is also available SAM: PHD:
33 1D: Solvent Accessibility Prediction Exposed MWLKKFGINLLIGQSV Neural Networks + Alignments Buried eeeeeeebbbbbbbbeeeebbb Accuracy: 79% at 25% threshold Cheng, Randall, Sweredoski, Baldi. Nucleic Acid Research, 2005
34 Widely Used Tools (78%) ACCpro 4.1: software: SCRATCH: PHD: Distill:
35 1D: Disordered Region Prediction Using Neural Networks Disordered Region MWLKKFGINLLIGQSV 1D-RNN OOOOODDDDOOOOO 93% TP at 5% FP Deng, Eickholt, Cheng. BMC Bioinformatics, 2009
36 Tools PreDisorder: A collection of disorder predictors: Deng, Eickholt, Cheng. BMC Bioinformatics, 2009
37 1D: Protein Domain Prediction Using Neural Networks Boundary MWLKKFGINLLIGQSV + SS and SA 1D-RNN HIV capsid protein NNNNNNNBBBBBNNNN Inference/Cut Domain 1 Domain 2 Domains Top ab-initio domain predictor in CAFASP4 Cheng, Sweredoski, Baldi. Data Mining and Knowledge Discovery, 2006.
38 Tools DOMAC: Cheng, Nucleic Acids Research, 2007
39 Web:
40
41 Project 1 Predict Secondary Structure, Solvent Accessibility, Disorder Regions of soybean transcription factors Data: du/marc/muii_7005/seq_ TFP_90_2500.txt Select 10 proteins to make predictions
42 2D: Contact Map Prediction 3D Structure i n 2D Contact Map j..... n Distance Threshold = 8A o Cheng, Randall, Sweredoski, Baldi. Nucleic Acid Research, 2005
43 Contact Prediction SVMcon: NNcon: SCRATCH: SAM:
44 Tegge, Wang, Eickholt, Cheng, Nucleic Acids Research, 2009
45 Two Methodologies for 3D Structure Prediction AB Initio Method (physical-chemical principles / molecular dynamics, knowledge-based approaches) Template-Based Method (knowledge-based approaches)
46 Two Approaches Ab Initio Structure Prediction Physical force field protein folding Contact map - reconstruction Template-Based Structure Prediction MWLKKFGINLLIGQSV Simulation Select structure with minimum free energy Query protein MWLKKFGINKH Fold Recognition Alignment Protein Data Bank Template
47 Protein Energy Landscape C. Park, 2005
48 Markov Chain Monte Carlo Simulation
49 Template-Based Structure Prediction 1. Template identification 2. Query-template alignment 3. Model generation 4. Model evaluation 5. Model refinement Notes: if template is easy to identify, it is often called comparative Modeling or homology modeling. If template is hard to identify, it is often called fold recognition.
50 A. Fisher, 2005
51 Modeller Need an alignment file between query and template sequence in the PIR format Need the structure (atom coordinates) file of template protein You need to write a simple script (Python for version 8.2) to tell how to generate the model and where to find the alignment file and template structure file. Run Modeller on the script. Modeller will automatically copy coordinates and make necessary adjustments to generate a model. See project step 5-8 for more details.
52 An PIR Alignment Example Template id Structure determination method Start index End index >P1;1SDMA structurex:1sdma: 1: : 344: : : : : KIRVYCRLRPLCEKEIIAKERNAIRSVDEFTVEHLWKDDKAKQHMYDRVFDGNATQDDVFEDTKYL VQSAVDGYNVCIFAYGQTGSGKTFTIYGADSNPGLTPRAMSELFRIMKKDSNKFSFSLKAYMVELY QDTLVDLLLPKQAKRLKLDIKKDSKGMVSVENVTVVSISTYEELKTIIQRGSEQRHTTGTLMNEQS SRSHLIVSVIIESTNLQTQAIARGKLSFVDLAGSERVKKEAQSINKSLSALGDVISALSSGNQHIP YRNHKLTMLMSDSLGGNAKTLMFVNISPAESNLDETHNSLTYASRVRSIVNDPSKNVSSKEVARLK Query sequence id Template structure file id KLVSYWELEEIQDE* >P1;bioinfo : : : : : : : : : NIRVIARVRPVTKEDGEGPEATNAVTFDADDDSIIHLLHKGKPVSFELDKVFSPQASQQDVFQEVQ ALVTSCIDGFNVCIFAYGQTGAGKTYTMEGTAENPGINQRALQLLFSEVQEKASDWEYTITVSAAE IYNEVLRDLLGKEPQEKLEIRLCPDGSGQLYVPGLTEFQVQSVDDINKVFEFGHTNRTTEFTNLNE HSSRSHALLIVTVRGVDCSTGLRTTGKLNLVDLAGSERVGKSGAEGSRLREAQHINKSLSALGDVI AALRSRQGHVPFRNSKLTYLLQDSLSGDSKTLMVV QVSPVEKNTSETLYSLKFAER VR*
53 Structure File Example (1SDMA.atm) ATOM 1 N LYS N ATOM 2 CA LYS C ATOM 3 C LYS C ATOM 4 O LYS O ATOM 5 CB LYS C ATOM 6 CG LYS C ATOM 7 CD LYS C ATOM 8 CE LYS C ATOM 9 NZ LYS N ATOM 10 N ILE N ATOM 11 CA ILE C ATOM 12 C ILE C ATOM 13 O ILE O ATOM 14 CB ILE C ATOM 15 CG1 ILE C ATOM 16 CG2 ILE C ATOM 17 CD1 ILE C ATOM 18 N ARG N ATOM 19 CA ARG C ATOM 20 C ARG C ATOM 21 O ARG O ATOM 22 CB ARG C ATOM 23 CG ARG C ATOM 24 CD ARG C ATOM 25 NE ARG N ATOM 26 CZ ARG C ATOM 27 NH1 ARG N ATOM 28 NH2 ARG N ATOM 29 N VAL N ATOM 30 CA VAL C ATOM 31 C VAL C ATOM 32 O VAL O ATOM 33 CB VAL C ATOM 34 CG1 VAL C ATOM 35 CG2 VAL C
54 Modeller Python Script (bioinfo.py) # Homology modelling by the automodel class from modeller.automodel import * # Load the automodel class Where to find structure file log.verbose() # request verbose output env = environ() # create a new MODELLER environment to build this model in # directories for input atom files env.io.atom_files_directory = './:../atom_files' a = automodel(env, alnfile = 'bioinfo.pir', # alignment filename knowns = '1SDMA', # codes of the templates sequence = 'bioinfo') # code of the target a.starting_model= 1 # index of the first model a.ending_model = 1 # index of the last model # (determines how many models to calculate) a.make() # do the actual homology modelling PIR alignment file name Template structure file id Query sequence id
55 Output Example Command: mod8v2 bioinfo.py
56 Organism Homology modelling for entire genomes 0 5,000 10,000 15,000 20,000 H sapiens(chr. 22) H sapiens D melanogaster C elegans S cerevisiae U urealyticum T pallidum S PCC6803 R prowazekii N meningitidis M tuberculosis M pneumoniae M genitalium H pylori H influenzae E coli C trachomatis C pneumoniae C jejuni B burgdorferi B subtilis A aeolicus T maritima D radiodurans P horikoshii P abyssi M thermoautotrophicu M jannaschii A fulgidus A pernix Number of ORFs Number of ORFs with PDB hit 0 5,000 10,000 15,000 20,000 Number of proteins B. Rost, 2005
57 Sequence Identity and Alignment Quality in Structure Prediction Superimpose -> RMSD %Sequence Identity: percent of identical residues in alignment RMSD: square root of average distance between predicted structure and native structure.
58 3D Structure Prediction Tools MULTICOM ( ) I-TASSER ( HHpred ( Robetta ( 3D-Jury ( FFAS ( Pcons ( Sparks ( FUGUE ( FOLDpro ( SAM ( Phyre ( 3D-PSSM ( mgenthreader (
59 Protein Model Quality Assessment
60 APOLLO Output
61 Application of Structure Prediction Structure prediction is improving Template-based structure become more and more practical. Particularly, comparative / homology modeling is pretty accurate in many cases. Comparative modeling has been widely used in drug design. Protein structure prediction (both secondary and tertiary) has become an indispensable tool of investigating function of proteins and mechanisms of biological processes.
62 Baker and Sali (2000) J. Pevsner, 2005
63 Project 2 Select 5 soybean proteins Predict 3D structures Visualize the structures (data: /marc/muii_7005/seq_tfp_90_2500.txt)
Structure Tools and Visualization
Structure Tools and Visualization Gary Van Domselaar University of Alberta [email protected] Slides Adapted from Michel Dumontier, Blueprint Initiative 1 Visualization & Communication Visualization
Amino Acids. Amino acids are the building blocks of proteins. All AA s have the same basic structure: Side Chain. Alpha Carbon. Carboxyl. Group.
Protein Structure Amino Acids Amino acids are the building blocks of proteins. All AA s have the same basic structure: Side Chain Alpha Carbon Amino Group Carboxyl Group Amino Acid Properties There are
Consensus alignment server for reliable comparative modeling with distant templates
W50 W54 Nucleic Acids Research, 2004, Vol. 32, Web Server issue DOI: 10.1093/nar/gkh456 Consensus alignment server for reliable comparative modeling with distant templates Jahnavi C. Prasad 1, Sandor Vajda
Statistical modeling of non-coding DNA
Statistical modeling of non-coding DNA 25/06/03 Infmeeting, Genova 23-25 June 2003 Overview Investigation of Inverted Repeats (IRs) in complete genomes of bacteria; IReD a web server of Inverted Repeats
Advanced Medicinal & Pharmaceutical Chemistry CHEM 5412 Dept. of Chemistry, TAMUK
Advanced Medicinal & Pharmaceutical Chemistry CHEM 5412 Dept. of Chemistry, TAMUK Dai Lu, Ph.D. [email protected] Tel: 361-221-0745 Office: RCOP, Room 307 Drug Discovery and Development Drug Molecules Medicinal
Section I Using Jmol as a Computer Visualization Tool
Section I Using Jmol as a Computer Visualization Tool Jmol is a free open source molecular visualization program used by students, teachers, professors, and scientists to explore protein structures. Section
Bioinformatics for Biologists. Protein Structure
Bioinformatics for Biologists Comparative Protein Analysis: Part III. Protein Structure Prediction and Comparison Robert Latek, PhD Sr. Bioinformatics Scientist Whitehead Institute for Biomedical Research
Protein Physics. A. V. Finkelstein & O. B. Ptitsyn LECTURE 1
Protein Physics A. V. Finkelstein & O. B. Ptitsyn LECTURE 1 PROTEINS Functions in a Cell MOLECULAR MACHINES BUILDING BLOCKS of a CELL ARMS of a CELL ENZYMES - enzymatic catalysis of biochemical reactions
Lecture 19: Proteins, Primary Struture
CPS260/BGT204.1 Algorithms in Computational Biology November 04, 2003 Lecture 19: Proteins, Primary Struture Lecturer: Pankaj K. Agarwal Scribe: Qiuhua Liu 19.1 The Building Blocks of Protein [1] Proteins
Steffen Lindert, René Staritzbichler, Nils Wötzel, Mert Karakaş, Phoebe L. Stewart, and Jens Meiler
Structure 17 Supplemental Data EM-Fold: De Novo Folding of α-helical Proteins Guided by Intermediate-Resolution Electron Microscopy Density Maps Steffen Lindert, René Staritzbichler, Nils Wötzel, Mert
Built from 20 kinds of amino acids
Built from 20 kinds of amino acids Each Protein has a three dimensional structure. Majority of proteins are compact. Highly convoluted molecules. Proteins are folded polypeptides. There are four levels
Basic Concepts of DNA, Proteins, Genes and Genomes
Basic Concepts of DNA, Proteins, Genes and Genomes Kun-Mao Chao 1,2,3 1 Graduate Institute of Biomedical Electronics and Bioinformatics 2 Department of Computer Science and Information Engineering 3 Graduate
The peptide bond is rigid and planar
Level Description Bonds Primary Sequence of amino acids in proteins Covalent (peptide bonds) Secondary Structural motifs in proteins: α- helix and β-sheet Hydrogen bonds (between NH and CO groups in backbone)
Peptide bonds: resonance structure. Properties of proteins: Peptide bonds and side chains. Dihedral angles. Peptide bond. Protein physics, Lecture 5
Protein physics, Lecture 5 Peptide bonds: resonance structure Properties of proteins: Peptide bonds and side chains Proteins are linear polymers However, the peptide binds and side chains restrict conformational
Protein Structure Determination. Why Bother With Structure?
Protein Structure Determination How are these structures determined? Why Bother With Structure? The amino acid sequence of a protein contains interesting information. A protein sequence can be compared
Recap. Lecture 2. Protein conformation. Proteins. 8 types of protein function 10/21/10. Proteins.. > 50% dry weight of a cell
Lecture 2 Protein conformation ecap Proteins.. > 50% dry weight of a cell ell s building blocks and molecular tools. More important than genes A large variety of functions http://www.tcd.ie/biochemistry/courses/jf_lectures.php
Overview'of'Solid-Phase'Peptide'Synthesis'(SPPS)'and'Secondary'Structure'Determination'by'FTIR'
verviewofsolid-phasepeptidesynthesis(spps)andsecondarystructuredeterminationbyftir Introduction Proteinsareubiquitousinlivingorganismsandcells,andcanserveavarietyoffunctions.Proteinscanactas enzymes,hormones,antibiotics,receptors,orserveasstructuralsupportsintissuessuchasmuscle,hair,and
Proteins. Proteins. Amino Acids. Most diverse and most important molecule in. Functions: Functions (cont d)
Proteins Proteins Most diverse and most important molecule in living i organisms Functions: 1. Structural (keratin in hair, collagen in ligaments) 2. Storage (casein in mother s milk) 3. Transport (HAEMOGLOBIN!)
Pipe Cleaner Proteins. Essential question: How does the structure of proteins relate to their function in the cell?
Pipe Cleaner Proteins GPS: SB1 Students will analyze the nature of the relationships between structures and functions in living cells. Essential question: How does the structure of proteins relate to their
18.2 Protein Structure and Function: An Overview
18.2 Protein Structure and Function: An Overview Protein: A large biological molecule made of many amino acids linked together through peptide bonds. Alpha-amino acid: Compound with an amino group bonded
INTRODUCTION TO PROTEIN STRUCTURE
Name Class: Partner, if any: INTRODUCTION TO PROTEIN STRUCTURE PRIMARY STRUCTURE: 1. Write the complete structural formula of the tripeptide shown (frame 10). Circle and label the three sidechains which
Peptide Bonds: Structure
Peptide Bonds: Structure Peptide primary structure The amino acid sequence, from - to C-terminus, determines the primary structure of a peptide or protein. The amino acids are linked through amide or peptide
Hydrogen Bonds The electrostatic nature of hydrogen bonds
Hydrogen Bonds Hydrogen bonds have played an incredibly important role in the history of structural biology. Both the structure of DNA and of protein a-helices and b-sheets were predicted based largely
Template-based protein structure modeling using the RaptorX web server
Template-based protein structure modeling using the RaptorX web server Morten Källberg 1 3, Haipeng Wang 1,3, Sheng Wang 1, Jian Peng 1, Zhiyong Wang 1, Hui Lu 2 & Jinbo Xu 1 1 Toyota Technological Institute
Biological Molecules
Biological Molecules I won t lie. This is probably the most boring topic you have ever done in any science. It s pretty much as simple as this: learn the material deal with it. Enjoy don t say I didn t
Linear Sequence Analysis. 3-D Structure Analysis
Linear Sequence Analysis What can you learn from a (single) protein sequence? Calculate it s physical properties Molecular weight (MW), isoelectric point (pi), amino acid content, hydropathy (hydrophilic
Protein annotation and modelling servers at University College London
Nucleic Acids Research Advance Access published May 27, 2010 Nucleic Acids Research, 2010, 1 6 doi:10.1093/nar/gkq427 Protein annotation and modelling servers at University College London D. W. A. Buchan*,
Structure Determination
5 Structure Determination Most of the protein structures described and discussed in this book have been determined either by X-ray crystallography or by nuclear magnetic resonance (NMR) spectroscopy. Although
Protein 3D-structure analysis. why and how
Protein 3D-structure analysis why and how 3D-structures are precious sources of information Shape and domain structure Protein classification Prediction of function for uncharacterized proteins Interaction
Structure of proteins
Structure of proteins Primary structure: is amino acids sequence or the covalent structure (50-2500) amino acids M.Wt. of amino acid=110 Dalton (56 110=5610 Dalton). Single chain or more than one polypeptide
Disulfide Bonds at the Hair Salon
Disulfide Bonds at the Hair Salon Three Alpha Helices Stabilized By Disulfide Bonds! In order for hair to grow 6 inches in one year, 9 1/2 turns of α helix must be produced every second!!! In some proteins,
(c) How would your answers to problem (a) change if the molecular weight of the protein was 100,000 Dalton?
Problem 1. (12 points total, 4 points each) The molecular weight of an unspecified protein, at physiological conditions, is 70,000 Dalton, as determined by sedimentation equilibrium measurements and by
Myoglobin and Hemoglobin
Myoglobin and Hemoglobin Myoglobin and hemoglobin are hemeproteins whose physiological importance is principally related to their ability to bind molecular oxygen. Myoglobin (Mb) The oxygen storage protein
BIOC351: Proteins. PyMOL Laboratory #1. Installing and Using
BIOC351: Proteins PyMOL Laboratory #1 Installing and Using Information and figures for this handout was obtained from the following sources: Introduction to PyMOL (2009) DeLano Scientific LLC. Installing
Biological Databases and Protein Sequence Analysis
Biological Databases and Protein Sequence Analysis Introduction M. Madan Babu, Center for Biotechnology, Anna University, Chennai 25, India Bioinformatics is the application of Information technology to
Functional Architecture of RNA Polymerase I
Cell, Volume 131 Supplemental Data Functional Architecture of RNA Polymerase I Claus-D. Kuhn, Sebastian R. Geiger, Sonja Baumli, Marco Gartmann, Jochen Gerber, Stefan Jennebach, Thorsten Mielke, Herbert
IV. -Amino Acids: carboxyl and amino groups bonded to -Carbon. V. Polypeptides and Proteins
IV. -Amino Acids: carboxyl and amino groups bonded to -Carbon A. Acid/Base properties 1. carboxyl group is proton donor! weak acid 2. amino group is proton acceptor! weak base 3. At physiological ph: H
CSC 2427: Algorithms for Molecular Biology Spring 2006. Lecture 16 March 10
CSC 2427: Algorithms for Molecular Biology Spring 2006 Lecture 16 March 10 Lecturer: Michael Brudno Scribe: Jim Huang 16.1 Overview of proteins Proteins are long chains of amino acids (AA) which are produced
Refinement of a pdb-structure and Convert
Refinement of a pdb-structure and Convert A. Search for a pdb with the closest sequence to your protein of interest. B. Choose the most suitable entry (or several entries). C. Convert and resolve errors
LESSON 5. Learning to Use Cn3D: A Bioinformatics Tool. Introduction. Learning Objectives. Key Concepts
5 Learning to Use Cn3D: A Bioinformatics Tool Introduction Up to this point, students have seen the BRCA1 protein represented in a linear, sequential form. In this lesson, students are introduced to the
FTIR Analysis of Protein Structure
FTIR Analysis of Protein Structure Warren Gallagher A. Introduction to protein structure The first structures of proteins at an atomic resolution were determined in the late 1950 s. 1 From that time to
PyRy3D: a software tool for modeling of large macromolecular complexes MODELING OF STRUCTURES FOR LARGE MACROMOLECULAR COMPLEXES
MODELING OF STRUCTURES FOR LARGE MACROMOLECULAR COMPLEXES PyRy3D is a method for building low-resolution models of large macromolecular complexes. The components (proteins, nucleic acids and any other
3D structure visualization and high quality imaging. Chimera
3D structure visualization and high quality imaging. Chimera Vincent Zoete 2008 Contact : vincent.zoete@isb sib.ch 1/27 Table of Contents Presentation of Chimera...3 Exercise 1...4 Loading a structure
Guide for Bioinformatics Project Module 3
Structure- Based Evidence and Multiple Sequence Alignment In this module we will revisit some topics we started to look at while performing our BLAST search and looking at the CDD database in the first
BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm377
Vol. 23 no. 19 2007, pages 2558 2565 BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm377 Structural bioinformatics Comparative protein structure modeling by combining multiple templates and
http://faculty.sau.edu.sa/h.alshehri
http://faculty.sau.edu.sa/h.alshehri Definition: Proteins are macromolecules with a backbone formed by polymerization of amino acids. Proteins carry out a number of functions in living organisms: - They
Computational methods for increasing the stability of type IV pilin protein from Pseudomonas aeruginosa
2/15/2015 Computational methods for increasing the stability of type IV pilin protein from Pseudomonas aeruginosa John Loft. Bioengineering 488 Computational Protein Design (Winter 2015), University of
Introduction to Proteins and Enzymes
Introduction to Proteins and Enzymes Basics of protein structure and composition The life of a protein Enzymes Theory of enzyme function Not all enzymes are proteins / not all proteins are enzymes Enzyme
NMR Nuclear Magnetic Resonance
NMR Nuclear Magnetic Resonance Nuclear magnetic resonance (NMR) is an effect whereby magnetic nuclei in a magnetic field absorb and re-emit electromagnetic (EM) energy. This energy is at a specific resonance
Disaccharides consist of two monosaccharide monomers covalently linked by a glycosidic bond. They function in sugar transport.
1. The fundamental life processes of plants and animals depend on a variety of chemical reactions that occur in specialized areas of the organism s cells. As a basis for understanding this concept: 1.
A. A peptide with 12 amino acids has the following amino acid composition: 2 Met, 1 Tyr, 1 Trp, 2 Glu, 1 Lys, 1 Arg, 1 Thr, 1 Asn, 1 Ile, 1 Cys
Questions- Proteins & Enzymes A. A peptide with 12 amino acids has the following amino acid composition: 2 Met, 1 Tyr, 1 Trp, 2 Glu, 1 Lys, 1 Arg, 1 Thr, 1 Asn, 1 Ile, 1 Cys Reaction of the intact peptide
Part A: Amino Acids and Peptides (Is the peptide IAG the same as the peptide GAI?)
ChemActivity 46 Amino Acids, Polypeptides and Proteins 1 ChemActivity 46 Part A: Amino Acids and Peptides (Is the peptide IAG the same as the peptide GAI?) Model 1: The 20 Amino Acids at Biological p See
Helices From Readily in Biological Structures
The α Helix and the β Sheet Are Common Folding Patterns Although the overall conformation each protein is unique, there are only two different folding patterns are present in all proteins, which are α
Invariant residue-a residue that is always conserved. It is assumed that these residues are essential to the structure or function of the protein.
Chapter 6 The amino acid side chains have polar and nonpolar properties, and the relative hydrophobicity of the amino acid side chains is critical for the folding and stability of a protein. The more hydrophobic
Application of Neutralization Fingerprinting in Antibody Epitope Prediction and Delineating Antibody Recognition in HIV-1 Sera
Application of Neutralization Fingerprinting in Antibody Epitope Prediction and Delineating Antibody Recognition in HIV-1 Sera Gwo-Yu Chuang Structural Bioinformatics Section Vaccine Research Center/NIAID/NIH
Protein Sequence Analysis - Overview -
Protein Sequence Analysis - Overview - UDEL Workshop Raja Mazumder Research Associate Professor, Department of Biochemistry and Molecular Biology Georgetown University Medical Center Topics Why do protein
Structural Bioinformatics (C3210) Experimental Methods for Macromolecular Structure Determination
Structural Bioinformatics (C3210) Experimental Methods for Macromolecular Structure Determination Introduction Knowing the exact 3D-structure of bio-molecules is essential for any attempt to understand
Sequence Formats and Sequence Database Searches. Gloria Rendon SC11 Education June, 2011
Sequence Formats and Sequence Database Searches Gloria Rendon SC11 Education June, 2011 Sequence A is the primary structure of a biological molecule. It is a chain of residues that form a precise linear
Amino Acids, Proteins, and Enzymes. Primary and Secondary Structure Tertiary and Quaternary Structure Protein Hydrolysis and Denaturation
Amino Acids, Proteins, and Enzymes Primary and Secondary Structure Tertiary and Quaternary Structure Protein Hydrolysis and Denaturation 1 Primary Structure of Proteins H 3 N The particular sequence of
A disaccharide is formed when a dehydration reaction joins two monosaccharides. This covalent bond is called a glycosidic linkage.
CH 5 Structure & Function of Large Molecules: Macromolecules Molecules of Life All living things are made up of four classes of large biological molecules: carbohydrates, lipids, proteins, and nucleic
This class deals with the fundamental structural features of proteins, which one can understand from the structure of amino acids, and how they are
This class deals with the fundamental structural features of proteins, which one can understand from the structure of amino acids, and how they are put together. 1 A more detailed view of a single protein
Computational Systems Biology. Lecture 2: Enzymes
Computational Systems Biology Lecture 2: Enzymes 1 Images from: David L. Nelson, Lehninger Principles of Biochemistry, IV Edition, Freeman ed. or under creative commons license (search for images at http://search.creativecommons.org/)
Hands-on exercises on solvent models & electrostatics EMBnet - Molecular Modeling Course 2005
Hands-on exercises on solvent models & electrostatics EMBnet - Molecular Modeling Course 2005 Exercise 1. The purpose of this exercise is to color the solvent accessible surface of a protein according
Carbohydrates, proteins and lipids
Carbohydrates, proteins and lipids Chapter 3 MACROMOLECULES Macromolecules: polymers with molecular weights >1,000 Functional groups THE FOUR MACROMOLECULES IN LIFE Molecules in living organisms: proteins,
Chapter 5: The Structure and Function of Large Biological Molecules
Name Period Concept 5.1 Macromolecules are polymers, built from monomers 1. The large molecules of all living things fall into just four main classes. Name them. 2. Circle the three classes that are called
Lecture Overview. Hydrogen Bonds. Special Properties of Water Molecules. Universal Solvent. ph Scale Illustrated. special properties of water
Lecture Overview special properties of water > water as a solvent > ph molecules of the cell > properties of carbon > carbohydrates > lipids > proteins > nucleic acids Hydrogen Bonds polarity of water
Combinatorial Biochemistry and Phage Display
Combinatorial Biochemistry and Phage Display Prof. Valery A. Petrenko Director - Valery Petrenko Instructors Galina Kouzmitcheva and I-Hsuan Chen Auburn 2006, Spring semester COMBINATORIAL BIOCHEMISTRY
4. Which carbohydrate would you find as part of a molecule of RNA? a. Galactose b. Deoxyribose c. Ribose d. Glucose
1. How is a polymer formed from multiple monomers? a. From the growth of the chain of carbon atoms b. By the removal of an OH group and a hydrogen atom c. By the addition of an OH group and a hydrogen
Chapter 3 Molecules of Cells
Bio 100 Molecules of cells 1 Chapter 3 Molecules of Cells Compounds containing carbon are called organic compounds Molecules such as methane that are only composed of carbon and hydrogen are called hydrocarbons
Structure and Function of DNA
Structure and Function of DNA DNA and RNA Structure DNA and RNA are nucleic acids. They consist of chemical units called nucleotides. The nucleotides are joined by a sugar-phosphate backbone. The four
Replication Study Guide
Replication Study Guide This study guide is a written version of the material you have seen presented in the replication unit. Self-reproduction is a function of life that human-engineered systems have
PROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINE
PROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINE Anil Kumar Mandle 1, Pranita Jain 2, and Shailendra Kumar Shrivastava 3 1 Research Scholar, Information Technology Department Samrat Ashok Technological
Covalent bonds are the strongest chemical bonds contributing to the protein structure A peptide bond is formed between with of the following?
MCAT Question Covalent bonds are the strongest chemical bonds contributing to the protein structure A peptide bond is formed between with of the following? A. Carboxylic group and amino group B. Two carboxylic
Phase determination methods in macromolecular X- ray Crystallography
Phase determination methods in macromolecular X- ray Crystallography Importance of protein structure determination: Proteins are the life machinery and are very essential for the various functions in the
Amino Acids and Proteins
Amino Acids and Proteins Proteins are composed of amino acids. There are 20 amino acids commonly found in proteins. All have: N2 C α R COO Amino acids at neutral p are dipolar ions (zwitterions) because
Introduction to Principal Components and FactorAnalysis
Introduction to Principal Components and FactorAnalysis Multivariate Analysis often starts out with data involving a substantial number of correlated variables. Principal Component Analysis (PCA) is a
Bioinformatics Grid - Enabled Tools For Biologists.
Bioinformatics Grid - Enabled Tools For Biologists. What is Grid-Enabled Tools (GET)? As number of data from the genomics and proteomics experiment increases. Problems arise for the current sequence analysis
HYDRONMR and Fast-HYDRONMR
File:hydronmr7c-pub.doc HYDRONMR and Fast-HYDRONMR Index 1. Introduction to HYDRONMR 2. Literature 3. Running HYDRONMR. Input data file 4. Output files 5. Notes and hints 7. Release notes Version 7c, September
Translation Study Guide
Translation Study Guide This study guide is a written version of the material you have seen presented in the replication unit. In translation, the cell uses the genetic information contained in mrna to
Chapter 3. Protein Structure and Function
Chapter 3 Protein Structure and Function Broad functional classes So Proteins have structure and function... Fine! -Why do we care to know more???? Understanding functional architechture gives us POWER
Structure and properties of proteins. Vladimíra Kvasnicová
Structure and properties of proteins Vladimíra Kvasnicová Chemical nature of proteins biopolymers of amino acids macromolecules (M r > 10 000) Classification of proteins 1) by localization in an organism
Proteins and Nucleic Acids
Proteins and Nucleic Acids Chapter 5 Macromolecules: Proteins Proteins Most structurally & functionally diverse group of biomolecules. : o Involved in almost everything o Enzymes o Structure (keratin,
EMBL-EBI. EM map display & Visualization
EM map display & Visualization Introduction Biological data comes from & is of interest to: Chemists : reaction mechanism, drug design Biologists : sequence, expression, homology, function. Structure biologists
Sickle cell anemia: Altered beta chain Single AA change (#6 Glu to Val) Consequence: Protein polymerizes Change in RBC shape ---> phenotypes
Protein Structure Polypeptide: Protein: Therefore: Example: Single chain of amino acids 1 or more polypeptide chains All polypeptides are proteins Some proteins contain >1 polypeptide Hemoglobin (O 2 binding
SGI. High Throughput Computing (HTC) Wrapper Program for Bioinformatics on SGI ICE and SGI UV Systems. January, 2012. Abstract. Haruna Cofer*, PhD
White Paper SGI High Throughput Computing (HTC) Wrapper Program for Bioinformatics on SGI ICE and SGI UV Systems Haruna Cofer*, PhD January, 2012 Abstract The SGI High Throughput Computing (HTC) Wrapper
AISMIG an interactive server-side molecule image generator
AISMIG an interactive server-side molecule image generator Andreas Bohne-Lang, Wolf-Dieter Groch 1 and René Ranzinger* W705 W709 doi:10.1093/nar/gki438 German Cancer Research Center Heidelberg, Central
Gold (Genetic Optimization for Ligand Docking) G. Jones et al. 1996
Gold (Genetic Optimization for Ligand Docking) G. Jones et al. 1996 LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 1 Genetic algorithms Inspired
Bioinformatics Tools Tutorial Project Gene ID: KRas
Bioinformatics Tools Tutorial Project Gene ID: KRas Bednarski 2011 Original project funded by HHMI Bioinformatics Projects Introduction and Tutorial Purpose of this tutorial Illustrate the link between
The Molecules of Cells
The Molecules of Cells I. Introduction A. Most of the world s population cannot digest milk-based foods. 1. These people are lactose intolerant because they lack the enzyme lactase. 2. This illustrates
Papers listed: Cell2. This weeks papers. Chapt 4. Protein structure and function
Papers listed: Cell2 During the semester I will speak of information from several papers. For many of them you will not be required to read these papers, however, you can do so for the fun of it (and it
Syllabus of B.Sc. (Bioinformatics) Subject- Bioinformatics (as one subject) B.Sc. I Year Semester I Paper I: Basic of Bioinformatics 85 marks
Syllabus of B.Sc. (Bioinformatics) Subject- Bioinformatics (as one subject) B.Sc. I Year Semester I Paper I: Basic of Bioinformatics 85 marks Semester II Paper II: Mathematics I 85 marks B.Sc. II Year
Protein Block Expert (PBE): a web-based protein structure analysis server using a structural alphabet
Nucleic Acids Research, 2006, Vol. 34, Web Server issue W119 W123 doi:10.1093/nar/gkl199 Protein Block Expert (PBE): a web-based protein structure analysis server using a structural alphabet M. Tyagi 1,
