Gold (Genetic Optimization for Ligand Docking) G. Jones et al. 1996
|
|
|
- Stewart Marshall Hamilton
- 9 years ago
- Views:
Transcription
1 Gold (Genetic Optimization for Ligand Docking) G. Jones et al LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 1
2 Genetic algorithms Inspired from evolution General principle: LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 2
3 Gold GA Gold uses a genetic algorithm for optimization Steady state principle (single operations no generations) No duplicates Roulette wheel selection Operators and parents Gray coding of binary features Approximate coding of conformation LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 3
4 The Gold chromosomes Each chromosome consists of two binary plus two integer strings The binary strings code the torsions of the ligand and the protein In the protein the single bonds to terminal H-bond donors are rotatable The integer strings code for the translation and orientation of the ligand, in terms of the H-bonds that are formed. If the Nth integer in the FIRST integer string has the value P then the Nth H-donor in the ligand forms a H-bond with the Pth acceptor of the protein If the Nth integer in the SECOND integer string has the value P then the Nth H-acceptor in the ligand forms a H-bond with the Pth donor of the protein The actual position of the ligand is obtained with a least squares fit LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 4
5 The H-Bonds LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 5
6 Gold 1. A set of reproduction operators (crossover, mutation, etc.) is chosen. Each operator is assigned a weight. 2. An initial population is randomly created and the fitness of its members determined 3. An operator is chosen using roulette wheel selection, based on operator weights 10 for crossover, 40 for mutation 4. The parents are chosen with rws based on fitness 5. Offspring are obtained and their fitness evaluated 6. If not already present in the population the children replace the least fit members of the population 7. After operations stop else goto 3 LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 6
7 The energy function H-bonds VdW between protein and ligand (12-6 potential) Intra-ligand VdW The energy function of Gold is one of its strengths LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 7
8 Efficiency depends strongly on the parameters (initial population, number of runs) The developers report very good results already with runs that take ~1 min per complex LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 8
9 Some results LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 9
10 A related approach Autodock used initially a SA/MC approach The main advantage of SA is the combination of global optimization (high temperature) with local optimization (lower temperature) For flexible molecules >8 flexible dihedrals it turns out that SA is far too slow LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 10
11 LGA LGA or GALS Lamarkian GA or GA with local search has been implemented The idea is to adapt each individual to its environment by performing a LS (minimization) Optimization takes place directly on the chromosomes The effect of the minimization is passed on to the offspring Force field type of energy function GM Morris et al 1998, Comparison of SA, GA, LGA LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 11
12 LGA LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 12
13 SA/GA/LGA comparison SA GA LGA LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 13
14 LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 14
15 Conclusion GA GAs are very robust Default parameters used all along and efficient (depending on the settings) They clearly outperform SA for docking problems Not in our hands A significant part of the trick, seems to be the combination with at least a crude type of local optimization Hydrogen bonds are crucial for docking How do GAs compare with systematic approaches? LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 15
16 GlamDock Old GlamDock Gold-like interaction point matching search space Steady-State Genetic Algorithm search A ChemScore-like empirical function New GlamDock Replaced the GA with a simpler MC/SA search + conformational stack Simpler configuration More efficient search Smooth, continuously differentiable ChemScore based scoring a gradient based minimization in torsion space More effective identification of local minima LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 16
17 GlamDock (MCM) LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 17
18 Comparison between 8 different docking tools Bissantz et al. J. Med. Chem. 2000, 43, Kellenberger et al. PROTEINS: Structure, Function, and Bioinformatics 57: (2004) LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 18
19 8 Docking tools against each other Dock (negative image of binding site) FlexX (incremental construction) Fred (naive) Glide (systematic, funnel) Gold (GA) Slide (Flex protein (side chains), Surflex (Det. GA), QXP (Monte Carlo) (Why not ICM?) LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 19
20 Sampling accuracy LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 20
21 Ranking accuracy LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 21
22 CPU time LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 22
23 GlamDock LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 23
24 Conclusion of comparison study Gold, Glide, Surflex, Flexx: Best structure prediction (50-55%) Gold, Glide, Surflex, Flexx: Best screening properties (50-55%) Previous results Poor prediction of absolute free energies Reasonable results for virtual screening Docking and esp. virtual screening depend mainly on scoring function Consensus scoring improves results significantly LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 28
25 Conclusion of flexible ligand docking Flexible redocking is doable Best methods GAs, and incremental construction (and MCM Main problem is the evaluation of the structures (Score) Possibly scoring functions have been fitted too strongly to redocking of known ligands LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 29
26 Flexible receptor LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 30
27 Flexible receptor Side chain flexibility Backbone flexibility Hinge bending Domain flexibility Even small differences can be important! Induced fit Protein mutants Homology modelling LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 31
28 Substate view of protein dynamics LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 32
29 Induced fit Folding free energy lies between kcal for many proteins Less favorable substates may be stabilized by certain ligands Most of the time the differences are not very large, yet significant LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 33
30 Side chain flexibility of proteins upon ligand binding Najmanovich et al. Proteins 39: LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 34
31 Number of flexible side chains per binding site LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 35
32 Amino acid type dependence LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 36
33 AA dependence related to N tor LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 37
34 Backbone / Side chain flexibility LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 38
35 Conclusions Relatively few side chains move on average ( 3 for 85% of cases) Polar side chains move most Side chain flexibility does not correlate with backbone flexibility LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 39
36 Flexible receptor docking LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 40
37 Methods Simulation MC/MD, SA Fuzzy Discrete Ensembles of structures Rotamer libraries LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 41
38 FlexE H. Claussen J. Mol. Biol , LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 42
39 Protein flexibility Main idea: describe the protein structure variations with a set of protein structures representing the flexibility, mutation or alternative models of a protein. The variability considered by flexe is defined by the differences within the given input structures. LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 43
40 United protein description Data structure that administers the protein structures variations. Contains an ensemble of up to 30 possible conformations of the protein. Most of them are low energy conformations of the same protein. LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 44
41 United protein description - construction Superposition Clustering Add picture - 8 LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 45
42 United protein description - clustering The superimposed structures are combined by clustering each part separately Complete linkage hierarchical cluster The clustered instances can be recombined to form new valid protein structures. LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 46
43 Notation Component : all the atoms which belong to the same amino acid or mutation of the amino acid. Contains a backbone part and a side chain part Part : set of instances Instance : one of the alternative conformations. LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 47
44 Incompatibility Two instances of the united protein description are incompatible if they cannot be realized simultaneously. Logical: two instances are alternative to each other Geometric: two logically compatible instances overlap Structural: two instances of the same chain are unconnected LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 48
45 Incompatibility graph LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 49
46 Incompatibility graph The incompatibility is internally represented as a graph by using the instances as nodes and connecting pairs of incompatible nodes by an edge. Valid protein structures correspond to independent sets in the graph. LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 50
47 Selection of instances The ligand is placed fragment by fragment into the active site by the incremental construction algorithm. After each construction step, all possible interactions are determined. Apply the scoring function for each instance. We choose the IS with the highest score. LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 51
48 Independent set The IS can be assembled from IS of the connected components. Apply a modified version of the Bron-Kerbosch algorithm on the complementary graph. Compatibility graph Independent components! cliques LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 52
49 Cannot be extended Enumerating all cliques (Bron Kerbosch, 1973) Clique: Maximal complete subgraph Two versions of the algorithm Both are backtracking algorithms The two algorithms are quite similar The first goes through the cliques in an ordered fashion The second optimizes the order of the search and visits larger cliques at the beginning Version I is mainly relevant for illustration purposes LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 53
50 Version I Three sets are important for the algorithms: Compsub: Current set Is extended or reduced by one point by travelling along the edges of the backtracking tree Candidates The set of all points that will in due time serve as extension to compsub Not The set of all points that have already served as an extension of the present configuration of compsub LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 54
51 Version I Recursive extension operator: Extend (COMPSUB, CANDIDATES, NOT, G) If CANDIDATES== //cannot grow if NOT== print COMPSUB //maximality return //backtrack end if For c 2 CANDIDATES Put c in COMPSUB Update CANDIDATES and NOT // Remove all points not connected to the selected candidate Extend (COMPSUB, CANDIDATES, NOT, G) Remove c from COMPSUB and put into NOT End //for return Also for NOT LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 55
52 Some remarks The lists NOT and CANDIDATES can be concatenated into a single local array NOT CANDIDATES 1.ne ce For the indices ne, ce we have: ne ce ne = ce: CANDIDATES= ne=0: NOT= Ce=0: NOT=CANDIDATES= clique found If ne+1 is the current candidate then all we need to do at the end of extend is ne=ne+1 Both CANDIDATES and NOT must be empty when a clique is found If 9 c 2 NOT s.t. 8 d2 CANDIDATES: (c,d)2 E c will never be removed from NOT! no cliques on this subtree LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 56
53 Version II Is simply a clever way of choosing the next candidate: Pick vertex c in NOT with the most edges to CANDIDATES Use as next candidate a vertex that is not connected to c With every iteration we are at least one step closer to cutting the subtree LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 57
54 Evaluation FlexE was evaluated with ten protein structures ensembles containing 105 crystal structure from the PDB. The structures within the ensemble highly similar backbone trace Different conformations for several side chains. LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 58
55 LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 59
56 Evaluation Cont. FlexE finds a ligand position with RMSD below 2 A in 67% of the cases. Average CPU time for the incremental construction algorithm is 5.5 minutes. LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 60
57 LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 61
58 Conclusion The ensemble approach is able to cope with several sidechains conformations and even movements of loops. Very efficient. Motions of larger backbone segments or even domain movements are not covered by this approach. Main problems: Protein structures (where do they come from?) Internal protein energy LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure based methods J. Apostolakis 62
Molecular Docking. - Computational prediction of the structure of receptor-ligand complexes. Receptor: Protein Ligand: Protein or Small Molecule
Scoring and Docking Molecular Docking - Computational prediction of the structure of receptor-ligand complexes Receptor: Protein Ligand: Protein or Small Molecule Protein-Protein Docking Protein-Small
Scoring Functions and Docking. Keith Davies Treweren Consultants Ltd 26 October 2005
Scoring Functions and Docking Keith Davies Treweren Consultants Ltd 26 October 2005 Overview Applications Docking Algorithms Scoring Functions Results Demonstration Docking Applications Drug Design Lead
Multiobjective Robust Design Optimization of a docked ligand
Multiobjective Robust Design Optimization of a docked ligand Carlo Poloni,, Universitaʼ di Trieste Danilo Di Stefano, ESTECO srl Design Process DESIGN ANALYSIS MODEL Dynamic Analysis Logistics & Field
QSAR. The following lecture has drawn many examples from the online lectures by H. Kubinyi
QSAR The following lecture has drawn many examples from the online lectures by H. Kubinyi LMU Institut für Informatik, LFE Bioinformatik, Cheminformatics, Structure independent methods J. Apostolakis 1
Consensus Scoring to Improve the Predictive Power of in-silico Screening for Drug Design
Consensus Scoring to Improve the Predictive Power of in-silico Screening for Drug Design Masato Okada Faculty of Science and Technology, Masato Tsukamoto Faculty of Pharmaceutical Sciences, Hayato Ohwada
Section IV.1: Recursive Algorithms and Recursion Trees
Section IV.1: Recursive Algorithms and Recursion Trees Definition IV.1.1: A recursive algorithm is an algorithm that solves a problem by (1) reducing it to an instance of the same problem with smaller
Genetic Algorithms commonly used selection, replacement, and variation operators Fernando Lobo University of Algarve
Genetic Algorithms commonly used selection, replacement, and variation operators Fernando Lobo University of Algarve Outline Selection methods Replacement methods Variation operators Selection Methods
Hydrogen Bonds The electrostatic nature of hydrogen bonds
Hydrogen Bonds Hydrogen bonds have played an incredibly important role in the history of structural biology. Both the structure of DNA and of protein a-helices and b-sheets were predicted based largely
Amino Acids. Amino acids are the building blocks of proteins. All AA s have the same basic structure: Side Chain. Alpha Carbon. Carboxyl. Group.
Protein Structure Amino Acids Amino acids are the building blocks of proteins. All AA s have the same basic structure: Side Chain Alpha Carbon Amino Group Carboxyl Group Amino Acid Properties There are
6 Creating the Animation
6 Creating the Animation Now that the animation can be represented, stored, and played back, all that is left to do is understand how it is created. This is where we will use genetic algorithms, and this
A Review And Evaluations Of Shortest Path Algorithms
A Review And Evaluations Of Shortest Path Algorithms Kairanbay Magzhan, Hajar Mat Jani Abstract: Nowadays, in computer networks, the routing is based on the shortest path problem. This will help in minimizing
Replication Study Guide
Replication Study Guide This study guide is a written version of the material you have seen presented in the replication unit. Self-reproduction is a function of life that human-engineered systems have
Krishna Institute of Engineering & Technology, Ghaziabad Department of Computer Application MCA-213 : DATA STRUCTURES USING C
Tutorial#1 Q 1:- Explain the terms data, elementary item, entity, primary key, domain, attribute and information? Also give examples in support of your answer? Q 2:- What is a Data Type? Differentiate
agucacaaacgcu agugcuaguuua uaugcagucuua
RNA Secondary Structure Prediction: The Co-transcriptional effect on RNA folding agucacaaacgcu agugcuaguuua uaugcagucuua By Conrad Godfrey Abstract RNA secondary structure prediction is an area of bioinformatics
Genetic Algorithm. Based on Darwinian Paradigm. Intrinsically a robust search and optimization mechanism. Conceptual Algorithm
24 Genetic Algorithm Based on Darwinian Paradigm Reproduction Competition Survive Selection Intrinsically a robust search and optimization mechanism Slide -47 - Conceptual Algorithm Slide -48 - 25 Genetic
A Non-Linear Schema Theorem for Genetic Algorithms
A Non-Linear Schema Theorem for Genetic Algorithms William A Greene Computer Science Department University of New Orleans New Orleans, LA 70148 bill@csunoedu 504-280-6755 Abstract We generalize Holland
CSC 2427: Algorithms for Molecular Biology Spring 2006. Lecture 16 March 10
CSC 2427: Algorithms for Molecular Biology Spring 2006 Lecture 16 March 10 Lecturer: Michael Brudno Scribe: Jim Huang 16.1 Overview of proteins Proteins are long chains of amino acids (AA) which are produced
How To Cluster Of Complex Systems
Entropy based Graph Clustering: Application to Biological and Social Networks Edward C Kenley Young-Rae Cho Department of Computer Science Baylor University Complex Systems Definition Dynamically evolving
Translation Study Guide
Translation Study Guide This study guide is a written version of the material you have seen presented in the replication unit. In translation, the cell uses the genetic information contained in mrna to
Original article: A SIMPLE CLICK BY CLICK PROTOCOL TO PERFORM DOCKING: AUTODOCK 4.2 MADE EASY FOR NON-BIOINFORMATICIANS
Original article: A SIMPLE CLICK BY CLICK PROTOCOL TO PERFORM DOCKING: AUTODOCK 4.2 MADE EASY FOR NON-BIOINFORMATICIANS Syed Mohd. Danish Rizvi 1, Shazi Shakil* 2, Mohd. Haneef 2 1 Department of Biosciences,
green B 1 ) into a single unit to model the substrate in this reaction. enzyme
Teacher Key Objectives You will use the model pieces in the kit to: Simulate enzymatic actions. Explain enzymatic specificity. Investigate two types of enzyme inhibitors used in regulating enzymatic activity.
Overview of Eukaryotic Gene Prediction
Overview of Eukaryotic Gene Prediction CBB 231 / COMPSCI 261 W.H. Majoros What is DNA? Nucleus Chromosome Telomere Centromere Cell Telomere base pairs histones DNA (double helix) DNA is a Double Helix
Genetic Algorithms and Sudoku
Genetic Algorithms and Sudoku Dr. John M. Weiss Department of Mathematics and Computer Science South Dakota School of Mines and Technology (SDSM&T) Rapid City, SD 57701-3995 [email protected] MICS 2009
Graph Mining and Social Network Analysis
Graph Mining and Social Network Analysis Data Mining and Text Mining (UIC 583 @ Politecnico di Milano) References Jiawei Han and Micheline Kamber, "Data Mining: Concepts and Techniques", The Morgan Kaufmann
Lab 4: 26 th March 2012. Exercise 1: Evolutionary algorithms
Lab 4: 26 th March 2012 Exercise 1: Evolutionary algorithms 1. Found a problem where EAs would certainly perform very poorly compared to alternative approaches. Explain why. Suppose that we want to find
D A T A M I N I N G C L A S S I F I C A T I O N
D A T A M I N I N G C L A S S I F I C A T I O N FABRICIO VOZNIKA LEO NARDO VIA NA INTRODUCTION Nowadays there is huge amount of data being collected and stored in databases everywhere across the globe.
Lectures 2 & 3. If the base pair is imbedded in a helix, then there are several more angular attributes of the base pair that we must consider:
Lectures 2 & 3 Patterns of base-base hydrogen bonds-characteristics of the base pairs How are double helices assembled?? Figure 13 Let us first examine the angular characteristics of base pairs. Figure
Bioinformatics for Biologists. Protein Structure
Bioinformatics for Biologists Comparative Protein Analysis: Part III. Protein Structure Prediction and Comparison Robert Latek, PhD Sr. Bioinformatics Scientist Whitehead Institute for Biomedical Research
An approach of detecting structure emergence of regional complex network of entrepreneurs: simulation experiment of college student start-ups
An approach of detecting structure emergence of regional complex network of entrepreneurs: simulation experiment of college student start-ups Abstract Yan Shen 1, Bao Wu 2* 3 1 Hangzhou Normal University,
SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
Introduction To Genetic Algorithms
1 Introduction To Genetic Algorithms Dr. Rajib Kumar Bhattacharjya Department of Civil Engineering IIT Guwahati Email: [email protected] References 2 D. E. Goldberg, Genetic Algorithm In Search, Optimization
Steffen Lindert, René Staritzbichler, Nils Wötzel, Mert Karakaş, Phoebe L. Stewart, and Jens Meiler
Structure 17 Supplemental Data EM-Fold: De Novo Folding of α-helical Proteins Guided by Intermediate-Resolution Electron Microscopy Density Maps Steffen Lindert, René Staritzbichler, Nils Wötzel, Mert
Protein Protein Interaction Networks
Functional Pattern Mining from Genome Scale Protein Protein Interaction Networks Young-Rae Cho, Ph.D. Assistant Professor Department of Computer Science Baylor University it My Definition of Bioinformatics
Structure Tools and Visualization
Structure Tools and Visualization Gary Van Domselaar University of Alberta [email protected] Slides Adapted from Michel Dumontier, Blueprint Initiative 1 Visualization & Communication Visualization
DNA Worksheet BIOL 1107L DNA
Worksheet BIOL 1107L Name Day/Time Refer to Chapter 5 and Chapter 16 (Figs. 16.5, 16.7, 16.8 and figure embedded in text on p. 310) in your textbook, Biology, 9th Ed, for information on and its structure
Chapter 13: Query Processing. Basic Steps in Query Processing
Chapter 13: Query Processing! Overview! Measures of Query Cost! Selection Operation! Sorting! Join Operation! Other Operations! Evaluation of Expressions 13.1 Basic Steps in Query Processing 1. Parsing
Algorithms in Computational Biology (236522) spring 2007 Lecture #1
Algorithms in Computational Biology (236522) spring 2007 Lecture #1 Lecturer: Shlomo Moran, Taub 639, tel 4363 Office hours: Tuesday 11:00-12:00/by appointment TA: Ilan Gronau, Taub 700, tel 4894 Office
College of information technology Department of software
University of Babylon Undergraduate: third class College of information technology Department of software Subj.: Application of AI lecture notes/2011-2012 ***************************************************************************
Role of Hydrogen Bonding on Protein Secondary Structure Introduction
Role of Hydrogen Bonding on Protein Secondary Structure Introduction The function and chemical properties of proteins are determined by its three-dimensional structure. The final architecture of the protein
On Efficiently Capturing Scien3fic Proper3es in Distributed Big Data without Moving the Data:
On Efficiently Capturing Scien3fic Proper3es in Distributed Big Data without Moving the Data: Case Study in Distributed Structural Biology using MapReduce Boyu Zhang, Trilce Estrada 2, Pietro Cico@ 3,
ENZYMES. Serine Proteases Chymotrypsin, Trypsin, Elastase, Subtisisin. Principle of Enzyme Catalysis
ENZYMES Serine Proteases Chymotrypsin, Trypsin, Elastase, Subtisisin Principle of Enzyme Catalysis Linus Pauling (1946) formulated the first basic principle of enzyme catalysis Enzyme increase the rate
Three Effective Top-Down Clustering Algorithms for Location Database Systems
Three Effective Top-Down Clustering Algorithms for Location Database Systems Kwang-Jo Lee and Sung-Bong Yang Department of Computer Science, Yonsei University, Seoul, Republic of Korea {kjlee5435, yang}@cs.yonsei.ac.kr
Use the Force! Noncovalent Molecular Forces
Use the Force! Noncovalent Molecular Forces Not quite the type of Force we re talking about Before we talk about noncovalent molecular forces, let s talk very briefly about covalent bonds. The Illustrated
Persistent Binary Search Trees
Persistent Binary Search Trees Datastructures, UvA. May 30, 2008 0440949, Andreas van Cranenburgh Abstract A persistent binary tree allows access to all previous versions of the tree. This paper presents
1 The water molecule and hydrogen bonds in water
The Physics and Chemistry of Water 1 The water molecule and hydrogen bonds in water Stoichiometric composition H 2 O the average lifetime of a molecule is 1 ms due to proton exchange (catalysed by acids
Random Map Generator v1.0 User s Guide
Random Map Generator v1.0 User s Guide Jonathan Teutenberg 2003 1 Map Generation Overview...4 1.1 Command Line...4 1.2 Operation Flow...4 2 Map Initialisation...5 2.1 Initialisation Parameters...5 -w xxxxxxx...5
Genetic programming with regular expressions
Genetic programming with regular expressions Børge Svingen Chief Technology Officer, Open AdExchange [email protected] 2009-03-23 Pattern discovery Pattern discovery: Recognizing patterns that characterize
Using AutoDock with AutoDockTools: A Tutorial
Using AutoDock with AutoDockTools: A Tutorial Written by Ruth Huey and Garrett M. Morris The Scripps Resea rch Institute Molecula r Graphics Labora tory 10550 N. Torrey Pines Rd. La Jolla, California 92037-1000
AP BIOLOGY 2010 SCORING GUIDELINES (Form B)
AP BIOLOGY 2010 SCORING GUIDELINES (Form B) Question 2 Certain human genetic conditions, such as sickle cell anemia, result from single base-pair mutations in DNA. (a) Explain how a single base-pair mutation
A Fast Computational Genetic Algorithm for Economic Load Dispatch
A Fast Computational Genetic Algorithm for Economic Load Dispatch M.Sailaja Kumari 1, M.Sydulu 2 Email: 1 [email protected] 1, 2 Department of Electrical Engineering National Institute of Technology,
Hands-on exercises on solvent models & electrostatics EMBnet - Molecular Modeling Course 2005
Hands-on exercises on solvent models & electrostatics EMBnet - Molecular Modeling Course 2005 Exercise 1. The purpose of this exercise is to color the solvent accessible surface of a protein according
NO CALCULATORS OR CELL PHONES ALLOWED
Biol 205 Exam 1 TEST FORM A Spring 2008 NAME Fill out both sides of the Scantron Sheet. On Side 2 be sure to indicate that you have TEST FORM A The answers to Part I should be placed on the SCANTRON SHEET.
Memory Allocation Technique for Segregated Free List Based on Genetic Algorithm
Journal of Al-Nahrain University Vol.15 (2), June, 2012, pp.161-168 Science Memory Allocation Technique for Segregated Free List Based on Genetic Algorithm Manal F. Younis Computer Department, College
K'NEX DNA Models. Developed by Dr. Gary Benson Department of Biomathematical Sciences Mount Sinai School of Medicine
KNEX DNA Models Introduction Page 1 of 11 All photos by Kevin Kelliher. To download an Acrobat pdf version of this website Click here. K'NEX DNA Models Developed by Dr. Gary Benson Department of Biomathematical
Biochemistry 462a Hemoglobin Structure and Function Reading - Chapter 7 Practice problems - Chapter 7: 1-6; Proteins extra problems
Biochemistry 462a Hemoglobin Structure and Function Reading - Chapter 7 Practice problems - Chapter 7: 1-6; Proteins extra problems Myoglobin and Hemoglobin Oxygen is required for oxidative metabolism
Lecture #7 (2D NMR) Utility of Resonance Assignments
Lecture #7 (2D NMR) Basics of multidimensional NMR (2D NMR) 2D NOESY, COSY and TOCSY 2/23/15 Utility of Resonance Assignments Resonance Assignments: Assignment of frequency positions of resonances (peaks)
Model-based Parameter Optimization of an Engine Control Unit using Genetic Algorithms
Symposium on Automotive/Avionics Avionics Systems Engineering (SAASE) 2009, UC San Diego Model-based Parameter Optimization of an Engine Control Unit using Genetic Algorithms Dipl.-Inform. Malte Lochau
Chapter 8: Energy and Metabolism
Chapter 8: Energy and Metabolism 1. Discuss energy conversions and the 1 st and 2 nd law of thermodynamics. Be sure to use the terms work, potential energy, kinetic energy, and entropy. 2. What are Joules
Isotope distributions
Isotope distributions This exposition is based on: R. Martin Smith: Understanding Mass Spectra. A Basic Approach. Wiley, 2nd edition 2004. [S04] Exact masses and isotopic abundances can be found for example
(Refer Slide Time: 2:03)
Control Engineering Prof. Madan Gopal Department of Electrical Engineering Indian Institute of Technology, Delhi Lecture - 11 Models of Industrial Control Devices and Systems (Contd.) Last time we were
Refinement of a pdb-structure and Convert
Refinement of a pdb-structure and Convert A. Search for a pdb with the closest sequence to your protein of interest. B. Choose the most suitable entry (or several entries). C. Convert and resolve errors
Essentials of Human Anatomy & Physiology 11 th Edition, 2015 Marieb
A Correlation of Essentials of Human Anatomy Marieb To the Next Generation Science Standards Life A Correlation of, HS-LS1 From Molecules to Organisms: Structures and Processes HS-LS1-1. Construct an explanation
Language: English Lecturer: Gianni de Fabritiis. Teaching staff: Language: English Lecturer: Jordi Villà i Freixa
MSI: Molecular Simulations Descriptive details concerning the subject: Name of the subject: Molecular Simulations Code : MSI Type of subject: Optional ECTS: 5 Total hours: 125.0 Scheduling: 11:00-13:00
Section Activity #1: Fill out the following table for biology s most common elements assuming that each atom is neutrally charged.
LS1a Fall 2014 Section Week #1 I. Valence Electrons and Bonding The number of valence (outer shell) electrons in an atom determines how many bonds it can form. Knowing the number of valence electrons present
Helices From Readily in Biological Structures
The α Helix and the β Sheet Are Common Folding Patterns Although the overall conformation each protein is unique, there are only two different folding patterns are present in all proteins, which are α
2. The number of different kinds of nucleotides present in any DNA molecule is A) four B) six C) two D) three
Chem 121 Chapter 22. Nucleic Acids 1. Any given nucleotide in a nucleic acid contains A) two bases and a sugar. B) one sugar, two bases and one phosphate. C) two sugars and one phosphate. D) one sugar,
CAB TRAVEL TIME PREDICTI - BASED ON HISTORICAL TRIP OBSERVATION
CAB TRAVEL TIME PREDICTI - BASED ON HISTORICAL TRIP OBSERVATION N PROBLEM DEFINITION Opportunity New Booking - Time of Arrival Shortest Route (Distance/Time) Taxi-Passenger Demand Distribution Value Accurate
Unit I: Introduction To Scientific Processes
Unit I: Introduction To Scientific Processes This unit is an introduction to the scientific process. This unit consists of a laboratory exercise where students go through the QPOE2 process step by step
Consensus alignment server for reliable comparative modeling with distant templates
W50 W54 Nucleic Acids Research, 2004, Vol. 32, Web Server issue DOI: 10.1093/nar/gkh456 Consensus alignment server for reliable comparative modeling with distant templates Jahnavi C. Prasad 1, Sandor Vajda
Alpha Cut based Novel Selection for Genetic Algorithm
Alpha Cut based Novel for Genetic Algorithm Rakesh Kumar Professor Girdhar Gopal Research Scholar Rajesh Kumar Assistant Professor ABSTRACT Genetic algorithm (GA) has several genetic operators that can
Peptide bonds: resonance structure. Properties of proteins: Peptide bonds and side chains. Dihedral angles. Peptide bond. Protein physics, Lecture 5
Protein physics, Lecture 5 Peptide bonds: resonance structure Properties of proteins: Peptide bonds and side chains Proteins are linear polymers However, the peptide binds and side chains restrict conformational
Demand Forecasting Optimization in Supply Chain
2011 International Conference on Information Management and Engineering (ICIME 2011) IPCSIT vol. 52 (2012) (2012) IACSIT Press, Singapore DOI: 10.7763/IPCSIT.2012.V52.12 Demand Forecasting Optimization
CHM333 LECTURE 13 14: 2/13 15/13 SPRING 2013 Professor Christine Hrycyna
INTRODUCTION TO ENZYMES Enzymes are usually proteins (some RNA) In general, names end with suffix ase Enzymes are catalysts increase the rate of a reaction not consumed by the reaction act repeatedly to
Data Structure [Question Bank]
Unit I (Analysis of Algorithms) 1. What are algorithms and how they are useful? 2. Describe the factor on best algorithms depends on? 3. Differentiate: Correct & Incorrect Algorithms? 4. Write short note:
DATA STRUCTURES USING C
DATA STRUCTURES USING C QUESTION BANK UNIT I 1. Define data. 2. Define Entity. 3. Define information. 4. Define Array. 5. Define data structure. 6. Give any two applications of data structures. 7. Give
Clustering. Danilo Croce Web Mining & Retrieval a.a. 2015/201 16/03/2016
Clustering Danilo Croce Web Mining & Retrieval a.a. 2015/201 16/03/2016 1 Supervised learning vs. unsupervised learning Supervised learning: discover patterns in the data that relate data attributes with
VECTORAL IMAGING THE NEW DIRECTION IN AUTOMATED OPTICAL INSPECTION
VECTORAL IMAGING THE NEW DIRECTION IN AUTOMATED OPTICAL INSPECTION Mark J. Norris Vision Inspection Technology, LLC Haverhill, MA [email protected] ABSTRACT Traditional methods of identifying and
2. (a) Explain the strassen s matrix multiplication. (b) Write deletion algorithm, of Binary search tree. [8+8]
Code No: R05220502 Set No. 1 1. (a) Describe the performance analysis in detail. (b) Show that f 1 (n)+f 2 (n) = 0(max(g 1 (n), g 2 (n)) where f 1 (n) = 0(g 1 (n)) and f 2 (n) = 0(g 2 (n)). [8+8] 2. (a)
D-optimal plans in observational studies
D-optimal plans in observational studies Constanze Pumplün Stefan Rüping Katharina Morik Claus Weihs October 11, 2005 Abstract This paper investigates the use of Design of Experiments in observational
3 Some Integer Functions
3 Some Integer Functions A Pair of Fundamental Integer Functions The integer function that is the heart of this section is the modulo function. However, before getting to it, let us look at some very simple
INTRODUCTION TO PROTEIN STRUCTURE
Name Class: Partner, if any: INTRODUCTION TO PROTEIN STRUCTURE PRIMARY STRUCTURE: 1. Write the complete structural formula of the tripeptide shown (frame 10). Circle and label the three sidechains which
Original Article Efficient Genetic Algorithm on Linear Programming Problem for Fittest Chromosomes
International Archive of Applied Sciences and Technology Volume 3 [2] June 2012: 47-57 ISSN: 0976-4828 Society of Education, India Website: www.soeagra.com/iaast/iaast.htm Original Article Efficient Genetic
Data Structures and Algorithms Written Examination
Data Structures and Algorithms Written Examination 22 February 2013 FIRST NAME STUDENT NUMBER LAST NAME SIGNATURE Instructions for students: Write First Name, Last Name, Student Number and Signature where
A Procedure for Classifying New Respondents into Existing Segments Using Maximum Difference Scaling
A Procedure for Classifying New Respondents into Existing Segments Using Maximum Difference Scaling Background Bryan Orme and Rich Johnson, Sawtooth Software March, 2009 Market segmentation is pervasive
Performance Optimization of I-4 I 4 Gasoline Engine with Variable Valve Timing Using WAVE/iSIGHT
Performance Optimization of I-4 I 4 Gasoline Engine with Variable Valve Timing Using WAVE/iSIGHT Sean Li, DaimlerChrysler (sl60@dcx dcx.com) Charles Yuan, Engineous Software, Inc ([email protected]) Background!
Data Visualization in Cheminformatics. Simon Xi Computational Sciences CoE Pfizer Cambridge
Data Visualization in Cheminformatics Simon Xi Computational Sciences CoE Pfizer Cambridge My Background Professional Experience Senior Principal Scientist, Computational Sciences CoE, Pfizer Cambridge
SAnDReS Tutorial 01 Prof. Dr. Walter F. de Azevedo Jr.
2015 Dr. Walter F. de Azevedo Jr. SAnDReS Tutorial 01 Prof. Dr. Walter F. de Azevedo Jr. 1 Running in the Windows On the Windows, left click on Command Prompt. Go to SAnDReS directory (c:\sandres) and
Automated TLS group determination in Phenix
Computational Crystallography Initiative Automated TLS group determination in Phenix Pavel Afonine Computation Crystallography Initiative Physical Biosciences Division Lawrence Berkeley National Laboratory,
BCS HIGHER EDUCATION QUALIFICATIONS Level 6 Professional Graduate Diploma in IT. March 2013 EXAMINERS REPORT. Knowledge Based Systems
BCS HIGHER EDUCATION QUALIFICATIONS Level 6 Professional Graduate Diploma in IT March 2013 EXAMINERS REPORT Knowledge Based Systems Overall Comments Compared to last year, the pass rate is significantly
H 2O gas: molecules are very far apart
Non-Covalent Molecular Forces 2/27/06 3/1/06 How does this reaction occur: H 2 O (liquid) H 2 O (gas)? Add energy H 2O gas: molecules are very far apart H 2O liquid: bonding between molecules Use heat
REMOTE CONTROL by DNA as a Bio-sensor -antenna.
REMOTE CONTROL by DNA as a Bio-sensor -antenna. "Piezoelectric quantum transduction is a fundamental property of at- distance induction of genetic control " Paolo Manzelli: [email protected] ; www.edscuola.it/lre.html;www.egocreanet.it
Protein Studies Using CAChe
Protein Studies Using CAChe Exercise 1 Building the Molecules of Interest, and Using the Protein Data Bank In the CAChe workspace, click File / pen, and navigate to the C:\Program Files\Fujitsu\ CAChe\Fragment
Geometric Transformations Grade Four
Ohio Standards Connection Geometry and Spatial Sense Benchmark I Describe, identify and model reflections, rotations and translations, using physical materials. Indicator 7 Identify, describe and use reflections
8-3 The Reactions of Photosynthesis Slide 1 of 51
8-3 The of Photosynthesis 1 of 51 Inside a Chloroplast Inside a Chloroplast In plants, photosynthesis takes place inside chloroplasts. Plant Chloroplast Plant cells 2 of 51 Inside a Chloroplast Chloroplasts
Neural Network and Genetic Algorithm Based Trading Systems. Donn S. Fishbein, MD, PhD Neuroquant.com
Neural Network and Genetic Algorithm Based Trading Systems Donn S. Fishbein, MD, PhD Neuroquant.com Consider the challenge of constructing a financial market trading system using commonly available technical
