The Quixote Project: a pioneering work in managing Computational Chemistry research data

Size: px
Start display at page:

Download "The Quixote Project: a pioneering work in managing Computational Chemistry research data"

Transcription

1 1 The Quixote Project: a pioneering work in managing Computational Chemistry research data Pablo Echenique echenique.p@gmail.com

2 2 The protein folding problem Folding Native state Human Albumin sequence 585 amino acid residues, 4496 atoms

3 2 The protein folding problem Folding Native state human protein sequences Human Albumin sequence 585 amino acid residues, 4496 atoms 1385 human protein structures * Data extracted on 11/14/2008

4 The computational requirements of the protein folding problem 3 State-of-the-art example in a dedicated computer: D. E. Shaw et al., Millisecond-scale molecular dynamics simulations on Anton, Proceedings of the ACM/IEEE Conference on Supercomputing (SC09), November 14-20, 2009.

5 The computational requirements of the protein folding problem 3 State-of-the-art example in a dedicated computer: D. E. Shaw et al., Millisecond-scale molecular dynamics simulations on Anton, Proceedings of the ACM/IEEE Conference on Supercomputing (SC09), November 14-20, residue protein with explicit water molecules: ~ atoms

6 The computational requirements of the protein folding problem 3 State-of-the-art example in a dedicated computer: D. E. Shaw et al., Millisecond-scale molecular dynamics simulations on Anton, Proceedings of the ACM/IEEE Conference on Supercomputing (SC09), November 14-20, residue protein with explicit water molecules: ~ atoms During: ~1ms = 10-3s

7 The computational requirements of the protein folding problem 3 State-of-the-art example in a dedicated computer: D. E. Shaw et al., Millisecond-scale molecular dynamics simulations on Anton, Proceedings of the ACM/IEEE Conference on Supercomputing (SC09), November 14-20, residue protein with explicit water molecules: ~ atoms During: ~1ms = 10-3s Using Anton, a specific-purpose computer, and a code specially adapted to it

8 The computational requirements of the protein folding problem 3 State-of-the-art example in a dedicated computer: D. E. Shaw et al., Millisecond-scale molecular dynamics simulations on Anton, Proceedings of the ACM/IEEE Conference on Supercomputing (SC09), November 14-20, residue protein with explicit water molecules: ~ atoms During: ~1ms = 10-3s Using Anton, a specific-purpose computer, and a code specially adapted to it They report calculations in larger systems of a size similar to the typical one and achieving 5μs/day

9 The computational requirements of the protein folding problem 3 State-of-the-art example in a dedicated computer: D. E. Shaw et al., Millisecond-scale molecular dynamics simulations on Anton, Proceedings of the ACM/IEEE Conference on Supercomputing (SC09), November 14-20, residue protein with explicit water molecules: ~ atoms During: ~1ms = 10-3s Using Anton, a specific-purpose computer, and a code specially adapted to it They report calculations in larger systems of a size similar to the typical one and achieving 5μs/day They see hundreds of folding events

10 The computational requirements of the protein folding problem 3 State-of-the-art example in a dedicated computer: D. E. Shaw et al., Millisecond-scale molecular dynamics simulations on Anton, Proceedings of the ACM/IEEE Conference on Supercomputing (SC09), November 14-20, residue protein with explicit water molecules: ~ atoms During: ~1ms = 10-3s Using Anton, a specific-purpose computer, and a code specially adapted to it They report calculations in larger systems of a size similar to the typical one and achieving 5μs/day They see hundreds of folding events Simulating a 106-atoms system for 1s in Anton would still take more than 500 wall-clock years

11 4 Present-day energy functions have to be improved Force fields comparison: S. Piana, K. Lindorff-Larsen, and D. E. Shaw, How Robust Are Protein Folding Simulations with Respect to Force Field Parameterization?, Biophys. J. 100 (2011)

12 4 Present-day energy functions have to be improved Force fields comparison: S. Piana, K. Lindorff-Larsen, and D. E. Shaw, How Robust Are Protein Folding Simulations with Respect to Force Field Parameterization?, Biophys. J. 100 (2011) residue protein with explicit water molecules: ~5 103 atoms

13 4 Present-day energy functions have to be improved Force fields comparison: S. Piana, K. Lindorff-Larsen, and D. E. Shaw, How Robust Are Protein Folding Simulations with Respect to Force Field Parameterization?, Biophys. J. 100 (2011) residue protein with explicit water molecules: ~5 103 atoms During: ~0.5ms (different runs for different force fields)

14 4 Present-day energy functions have to be improved Force fields comparison: S. Piana, K. Lindorff-Larsen, and D. E. Shaw, How Robust Are Protein Folding Simulations with Respect to Force Field Parameterization?, Biophys. J. 100 (2011) residue protein with explicit water molecules: ~5 103 atoms During: ~0.5ms (different runs for different force fields) Four force fields: 2 CHARMM variants and 2 AMBER ones

15 4 Present-day energy functions have to be improved Force fields comparison: S. Piana, K. Lindorff-Larsen, and D. E. Shaw, How Robust Are Protein Folding Simulations with Respect to Force Field Parameterization?, Biophys. J. 100 (2011) residue protein with explicit water molecules: ~5 103 atoms During: ~0.5ms (different runs for different force fields) Four force fields: 2 CHARMM variants and 2 AMBER ones Native state and folding rates are coincident between force fields, but folding pathways are not:

16 5 Present-day energy functions have to be improved Mecánica cuántica no relativista = química cuántica

17 5 Present-day energy functions have to be improved Mecánica cuántica no relativista = química cuántica Peptides

18 Where do you start? 6

19 Mmm, soon no human will be able to keep the pace 7

20 OK, I still want to do my study. How do I find and get the data? 8 To find and select the data

21 OK, I still want to do my study. How do I find and get the data? To find and select the data 8 To get the data

22 OK, I still want to do my study. How do I find and get the data? To find and select the data 8 To get the data

23 9 2011? Internet??!?

24 9 2011? Internet??!??!?

25 And... what if I am interested in crystal structures of proteins? 10

26 And... what if I am interested in crystal structures of proteins? 11

27 And... what if I am interested in crystal structures of proteins? 12

28 So... why not in computational chemistry? 13 Why not in computational chemistry?

29 13 So... why not in computational chemistry? Why not in computational chemistry? Because there is little data and it can be searched in a 20th Century fashion? Because nobody needs to do such a search?

30 13 So... why not in computational chemistry? Why not in computational chemistry? Because there is little data and it can be searched in a 20th Century fashion? Because nobody needs to do such a search? Because the data is cheaper to obtain? The data I am going to show later can be estimated to cost approximately 5000 (then), not including electricity or salaries.

31 13 So... why not in computational chemistry? Why not in computational chemistry? Because there is little data and it can be searched in a 20th Century fashion? Because nobody needs to do such a search? Because the data is cheaper to obtain? The data I am going to show later can be estimated to cost approximately 5000 (then), not including electricity or salaries. Because it is theory and not experiment? Quantum chemistry calculations have been shown to be more precise than experiments in several cases.

32 13 So... why not in computational chemistry? Why not in computational chemistry? Because there is little data and it can be searched in a 20th Century fashion? Because nobody needs to do such a search? Because the data is cheaper to obtain? The data I am going to show later can be estimated to cost approximately 5000 (then), not including electricity or salaries. Because it is theory and not experiment? Quantum chemistry calculations have been shown to be more precise than experiments in several cases. Because nobody has decided to build the infrastructure?

33 Quixote 14

34 15 Real use case P. Echenique and J. L. Alonso, Efficient model chemistries for peptides. I. General framework and a study of the heterolevel approximation in RHF and MP2 with Pople split-valence basis sets, Journal of Computational Chemistry 29 (2008)

35 Real use case 16

36 Real use case 17

37 Put the data in an Apache server and link from the paper 18

38 Put the data in an Apache server and link from the paper 19 Pros Instead of the CD, I can send the link to you.

39 Put the data in an Apache server and link from the paper 19 Pros Instead of the CD, I can send the link to you. Cons I have to maintain and curate the server. Persistence of the link? No metadata. No searchability and indexing (metadata-wise). No searchability, indexing or analysis (QC-wise). Little interoperability with other services.

40 Put the data in an Institutional Repository 20

41 Put the data in an Institutional Repository 21

42 22 Put the data in an Institutional Repository Pros Instead of the CD, I can send the link to you. The server is maintained and curated for me. The funding for the infrastructure is typically long-term. The link is persistent. Rich metadata. Searchability and indexing (metadata-wise). Interoperability with other services (metadata-wise).

43 22 Put the data in an Institutional Repository Pros Instead of the CD, I can send the link to you. The server is maintained and curated for me. The funding for the infrastructure is typically long-term. The link is persistent. Rich metadata. Searchability and indexing (metadata-wise). Interoperability with other services (metadata-wise). Cons No searchability, indexing or analysis (QC-wise). No interoperability with other services (QC-wise). Lack of specificity. Motivation to upload?

44 Put the data in a thematic repository 23

45 24 Put the data in an thematic repository Pros Instead of the CD, I can send the link to you. Rich metadata. Searchability and indexing (metadata-wise). Interoperability with other services (metadata-wise). Searchability, indexing or analysis (QC-wise). Interoperability with other services (QC-wise). Specific (and hence motivation?). The server is maintained and curated for me (hopefully). The link is persistent (hopefully).

46 24 Put the data in an thematic repository Pros Instead of the CD, I can send the link to you. Rich metadata. Searchability and indexing (metadata-wise). Interoperability with other services (metadata-wise). Searchability, indexing or analysis (QC-wise). Interoperability with other services (QC-wise). Specific (and hence motivation?). The server is maintained and curated for me (hopefully). The link is persistent (hopefully). Cons The funding for the infrastructure can be unstable.

47 25 Quixote versatile, lightweight, modular, open source,...

48 Chempound 26

49 Chempound 27

50 50 Some of the people involved José Luis Alonso UZ-BIFI Pablo de Castro SONEX Pablo Echenique CSIC-UZ-BIFI Jorge Peter Estrada Murray-Rust CSIC-UZ-ZCAM Cambridge Thank you for your attention

DFN Project Chemie.DE: Building an Internet Information Service for Chemistry

DFN Project Chemie.DE: Building an Internet Information Service for Chemistry Building an Internet Information Service for Chemistry B. Kirste, H. Busse, W. Dreißig, H. Benedict, T. Richter, C. Schröter Fachbereich Chemie der Freien Universität Berlin URL for this talk: http://www.chemie.fu

More information

M.Sc. in Nano Technology with specialisation in Nano Biotechnology

M.Sc. in Nano Technology with specialisation in Nano Biotechnology M.Sc. in Nano Technology with specialisation in Nano Biotechnology Nanotechnology is all about designing, fabricating and controlling materials, components and machinery with dimensions on the nanoscale,

More information

Papers listed: Cell2. This weeks papers. Chapt 4. Protein structure and function

Papers listed: Cell2. This weeks papers. Chapt 4. Protein structure and function Papers listed: Cell2 During the semester I will speak of information from several papers. For many of them you will not be required to read these papers, however, you can do so for the fun of it (and it

More information

Molecular Dynamics Simulations

Molecular Dynamics Simulations Molecular Dynamics Simulations Yaoquan Tu Division of Theoretical Chemistry and Biology, Royal Institute of Technology (KTH) 2011-06 1 Outline I. Introduction II. Molecular Mechanics Force Field III. Molecular

More information

1. Free energy with controlled uncertainty 2. The modes of ligand binding to DNA

1. Free energy with controlled uncertainty 2. The modes of ligand binding to DNA 1. Free energy with controlled uncertainty 2. The modes of ligand binding to DNA Tomáš Kubař Institute of Organic Chemistry and Biochemistry Praha, Czech Republic Thermodynamic Integration Alchemical change

More information

Pipeline Pilot Enterprise Server. Flexible Integration of Disparate Data and Applications. Capture and Deployment of Best Practices

Pipeline Pilot Enterprise Server. Flexible Integration of Disparate Data and Applications. Capture and Deployment of Best Practices overview Pipeline Pilot Enterprise Server Pipeline Pilot Enterprise Server (PPES) is a powerful client-server platform that streamlines the integration and analysis of the vast quantities of data flooding

More information

Structure Check. Authors: Eduard Schreiner Leonardo G. Trabuco. February 7, 2012

Structure Check. Authors: Eduard Schreiner Leonardo G. Trabuco. February 7, 2012 University of Illinois at Urbana-Champaign NIH Resource for Macromolecular Modeling and Bioinformatics Beckman Institute Computational Biophysics Workshop Structure Check Authors: Eduard Schreiner Leonardo

More information

Amino Acids. Amino acids are the building blocks of proteins. All AA s have the same basic structure: Side Chain. Alpha Carbon. Carboxyl. Group.

Amino Acids. Amino acids are the building blocks of proteins. All AA s have the same basic structure: Side Chain. Alpha Carbon. Carboxyl. Group. Protein Structure Amino Acids Amino acids are the building blocks of proteins. All AA s have the same basic structure: Side Chain Alpha Carbon Amino Group Carboxyl Group Amino Acid Properties There are

More information

MassMatrix Web Server User Manual

MassMatrix Web Server User Manual MassMatrix Web Server User Manual Version 2.2.3 or later Hua Xu, Ph. D. Center for Proteomics & Bioinformatics Case Western Reserve University August 2009 Main Navigation Bar of the Site MassMatrix Web

More information

Chapter 3 Molecules of Cells

Chapter 3 Molecules of Cells Bio 100 Molecules of cells 1 Chapter 3 Molecules of Cells Compounds containing carbon are called organic compounds Molecules such as methane that are only composed of carbon and hydrogen are called hydrocarbons

More information

What is molecular dynamics (MD) simulation and how does it work?

What is molecular dynamics (MD) simulation and how does it work? What is molecular dynamics (MD) simulation and how does it work? A lecture for CHM425/525 Fall 2011 The underlying physical laws necessary for the mathematical theory of a large part of physics and the

More information

YILUN SHANG. e λi. i=1

YILUN SHANG. e λi. i=1 LOWER BOUNDS FOR THE ESTRADA INDEX OF GRAPHS YILUN SHANG Abstract. Let G be a graph with n vertices and λ 1,λ,...,λ n be its eigenvalues. The Estrada index of G is defined as EE(G = n eλ i. In this paper,

More information

Recognizing Organic Molecules: Carbohydrates, Lipids and Proteins

Recognizing Organic Molecules: Carbohydrates, Lipids and Proteins Recognizing Organic Molecules: Carbohydrates, Lipids and Proteins Oct 15 8:05 PM What is an Organic Molecule? An Organic Molecule is a molecule that contains carbon and hydrogen and oxygen Carbon is found

More information

Mass Spectrometry Signal Calibration for Protein Quantitation

Mass Spectrometry Signal Calibration for Protein Quantitation Cambridge Isotope Laboratories, Inc. www.isotope.com Proteomics Mass Spectrometry Signal Calibration for Protein Quantitation Michael J. MacCoss, PhD Associate Professor of Genome Sciences University of

More information

Myoglobin and Hemoglobin

Myoglobin and Hemoglobin Myoglobin and Hemoglobin Myoglobin and hemoglobin are hemeproteins whose physiological importance is principally related to their ability to bind molecular oxygen. Myoglobin (Mb) The oxygen storage protein

More information

Yuri Pevzner. 18136 Lembrecht way Tampa, FL 33647 Phone: (610) 216-5465 E-mail:yuri_pevzner@email.com

Yuri Pevzner. 18136 Lembrecht way Tampa, FL 33647 Phone: (610) 216-5465 E-mail:yuri_pevzner@email.com Yuri Pevzner 18136 Lembrecht way Tampa, FL 33647 Phone: (610) 216-5465 E-mail:yuri_pevzner@email.com Career Objective: Computational Chemistry/Chemoinformatics Software Engineer position in a scientific

More information

Peptide Bonds: Structure

Peptide Bonds: Structure Peptide Bonds: Structure Peptide primary structure The amino acid sequence, from - to C-terminus, determines the primary structure of a peptide or protein. The amino acids are linked through amide or peptide

More information

Statistical Inference for Big Data Problems in Molecular Biophysics

Statistical Inference for Big Data Problems in Molecular Biophysics Statistical Inference for Big Data Problems in Molecular Biophysics Arvind Ramanathan 1, Andrej Savol 2,4, Virginia Burger 2,4, Shannon Quinn 2,4, Pratul K. Agarwal 3, Chakra Chennubhotla 4 1 Computational

More information

Sequence Formats and Sequence Database Searches. Gloria Rendon SC11 Education June, 2011

Sequence Formats and Sequence Database Searches. Gloria Rendon SC11 Education June, 2011 Sequence Formats and Sequence Database Searches Gloria Rendon SC11 Education June, 2011 Sequence A is the primary structure of a biological molecule. It is a chain of residues that form a precise linear

More information

Data Integration Extravaganza

Data Integration Extravaganza EPM Suite (Hyperion) Data Integration Extravaganza Technologies Hyperion Application Link (HAL) Data Integration Management (DIM) Oracle Data Integrator (ODI) Enterprise Performance Management Architect

More information

CSC 2427: Algorithms for Molecular Biology Spring 2006. Lecture 16 March 10

CSC 2427: Algorithms for Molecular Biology Spring 2006. Lecture 16 March 10 CSC 2427: Algorithms for Molecular Biology Spring 2006 Lecture 16 March 10 Lecturer: Michael Brudno Scribe: Jim Huang 16.1 Overview of proteins Proteins are long chains of amino acids (AA) which are produced

More information

ProteinPilot Report for ProteinPilot Software

ProteinPilot Report for ProteinPilot Software ProteinPilot Report for ProteinPilot Software Detailed Analysis of Protein Identification / Quantitation Results Automatically Sean L Seymour, Christie Hunter SCIEX, USA Pow erful mass spectrometers like

More information

Guide for Bioinformatics Project Module 3

Guide for Bioinformatics Project Module 3 Structure- Based Evidence and Multiple Sequence Alignment In this module we will revisit some topics we started to look at while performing our BLAST search and looking at the CDD database in the first

More information

Peptide Bond Amino acids are linked together by peptide bonds to form polypepetide chain.

Peptide Bond Amino acids are linked together by peptide bonds to form polypepetide chain. Peptide Bond Peptide Bond Amino acids are linked together by peptide bonds to form polypepetide chain. + H 2 O 2 Peptide bonds are strong and not broken by conditions that denature proteins, such as heating.

More information

Hydrogen Bonds The electrostatic nature of hydrogen bonds

Hydrogen Bonds The electrostatic nature of hydrogen bonds Hydrogen Bonds Hydrogen bonds have played an incredibly important role in the history of structural biology. Both the structure of DNA and of protein a-helices and b-sheets were predicted based largely

More information

2010, Post-doctoral researcher, Institute of Organic and Pharmaceutical Chemistry, National Hellenic Research Foundation, Athens, Greece.

2010, Post-doctoral researcher, Institute of Organic and Pharmaceutical Chemistry, National Hellenic Research Foundation, Athens, Greece. Curriculum Vitae LEONIS Georgios, Ph.D Post-doctoral researcher Institute of Organic and Pharmaceutical Chemistry Tel: (30) 210-922-4931 Email : gleonis@eie.gr Area of specialization Computational bio-physical

More information

Phase determination methods in macromolecular X- ray Crystallography

Phase determination methods in macromolecular X- ray Crystallography Phase determination methods in macromolecular X- ray Crystallography Importance of protein structure determination: Proteins are the life machinery and are very essential for the various functions in the

More information

NMR and other Instrumental Techniques in Chemistry and the proposed National Curriculum.

NMR and other Instrumental Techniques in Chemistry and the proposed National Curriculum. NMR and other Instrumental Techniques in Chemistry and the proposed National Curriculum. Dr. John Jackowski Chair of Science, Head of Chemistry Scotch College Melbourne john.jackowski@scotch.vic.edu.au

More information

Selvita Integrated drug discovery collaborations

Selvita Integrated drug discovery collaborations Selvita Integrated drug discovery collaborations Magdalena Żabka Selvita overview Origins Polish biotechnology research company headquartered in Krakow Mission Resources We deliver comprehensive solutions

More information

PROTEINS THE PEPTIDE BOND. The peptide bond, shown above enclosed in the blue curves, generates the basic structural unit for proteins.

PROTEINS THE PEPTIDE BOND. The peptide bond, shown above enclosed in the blue curves, generates the basic structural unit for proteins. Ca 2+ The contents of this module were developed under grant award # P116B-001338 from the Fund for the Improvement of Postsecondary Education (FIPSE), United States Department of Education. However, those

More information

file:///c /Documents%20and%20Settings/terry/Desktop/DOCK%20website/terry/Old%20Versions/dock4.0_faq.txt

file:///c /Documents%20and%20Settings/terry/Desktop/DOCK%20website/terry/Old%20Versions/dock4.0_faq.txt -- X. Zou, 6/28/1999 -- Questions on installation of DOCK4.0.1: ======================================= Q. Can I run DOCK on platforms other than SGI (e.g., SparcStations, DEC Stations, Pentium, etc.)?

More information

Investor Newsletter. Storage Made Easy Cloud Appliance High Availability Options WHAT IS THE CLOUD APPLIANCE?

Investor Newsletter. Storage Made Easy Cloud Appliance High Availability Options WHAT IS THE CLOUD APPLIANCE? Investor Newsletter Storage Made Easy Cloud Appliance High Availability Options WHAT IS THE CLOUD APPLIANCE? The SME Cloud Appliance is a software platform that enables companies to enhance their existing

More information

Free web-based solution to manage photographs that could be used to manage collection items online if there is a photo of every item.

Free web-based solution to manage photographs that could be used to manage collection items online if there is a photo of every item. Review of affordable Collections Database options Our wish list and needs for the Anna Maria Island Historical Society: - Free, or inexpensive - Web-based, cloud storage solution, no server exists at the

More information

Life as a scientific database curator

Life as a scientific database curator Life as a scientific database curator Sandra Orchard EBI is an Outstation of the European Molecular Biology Laboratory. What is a database curator Curator OED - a keeper of a museum or other collection

More information

Research Article Cloud Computing for Protein-Ligand Binding Site Comparison

Research Article Cloud Computing for Protein-Ligand Binding Site Comparison BioMed Research International Volume 213, Article ID 17356, 7 pages http://dx.doi.org/1.1155/213/17356 Research Article Cloud Computing for Protein-Ligand Binding Site Comparison Che-Lun Hung 1 and Guan-Jie

More information

CNAS ASSESSMENT COMMITTEE CHEMISTRY (CH) DEGREE PROGRAM CURRICULAR MAPPINGS AND COURSE EXPECTED STUDENT LEARNING OUTCOMES (SLOs)

CNAS ASSESSMENT COMMITTEE CHEMISTRY (CH) DEGREE PROGRAM CURRICULAR MAPPINGS AND COURSE EXPECTED STUDENT LEARNING OUTCOMES (SLOs) CNAS ASSESSMENT COMMITTEE CHEMISTRY (CH) DEGREE PROGRAM CURRICULAR MAPPINGS AND COURSE EXPECTED STUDENT LEARNING OUTCOMES (SLOs) DEGREE PROGRAM CURRICULAR MAPPING DEFINED PROGRAM SLOs Course No. 11 12

More information

INTRODUCTION: SQL SERVER ACCESS / LOGIN ACCOUNT INFO:

INTRODUCTION: SQL SERVER ACCESS / LOGIN ACCOUNT INFO: INTRODUCTION: You can extract data (i.e. the total cost report) directly from the Truck Tracker SQL Server database by using a 3 rd party data tools such as Excel or Crystal Reports. Basically any software

More information

BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS

BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS 1. The Technology Strategy sets out six areas where technological developments are required to push the frontiers of knowledge

More information

CHEM 451 BIOCHEMISTRY I. SUNY Cortland Fall 2010

CHEM 451 BIOCHEMISTRY I. SUNY Cortland Fall 2010 CHEM 451 BIOCHEMISTRY I SUNY Cortland Fall 2010 Instructor: Dr. Frank Rossi Office: Bowers 135 Office Hours: Mon. 2:30-4:00, Wed. 4:00-5:30, Friday 2:30-3:00, or by appointment. Extra evening office hours

More information

THE CCLRC DATA PORTAL

THE CCLRC DATA PORTAL THE CCLRC DATA PORTAL Glen Drinkwater, Shoaib Sufi CCLRC Daresbury Laboratory, Daresbury, Warrington, Cheshire, WA4 4AD, UK. E-mail: g.j.drinkwater@dl.ac.uk, s.a.sufi@dl.ac.uk Abstract: The project aims

More information

Laboration 1. Identifiering av proteiner med Mass Spektrometri. Klinisk Kemisk Diagnostik

Laboration 1. Identifiering av proteiner med Mass Spektrometri. Klinisk Kemisk Diagnostik Laboration 1 Identifiering av proteiner med Mass Spektrometri Klinisk Kemisk Diagnostik Sven Kjellström 2014 kjellstrom.sven@gmail.com 0702-935060 Laboration 1 Klinisk Kemisk Diagnostik Identifiering av

More information

Ahsay Online Backup Suite v5.1. Whitepaper In-File Delta

Ahsay Online Backup Suite v5.1. Whitepaper In-File Delta Suite v5.1 Version 5.1 June 2006 Table of Content 1 Introduction...3 1.1 Differential Delta Mode...3 1.2 Incremental Delta Mode...3 2 Delta Generation...4 3 Block Size Setting...4 4 During Backup...5 5

More information

Digital Asset Management. Content Control for Valuable Media Assets

Digital Asset Management. Content Control for Valuable Media Assets Digital Asset Management Content Control for Valuable Media Assets Overview Digital asset management is a core infrastructure requirement for media organizations and marketing departments that need to

More information

Towards Large-Scale Molecular Dynamics Simulations on Graphics Processors

Towards Large-Scale Molecular Dynamics Simulations on Graphics Processors Towards Large-Scale Molecular Dynamics Simulations on Graphics Processors Joe Davis, Sandeep Patel, and Michela Taufer University of Delaware Outline Introduction Introduction to GPU programming Why MD

More information

Guidance for Industry

Guidance for Industry Guidance for Industry Interpreting Sameness of Monoclonal Antibody Products Under the Orphan Drug Regulations U.S. Department of Health and Human Services Food and Drug Administration Center for Drug Evaluation

More information

Advanced Medicinal & Pharmaceutical Chemistry CHEM 5412 Dept. of Chemistry, TAMUK

Advanced Medicinal & Pharmaceutical Chemistry CHEM 5412 Dept. of Chemistry, TAMUK Advanced Medicinal & Pharmaceutical Chemistry CHEM 5412 Dept. of Chemistry, TAMUK Dai Lu, Ph.D. dlu@tamhsc.edu Tel: 361-221-0745 Office: RCOP, Room 307 Drug Discovery and Development Drug Molecules Medicinal

More information

Introduction to Research Data Management

Introduction to Research Data Management Introduction to Research Data Management Marta Teperek, Veronica Phillips 30/10/2015 University of Cambridge TODAY: Mixture of activities and talking Introduction 1. Backup and exchange strategies 2. How

More information

Algorithm and computational complexity of Insulin

Algorithm and computational complexity of Insulin Algorithm and computational complexity Insulin Lutvo Kurić Bosnia and Herzegovina, Novi Travnik, Kalinska 7 Abstract:This paper discusses cyberinformation studies the amino acid composition insulin, in

More information

Organic Molecules of Life - Exercise 2

Organic Molecules of Life - Exercise 2 Organic Molecules of Life - Exercise 2 Objectives -Know the difference between a reducing sugar and a non-reducing sugar. -Distinguish Monosaccharides from Disaccharides and Polysaccharides -Understand

More information

Workshop IIc. Manual interpretation of MS/MS spectra. Ebbing de Jong. Center for Mass Spectrometry and Proteomics Phone (612)625-2280 (612)625-2279

Workshop IIc. Manual interpretation of MS/MS spectra. Ebbing de Jong. Center for Mass Spectrometry and Proteomics Phone (612)625-2280 (612)625-2279 Workshop IIc Manual interpretation of MS/MS spectra Ebbing de Jong Why MS/MS spectra? The information contained in an MS spectrum (m/z, isotope spacing and therefore z ) is not enough to tell us the amino

More information

When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want

When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want 1 When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want to search other databases as well. There are very

More information

Tips & Tricks GPC/SEC: Analysis and Calibration For Low Molar Mass Macromolecules

Tips & Tricks GPC/SEC: Analysis and Calibration For Low Molar Mass Macromolecules GPC/SEC: Analysis and Calibration For Low Molar Mass Macromolecules Daniela Held, Silvia Fugmann, and Peter Kilz, PSS Polymer Standards Service GmbH, Mainz, Germany. Macromolecules can range from several

More information

Non-Stop for Apache HBase: Active-active region server clusters TECHNICAL BRIEF

Non-Stop for Apache HBase: Active-active region server clusters TECHNICAL BRIEF Non-Stop for Apache HBase: -active region server clusters TECHNICAL BRIEF Technical Brief: -active region server clusters -active region server clusters HBase is a non-relational database that provides

More information

The Deployment Production Line

The Deployment Production Line The Deployment Production Line Jez Humble, Chris Read, Dan North ThoughtWorks Limited jez.humble@thoughtworks.com, chris.read@thoughtworks.com, dan.north@thoughtworks.com Abstract Testing and deployment

More information

Information Technology: Principles and Strategic Aims

Information Technology: Principles and Strategic Aims Information Technology: Principles and Strategic Aims As observed in the University of Cambridge IT Strategy, the University is a complex and diverse organization whose IT requirements vary depending on

More information

EUDAT. Towards a pan-european Collaborative Data Infrastructure

EUDAT. Towards a pan-european Collaborative Data Infrastructure EUDAT Towards a pan-european Collaborative Data Infrastructure Damien Lecarpentier CSC-IT Center for Science, Finland EISCAT User Meeting, Uppsala,6 May 2013 2 Exponential growth Data trends Zettabytes

More information

Molecular Genetics. RNA, Transcription, & Protein Synthesis

Molecular Genetics. RNA, Transcription, & Protein Synthesis Molecular Genetics RNA, Transcription, & Protein Synthesis Section 1 RNA AND TRANSCRIPTION Objectives Describe the primary functions of RNA Identify how RNA differs from DNA Describe the structure and

More information

Management of Proteomics Data: 2D Gel Electrophoresis and Other Methods

Management of Proteomics Data: 2D Gel Electrophoresis and Other Methods Management of Proteomics Data: 2D Gel Electrophoresis and Other Methods Philip Andrews National Resource for Proteomics & Pathway Mapping Michigan Proteome Consortium University of Michigan Outline of

More information

This class deals with the fundamental structural features of proteins, which one can understand from the structure of amino acids, and how they are

This class deals with the fundamental structural features of proteins, which one can understand from the structure of amino acids, and how they are This class deals with the fundamental structural features of proteins, which one can understand from the structure of amino acids, and how they are put together. 1 A more detailed view of a single protein

More information

Open Access to Manuscripts, Open Science, and Big Data

Open Access to Manuscripts, Open Science, and Big Data Open Access to Manuscripts, Open Science, and Big Data Progress, and the Elsevier Perspective in 2013 Presented by: Dan Morgan Title: Senior Manager Access Relations, Global Academic Relations Company

More information

Foundations of software engineering

Foundations of software engineering Foundations of software engineering Waterfalls, V s and Spirals: Standard SE Methodologies Dr. Julie Greensmith G51 Objectives To introduce three of the major software process models: Waterfall methods

More information

Fedora Distributed data management (SI1)

Fedora Distributed data management (SI1) Fedora Distributed data management (SI1) Mohamed Rafi DART UQ Outline of Work Package To enable Fedora to natively handle large datasets. Explore SRB integration at the storage level of the repository

More information

Scientific versus Business Workflows

Scientific versus Business Workflows 2 Scientific versus Business Workflows Roger Barga and Dennis Gannon The formal concept of a workflow has existed in the business world for a long time. An entire industry of tools and technology devoted

More information

Thermo Scientific PepFinder Software A New Paradigm for Peptide Mapping

Thermo Scientific PepFinder Software A New Paradigm for Peptide Mapping Thermo Scientific PepFinder Software A New Paradigm for Peptide Mapping For Conclusive Characterization of Biologics Deep Protein Characterization Is Crucial Pharmaceuticals have historically been small

More information

Paradigm Changes Affecting the Practice of Scientific Communication in the Life Sciences

Paradigm Changes Affecting the Practice of Scientific Communication in the Life Sciences Paradigm Changes Affecting the Practice of Scientific Communication in the Life Sciences Prof. Dr. Martin Hofmann-Apitius Head of the Department of Bioinformatics Fraunhofer Institute for Algorithms and

More information

INTRODUCTION TO PROTEIN STRUCTURE

INTRODUCTION TO PROTEIN STRUCTURE Name Class: Partner, if any: INTRODUCTION TO PROTEIN STRUCTURE PRIMARY STRUCTURE: 1. Write the complete structural formula of the tripeptide shown (frame 10). Circle and label the three sidechains which

More information

Lecture Overview. Hydrogen Bonds. Special Properties of Water Molecules. Universal Solvent. ph Scale Illustrated. special properties of water

Lecture Overview. Hydrogen Bonds. Special Properties of Water Molecules. Universal Solvent. ph Scale Illustrated. special properties of water Lecture Overview special properties of water > water as a solvent > ph molecules of the cell > properties of carbon > carbohydrates > lipids > proteins > nucleic acids Hydrogen Bonds polarity of water

More information

AppsLab Library. A new online platform for Thermo Fisher Scientific applications and methods. Susanne Kramer, Ph.D. Product Manager AppsLab Library

AppsLab Library. A new online platform for Thermo Fisher Scientific applications and methods. Susanne Kramer, Ph.D. Product Manager AppsLab Library AppsLab Library A new online platform for Thermo Fisher Scientific applications and methods Susanne Kramer, Ph.D. Product Manager AppsLab Library PO71675-EN 0515S 1 The world leader in serving science

More information

Protein Physics. A. V. Finkelstein & O. B. Ptitsyn LECTURE 1

Protein Physics. A. V. Finkelstein & O. B. Ptitsyn LECTURE 1 Protein Physics A. V. Finkelstein & O. B. Ptitsyn LECTURE 1 PROTEINS Functions in a Cell MOLECULAR MACHINES BUILDING BLOCKS of a CELL ARMS of a CELL ENZYMES - enzymatic catalysis of biochemical reactions

More information

MS Amanda Standalone User Manual

MS Amanda Standalone User Manual MS Amanda Standalone User Manual February 2014 Viktoria Dorfer, Peter Pichler, Stephan Winkler, and Karl Mechtler 1. Installation of MS Amanda Standalone In order to install MS Amanda Standalone please

More information

data.bris: collecting and organising repository metadata, an institutional case study

data.bris: collecting and organising repository metadata, an institutional case study Describe, disseminate, discover: metadata for effective data citation. DataCite workshop, no.2.. data.bris: collecting and organising repository metadata, an institutional case study David Boyd data.bris

More information

IO Informatics The Sentient Suite

IO Informatics The Sentient Suite IO Informatics The Sentient Suite Our software, The Sentient Suite, allows a user to assemble, view, analyze and search very disparate information in a common environment. The disparate data can be numeric

More information

18.2 Protein Structure and Function: An Overview

18.2 Protein Structure and Function: An Overview 18.2 Protein Structure and Function: An Overview Protein: A large biological molecule made of many amino acids linked together through peptide bonds. Alpha-amino acid: Compound with an amino group bonded

More information

Analytical Information Management for the IODP Riserless Drilling Vessel

Analytical Information Management for the IODP Riserless Drilling Vessel Analytical Information Management for the IODP Riserless Drilling Vessel Paul Foster David Fackler Peter Blum David Houpt USIO Analytical Services May 8, 2006 Project Framework Scientific Ocean Drilling

More information

LifeTein in Industrial Production of Therapeutic Peptides. Phil Moore, PhD Director of Business Development LifeTein LLC, NJ, USA

LifeTein in Industrial Production of Therapeutic Peptides. Phil Moore, PhD Director of Business Development LifeTein LLC, NJ, USA LifeTein in Industrial Production of Therapeutic Peptides Phil Moore, PhD Director of Business Development LifeTein LLC, NJ, USA 1 Outline Market and Technology Trend LifeTein s Technology portfolio LifeTein

More information

UV-Vis Vis spectroscopy. Electronic absorption spectroscopy

UV-Vis Vis spectroscopy. Electronic absorption spectroscopy UV-Vis Vis spectroscopy Electronic absorption spectroscopy Absortpion spectroscopy Provide information about presence and absence of unsaturated functional groups Useful adjunct to IR Determination of

More information

Structure of proteins

Structure of proteins Structure of proteins Primary structure: is amino acids sequence or the covalent structure (50-2500) amino acids M.Wt. of amino acid=110 Dalton (56 110=5610 Dalton). Single chain or more than one polypeptide

More information

High-Performance, Low-Cost Computational Chemistry: Servers in a Stick, Box, and Cloud. Nathan Vance Polik Group Hope College February 19, 2015

High-Performance, Low-Cost Computational Chemistry: Servers in a Stick, Box, and Cloud. Nathan Vance Polik Group Hope College February 19, 2015 High-Performance, Low-Cost Computational Chemistry: Servers in a Stick, Box, and Cloud Nathan Vance Polik Group Hope College February 19, 2015 Outline The use and history of computing in chemistry The

More information

Committee on WIPO Standards (CWS)

Committee on WIPO Standards (CWS) E CWS/1/5 ORIGINAL: ENGLISH DATE: OCTOBER 13, 2010 Committee on WIPO Standards (CWS) First Session Geneva, October 25 to 29, 2010 PROPOSAL FOR THE PREPARATION OF A NEW WIPO STANDARD ON THE PRESENTATION

More information

Molecular Modelling of DNA. Charlie Laughton University of Nottingham

Molecular Modelling of DNA. Charlie Laughton University of Nottingham Molecular Modelling of DNA Charlie Laughton University of Nottingham Why model DNA? Sequences/structures you can t crystallise or get by NMR Drug design Understanding protein-dna recognition Quantitating

More information

Student Text and E-Book ISBN: 0-8053-6624-5

Student Text and E-Book ISBN: 0-8053-6624-5 Course Syllabus Advanced Biology A Syllabus Required Student Text: Campbell Biology (6 th edition) Student Text and E-Book ISBN: 0-8053-6624-5 Developer: Judith S. Nuno Email: jdenuno@mhs-la.org Course

More information

Lecture Data Warehouse Systems

Lecture Data Warehouse Systems Lecture Data Warehouse Systems Eva Zangerle SS 2013 PART A: Architecture Chapter 1: Motivation and Definitions Motivation Goal: to build an operational general view on a company to support decisions in

More information

Some terms: An antigen is a molecule or pathogen capable of eliciting an immune response

Some terms: An antigen is a molecule or pathogen capable of eliciting an immune response Overview of the immune system We continue our discussion of protein structure by considering the structure of antibodies. All organisms are continually subject to attack by microorganisms and viruses.

More information

Assessment Method 1: Oral seminar presentation

Assessment Method 1: Oral seminar presentation 2013-2014 Assessment Report College of Sciences & Mathematics Chemistry & Biochemistry Chemistry, Master's Expected Outcome 1: Effective Oral Communication Skills Students in M.S. degree Program will demonstrate

More information

How To Make A Drug From A Peptide

How To Make A Drug From A Peptide MODERN PERSPECTIVES ON PEPTIDE SYNTHESIS INTRODUCTION WHITEPAPER www.almacgroup.com The complexity of synthetic peptide products, whether as reagents used in research or as therapeutic APIs, is increasing.

More information

CLARIN-NL Second Open Call. Jan Odijk CLARIN-NL Call 2 Info-session Amsterdam, 26 Aug 2010

CLARIN-NL Second Open Call. Jan Odijk CLARIN-NL Call 2 Info-session Amsterdam, 26 Aug 2010 CLARIN-NL Second Open Call Jan Odijk CLARIN-NL Call 2 Info-session Amsterdam, 26 Aug 2010 Overview Background Project Types Project Goals Roles Resource Curation Projects Demonstrator Projects CLARIN Centres

More information

Application Note # LCMS-62 Walk-Up Ion Trap Mass Spectrometer System in a Multi-User Environment Using Compass OpenAccess Software

Application Note # LCMS-62 Walk-Up Ion Trap Mass Spectrometer System in a Multi-User Environment Using Compass OpenAccess Software Application Note # LCMS-62 Walk-Up Ion Trap Mass Spectrometer System in a Multi-User Environment Using Compass OpenAccess Software Abstract Presented here is a case study of a walk-up liquid chromatography

More information

Checklist and guidance for a Data Management Plan

Checklist and guidance for a Data Management Plan Checklist and guidance for a Data Management Plan Please cite as: DMPTuuli-project. (2016). Checklist and guidance for a Data Management Plan. v.1.0. Available online: https://wiki.helsinki.fi/x/dzeacw

More information

The Data Management Plan with. Dataverse. Mercè Crosas, Ph.D. Director of Product Development

The Data Management Plan with. Dataverse. Mercè Crosas, Ph.D. Director of Product Development The Data Management Plan with Dataverse Mercè Crosas, Ph.D. Director of Product Development The Dataverse The Data Management Plan The Data Management Plan with Dataverse The Dataverse The Data Management

More information

Document Management Solutions

Document Management Solutions TM SCAN SOLUTIONS Efficient, Flexible, Powerful, Cost-Effective Document Management Solutions Introduction About Pearl Scan Pearl Scan Solutions Ltd is one of the leading providers of document management

More information

Hands-on exercises on solvent models & electrostatics EMBnet - Molecular Modeling Course 2005

Hands-on exercises on solvent models & electrostatics EMBnet - Molecular Modeling Course 2005 Hands-on exercises on solvent models & electrostatics EMBnet - Molecular Modeling Course 2005 Exercise 1. The purpose of this exercise is to color the solvent accessible surface of a protein according

More information

Integrating Research Information: Requirements of Science Research

Integrating Research Information: Requirements of Science Research Integrating Research Information: Requirements of Science Research Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory brian.matthews@stfc.ac.uk The science

More information

Managing Complexity in Distributed Data Life Cycles Enhancing Scientific Discovery

Managing Complexity in Distributed Data Life Cycles Enhancing Scientific Discovery Center for Information Services and High Performance Computing (ZIH) Managing Complexity in Distributed Data Life Cycles Enhancing Scientific Discovery Richard Grunzke*, Jens Krüger, Sandra Gesing, Sonja

More information

Structure Tools and Visualization

Structure Tools and Visualization Structure Tools and Visualization Gary Van Domselaar University of Alberta gary.vandomselaar@ualberta.ca Slides Adapted from Michel Dumontier, Blueprint Initiative 1 Visualization & Communication Visualization

More information

Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness

Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness Melanie Dulong de Rosnay Fellow, Science Commons and Berkman Center for Internet & Society at Harvard University This article

More information

http://faculty.sau.edu.sa/h.alshehri

http://faculty.sau.edu.sa/h.alshehri http://faculty.sau.edu.sa/h.alshehri Definition: Proteins are macromolecules with a backbone formed by polymerization of amino acids. Proteins carry out a number of functions in living organisms: - They

More information

(c) How would your answers to problem (a) change if the molecular weight of the protein was 100,000 Dalton?

(c) How would your answers to problem (a) change if the molecular weight of the protein was 100,000 Dalton? Problem 1. (12 points total, 4 points each) The molecular weight of an unspecified protein, at physiological conditions, is 70,000 Dalton, as determined by sedimentation equilibrium measurements and by

More information

Introduction to Proteomics

Introduction to Proteomics Introduction to Proteomics Åsa Wheelock, Ph.D. Division of Respiratory Medicine & Karolinska Biomics Center asa.wheelock@ki.se In: Systems Biology and the Omics Cascade, Karolinska Institutet, June 9-13,

More information