Bioinformatics & Protein Database Concepts. Learning Objective. Proteomics Bioinformatics and Protein Database Concepts

Size: px
Start display at page:

Download "Bioinformatics & Protein Database Concepts. Learning Objective. Proteomics Bioinformatics and Protein Database Concepts"

Transcription

1 Bioinformatics & Protein Database Concepts With the emergence of high-throughput techniques for generation of protein sequences, computational tools are required for storing, sharing, analyzing and updating this data. Databases and its associated features provide tools for accomplishing meaningful storage of biological data. Learning Objective In this Learning Object, the learner will be able to, Recall procedures involved in wet lab and Bioinformatics, and, Recall

2 From wet lab to Bioinformatics The cells present in the tissue culture are lysed open thereby releasing crude extract. This extract is centrifuged to separate the protein mixture from the cell debris. The supernatant obtained is made up of a mixture of proteins having a variety of properties. Protein of interest must then be isolated from this mixture.

3 From wet lab to Bioinformatics The protein of interest is separated from the protein mixture present in the supernatant. This is carried out by suitable techniques such as chromatography or electrophoresis which make use of various properties of the proteins such as their charge, mass etc for separation.

4 From wet lab to Bioinformatics Edman degradation employs pheny isothiocyanate reagent, which reacts with the amino terminal residue of the peptide giving rise to phenyl thiocarbamoyl derivative of the amino-acid reside. In mild acidic conditions, this cyclic derivative of the amino acid is released in the form of a PTH-amino acid, which can then be identified by chromatographic techniques. The procedure is then repeated to identify each N-terminal amino acid sequentially.

5 From wet lab to Bioinformatics The mass spectrometer is an instrument that produces charged molecular species in vacuum, separates them by means of electric and magnetic fields and measures the mass-to-charge ratios and relative abundances of the ions thus produced. A tandem mass spectrometer makes use of a combination of two mass analyzers, separated by a collision cell, in order to provide improved resolution of the fragment ions. The first mass analyzer usually operates in a scanning mode in order to select only a particular peptide ion which is further fragmented and resolved in the second analyzer. This can be used for protein sequencing studies.

6 All data related to a protein can be divided into four broad categories namely sequence details, Source, Gene details and References. Sequence details contain the features of a protein s amino acid sequence such as the length, location, patterns and identifiers of the protein sequence. The source contains information based on the biological source used for retrieving the protein. Gene contains details of the gene from which the proteins is being expressed. Reference contains the details of the research publication in which the study was reported.

7 Database designing is done at various levels such as Physical, Logical and View. At the physical level, we define the purpose of the database which is in accordance with the prospected usage. At the logical level, we define the tables, attributes of the tables and relationship between tables. Logical level is the most complex and important schema for databases and requires a thorough understanding of the data and its contexts and relationships. At the View level we define the views and appearance of the database

8 A typical biological database can be characterized by its Type and its Tools. The Type defines the category of data that it includes, such as sequence, domains or structure. This implies that the particular database s most prominent feature includes either sequences, domains or structure and it will primarily be used for their analysis. The analysis tools defines the platforms that the site will provide for gaining an insight into the protein data.

9 For extracting the protein information from a database, users can give a variety of input terms. These can be: Unique ID: Molecular Name Amino-acid sequence Keyword Literature Gene Taxonomy

10 Once the user submits the query, the output can be of multiple formats. The generalized information that users can obtain from protein databases is the protein s General Description of the protein molecule Annotations of the protein Name and description of the gene that transcribes them ID of the same protein in other relevant databases Details of the experiment conducted for characterizing proteins Details of the Protein s secondary structure Details of the organism which was used as a source for obtaining the protein Citations of research conducted for obtaining this protein Patterns occurring within a sequence and their analysis

11 This slide shows the different kinds of analysis that can be conducted on a given protein sequence. The query can be the protein name, sequence or any other identifier of the protein. In this example, we provide the protein sequence as Input. Once the query protein sequence is entered into the Analysis tool, it can give various kinds of results such as Identify protein from sequence Identify physico-chemical properties such as chemical formula, half-life, iso-electric point, molecular weight, etc. Aligned sequences and structures Variable and conserved residues Predicted Secondary and Tertiary Structures Synonyms and Scientific terminology of proteins

12 We explain the usage of Protein databases using the example of Human Serum Albumin protein. If you want to view a specific step in the case study, click on the relevant panel. Else click on View Full Animation

13 Open a web browser and go to On the top right corner of thepage, there will be a search box. Click on the downlink ahead of the search box. We get a list of options for the databases to search from. Select UniProKB. Type the name of the protein of your choice (Ex-Serum Albumin) in the text box in front of the word 'for'.

14 The results page for the search shows 179 hits for our query. It is shown on the top of the page. The first 25 of them are shown on the first page, which can be viewed by scrolling down the page. Click on the entry of your choice. Here we click on the human Albumin hit (ALBU_HUMAN).

15 The top of the result page looks like this. Search for the heading Sequences, by scrolling down the page. Click on the tab FASTA next to the sequence of your interest. The FASTA sequence opens on a new tab. Save this FASTA sequence in your computer.

16 Once the FASTA sequence is retreived, we can subject it to variety of Protein Analysis toools which are broadly classified into Sequence Similarity search tools, Primary structural analysis tools, Phylogenetic Analysis tools, Molecular Modeling and Visualisation Tools and Structure Prediction tools. Here we explore the web based service called ProtParam which belongs to Primary Structural Analysis tools. For exploring other such services, users can visit

17 The front-end for the tool will ask you to input the accession ID of the protein under study OR the sequence of that protein. Delete the first line (descriptive line) from your FASTA sequence, such that only the amino acid sequence is there. Click on Compute Parameters. On the results page, scroll down to find the various physico-chemical parameters of this protein

18 This part of the results gives the percentage of each amino acid in the sequence. The highlighted region indicates the CSV file link. CSV stands for Comma Separated Values. which can be opened from text as well as spread sheet formats. This file can be downloaded in its comma separated format, by clicking on it. CSV files can also be opened with Microsoft Excel

19 Other information that can be obtained from these databases include chemical formula for the protein, total number of atoms present in the protein, total number of negatively and positively charged residues, estimated half-life of the protein, i.e. the time in which the protein will degrade to half its original mass and the average hydropathicity which gives an insight into the solubility of the proteins. Hydrophobic molecules exhibit a Positive GRAVY value while hydrophilic molecules show a negative GRAVY value

20 Go to the FASTA sequence obtained in previous steps into the input box of the server. Click on Scan.

21 The results page shows the various profiles that have the highest probability of occurrence on the basis of which they are assigned scores. You should select the hit with the highest score

22 The result displays the position of the Albumin domain highlighted in the sequence from position It also displays a graphical view in form of a downloadable png image where the Profile hits are represented as colored shapes with their PROSITE name. It then displays the structure of the Albumin Domain highlighting the di-sulhphide bonding cysteine residues as C and and its signature pattern as *

23 Once the user enters Serum Albumin in the PDB search box, in the output page of the selected PDB entry, we find the following tabs. The horizontal tabs summarize the entire result page. The vertical tabs occur as the initial description in the first page. Each of these tabs can be explored in detail. The structural analysis of the protein can display a wide range of properties such as the description of the protein molecule including classification of the protein, the chains it contains, number of amino acids, etc.

24 The display also shows entries that are closely related to the user s query, such as in the case of the same protein characterized from a different organism.

25 The protein molecules are generally structurally characterized by attaching it with a ligand and determining its structure from experimental techniques. The description of these ligands is given in the result summary of the query protein

26 Result summary displays derived data for the Serum Albumin such as the molecular and biological functions that the protein is involved in.

27 The Biological aspect of Serum Albumin are also displayed as results. The unique feature of this tab is that it gives a complete list of Single Nucleotide Polymorphisms (SNP) in the protein sequence. This shows the change in amino acids as well as the locations of the SNPs and the SNP Ids.

28 The 3-D visualization of Serum Albumin is given as a part of the results which can be viewed from a tool called Jmol. Along with the image analysis from Jmol, users can also study and download the structural characteristics of the protein such as its Bond Length along with the place and frequency of its occurrence. Structural results also summarize the Bond Angle and the Dihedral Angles including the chain where they occur and the frequency of its occurrence.

29 From wet lab to bioinformatics 1. Protein: Protein is a bio-molecule made out of chains of amino acid residues. These chains are formed between amino-acids by eliminating a water molecule and forming a peptide bond. Proteins are involved in performing the structural, functional and regulatory functions of the cell. 2. Peptide: Small protein fragments which are formed by a stretch of around 50 amino-acids are called peptides 3. Amino acid sequence: The order of amino acids and their linear arrangement is known as amino-acid sequence. It is also known as the primary structure of the protein. 4. Edman degradation: This is a chemical method for sequencing amino acid residues in a protein or a peptide. The N-terminal residue is labelled using phenyl isothiocyanate and then cleaved from the remaining peptide chain without disrupting any of the other peptide bonds. This labelled amino acid is then detected and the procedure is repeated to identify each N-terminal amino acid sequentially. 5. Mass spectrometry: A technique for production and detection of charged molecular species in vacuum, after their separation by magnetic and electric fields based on mass to charge (m/z) ratio.

30 1. Type of data: The type of data stored in Biological Databases can be of various types such as Pure Sequences, Sequences with structure, meta-data about the source of the sequence, experimental detail, etc. 2. Prospected Usage: The databases are primarily used to store all the information in a single web-based resource. It also provide analysis tools for various sequence analysis functions such as pair-wise sequence alignment, multiple sequence alignment, homology modelling, etc 3. Database schema: The design of the database at various levels is called a database schema. It includes the attributes of all individual tables and the relationships between them. The schema is defined at three levels, namely, Physical, Logical and View. 4. Primary Database: In biological database studies, primary databases store only the protein sequence information.

31

32

33

34

35

36

37

38

Guide for Bioinformatics Project Module 3

Guide for Bioinformatics Project Module 3 Structure- Based Evidence and Multiple Sequence Alignment In this module we will revisit some topics we started to look at while performing our BLAST search and looking at the CDD database in the first

More information

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the

More information

Structure Tools and Visualization

Structure Tools and Visualization Structure Tools and Visualization Gary Van Domselaar University of Alberta gary.vandomselaar@ualberta.ca Slides Adapted from Michel Dumontier, Blueprint Initiative 1 Visualization & Communication Visualization

More information

Pep-Miner: A Novel Technology for Mass Spectrometry-Based Proteomics

Pep-Miner: A Novel Technology for Mass Spectrometry-Based Proteomics Pep-Miner: A Novel Technology for Mass Spectrometry-Based Proteomics Ilan Beer Haifa Research Lab Dec 10, 2002 Pep-Miner s Location in the Life Sciences World The post-genome era - the age of proteome

More information

PeptidomicsDB: a new platform for sharing MS/MS data.

PeptidomicsDB: a new platform for sharing MS/MS data. PeptidomicsDB: a new platform for sharing MS/MS data. Federica Viti, Ivan Merelli, Dario Di Silvestre, Pietro Brunetti, Luciano Milanesi, Pierluigi Mauri NETTAB2010 Napoli, 01/12/2010 Mass Spectrometry

More information

Bioinformatics Resources at a Glance

Bioinformatics Resources at a Glance Bioinformatics Resources at a Glance A Note about FASTA Format There are MANY free bioinformatics tools available online. Bioinformaticists have developed a standard format for nucleotide and protein sequences

More information

Sequence Formats and Sequence Database Searches. Gloria Rendon SC11 Education June, 2011

Sequence Formats and Sequence Database Searches. Gloria Rendon SC11 Education June, 2011 Sequence Formats and Sequence Database Searches Gloria Rendon SC11 Education June, 2011 Sequence A is the primary structure of a biological molecule. It is a chain of residues that form a precise linear

More information

Separation of Amino Acids by Paper Chromatography

Separation of Amino Acids by Paper Chromatography Separation of Amino Acids by Paper Chromatography Chromatography is a common technique for separating chemical substances. The prefix chroma, which suggests color, comes from the fact that some of the

More information

GenBank, Entrez, & FASTA

GenBank, Entrez, & FASTA GenBank, Entrez, & FASTA Nucleotide Sequence Databases First generation GenBank is a representative example started as sort of a museum to preserve knowledge of a sequence from first discovery great repositories,

More information

ProteinPilot Report for ProteinPilot Software

ProteinPilot Report for ProteinPilot Software ProteinPilot Report for ProteinPilot Software Detailed Analysis of Protein Identification / Quantitation Results Automatically Sean L Seymour, Christie Hunter SCIEX, USA Pow erful mass spectrometers like

More information

MultiQuant Software 2.0 for Targeted Protein / Peptide Quantification

MultiQuant Software 2.0 for Targeted Protein / Peptide Quantification MultiQuant Software 2.0 for Targeted Protein / Peptide Quantification Gold Standard for Quantitative Data Processing Because of the sensitivity, selectivity, speed and throughput at which MRM assays can

More information

6 Characterization of Casein and Bovine Serum Albumin

6 Characterization of Casein and Bovine Serum Albumin 6 Characterization of Casein and Bovine Serum Albumin (BSA) Objectives: A) To separate a mixture of casein and bovine serum albumin B) to characterize these proteins based on their solubilities as a function

More information

ID of alternative translational initiation events. Description of gene function Reference of NCBI database access and relative literatures

ID of alternative translational initiation events. Description of gene function Reference of NCBI database access and relative literatures Data resource: In this database, 650 alternatively translated variants assigned to a total of 300 genes are contained. These database records of alternative translational initiation have been collected

More information

Methods for Protein Analysis

Methods for Protein Analysis Methods for Protein Analysis 1. Protein Separation Methods The following is a quick review of some common methods used for protein separation: SDS-PAGE (SDS-polyacrylamide gel electrophoresis) separates

More information

Biochemistry - I. Prof. S. Dasgupta Department of Chemistry Indian Institute of Technology, Kharagpur Lecture-11 Enzyme Mechanisms II

Biochemistry - I. Prof. S. Dasgupta Department of Chemistry Indian Institute of Technology, Kharagpur Lecture-11 Enzyme Mechanisms II Biochemistry - I Prof. S. Dasgupta Department of Chemistry Indian Institute of Technology, Kharagpur Lecture-11 Enzyme Mechanisms II In the last class we studied the enzyme mechanisms of ribonuclease A

More information

Pesticide Analysis by Mass Spectrometry

Pesticide Analysis by Mass Spectrometry Pesticide Analysis by Mass Spectrometry Purpose: The purpose of this assignment is to introduce concepts of mass spectrometry (MS) as they pertain to the qualitative and quantitative analysis of organochlorine

More information

Mass Frontier Version 7.0

Mass Frontier Version 7.0 Mass Frontier Version 7.0 User Guide XCALI-97349 Revision A February 2011 2011 Thermo Fisher Scientific Inc. All rights reserved. Mass Frontier, Mass Frontier Server Manager, Fragmentation Library, Spectral

More information

INFRARED SPECTROSCOPY (IR)

INFRARED SPECTROSCOPY (IR) INFRARED SPECTROSCOPY (IR) Theory and Interpretation of IR spectra ASSIGNED READINGS Introduction to technique 25 (p. 833-834 in lab textbook) Uses of the Infrared Spectrum (p. 847-853) Look over pages

More information

ProSightPC 3.0 Quick Start Guide

ProSightPC 3.0 Quick Start Guide ProSightPC 3.0 Quick Start Guide The Thermo ProSightPC 3.0 application is the only proteomics software suite that effectively supports high-mass-accuracy MS/MS experiments performed on LTQ FT and LTQ Orbitrap

More information

Bioinformatics Grid - Enabled Tools For Biologists.

Bioinformatics Grid - Enabled Tools For Biologists. Bioinformatics Grid - Enabled Tools For Biologists. What is Grid-Enabled Tools (GET)? As number of data from the genomics and proteomics experiment increases. Problems arise for the current sequence analysis

More information

A Navigation through the Tracefinder Software Structure and Workflow Options. Frans Schoutsen Pesticide Symposium Prague 27 April 2015

A Navigation through the Tracefinder Software Structure and Workflow Options. Frans Schoutsen Pesticide Symposium Prague 27 April 2015 A Navigation through the Tracefinder Software Structure and Workflow Options Frans Schoutsen Pesticide Symposium Prague 27 April 2015 Kings day in The Netherlands 1 Index Introduction Acquisition, Method

More information

(c) How would your answers to problem (a) change if the molecular weight of the protein was 100,000 Dalton?

(c) How would your answers to problem (a) change if the molecular weight of the protein was 100,000 Dalton? Problem 1. (12 points total, 4 points each) The molecular weight of an unspecified protein, at physiological conditions, is 70,000 Dalton, as determined by sedimentation equilibrium measurements and by

More information

Expression and Purification of Recombinant Protein in bacteria and Yeast. Presented By: Puspa pandey, Mohit sachdeva & Ming yu

Expression and Purification of Recombinant Protein in bacteria and Yeast. Presented By: Puspa pandey, Mohit sachdeva & Ming yu Expression and Purification of Recombinant Protein in bacteria and Yeast Presented By: Puspa pandey, Mohit sachdeva & Ming yu DNA Vectors Molecular carriers which carry fragments of DNA into host cell.

More information

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources 1 of 8 11/7/2004 11:00 AM National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools

More information

Lab 3 Organic Molecules of Biological Importance

Lab 3 Organic Molecules of Biological Importance Name Biology 3 ID Number Lab 3 Organic Molecules of Biological Importance Section 1 - Organic Molecules Section 2 - Functional Groups Section 3 - From Building Blocks to Macromolecules Section 4 - Carbohydrates

More information

Using Ontologies in Proteus for Modeling Data Mining Analysis of Proteomics Experiments

Using Ontologies in Proteus for Modeling Data Mining Analysis of Proteomics Experiments Using Ontologies in Proteus for Modeling Data Mining Analysis of Proteomics Experiments Mario Cannataro, Pietro Hiram Guzzi, Tommaso Mazza, and Pierangelo Veltri University Magna Græcia of Catanzaro, 88100

More information

Tutorial for proteome data analysis using the Perseus software platform

Tutorial for proteome data analysis using the Perseus software platform Tutorial for proteome data analysis using the Perseus software platform Laboratory of Mass Spectrometry, LNBio, CNPEM Tutorial version 1.0, January 2014. Note: This tutorial was written based on the information

More information

Computational Systems Biology. Lecture 2: Enzymes

Computational Systems Biology. Lecture 2: Enzymes Computational Systems Biology Lecture 2: Enzymes 1 Images from: David L. Nelson, Lehninger Principles of Biochemistry, IV Edition, Freeman ed. or under creative commons license (search for images at http://search.creativecommons.org/)

More information

Tutorial for Proteomics Data Submission. Katalin F. Medzihradszky Robert J. Chalkley UCSF

Tutorial for Proteomics Data Submission. Katalin F. Medzihradszky Robert J. Chalkley UCSF Tutorial for Proteomics Data Submission Katalin F. Medzihradszky Robert J. Chalkley UCSF Why Have Guidelines? Large-scale proteomics studies create huge amounts of data. It is impossible/impractical to

More information

MASCOT Search Results Interpretation

MASCOT Search Results Interpretation The Mascot protein identification program (Matrix Science, Ltd.) uses statistical methods to assess the validity of a match. MS/MS data is not ideal. That is, there are unassignable peaks (noise) and usually

More information

Chapter 3. Protein Structure and Function

Chapter 3. Protein Structure and Function Chapter 3 Protein Structure and Function Broad functional classes So Proteins have structure and function... Fine! -Why do we care to know more???? Understanding functional architechture gives us POWER

More information

ProteinScape. Innovation with Integrity. Proteomics Data Analysis & Management. Mass Spectrometry

ProteinScape. Innovation with Integrity. Proteomics Data Analysis & Management. Mass Spectrometry ProteinScape Proteomics Data Analysis & Management Innovation with Integrity Mass Spectrometry ProteinScape a Virtual Environment for Successful Proteomics To overcome the growing complexity of proteomics

More information

BIOC351: Proteins. PyMOL Laboratory #1. Installing and Using

BIOC351: Proteins. PyMOL Laboratory #1. Installing and Using BIOC351: Proteins PyMOL Laboratory #1 Installing and Using Information and figures for this handout was obtained from the following sources: Introduction to PyMOL (2009) DeLano Scientific LLC. Installing

More information

SICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE

SICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE AP Biology Date SICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE LEARNING OBJECTIVES Students will gain an appreciation of the physical effects of sickle cell anemia, its prevalence in the population,

More information

http://faculty.sau.edu.sa/h.alshehri

http://faculty.sau.edu.sa/h.alshehri http://faculty.sau.edu.sa/h.alshehri Definition: Proteins are macromolecules with a backbone formed by polymerization of amino acids. Proteins carry out a number of functions in living organisms: - They

More information

Aiping Lu. Key Laboratory of System Biology Chinese Academic Society APLV@sibs.ac.cn

Aiping Lu. Key Laboratory of System Biology Chinese Academic Society APLV@sibs.ac.cn Aiping Lu Key Laboratory of System Biology Chinese Academic Society APLV@sibs.ac.cn Proteome and Proteomics PROTEin complement expressed by genome Marc Wilkins Electrophoresis. 1995. 16(7):1090-4. proteomics

More information

Introduction to Proteomics 1.0

Introduction to Proteomics 1.0 Introduction to Proteomics 1.0 CMSP Workshop Tim Griffin Associate Professor, BMBB Faculty Director, CMSP Objectives Why are we here? For participants: Learn basics of MS-based proteomics Learn what s

More information

Section I Using Jmol as a Computer Visualization Tool

Section I Using Jmol as a Computer Visualization Tool Section I Using Jmol as a Computer Visualization Tool Jmol is a free open source molecular visualization program used by students, teachers, professors, and scientists to explore protein structures. Section

More information

Marmara Üniversitesi Fen-Edebiyat Fakültesi Kimya Bölümü / Biyokimya Anabilim Dalı PURIFICATION AND CHARACTERIZATION OF PROTEINS

Marmara Üniversitesi Fen-Edebiyat Fakültesi Kimya Bölümü / Biyokimya Anabilim Dalı PURIFICATION AND CHARACTERIZATION OF PROTEINS EXPERIMENT VI PURIFICATION AND CHARACTERIZATION OF PROTEINS I- Protein isolation and dialysis In order to investigate its structure and properties a protein must be obtained in pure form. Since proteins

More information

A disaccharide is formed when a dehydration reaction joins two monosaccharides. This covalent bond is called a glycosidic linkage.

A disaccharide is formed when a dehydration reaction joins two monosaccharides. This covalent bond is called a glycosidic linkage. CH 5 Structure & Function of Large Molecules: Macromolecules Molecules of Life All living things are made up of four classes of large biological molecules: carbohydrates, lipids, proteins, and nucleic

More information

Organic Molecules of Life - Exercise 2

Organic Molecules of Life - Exercise 2 Organic Molecules of Life - Exercise 2 Objectives -Know the difference between a reducing sugar and a non-reducing sugar. -Distinguish Monosaccharides from Disaccharides and Polysaccharides -Understand

More information

This class deals with the fundamental structural features of proteins, which one can understand from the structure of amino acids, and how they are

This class deals with the fundamental structural features of proteins, which one can understand from the structure of amino acids, and how they are This class deals with the fundamental structural features of proteins, which one can understand from the structure of amino acids, and how they are put together. 1 A more detailed view of a single protein

More information

Analyzing A DNA Sequence Chromatogram

Analyzing A DNA Sequence Chromatogram LESSON 9 HANDOUT Analyzing A DNA Sequence Chromatogram Student Researcher Background: DNA Analysis and FinchTV DNA sequence data can be used to answer many types of questions. Because DNA sequences differ

More information

Sub menu of functions to give the user overall information about the data in the file

Sub menu of functions to give the user overall information about the data in the file Visualize The Multitool for Proteomics! File Open Opens an.ez2 file to be examined. Import from TPP Imports data from files created by Trans Proteomic Pipeline. User chooses mzxml, pepxml and FASTA files

More information

Using Illumina BaseSpace Apps to Analyze RNA Sequencing Data

Using Illumina BaseSpace Apps to Analyze RNA Sequencing Data Using Illumina BaseSpace Apps to Analyze RNA Sequencing Data The Illumina TopHat Alignment and Cufflinks Assembly and Differential Expression apps make RNA data analysis accessible to any user, regardless

More information

Protein Prospector and Ways of Calculating Expectation Values

Protein Prospector and Ways of Calculating Expectation Values Protein Prospector and Ways of Calculating Expectation Values 1/16 Aenoch J. Lynn; Robert J. Chalkley; Peter R. Baker; Mark R. Segal; and Alma L. Burlingame University of California, San Francisco, San

More information

Introduction to Bioinformatics 3. DNA editing and contig assembly

Introduction to Bioinformatics 3. DNA editing and contig assembly Introduction to Bioinformatics 3. DNA editing and contig assembly Benjamin F. Matthews United States Department of Agriculture Soybean Genomics and Improvement Laboratory Beltsville, MD 20708 matthewb@ba.ars.usda.gov

More information

INTRODUCTION TO PROTEIN STRUCTURE

INTRODUCTION TO PROTEIN STRUCTURE Name Class: Partner, if any: INTRODUCTION TO PROTEIN STRUCTURE PRIMARY STRUCTURE: 1. Write the complete structural formula of the tripeptide shown (frame 10). Circle and label the three sidechains which

More information

Mascot Integra: Data management for Proteomics ASMS 2004

Mascot Integra: Data management for Proteomics ASMS 2004 Mascot Integra: Data management for Proteomics 1 Mascot Integra: Data management for proteomics What is Mascot Integra? What Mascot Integra isn t Instrument integration in Mascot Integra Designing and

More information

DBDB : a Disulfide Bridge DataBase for the predictive analysis of cysteine residues involved in disulfide bridges

DBDB : a Disulfide Bridge DataBase for the predictive analysis of cysteine residues involved in disulfide bridges DBDB : a Disulfide Bridge DataBase for the predictive analysis of cysteine residues involved in disulfide bridges Emmanuel Jaspard Gilles Hunault Jean-Michel Richer Laboratoire PMS UMR A 9, Université

More information

Protease Peptide Microarrays Ready-to-use microarrays for protease profiling

Protease Peptide Microarrays Ready-to-use microarrays for protease profiling Protocol Protease Peptide Microarrays Ready-to-use microarrays for protease profiling Contact us: InfoLine: +49-30-97893-117 Order per fax: +49-30-97893-299 Or e-mail: peptide@jpt.com www: www.jpt.com

More information

Global and Discovery Proteomics Lecture Agenda

Global and Discovery Proteomics Lecture Agenda Global and Discovery Proteomics Christine A. Jelinek, Ph.D. Johns Hopkins University School of Medicine Department of Pharmacology and Molecular Sciences Middle Atlantic Mass Spectrometry Laboratory Global

More information

MassMatrix Web Server User Manual

MassMatrix Web Server User Manual MassMatrix Web Server User Manual Version 2.2.3 or later Hua Xu, Ph. D. Center for Proteomics & Bioinformatics Case Western Reserve University August 2009 Main Navigation Bar of the Site MassMatrix Web

More information

CSC 2427: Algorithms for Molecular Biology Spring 2006. Lecture 16 March 10

CSC 2427: Algorithms for Molecular Biology Spring 2006. Lecture 16 March 10 CSC 2427: Algorithms for Molecular Biology Spring 2006 Lecture 16 March 10 Lecturer: Michael Brudno Scribe: Jim Huang 16.1 Overview of proteins Proteins are long chains of amino acids (AA) which are produced

More information

Science, Technology, Engineering & Mathematics Career Cluster

Science, Technology, Engineering & Mathematics Career Cluster Science, Technology, Engineering & Mathematics Career Cluster 1. Apply engineering skills in a project that requires project management, process control and quality assurance. ST 1.1: Apply the skills

More information

Introduction to Chemistry. Course Description

Introduction to Chemistry. Course Description CHM 1025 & CHM 1025L Introduction to Chemistry Course Description CHM 1025 Introduction to Chemistry (3) P CHM 1025L Introduction to Chemistry Laboratory (1) P This introductory course is intended to introduce

More information

Chironomid DNA Barcode Database Search System. User Manual

Chironomid DNA Barcode Database Search System. User Manual Chironomid DNA Barcode Database Search System User Manual National Institute for Environmental Studies Center for Environmental Biology and Ecosystem Studies December 2015 Contents 1. Overview 1 2. Search

More information

Core Bioinformatics. Degree Type Year Semester. 4313473 Bioinformàtica/Bioinformatics OB 0 1

Core Bioinformatics. Degree Type Year Semester. 4313473 Bioinformàtica/Bioinformatics OB 0 1 Core Bioinformatics 2014/2015 Code: 42397 ECTS Credits: 12 Degree Type Year Semester 4313473 Bioinformàtica/Bioinformatics OB 0 1 Contact Name: Sònia Casillas Viladerrams Email: Sonia.Casillas@uab.cat

More information

Lab 2/Phylogenetics/September 16, 2002 1 PHYLOGENETICS

Lab 2/Phylogenetics/September 16, 2002 1 PHYLOGENETICS Lab 2/Phylogenetics/September 16, 2002 1 Read: Tudge Chapter 2 PHYLOGENETICS Objective of the Lab: To understand how DNA and protein sequence information can be used to make comparisons and assess evolutionary

More information

T cell Epitope Prediction

T cell Epitope Prediction Institute for Immunology and Informatics T cell Epitope Prediction EpiMatrix Eric Gustafson January 6, 2011 Overview Gathering raw data Popular sources Data Management Conservation Analysis Multiple Alignments

More information

18.2 Protein Structure and Function: An Overview

18.2 Protein Structure and Function: An Overview 18.2 Protein Structure and Function: An Overview Protein: A large biological molecule made of many amino acids linked together through peptide bonds. Alpha-amino acid: Compound with an amino group bonded

More information

Definition of the Measurand: CRP

Definition of the Measurand: CRP A Reference Measurement System for C-reactive Protein David M. Bunk, Ph.D. Chemical Science and Technology Laboratory National Institute of Standards and Technology Definition of the Measurand: Human C-reactive

More information

DNA Sequencing Overview

DNA Sequencing Overview DNA Sequencing Overview DNA sequencing involves the determination of the sequence of nucleotides in a sample of DNA. It is presently conducted using a modified PCR reaction where both normal and labeled

More information

AP BIOLOGY 2008 SCORING GUIDELINES

AP BIOLOGY 2008 SCORING GUIDELINES AP BIOLOGY 2008 SCORING GUIDELINES Question 1 1. The physical structure of a protein often reflects and affects its function. (a) Describe THREE types of chemical bonds/interactions found in proteins.

More information

Genomic DNA Extraction Kit INSTRUCTION MANUAL

Genomic DNA Extraction Kit INSTRUCTION MANUAL Genomic DNA Extraction Kit INSTRUCTION MANUAL Table of Contents Introduction 3 Kit Components 3 Storage Conditions 4 Recommended Equipment and Reagents 4 Introduction to the Protocol 4 General Overview

More information

Disaccharides consist of two monosaccharide monomers covalently linked by a glycosidic bond. They function in sugar transport.

Disaccharides consist of two monosaccharide monomers covalently linked by a glycosidic bond. They function in sugar transport. 1. The fundamental life processes of plants and animals depend on a variety of chemical reactions that occur in specialized areas of the organism s cells. As a basis for understanding this concept: 1.

More information

AB SCIEX TOF/TOF 4800 PLUS SYSTEM. Cost effective flexibility for your core needs

AB SCIEX TOF/TOF 4800 PLUS SYSTEM. Cost effective flexibility for your core needs AB SCIEX TOF/TOF 4800 PLUS SYSTEM Cost effective flexibility for your core needs AB SCIEX TOF/TOF 4800 PLUS SYSTEM It s just what you expect from the industry leader. The AB SCIEX 4800 Plus MALDI TOF/TOF

More information

Introduction to Bioinformatics AS 250.265 Laboratory Assignment 6

Introduction to Bioinformatics AS 250.265 Laboratory Assignment 6 Introduction to Bioinformatics AS 250.265 Laboratory Assignment 6 In the last lab, you learned how to perform basic multiple sequence alignments. While useful in themselves for determining conserved residues

More information

Mass Frontier 7.0 Quick Start Guide

Mass Frontier 7.0 Quick Start Guide Mass Frontier 7.0 Quick Start Guide The topics in this guide briefly step you through key features of the Mass Frontier application. Editing a Structure Working with Spectral Trees Building a Library Predicting

More information

NO CALCULATORS OR CELL PHONES ALLOWED

NO CALCULATORS OR CELL PHONES ALLOWED Biol 205 Exam 1 TEST FORM A Spring 2008 NAME Fill out both sides of the Scantron Sheet. On Side 2 be sure to indicate that you have TEST FORM A The answers to Part I should be placed on the SCANTRON SHEET.

More information

Guide to Reverse Phase SpinColumns Chromatography for Sample Prep

Guide to Reverse Phase SpinColumns Chromatography for Sample Prep Guide to Reverse Phase SpinColumns Chromatography for Sample Prep www.harvardapparatus.com Contents Introduction...2-3 Modes of Separation...4-6 Spin Column Efficiency...7-8 Fast Protein Analysis...9 Specifications...10

More information

Learning Objectives:

Learning Objectives: Proteomics Methodology for LC-MS/MS Data Analysis Methodology for LC-MS/MS Data Analysis Peptide mass spectrum data of individual protein obtained from LC-MS/MS has to be analyzed for identification of

More information

Biological Molecules

Biological Molecules Biological Molecules I won t lie. This is probably the most boring topic you have ever done in any science. It s pretty much as simple as this: learn the material deal with it. Enjoy don t say I didn t

More information

13C NMR Spectroscopy

13C NMR Spectroscopy 13 C NMR Spectroscopy Introduction Nuclear magnetic resonance spectroscopy (NMR) is the most powerful tool available for structural determination. A nucleus with an odd number of protons, an odd number

More information

Protein Sequence Analysis - Overview -

Protein Sequence Analysis - Overview - Protein Sequence Analysis - Overview - UDEL Workshop Raja Mazumder Research Associate Professor, Department of Biochemistry and Molecular Biology Georgetown University Medical Center Topics Why do protein

More information

Carbohydrates, proteins and lipids

Carbohydrates, proteins and lipids Carbohydrates, proteins and lipids Chapter 3 MACROMOLECULES Macromolecules: polymers with molecular weights >1,000 Functional groups THE FOUR MACROMOLECULES IN LIFE Molecules in living organisms: proteins,

More information

Searching Nucleotide Databases

Searching Nucleotide Databases Searching Nucleotide Databases 1 When we search a nucleic acid databases, Mascot always performs a 6 frame translation on the fly. That is, 3 reading frames from the forward strand and 3 reading frames

More information

ProteinQuest user guide

ProteinQuest user guide ProteinQuest user guide 1. Introduction... 3 1.1 With ProteinQuest you can... 3 1.2 ProteinQuest basic version 4 1.3 ProteinQuest extended version... 5 2. ProteinQuest dictionaries... 6 3. Directions for

More information

Unique Software Tools to Enable Quick Screening and Identification of Residues and Contaminants in Food Samples using Accurate Mass LC-MS/MS

Unique Software Tools to Enable Quick Screening and Identification of Residues and Contaminants in Food Samples using Accurate Mass LC-MS/MS Unique Software Tools to Enable Quick Screening and Identification of Residues and Contaminants in Food Samples using Accurate Mass LC-MS/MS Using PeakView Software with the XIC Manager to Get the Answers

More information

Determination of Molecular Structure by MOLECULAR SPECTROSCOPY

Determination of Molecular Structure by MOLECULAR SPECTROSCOPY Determination of Molecular Structure by MOLEULAR SPETROSOPY hemistry 3 B.Z. Shakhashiri Fall 29 Much of what we know about molecular structure has been learned by observing and analyzing how electromagnetic

More information

4. Which carbohydrate would you find as part of a molecule of RNA? a. Galactose b. Deoxyribose c. Ribose d. Glucose

4. Which carbohydrate would you find as part of a molecule of RNA? a. Galactose b. Deoxyribose c. Ribose d. Glucose 1. How is a polymer formed from multiple monomers? a. From the growth of the chain of carbon atoms b. By the removal of an OH group and a hydrogen atom c. By the addition of an OH group and a hydrogen

More information

Biological Databases and Protein Sequence Analysis

Biological Databases and Protein Sequence Analysis Biological Databases and Protein Sequence Analysis Introduction M. Madan Babu, Center for Biotechnology, Anna University, Chennai 25, India Bioinformatics is the application of Information technology to

More information

BCHM 32200 Analytical Biochemistry Syllabus Spring, 2013

BCHM 32200 Analytical Biochemistry Syllabus Spring, 2013 INSTRUCTOR: Dr. Mark Hall office: BCHM 214 TEL: 494-0714 e-mail: mchall@purdue.edu DEPARTMENT OF BIOCHEMISTRY BCHM 32200 Analytical Biochemistry Syllabus Spring, 2013 Office hours: By appointment only

More information

The Molecules of Cells

The Molecules of Cells The Molecules of Cells I. Introduction A. Most of the world s population cannot digest milk-based foods. 1. These people are lactose intolerant because they lack the enzyme lactase. 2. This illustrates

More information

Organic Functional Groups Chapter 7. Alcohols, Ethers and More

Organic Functional Groups Chapter 7. Alcohols, Ethers and More Organic Functional Groups Chapter 7 Alcohols, Ethers and More 1 What do you do when you are in Pain? What do you do when you are in a lot of pain? 2 Functional Groups A functional group is an atom, groups

More information

HiPer Ion Exchange Chromatography Teaching Kit

HiPer Ion Exchange Chromatography Teaching Kit HiPer Ion Exchange Chromatography Teaching Kit Product Code: HTC001 Number of experiments that can be performed: 5 Duration of Experiment: Protocol: 5-6 hours Storage Instructions: The kit is stable for

More information

Peptide Bonds: Structure

Peptide Bonds: Structure Peptide Bonds: Structure Peptide primary structure The amino acid sequence, from - to C-terminus, determines the primary structure of a peptide or protein. The amino acids are linked through amide or peptide

More information

The Theory of HPLC. Gradient HPLC

The Theory of HPLC. Gradient HPLC The Theory of HPLC Gradient HPLC i Wherever you see this symbol, it is important to access the on-line course as there is interactive material that cannot be fully shown in this reference manual. Aims

More information

LOS ANGELES MISSION COLLEGE-SUMMER 2013 CHEMISTRY 51-SECTIONS 0552 Lecture: MTWTh 10:35-12:40 ; Room: CMS-028 Lab: MTWTh 1:00-2:25 ; Room: CMS-201

LOS ANGELES MISSION COLLEGE-SUMMER 2013 CHEMISTRY 51-SECTIONS 0552 Lecture: MTWTh 10:35-12:40 ; Room: CMS-028 Lab: MTWTh 1:00-2:25 ; Room: CMS-201 LOS ANGELES MISSION COLLEGE-SUMMER 2013 CHEMISTRY 51-SECTIONS 0552 Lecture: MTWTh 10:35-12:40 ; Room: CMS-028 Lab: MTWTh 1:00-2:25 ; Room: CMS-201 INSTRUCTOR: Said Pazirandeh OFFICE PHONE: (818)364-7705

More information

Structure of proteins

Structure of proteins Structure of proteins Primary structure: is amino acids sequence or the covalent structure (50-2500) amino acids M.Wt. of amino acid=110 Dalton (56 110=5610 Dalton). Single chain or more than one polypeptide

More information

Built from 20 kinds of amino acids

Built from 20 kinds of amino acids Built from 20 kinds of amino acids Each Protein has a three dimensional structure. Majority of proteins are compact. Highly convoluted molecules. Proteins are folded polypeptides. There are four levels

More information

Chapter 3 Contd. Western blotting & SDS PAGE

Chapter 3 Contd. Western blotting & SDS PAGE Chapter 3 Contd. Western blotting & SDS PAGE Western Blot Western blots allow investigators to determine the molecular weight of a protein and to measure relative amounts of the protein present in different

More information

MRMPilot Software: Accelerating MRM Assay Development for Targeted Quantitative Proteomics

MRMPilot Software: Accelerating MRM Assay Development for Targeted Quantitative Proteomics MRMPilot Software: Accelerating MRM Assay Development for Targeted Quantitative Proteomics With Unique QTRAP and TripleTOF 5600 System Technology Targeted peptide quantification is a rapidly growing application

More information

ATLAS.ti for Mac OS X Getting Started

ATLAS.ti for Mac OS X Getting Started ATLAS.ti for Mac OS X Getting Started 2 ATLAS.ti for Mac OS X Getting Started Copyright 2014 by ATLAS.ti Scientific Software Development GmbH, Berlin. All rights reserved. Manual Version: 5.20140918. Updated

More information

Molecule Shapes. support@ingenuity.com www.ingenuity.com 1

Molecule Shapes. support@ingenuity.com www.ingenuity.com 1 IPA 8 Legend This legend provides a key of the main features of Network Explorer and Canonical Pathways, including molecule shapes and colors as well as relationship labels and types. For a high-resolution

More information

Preprocessing, Management, and Analysis of Mass Spectrometry Proteomics Data

Preprocessing, Management, and Analysis of Mass Spectrometry Proteomics Data Preprocessing, Management, and Analysis of Mass Spectrometry Proteomics Data M. Cannataro, P. H. Guzzi, T. Mazza, and P. Veltri Università Magna Græcia di Catanzaro, Italy 1 Introduction Mass Spectrometry

More information

M.Sc. in Nano Technology with specialisation in Nano Biotechnology

M.Sc. in Nano Technology with specialisation in Nano Biotechnology M.Sc. in Nano Technology with specialisation in Nano Biotechnology Nanotechnology is all about designing, fabricating and controlling materials, components and machinery with dimensions on the nanoscale,

More information

Chapter 5: The Structure and Function of Large Biological Molecules

Chapter 5: The Structure and Function of Large Biological Molecules Name Period Concept 5.1 Macromolecules are polymers, built from monomers 1. The large molecules of all living things fall into just four main classes. Name them. 2. Circle the three classes that are called

More information

Isotope distributions

Isotope distributions Isotope distributions This exposition is based on: R. Martin Smith: Understanding Mass Spectra. A Basic Approach. Wiley, 2nd edition 2004. [S04] Exact masses and isotopic abundances can be found for example

More information

A Multiple DNA Sequence Translation Tool Incorporating Web Robot and Intelligent Recommendation Techniques

A Multiple DNA Sequence Translation Tool Incorporating Web Robot and Intelligent Recommendation Techniques Proceedings of the 2007 WSEAS International Conference on Computer Engineering and Applications, Gold Coast, Australia, January 17-19, 2007 402 A Multiple DNA Sequence Translation Tool Incorporating Web

More information