Sequence Information. Sequence information. Good web sites. Sequence information. Sequence. Sequence



Similar documents
Linear Sequence Analysis. 3-D Structure Analysis

Integrated design of antibodies for systems biology using AbDesigner

Library page. SRS first view. Different types of database in SRS. Standard query form

Antibody responses to linear and conformational epitopes

Guide for Bioinformatics Project Module 3

Bioinformatics Grid - Enabled Tools For Biologists.

A new type of Hidden Markov Models to predict complex domain architecture in protein sequences

EMBL-EBI Web Services

Lecture 8. Protein Trafficking/Targeting. Protein targeting is necessary for proteins that are destined to work outside the cytoplasm.

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison

Error Tolerant Searching of Uninterpreted MS/MS Data

Introduction to Genome Annotation

Analisi in silicoe relazione tra enterotossine stafilococciche e tossine ipotetiche

Databases and mapping BWA. Samtools

Sequence homology search tools on the world wide web

Activity 7.21 Transcription factors

Module 1. Sequence Formats and Retrieval. Charles Steward

Protein annotation and modelling servers at University College London

The Pfam Protein Families Database

Problem Set 1 KEY

Vaxign Reverse Vaccinology Software Demo Introduction Zhuoshuang Allen Xiang, Yongqun Oliver He

Protein Physics. A. V. Finkelstein & O. B. Ptitsyn LECTURE 1

Chapter 3. Protein Structure and Function

Lecture Series 7. From DNA to Protein. Genotype to Phenotype. Reading Assignments. A. Genes and the Synthesis of Polypeptides

P G DIPLOMA IN BIOINFORMATICS

A Primer of Genome Science THIRD

Lecture 1 MODULE 3 GENE EXPRESSION AND REGULATION OF GENE EXPRESSION. Professor Bharat Patel Office: Science 2, b.patel@griffith.edu.

European Medicines Agency

Discovering Bioinformatics

Protein engineering for structural biology

Specific problems. The genetic code. The genetic code. Adaptor molecules match amino acids to mrna codons

The Neuron and the Synapse. The Neuron. Parts of the Neuron. Functions of the neuron:

Distributed Data Mining in Discovery Net. Dr. Moustafa Ghanem Department of Computing Imperial College London

Protein Analysis and WEKA Data Mining

Custom Antibody Services

Transcription and Translation of DNA

RNA & Protein Synthesis

Hormones & Chemical Signaling

Lecture Outline. Introduction to Databases. Introduction. Data Formats Sample databases How to text search databases. Shifra Ben-Dor Irit Orr

BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS

TEMA 10. REACCIONES INMUNITARIAS MEDIADAS POR CÉLULAS.

Control of Gene Expression

Three data delivery cases for EMBL- EBI s Embassy. Guy Cochrane

HUMORAL IMMUNE RE- SPONSES: ACTIVATION OF B CELLS AND ANTIBODIES JASON CYSTER SECTION 13

Teaching Bioinformatics to Undergraduates

Expression and Purification of Recombinant Protein in bacteria and Yeast. Presented By: Puspa pandey, Mohit sachdeva & Ming yu

AP BIOLOGY 2008 SCORING GUIDELINES

Searching Nucleotide Databases

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

Chapter-21b: Hormones and Receptors

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

Antibody Function & Structure

Proteins and Nucleic Acids

Protein Identification and Analysis Tools on the ExPASy Server

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

Your partner in immunology

Processing Genome Data using Scalable Database Technology. My Background

SGI. High Throughput Computing (HTC) Wrapper Program for Bioinformatics on SGI ICE and SGI UV Systems. January, Abstract. Haruna Cofer*, PhD

Review. Bioinformatics - a definition 1. As submitted to the Oxford English Dictionary

Course Outline. 1. COURSE INFORMATION Session Offered Winter 2012 Course Name Biochemistry

EMBOSS A data analysis package

Hidden Markov Models in Bioinformatics. By Máthé Zoltán Kőrösi Zoltán 2006

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE Q5B

Computational Systems Biology. Lecture 2: Enzymes

Bioinformatics for Biologists. Protein Structure

Syllabus of B.Sc. (Bioinformatics) Subject- Bioinformatics (as one subject) B.Sc. I Year Semester I Paper I: Basic of Bioinformatics 85 marks

thebiotutor. AS Biology OCR. Unit F211: Cells, Exchange & Transport. Module 1.2 Cell Membranes. Notes & Questions.

Guidance for Industry

Bio-Informatics Lectures. A Short Introduction

BIOLOGICAL MEMBRANES: FUNCTIONS, STRUCTURES & TRANSPORT

Understanding the immune response to bacterial infections

INTRODUCTION TO HORMONES

LESSON 3: ANTIBODIES/BCR/B-CELL RESPONSES

Core Bioinformatics. Degree Type Year Semester Bioinformàtica/Bioinformatics OB 0 1

Software review. Vector NTI, a balanced all-in-one sequence analysis suite

Actions of Hormones on Target Cells Page 1. Actions of Hormones on Target Cells Page 2. Goals/ What You Need to Know Goals What You Need to Know

BSC Exam I Lectures and Text Pages. The Plasma Membrane Structure and Function. Phospholipids. I. Intro to Biology (2-29) II.

Antibody Structure, and the Generation of B-cell Diversity CHAPTER 4 04/05/15. Different Immunoglobulins

Lecture 19: Proteins, Primary Struture

Concluding lesson. Student manual. What kind of protein are you? (Basic)

Chapter 5: The Structure and Function of Large Biological Molecules

specific B cells Humoral immunity lymphocytes antibodies B cells bone marrow Cell-mediated immunity: T cells antibodies proteins

Integrated Rule-based Data Management System for Genome Sequencing Data

Activation and effector functions of HMI

Bioinformatics: course introduction

Protein sequence databases Rolf Apweiler 1,, Amos Bairoch 2 and Cathy H Wu 3

Cells & Cell Organelles

Intrusion Detection via Machine Learning for SCADA System Protection

Module 10: Bioinformatics

H H N - C - C 2 R. Three possible forms (not counting R group) depending on ph

Proteins. Proteins. Amino Acids. Most diverse and most important molecule in. Functions: Functions (cont d)

Transcription:

Sequence information Multiple Pair-wise SRS Entrez Comparisons Database searches Sequence Information Orthologue clusters Sequence Organell localisation Patterns Protein families Membrane attachment Bengt Persson Post-translational modifications Prosite InterPro Pfam Secondary structure Linköping University & Karolinska Institutet 2 Multiple Orthologue clusters Sequence information Comparisons Pair-wise Sequence SRS Database searches Entrez Organell localisation www.expasy.org www.ebi.ac.uk www.ncbi.nlm.nih.gov www.cbs.dtu.dk Good web sites Patterns Protein families Membrane attachment Post-translational modifications Prosite InterPro Pfam Secondary structure Linköping University & Karolinska Institutet 3 Linköping University & Karolinska Institutet 4 (c) Bengt Persson 1

Protein family databases Protein families, nomenclature Super-family Family Sub-family Linköping University & Karolinska Institutet 6 InterPro InterPro entry Prosite Amos Bairoch, Genève Pfam Erik Sonnhammer, KI and Sanger Institute, UK PRINTS Terri Attwood, UCL, London, UK ProDom Daniel Kahn, INRA, Toulouse, France SMART Peer Bork, EMBL Swissprot+TrEMBL Linköping University & Karolinska Institutet 7 Linköping University & Karolinska Institutet 8 (c) Bengt Persson 2

InterPro entry, cont. InterPro entry, cont. Linköping University & Karolinska Institutet 9 Linköping University & Karolinska Institutet 10 InterPro -- protein matches InterPro -- protein matches, graphical Linköping University & Karolinska Institutet 11 Linköping University & Karolinska Institutet 12 (c) Bengt Persson 3

Prosite Prosite Database of protein families and domains Release 16, September 1999 1035 documentation entries 1375 different patterns http://www.expasy.ch/prosite/ Amos Bairoch, University of Geneva Linköping University & Karolinska Institutet 13 Linköping University & Karolinska Institutet 14 Prosite ScanProsite Linköping University & Karolinska Institutet 15 Linköping University & Karolinska Institutet 16 (c) Bengt Persson 4

Prosite, documentation entry Example of Prosite patterns Post-translational modifications Domains DNA or RNA associated proteins Enzymes Electron transport proteins Other transport proteins Structural proteins Receptors Hormones and active peptides Toxins Inhibitors Protein secretion and chaperones Cytokines and growth factors Others Linköping University & Karolinska Institutet 17 Linköping University & Karolinska Institutet 18 Pfam A collection of protein families and domains. Pfam contains multiple protein alignments and profile-hmms of these families. Pfam is a semi-automatic protein family database, which aims to be comprehensive as well as accurate. Hidden Markov Models (HMMs) Statistical profile method Enables database searches Enables multiple alignment creation http://www.sanger.ac.uk/software/pfam/index.shtml http://www.cgr.ki.se/pfam from Yvonne Kallberg Linköping University & Karolinska Institutet 19 Linköping University & Karolinska Institutet 20 (c) Bengt Persson 5

Pfam Pfam Linköping University & Karolinska Institutet 21 Linköping University & Karolinska Institutet 22 Pfam COG--Clusters of Orthologous Groups Linköping University & Karolinska Institutet 23 Linköping University & Karolinska Institutet 24 (c) Bengt Persson 6

Functional groups of protein families COG Linköping University & Karolinska Institutet 25 Linköping University & Karolinska Institutet 26 COG Predictions of structure and post- translational modifications Linköping University & Karolinska Institutet 27 (c) Bengt Persson 7

Secondary structure Hydrophilicity Structure predictions Membrane-spanning regions Antigenicity Glycosylation Acetylation and much more... Secondary structure predictions Chou & Fasman (CF) Garnier, Osguthorpe & Robson (GOR) http://pbil.ibcp.fr/cgi-bin/npsa_automat.pl?page=npsa_gor4.html neural networks (e.g. PHD) http://dodo.cpmc.columbia.edu/predictprotein/ Linköping University & Karolinska Institutet 29 Linköping University & Karolinska Institutet 30 Artificial Neural Networks (ANNs) The PredictProtein server Statistical method Pattern recognition, e. g. secondary structure predictions Output layer Output layer Hidden layer Hidden layer Input layer Input layer modified from Yvonne Kallberg Linköping University & Karolinska Institutet 31 Linköping University & Karolinska Institutet 32 (c) Bengt Persson 8

Default submission form Hydrophilicity Kyte & Doolittle Hopp & Woods Linköping University & Karolinska Institutet 33 Linköping University & Karolinska Institutet 34 Example of hydrophilicity and secondary structure plots ProtScale A general tool for plotting sequence properties, e.g. hydrophilicity http://www.expasy.ch/cgi-bin/protscale.pl Linköping University & Karolinska Institutet 35 Linköping University & Karolinska Institutet 36 (c) Bengt Persson 9

ProtScale, selection of property to plot ProtScale, results Linköping University & Karolinska Institutet 37 Linköping University & Karolinska Institutet 38 ProtScale, Graphic view Membrane protein prediction, TMAP http://www.ifm.liu.se/bioinfo/ Linköping University & Karolinska Institutet 39 Linköping University & Karolinska Institutet 40 (c) Bengt Persson 10

Membrane protein prediction, TMAP TMAP, graphics output Linköping University & Karolinska Institutet 41 Linköping University & Karolinska Institutet 42 Prediction servers at CBS www.cbs.dtu.dk/services/ SignalP Linköping University & Karolinska Institutet 43 Linköping University & Karolinska Institutet 44 (c) Bengt Persson 11

SignalP -- Results SignalP -- Results, cont. Linköping University & Karolinska Institutet 45 Linköping University & Karolinska Institutet 46 TargetP TargetP -- Results Linköping University & Karolinska Institutet 47 Linköping University & Karolinska Institutet 48 (c) Bengt Persson 12

Phobius Phobius, results Linköping University & Karolinska Institutet 49 Linköping University & Karolinska Institutet 50 ExPASy site map Protein identification and characterisation Linköping University & Karolinska Institutet 51 Linköping University & Karolinska Institutet 52 (c) Bengt Persson 13

Post-translational modifications Primary structure analysis Linköping University & Karolinska Institutet 53 Linköping University & Karolinska Institutet 54 Secondary structure prediction Transmembrane regions & Sequence alignments Linköping University & Karolinska Institutet 55 Linköping University & Karolinska Institutet 56 (c) Bengt Persson 14