Classification and Prioritization of Biomedical Literature for the Comparative Toxicogenomics Database
|
|
|
- Pamela McCormick
- 10 years ago
- Views:
Transcription
1 Classification and Prioritization of Biomedical Literature for the Comparative Toxicogenomics Database Dina VISHNYAKOVA a,b,d,1, Emilie PASCHE a,b,d, Julien GOBEILL a,c,d, Arnaud GAUDINAT a,c,d, Christian LOVIS b,d a,c,d and Patrick RUCH a BiTeM Group b Division of Medical Information Sciences, University and University Hospitals of Geneva, Switzerland c Information Science Department, University of Applied Science d Geneva, Switzerland Abstract. We present a new approach to perform biomedical documents classification and prioritization for the Comparative Toxicogenomics Database (CTD). This approach is motivated by needs such as literature curation, in particular applied to the human health environment domain. The unique integration of chemical, genes/proteins and disease data in the biomedical literature may advance the identification of exposure and disease biomarkers, mechanisms of chemical actions, and the complex aetiologies of chronic diseases. Our approach aims to assist biomedical researchers when searching for relevant articles for CTD. The task is functionally defined as a classification task, where selected articles must also be ranked by order of relevance. We design a SVM classifier, which combines three main feature sets: an information retrieval system (EAGLi), a biomedical named-entity recognizer (MeSH term extraction), a gene normalization (GN) service (NormaGene) and an ad-hoc keyword recognizer for diseases and chemicals. The evaluation of the gene identification module was done on BioCreativeIII test data. Disease normalization is achieved with 95% precision and 93% of recall. The evaluation of the classification was done on the corpus provided by BioCreative organizers in The approach showed promising performance on the test data. Keywords. Information Retrieval, literature curation, ontology look-up services. Introduction Since last 10 years the interest in information retrieval and text mining applied to the biomedical literature is rapidly increasing. The biggest database of abstracts on life science and biomedical topics PubMed, in the beginning of 2012, has over millions records; around 12 millions of these articles are listed with their abstracts; about new records are added each year [1]. Information about entities such as genes, diseases, chemicals and etc, is available in articles in a free textual format, which is comprehensive for humans. Biocurators of the Comparative Toxicogenomic 1 Corresponding Author: Dina Vishnyakova; SSIM; University Hospitals of Geneva; 4, rue Gabrielle- Perret-Gentil; CH-1211 Geneva 14; Tel: ; [email protected]
2 Database (CTD) read a scientific article and convert free-text information into a structured format using official nomenclatures, incorporating external control vocabularies for chemicals, genes, diseases and organisms [2]. Free-text information is difficult to interpret for information retrieval systems and manual curation of this information is a routine and costly task. As a consequence, there have been developed many methods, which address such tasks as identification of gene/protein mentions or extraction of protein-protein interactions and etc, performing with good results. Currently, the focus of text mining community is shifting towards the interest to employ computers to assist human curators, more specifically to prioritize articles to be curated. Our approach is based on the combination of Information Retrieval approaches. The feature selection method incorporates meta-data of documents and results retrieved with EAGLi [3][4] and NormaGene [5][6] systems. 1. Data and Methods 1.1. Data overview BioCreative Workshop 2012 Triage-I provides the benchmark in the biomedical domain; we used a subset of 1059 articles. These articles have been manually curated and contain information on found entities such as genes, chemicals, diseases and their interactions. There were given four main chemicals and a subset of 1059 articles curated to these four chemicals. Table 1. Distribution of curated articles for each chemical Chemical Name Number of articles per chemical Raloxifene articles per chemical in % 2-Acetylaminofluorene ,5 Amsacrine Quercetimin The decision about curation is done on the observation of curated chemicals, diseases and genes and their interactions in the article. Distributions of curated articles referring to selected chemicals are presented in Table 1. Approximately half of the articles in the benchmark contain no information about chemicals or genes; only half of the articles have information about both genes and chemicals, and only few have information about diseases. The distribution of entities in the benchmark is shown in Table 2. Relevant data from the abstract of an article is coded using controlled vocabularies. Chemicals vocabulary is trimmed in order to remove terms that are not considered to be chemicals of interest to CTD (e.g., the Amino Acids, Peptides, and Proteins branch or the Nucleic Acids, Nucleotides, and Nucleosides branch, etc.) [7]. genes are based on the NCBI identifiers and unlike EntrezGene, a gene in CTD represents the gene for all species [8]. The diseases vocabulary is a mix of the Online Mendelian Inheritance in Man (OMIM) terms and the MeSH Disease [C] and Mental Disorders [F03] hierarchies [9].
3 Table 2. Distribution of entities in the benchmark Entity Name Number of articles Chemicals 654 Genes 643 Diseases 28 Genes and Chemicals 602 Main Chemical in titles Methods We designed a SVN classifier for the classification of articles (with curated and not curated classes). This classifier combines three main feature sets: an information retrieval system (EAGLi), a biomedical named-entity recognizer (MeSH term extraction); a gene normalization service (NormaGene) and an ad-hoc keyword recognizer for diseases and chemicals. Selected features for SVN classifier are presented in Table 3. The first feature set contains information about MeSH terms of articles extracted from the PubMed. We extract this information with the information retrieval system EAGLi. According to our observations, curated articles usually have as one of the main MeSH terms such term as pharmacology, toxicity, drug therapy, metabolism, drug effects, chemistry and chemical synthesis. Another observation justified that extracted (from PubMed) MeSH terms of a curated article often contain the name of the main chemical. The main chemical is a chemical according to which the classification is done, in our case, for example raloxifene, amsacrine and etc. In the second feature set we incorporate meta-data received from the gene normalization system NormaGene. On gene name detection step we face ambiguity of gene names, e.g. homonyms and synonyms. NormaGene approves all gene candidates by Gene Protein Synonyms Database (GPSDB) [10]. Returned results from NormaGene are compared to the CTD genes control vocabulary. Genes from this vocabulary are based upon imported gene pages from EntrezGene; however, unlike EntrezGene, a gene page in CTD represents the gene for all species. This representation of the gene eases constrains of NormaGene on a cross-species normalization. The third feature set is an ad-hoc keyword recognizer for diseases and chemicals. This keyword recognizer is based on the control vocabularies provided by CTD. The chemical vocabulary is a modified subset of descriptors from the Chemicals and Drugs category and Supplementary Concept Records from MeSH. Compare to MeSH, CTD merged the descriptors and accompanying concepts into a single hierarchy. Several branches of the original MeSH vocabulary were excluded from CTD's chemical vocabulary because they are not molecular reagents, environmental chemicals or clinical drugs (e.g., Nucleic Acids, Nucleotides, and Nucleosides and Purines ). Other branches were excluded because they are simply broad chemical classes that do not contain more specific terms (e.g., Solutions and Poisons ) [11]. The provided disease vocabulary is a modified subset of descriptors from the Diseases category of MeSH combined with genetic disorders from the OMIM database. OMIM contains
4 textual information, references related to diseases, links to MEDLINE and sequence records in the Entrez system, and links to additional related resources at NCBI[11]. Table 3. Selected Features for the SVN Classifier Features Given chemical name in the abstract Given chemical name in the title Given chemical in MeSH terms of the article Appearance of chemicals in the abstract Quantity of found chemicals in the abstract Appearance of genes in the abstract Quantity of detected genes Appearance of chemicals and genes in the abstract Appearance of diseases in the abstract Quantity of disease names Appearance of pharmacology, toxicity, drug therapy, metabolism, drug effects, chemistry and chemical synthesis as the main MeSH terms of the article Quantity of main MeSH terms containing pharmacology, toxicity, drug therapy, metabolism, drug effects, chemistry and chemical synthesis Values integer(1..n) integer (1..n) integer (1..n) integer(1..n) 2. Results and Conclusion From the BioCreative 2012 data, we evaluated the effectiveness of our ad hoc terms recognizer for diseases. Our methods achieved 95% of precision and 92% of recall when tagging diseases in the training sample. Table 5. Results of Our approach for the task-i of BioCreative Chemical/Quantity of articles Intermediate MAP Score Gene Chemical Disease Urethane/ Phenacetin/ Cyclophosphamide/ Table 6. Results of evaluation performed by our approach with different input for gene detection on the test data of task-i of BioCreative Chemical/Quantity Intermediate MAP Gene Disease of articles Score Chemical Urethane/ Phenacetin/ Carcinoma/ In order to tune our classifier, we performed ten folders cross-validation and achieved accuracy of 80.5%. We applied the optimal model on the test data and obtained an accuracy of 77%, which suggests some moderate overfitting phenomena. The results of our approach of test data are provided in Table 5.
5 According to results in Table 5, the curated gene score relatively low compared to chemicals and disease scores, which confirms that gene and gene product recognition seems more challenging than recognition of other biomedical entity recognition tasks such as chemicals. We recomputed results, applying some changes to the parameters for the gene detection task, and removing the restriction of the overlapping gene names. The recomputed results for the gene detection subtask are found in Table 6. While the Entities scores in final results of BioCreative 2012 were based on recall scores it is obvious that overlapping gene candidates in the final results do improve the Gene score. At the same time they decrease the Intermediate MAP score. This can be explained by the changes in the classification model, which reduces the ranking score of input articles, when a large amount of gene candidates is found. In conclusion, our approach showed competitive performance, in particular for the recognition of chemical compounds. Intermediate MAP score showed that the selected SVM model produced promising results on the test data. In contrast, the identification of pathologies seems nearly as difficult as the recognition of genes and gene products. Further experiments are needed to explain where is the power of the approach as well as to start explaining the differences observed regarding the recognition power of some of the entity types. Acknowledgements. The DebugIT project ( is receiving funding from the European Community s Seventh Framework Programme under grant agreement n FP , which is gratefully acknowledged. The information in this document reflects solely the views of the authors and no guarantee or warranty is given that it is fit for any particular purpose. The European Commission, Directorate General Information Society and Media, Brussels, is not liable for any use that may be made of the information contained therein. References [1] PubMed [ ( ) [2] Davis AP, Wiegers TC, Murphy CG, and Mattingly CJ The curation paradigm and application tool used for manual curation of the scientific literature at the Comparative Toxicogenomics Database. Database.2011, Oxford. [3] EAGLi [ ( ) [4] Gobeill J, Pasche E, Teodoro D, Veuthey AL, Lovis C, Ruch P. Question answering for biology and medicine. Information Technology and Application in Biomedicine (ITAB 2009) [5] NormaGene [ [6] Lu Z, Kao HY, Wei CH, Huang M, Liu J, Kuo CJ, Hsu CN, Tsai RT, Dai HJ, Okazaki N, Cho HC, Gerner M, Solt I, Agarwal S, Liu F, Vishnyakova D, Ruch P, Romacker M, Rinaldi F, Bhattacharya S, Srinivasan P, Liu H, Torii M, Matos S, Campos D, Verspoor K, Livingston KM, Wilbur WJ. The gene normalization task in BioCreative III. BMC Bioinformatics Oct 3; 12 Suppl 8:S2. [7] CTD Chemicals data [ ( ) [8] CTD Genes data [ [9] CTD Disease data [ [10] Pillet V, Zehnder M, Seewald AK, Veuthey AL, Petrak J. GPSDB: a new database for synonyms expansion of gene and protein names. Bioinformatics Apr 15; 21(8): Epub 2004 Dec 21. [11] Wiegers TC, Davis AP, Cohen KB, Hirschman L, Mattingly CJ. Text mining and manual curation of chemical-gene-disease networks for the comparative toxicogenomics database (CTD). BMC Bioinformatics Oct 8;10:326.
PPInterFinder A Web Server for Mining Human Protein Protein Interaction
PPInterFinder A Web Server for Mining Human Protein Protein Interaction Kalpana Raja, Suresh Subramani, Jeyakumar Natarajan Data Mining and Text Mining Laboratory, Department of Bioinformatics, Bharathiar
A Knowledge-Poor Approach to BioCreative V DNER and CID Tasks
A Knowledge-Poor Approach to BioCreative V DNER and CID Tasks Firoj Alam 1, Anna Corazza 2, Alberto Lavelli 3, and Roberto Zanoli 3 1 Dept. of Information Eng. and Computer Science, University of Trento,
CENG 734 Advanced Topics in Bioinformatics
CENG 734 Advanced Topics in Bioinformatics Week 9 Text Mining for Bioinformatics: BioCreative II.5 Fall 2010-2011 Quiz #7 1. Draw the decompressed graph for the following graph summary 2. Describe the
A leader in the development and application of information technology to prevent and treat disease.
A leader in the development and application of information technology to prevent and treat disease. About MOLECULAR HEALTH Molecular Health was founded in 2004 with the vision of changing healthcare. Today
Practical Implementation of a Bridge between Legacy EHR System and a Clinical Research Environment
Cross-Border Challenges in Informatics with a Focus on Disease Surveillance and Utilising Big-Data L. Stoicu-Tivadar et al. (Eds.) 2014 The authors. This article is published online with Open Access by
Paradigm Changes Affecting the Practice of Scientific Communication in the Life Sciences
Paradigm Changes Affecting the Practice of Scientific Communication in the Life Sciences Prof. Dr. Martin Hofmann-Apitius Head of the Department of Bioinformatics Fraunhofer Institute for Algorithms and
Integrating Bioinformatics, Medical Sciences and Drug Discovery
Integrating Bioinformatics, Medical Sciences and Drug Discovery M. Madan Babu Centre for Biotechnology, Anna University, Chennai - 600025 phone: 44-4332179 :: email: [email protected] Bioinformatics
BiTeM group report for TREC Medical Records Track 2011
BiTeM group report for TREC Medical Records Track 2011 J. Gobeill a, A. Gaudinat a, E. Pasche b, D. Teodoro b, D. Vishnyakova b, P. Ruch a a BiTeM group, University of Applied Sciences, Information Studies,
How to stop looking in the wrong place? Use PubMed!
How to stop looking in the wrong place? Use PubMed! 1 Why not just use? Plus s Fast! Easy to remember web address Its huge - you always find something It includes PubMed citations Downside Is simply finding
LABERINTO at ImageCLEF 2011 Medical Image Retrieval Task
LABERINTO at ImageCLEF 2011 Medical Image Retrieval Task Jacinto Mata, Mariano Crespo, Manuel J. Maña Dpto. de Tecnologías de la Información. Universidad de Huelva Ctra. Huelva - Palos de la Frontera s/n.
Information Extraction from Patents: Combining Text- and Image-Mining. Martin Hofmann-Apitius
Information Extraction from Patents: Combining Text- and Image-Mining Martin Hofmann-Apitius Bonn-Aachen International Centre for Information Technology (B-IT) September 25, 2007 Status Report: Major Achievements
Text Mining for Health Care and Medicine. Sophia Ananiadou Director National Centre for Text Mining www.nactem.ac.uk
Text Mining for Health Care and Medicine Sophia Ananiadou Director National Centre for Text Mining www.nactem.ac.uk The Need for Text Mining MEDLINE 2005: ~14M 2009: ~18M Overwhelming information in textual,
The INFUSIS Project Data and Text Mining for In Silico Modeling
The INFUSIS Project Data and Text Mining for In Silico Modeling Henrik Boström 1,2, Ulf Norinder 3, Ulf Johansson 4, Cecilia Sönströd 4, Tuve Löfström 4, Elzbieta Dura 5, Ola Engkvist 6, Sorel Muresan
Bio-IT World 2013 Best Practices Awards
Published Resources for the Life Sciences 250 First Avenue, Suite 300, Needham, MA 02494 phone: 781-972-5400 fax: 781-972-5425 Bio-IT World 2013 Best Practices Awards Celebrating Excellence in Innovation
Vad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives
Vad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives [email protected] 2015-05-21 Functional Bioinformatics, Örebro University Vad är bioinformatik och varför
Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources
1 of 8 11/7/2004 11:00 AM National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools
InSyBio BioNets: Utmost efficiency in gene expression data and biological networks analysis
InSyBio BioNets: Utmost efficiency in gene expression data and biological networks analysis WHITE PAPER By InSyBio Ltd Konstantinos Theofilatos Bioinformatician, PhD InSyBio Technical Sales Manager August
RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison
RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the
Extraction and Visualization of Protein-Protein Interactions from PubMed
Extraction and Visualization of Protein-Protein Interactions from PubMed Ulf Leser Knowledge Management in Bioinformatics Humboldt-Universität Berlin Finding Relevant Knowledge Find information about Much
Experiments in Web Page Classification for Semantic Web
Experiments in Web Page Classification for Semantic Web Asad Satti, Nick Cercone, Vlado Kešelj Faculty of Computer Science, Dalhousie University E-mail: {rashid,nick,vlado}@cs.dal.ca Abstract We address
Annex 6: Nucleotide Sequence Information System BEETLE. Biological and Ecological Evaluation towards Long-Term Effects
Annex 6: Nucleotide Sequence Information System BEETLE Biological and Ecological Evaluation towards Long-Term Effects Long-term effects of genetically modified (GM) crops on health, biodiversity and the
Building a Spanish MMTx by using Automatic Translation and Biomedical Ontologies
Building a Spanish MMTx by using Automatic Translation and Biomedical Ontologies Francisco Carrero 1, José Carlos Cortizo 1,2, José María Gómez 3 1 Universidad Europea de Madrid, C/Tajo s/n, Villaviciosa
Biochemistry 1 Course Specifications. First year of M.B.B.Ch. Program
Faculty of Medicine Quality Assurance Unit Al-Azhar University Assuit Faculty of Medicine Biochemistry 1 Course Specifications First year of M.B.B.Ch. Program A- Professional information: Title: Biochemistry1
Discover more, discover faster. High performance, flexible NLP-based text mining for life sciences
Discover more, discover faster. High performance, flexible NLP-based text mining for life sciences It s not information overload, it s filter failure. Clay Shirky Life Sciences organizations face the challenge
Extracting value from scientific literature: the power of mining full-text articles for pathway analysis
FOR PHARMA & LIFE SCIENCES WHITE PAPER Harnessing the Power of Content Extracting value from scientific literature: the power of mining full-text articles for pathway analysis Executive Summary Biological
ProteinQuest user guide
ProteinQuest user guide 1. Introduction... 3 1.1 With ProteinQuest you can... 3 1.2 ProteinQuest basic version 4 1.3 ProteinQuest extended version... 5 2. ProteinQuest dictionaries... 6 3. Directions for
Computational Drug Repositioning by Ranking and Integrating Multiple Data Sources
Computational Drug Repositioning by Ranking and Integrating Multiple Data Sources Ping Zhang IBM T. J. Watson Research Center Pankaj Agarwal GlaxoSmithKline Zoran Obradovic Temple University Terms and
MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts
MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts Julio Villena-Román 1,3, Sara Lana-Serrano 2,3 1 Universidad Carlos III de Madrid 2 Universidad Politécnica de Madrid 3 DAEDALUS
POSBIOTM-NER: A Machine Learning Approach for. Bio-Named Entity Recognition
POSBIOTM-NER: A Machine Learning Approach for Bio-Named Entity Recognition Yu Song, Eunji Yi, Eunju Kim, Gary Geunbae Lee, Department of CSE, POSTECH, Pohang, Korea 790-784 Soo-Jun Park Bioinformatics
Protein Protein Interaction Networks
Functional Pattern Mining from Genome Scale Protein Protein Interaction Networks Young-Rae Cho, Ph.D. Assistant Professor Department of Computer Science Baylor University it My Definition of Bioinformatics
Dr Alexander Henzing
Horizon 2020 Health, Demographic Change & Wellbeing EU funding, research and collaboration opportunities for 2016/17 Innovate UK funding opportunities in omics, bridging health and life sciences Dr Alexander
METHODS IN MEDICAL INFORMATICS
Chapman & Hall/CRC Mathematical and Computational Biology Series METHODS IN MEDICAL INFORMATICS Fundamentals of Healthcare Programming in Perln Pythoni and Ruby Jules J- Berman TECHNISCHE INFORMATION SBIBLIOTHEK
Pharmacology skills for drug discovery. Why is pharmacology important?
skills for drug discovery Why is pharmacology important?, the science underlying the interaction between chemicals and living systems, emerged as a distinct discipline allied to medicine in the mid-19th
Regulatory Issues in Genetic Testing and Targeted Drug Development
Regulatory Issues in Genetic Testing and Targeted Drug Development Janet Woodcock, M.D. Deputy Commissioner for Operations Food and Drug Administration October 12, 2006 Genetic and Genomic Tests are Types
M110.726 The Nucleus M110.727 The Cytoskeleton M340.703 Cell Structure and Dynamics
of Biochemistry and Molecular Biology 1. Master the knowledge base of current biochemistry, molecular biology, and cellular physiology Describe current knowledge in metabolic transformations conducted
Biochemistry (Molecular and Cellular) Information Sheet for entry in 2016. What is Biochemistry?
Biochemistry (Molecular and Cellular) Information Sheet for entry in 2016 What is Biochemistry? The study of living things at the molecular level has undergone tremendous expansion in recent years, leading
HPI in-memory-based database system in Task 2b of BioASQ
CLEF 2014 Conference and Labs of the Evaluation Forum BioASQ workshop HPI in-memory-based database system in Task 2b of BioASQ Mariana Neves September 16th, 2014 Outline 2 Overview of participation Architecture
Pharmacy Technician Diploma (Part Time) - SC232
Pharmacy Technician Diploma (Part Time) - SC232 1. Special Note The Programme is designed to be a professional course, like the Diploma/BSc (Hons) Biomedical Sciences and the Diploma/BSc (Hons) Occupational
BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS
BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title:
Teaching Bioinformatics to Undergraduates
Teaching Bioinformatics to Undergraduates http://www.med.nyu.edu/rcr/asm Stuart M. Brown Research Computing, NYU School of Medicine I. What is Bioinformatics? II. Challenges of teaching bioinformatics
A Multiple DNA Sequence Translation Tool Incorporating Web Robot and Intelligent Recommendation Techniques
Proceedings of the 2007 WSEAS International Conference on Computer Engineering and Applications, Gold Coast, Australia, January 17-19, 2007 402 A Multiple DNA Sequence Translation Tool Incorporating Web
11-792 Software Engineering EMR Project Report
11-792 Software Engineering EMR Project Report Team Members Phani Gadde Anika Gupta Ting-Hao (Kenneth) Huang Chetan Thayur Suyoun Kim Vision Our aim is to build an intelligent system which is capable of
Medical Biochemistry BC 362 Fall 2014
Medical Biochemistry BC 362 Fall 2014 Instructor: Julie Millard, Dorros Professor of Life Sciences Keyes 304, 859-5757; [email protected] Office hours: As announced in class each week and also by appointment.
Lecture Outline. Introduction to Databases. Introduction. Data Formats Sample databases How to text search databases. Shifra Ben-Dor Irit Orr
Introduction to Databases Shifra Ben-Dor Irit Orr Lecture Outline Introduction Data and Database types Database components Data Formats Sample databases How to text search databases What units of information
ICH Topic S 1 A The Need for Carcinogenicity Studies of Pharmaceuticals. Step 5
European Medicines Agency July 1996 CPMP/ICH/140/95 ICH Topic S 1 A The Need for Carcinogenicity Studies of Pharmaceuticals Step 5 NOTE FOR GUIDANCE ON THE NEED FOR CARCINOGENICITY STUDIES OF PHARMACEUTICALS
Protein-protein Interaction Passage Extraction Using the Interaction Pattern Kernel Approach for the BioCreative 2015 BioC Track
Protein-protein Interaction Passage Extraction Using the Interaction Pattern Kernel Approach for the BioCreative 2015 BioC Track Yung-Chun Chang 1,2, Yu-Chen Su 3, Chun-Han Chu 1, Chien Chin Chen 2 and
A Semantic Model for Multimodal Data Mining in Healthcare Information Systems
A Semantic Model for Multimodal Data Mining in Healthcare Information Systems Dimitris IAKOVIDIS 1 and Christos SMAILIS Department of Informatics and Computer Technology, Technological Educational Institute
Biological Databases and Protein Sequence Analysis
Biological Databases and Protein Sequence Analysis Introduction M. Madan Babu, Center for Biotechnology, Anna University, Chennai 25, India Bioinformatics is the application of Information technology to
INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE S1A. Current Step 4 version
INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE ICH HARMONISED TRIPARTITE GUIDELINE GUIDELINE ON THE NEED FOR CARCINOGENICITY STUDIES
Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova
Using the Grid for the interactive workflow management in biomedicine Andrea Schenone BIOLAB DIST University of Genova overview background requirements solution case study results background A multilevel
GenBank, Entrez, & FASTA
GenBank, Entrez, & FASTA Nucleotide Sequence Databases First generation GenBank is a representative example started as sort of a museum to preserve knowledge of a sequence from first discovery great repositories,
MSc in Toxicology. Master Degree Programme
Master Degree Programme MSc in Toxicology Department of Pharmaceutical Sciences, University of Basel Swiss Centre for Applied Human Toxicology (SCAHT) Master of Science in Toxicology University of Basel
How to create and interpret the predictive analysis of a compound
How to create and interpret the predictive analysis of a compound Platform with suite of tools Predict & understand biological effects of small molecules & compounds Predict targets and metabolites, potential
Ingenuity Pathway Analysis (IPA )
ProductProfile Ingenuity Pathway Analysis (IPA ) For the analysis and interpretation of omics data IPA is a web-based software application for the analysis, integration, and interpretation of data derived
BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16
Course Director: Dr. Barry Grant (DCM&B, [email protected]) Description: This is a three module course covering (1) Foundations of Bioinformatics, (2) Statistics in Bioinformatics, and (3) Systems
Abstracting the types away from a UIMA type system
Abstracting the types away from a UIMA type system Karin Verspoor, William Baumgartner Jr., Christophe Roeder, and Lawrence Hunter Center for Computational Pharmacology University of Colorado Denver School
An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives
An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives Chalapathy Neti, Ph.D. Associate Director, Healthcare Transformation, Shahram Ebadollahi, Ph.D. Research Staff Memeber IBM Research,
Course Curriculum for Master Degree in Medical Laboratory Sciences/Clinical Biochemistry
Course Curriculum for Master Degree in Medical Laboratory Sciences/Clinical Biochemistry The Master Degree in Medical Laboratory Sciences /Clinical Biochemistry, is awarded by the Faculty of Graduate Studies
Using Ontologies in Proteus for Modeling Data Mining Analysis of Proteomics Experiments
Using Ontologies in Proteus for Modeling Data Mining Analysis of Proteomics Experiments Mario Cannataro, Pietro Hiram Guzzi, Tommaso Mazza, and Pierangelo Veltri University Magna Græcia of Catanzaro, 88100
Connecting Basic Research and Healthcare Big Data
Elsevier Health Analytics WHS 2015 Big Data in Health Connecting Basic Research and Healthcare Big Data Olaf Lodbrok Managing Director Elsevier Health Analytics [email protected] t +49 89 5383 600
BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS
BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS 1. The Technology Strategy sets out six areas where technological developments are required to push the frontiers of knowledge
Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness
Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness Melanie Dulong de Rosnay Fellow, Science Commons and Berkman Center for Internet & Society at Harvard University This article
1. Programme title and designation Biochemistry. For undergraduate programmes only Single honours Joint Major/minor
PROGRAMME APPROVAL FORM SECTION 1 THE PROGRAMME SPECIFICATION 1. Programme title and designation Biochemistry For undergraduate programmes only Single honours Joint Major/minor 2. Final award Award Title
Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems.
Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems. Roberto Todeschini Milano Chemometrics and QSAR Research Group - Dept. of
Bio-DSGS: An Automated Bioinformatics Data Service Generation System
Journal of Computational Information Systems 7: 8 (2011) 2989-2996 Available at http://www.jofcis.com Bio-DSGS: An Automated Bioinformatics Data Service Generation System Shuang QIU, Yadong WANG, Liang
Bioinformatics Resources at a Glance
Bioinformatics Resources at a Glance A Note about FASTA Format There are MANY free bioinformatics tools available online. Bioinformaticists have developed a standard format for nucleotide and protein sequences
How Real-time Analysis turns Big Medical Data into Precision Medicine?
Medical Data into Dr. Matthieu-P. Schapranow GLOBAL HEALTH, Rome, Italy August 27, 2014 Important things first: Where to find additional information? Online: Visit http://we.analyzegenomes.com for latest
From Data to Foresight:
Laura Haas, IBM Fellow IBM Research - Almaden From Data to Foresight: Leveraging Data and Analytics for Materials Research 1 2011 IBM Corporation The road from data to foresight is long? Consumer Reports
Programme Specification (Undergraduate) Date amended: August 2012
Programme Specification (Undergraduate) Date amended: August 2012 1. Programme Title(s) and UCAS code(s): BSc Biological Sciences C100 BSc Biological Sciences (Biochemistry) C700 BSc Biological Sciences
Fields of Education. Last updated August 2011
Fields of Education Last updated August 2011 Monash University is required to report to the Department of Education, Employment and Workplace Relations (DEEWR) the number of higher degree by research (HDR)
Integrated Data Mining Strategy for Effective Metabolomic Data Analysis
The First International Symposium on Optimization and Systems Biology (OSB 07) Beijing, China, August 8 10, 2007 Copyright 2007 ORSC & APORC pp. 45 51 Integrated Data Mining Strategy for Effective Metabolomic
Summary of Responses to the Request for Information (RFI): Input on Development of a NIH Data Catalog (NOT-HG-13-011)
Summary of Responses to the Request for Information (RFI): Input on Development of a NIH Data Catalog (NOT-HG-13-011) Key Dates Release Date: June 6, 2013 Response Date: June 25, 2013 Purpose This Request
Technical Report. The KNIME Text Processing Feature:
Technical Report The KNIME Text Processing Feature: An Introduction Dr. Killian Thiel Dr. Michael Berthold [email protected] [email protected] Copyright 2012 by KNIME.com AG
Distributed Bioinformatics Computing System for DNA Sequence Analysis
Global Journal of Computer Science and Technology: A Hardware & Computation Volume 14 Issue 1 Version 1.0 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global Journals Inc.
Big Data Text Mining and Visualization. Anton Heijs
Copyright 2007 by Treparel Information Solutions BV. This report nor any part of it may be copied, circulated, quoted without prior written approval from Treparel7 Treparel Information Solutions BV Delftechpark
EUROPASS DIPLOMA SUPPLEMENT
EUROPASS DIPLOMA SUPPLEMENT TITLE OF THE DIPLOMA (ES) Técnico Superior en Laboratorio de análisis y de control de calidad TRANSLATED TITLE OF THE DIPLOMA (EN) (1) Higher Technician in Analysis and Quality
Web-Based Genomic Information Integration with Gene Ontology
Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, [email protected] Abstract. Despite the dramatic growth of online genomic
Visualization methods for patent data
Visualization methods for patent data Treparel 2013 Dr. Anton Heijs (CTO & Founder) Delft, The Netherlands Introduction Treparel can provide advanced visualizations for patent data. This document describes
A disaccharide is formed when a dehydration reaction joins two monosaccharides. This covalent bond is called a glycosidic linkage.
CH 5 Structure & Function of Large Molecules: Macromolecules Molecules of Life All living things are made up of four classes of large biological molecules: carbohydrates, lipids, proteins, and nucleic
Committee on WIPO Standards (CWS)
E CWS/1/5 ORIGINAL: ENGLISH DATE: OCTOBER 13, 2010 Committee on WIPO Standards (CWS) First Session Geneva, October 25 to 29, 2010 PROPOSAL FOR THE PREPARATION OF A NEW WIPO STANDARD ON THE PRESENTATION
REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf])
820 REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf]) (See also General Regulations) BMS1 Admission to the Degree To be eligible for admission to the degree of Bachelor
IMPLEMENTING BIG DATA IN TODAY S HEALTH CARE PRAXIS: A CONUNDRUM TO PATIENTS, CAREGIVERS AND OTHER STAKEHOLDERS - WHAT IS THE VALUE AND WHO PAYS
IMPLEMENTING BIG DATA IN TODAY S HEALTH CARE PRAXIS: A CONUNDRUM TO PATIENTS, CAREGIVERS AND OTHER STAKEHOLDERS - WHAT IS THE VALUE AND WHO PAYS 29 OCTOBER 2015 DR. DIRK J. EVERS BACKGROUND TreatmentMAP
Bachelor of Biomedical Science
Bachelor of Biomedical Science Have you ever wondered why we age, what causes cancer or how you inherited your mum s eyes and your dad s height? Biomedical Science answers these questions, and more. A
Master of Science in Biomedical Sciences
Master of Science in Biomedical Sciences Faculty of Medicine Good health is our greatest treasure. Understanding the human body, healthy and diseased, is the stepping stone to finding tools to improving
