Information Extraction from Patents: Combining Text- and Image-Mining. Martin Hofmann-Apitius
|
|
|
- Christopher Miller
- 10 years ago
- Views:
Transcription
1 Information Extraction from Patents: Combining Text- and Image-Mining Martin Hofmann-Apitius Bonn-Aachen International Centre for Information Technology (B-IT) September 25, 2007
2 Status Report: Major Achievements at SCAI in 2006 / 07 ProMiner Dictionary - based Named Entity Recognition 1. Successful participation in BioCreative 2006; rank 3 (F-score 0,799) out of 21 participating systems in task gene protein name recognition and normalization [rank 2: F-score 0,804; rank 1: F-score 0,810] 2. Extension of ProMiner functionalities beyond biological entities: integration of chemical and medical dictionaries; extension towards integration of ontologyderived dictionaries (GO; ANEURIST disease-ontology) 3. ProMiner installed as UIMA service in testbed installation Seite 2
3 Status Report: Major Achievements at SCAI in 2006 / 07 SCAI Machine Learning - based Named Entity Recognition 1. Successful participation in BioCreative 2006; rank 4 (F-score 0,8633) out of 21 participating systems in gene mention task [rank 1: F-score 0,8721; rank 2: F- score 0,8683; rank 3: F-score 0,8657] 2. Extension of ProMiner functionalities towards entity-types that cannot easily be represented in dictionaries (e.g. certain SNP-information; IUPAC names) 3. IUPAC and SNP identification with conditional random field approach available as ProMiner add-ons. Seite 3
4 Status Report: Major Achievements at SCAI in 2006 / Development of a Persistence-Layer for Extracted Information in (IST ) 1. Oracle-based system storing the results of ProMiner runs and integrating reference databases (UniProt; EntrezGene; dbsnp; PubChem) with textual objects 2. Used in for aggregation of disease-related information, including SNP-Information on the genes identified by ProMiner (in collaboration with the IMIM Institute in Barcelona) 3. Tagging of gene & protein names, drug trivial names, IUPAC names, disease terminology, chromosomal position information, SNP information and OMIM references in the entire MEDLINE Seite 4
5 @neulink - a disease-specific literature mining Confidential Copyright 2006
6 Drugs Associated with Aneurysm Rupture + Drug Confidential Copyright 2006
7 @neulink Document View + Confidential Copyright 2006
8 @neulink Document Confidential Copyright 2006
9 Status Report: Major Achievements at SCAI in 2006 / 07 ChemoCR reconstruction of chemical information from structure depictions 1. Partnership between InfoChem and SCAI on further development of ChemoCR 2. First large scale test runs provide basis for classification of types of depictions 3. First steps towards multi-modal information extraction: using ProMiner and ChemoCR simultaneously on patent literature Seite 9
10 Beyond Analysis of MEDLINE: Information Extraction from Patents Current challenges in text mining (at least at SCAI): 1. Information extraction from large corpora of full text documents 2. Combination of text analysis and analysis of chemical structure depictions 3. Normalization (mapping of text objects to database reference entries) issues: ProMiner normalized so far on UniProt and EntrezGene; mapping of SNPs to reference databases and mapping of chemical names and structures to chemical reference structures are completely new challenges Seite 10
11 Named Entity Recognition in Full-Text Patents Seite 11
12 Chemical Entities Related to Target Estrogen Receptor Seite 12
13 Multimodal Tagging: Embedding ChemoCR Reconstruction Seite 13
14 Linkout to PubChem through InChI Seite 14
15 Embedded Reconstruction of Reaction Schemata Seite 15
16 Potential Knowledge Gain Through ChemoCR Analysis One of the key questions associated with multi-modal patent mining is: do we gain from being able to simultaneously analyze text and chemical structure depictions? What is the gain of knowledge if we combine text analysis and image analysis? Seite 16
17 Molecules can be Found in Text 560 different molecules (fragments) identified in text Mapping to PubChem via dictionary (name to InChI) Mostly known structures Seite 17
18 Structure Depictions Frequently Contain Novel Structures Seite 18
19 Reconstruction of Synthesis Pathways from Patents: an Example 35 could be converted to structure using a single name to structure tool Seite 19
20 Biological Background Information Seite 20
21 A Synthesis Reaction Schema Seite 21
22 ChemoCR GUI Input: Bitmaps Seite 22 Output: Molecules
23 From Picture to Reaction: Pre-Processing 0: Picture Seite 23 BMP after scaling & binarization
24 From Picture to Reaction: Identification of Connected Components 0: Picture 1: Connected Components 1 component - Bondset -Letter Seite 24
25 From Picture to Reaction: Tagging of Characters 0: Picture 1: Connected Components 2: Tag Text Text or Bond? Seite 25 Text or Bond?
26 From Picture to Reaction: OCR of Identified Characters 0: Picture 1: Connected Components 2: Tag Text 3: OCR Seite 26
27 From Picture to Reaction: Identification of Lines (Bonds) Only the bonds, please 0: Picture 1: Connected Components 2: Tag Text 3: OCR 4: Vectorizer Seite 27
28 From Picture to Reaction: Applying Chemical Knowledge Associate semantics: -Bond -Atom - Reaction symbol 0: Picture 1: Connected Components 2: Tag Text 3: OCR 4: Vectorizer 5: Expert System Seite 28
29 From Picture to Reaction: Interpretation as Chemical Graphs Connect everything & mind the gaps 0: Picture 1: Connected Components 2: Tag Text 3: OCR 4: Vectorizer 5: Expert System 6: Chemical Graph Seite 29
30 From Picture to Reaction: Representation Format Molecule Reaction 0: Picture 1: Connected Components 2: Tag Text 3: OCR 4: Vectorizer 5: Expert System 6: Chemical Graph 7: Molecule Seite 30 Annotation
31 From Picture to Reaction: Error / Problem Detection Sorry, but no Markush 0: Picture 1: Connected Components 2: Tag Text 3: OCR 4: Vectorizer 5: Expert System 6: Chemical Graph 7: Molecule 8: Validation Seite 31
32 Reaction Schema Reconstructed by ChemoCR: Embedding the Resulting SDF in Patent Document View Seite 32
33 Lessons Learned from First Encounter in Patent Mining Patents are the hardest data source for text miners Text and images comprise complementary information Chemical search thus has to be combined with text search Preprocessing needs a lot of attention (e.g. scaling, rotation, cleaning) Image categorization is required (identification of images containing chemistry versus images with mice, cells and different machines) Images in patents are sometimes a nightmare; quality ranges from bad to saumaessig, need for adaptive parameter optimization Extension for Markush and tables on the wish list; reconstruction of synthesis schemata works reasonably well Seite 33
34 Next Steps on our Roadmap 1. Focus on pre-categorization of images (chemistry vs non-chemistry) 2. Generating rapid overview on entity-types in a patent and context ( this patent talks about... ) 3. Integration of dictionary-based, pattern-based and image-based information in the field of chemistry 4. Bridging from chemistry to biology and medicine: extraction of relationships between chemical and biomedical entities Seite 34
35 Summary and Take Home Message 1. ProMiner and ChemoCR work nicely on full text documents 2. ProMiner and ChemoCR can be combined to efficiently annotate full text patent documents, enabling queries that were previously impossible 3. The persistence layer based allows us to store the results of mining approaches and integrates them with other knowledge bases 4. However, we just started to adjust our machinery to the challenge of analyzing patent literature. We already face significant challenges in the area of document segmentation and image categorization 5. But: besides being a scientific challenge, mining of patents is real fun and that is why we will report on our progress in this field next year Seite 35
36 Acknowledgement In alphabetical order: Holger Dach Juliane Fluck Christoph Friedrich Tobias Gattermayer Tobias Goecke Carina Haupt Roman Klinger Corinna Kolarik Peter Kral Theo Mevissen Chia-Hao Ou A. Weihermüller Marc Zimmermann Seite 36
Information Extraction Technologies in Chemistry A Critical Review
Information Extraction Technologies in Chemistry A Critical Review Martin Hofmann-Apitius The International Conference in Trends for Scientific Information Professionals Sitges (Barcelona), Spain. 21-24
Paradigm Changes Affecting the Practice of Scientific Communication in the Life Sciences
Paradigm Changes Affecting the Practice of Scientific Communication in the Life Sciences Prof. Dr. Martin Hofmann-Apitius Head of the Department of Bioinformatics Fraunhofer Institute for Algorithms and
CENG 734 Advanced Topics in Bioinformatics
CENG 734 Advanced Topics in Bioinformatics Week 9 Text Mining for Bioinformatics: BioCreative II.5 Fall 2010-2011 Quiz #7 1. Draw the decompressed graph for the following graph summary 2. Describe the
Technical Report. The KNIME Text Processing Feature:
Technical Report The KNIME Text Processing Feature: An Introduction Dr. Killian Thiel Dr. Michael Berthold [email protected] [email protected] Copyright 2012 by KNIME.com AG
Visual Structure Analysis of Flow Charts in Patent Images
Visual Structure Analysis of Flow Charts in Patent Images Roland Mörzinger, René Schuster, András Horti, and Georg Thallinger JOANNEUM RESEARCH Forschungsgesellschaft mbh DIGITAL - Institute for Information
Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems.
Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems. Roberto Todeschini Milano Chemometrics and QSAR Research Group - Dept. of
European Archival Records and Knowledge Preservation Database Archiving in the E-ARK Project
European Archival Records and Knowledge Preservation Database Archiving in the E-ARK Project Janet Delve, University of Portsmouth Kuldar Aas, National Archives of Estonia Rainer Schmidt, Austrian Institute
Big Data and Text Mining
Big Data and Text Mining Dr. Ian Lewin Senior NLP Resource Specialist [email protected] www.linguamatics.com About Linguamatics Boston, USA Cambridge, UK Software Consulting Hosted content Agile,
Classification and Prioritization of Biomedical Literature for the Comparative Toxicogenomics Database
Classification and Prioritization of Biomedical Literature for the Comparative Toxicogenomics Database Dina VISHNYAKOVA a,b,d,1, Emilie PASCHE a,b,d, Julien GOBEILL a,c,d, Arnaud GAUDINAT a,c,d, Christian
POSBIOTM-NER: A Machine Learning Approach for. Bio-Named Entity Recognition
POSBIOTM-NER: A Machine Learning Approach for Bio-Named Entity Recognition Yu Song, Eunji Yi, Eunju Kim, Gary Geunbae Lee, Department of CSE, POSTECH, Pohang, Korea 790-784 Soo-Jun Park Bioinformatics
Neural Networks in Data Mining
IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 Vol. 04, Issue 03 (March. 2014), V6 PP 01-06 www.iosrjen.org Neural Networks in Data Mining Ripundeep Singh Gill, Ashima Department
Web-Based Genomic Information Integration with Gene Ontology
Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, [email protected] Abstract. Despite the dramatic growth of online genomic
Computational Drug Repositioning by Ranking and Integrating Multiple Data Sources
Computational Drug Repositioning by Ranking and Integrating Multiple Data Sources Ping Zhang IBM T. J. Watson Research Center Pankaj Agarwal GlaxoSmithKline Zoran Obradovic Temple University Terms and
PPInterFinder A Web Server for Mining Human Protein Protein Interaction
PPInterFinder A Web Server for Mining Human Protein Protein Interaction Kalpana Raja, Suresh Subramani, Jeyakumar Natarajan Data Mining and Text Mining Laboratory, Department of Bioinformatics, Bharathiar
Extraction and Visualization of Protein-Protein Interactions from PubMed
Extraction and Visualization of Protein-Protein Interactions from PubMed Ulf Leser Knowledge Management in Bioinformatics Humboldt-Universität Berlin Finding Relevant Knowledge Find information about Much
Anforderungen der Life-Science Industrie an die Hochschulen. Hans Widmer Novartis Institutes for BioMedical Research
Anforderungen der Life-Science Industrie an die Hochschulen Hans Widmer Novartis Institutes for BioMedical Research There s nothing more extraordinary than a normal life 2 What does industry expect from
Discover more, discover faster. High performance, flexible NLP-based text mining for life sciences
Discover more, discover faster. High performance, flexible NLP-based text mining for life sciences It s not information overload, it s filter failure. Clay Shirky Life Sciences organizations face the challenge
METHODS IN MEDICAL INFORMATICS
Chapman & Hall/CRC Mathematical and Computational Biology Series METHODS IN MEDICAL INFORMATICS Fundamentals of Healthcare Programming in Perln Pythoni and Ruby Jules J- Berman TECHNISCHE INFORMATION SBIBLIOTHEK
Using LSI for Implementing Document Management Systems Turning unstructured data from a liability to an asset.
White Paper Using LSI for Implementing Document Management Systems Turning unstructured data from a liability to an asset. Using LSI for Implementing Document Management Systems By Mike Harrison, Director,
LDIF - Linked Data Integration Framework
LDIF - Linked Data Integration Framework Andreas Schultz 1, Andrea Matteini 2, Robert Isele 1, Christian Bizer 1, and Christian Becker 2 1. Web-based Systems Group, Freie Universität Berlin, Germany [email protected],
Is 20TB really Big Data?
From Big Data to Chemical Information (RSC CICAG) 22 April 2015 London 100 million compounds, 100K protein structures, 2 million reactions, 1 million journal articles, 20 million patents and 15 billion
Cellular Respiration: Practice Questions #1
Cellular Respiration: Practice Questions #1 1. Which statement best describes one of the events taking place in the chemical reaction? A. Energy is being stored as a result of aerobic respiration. B. Fermentation
Model Deployment. Dr. Saed Sayad. University of Toronto 2010 [email protected]. http://chem-eng.utoronto.ca/~datamining/
Model Deployment Dr. Saed Sayad University of Toronto 2010 [email protected] http://chem-eng.utoronto.ca/~datamining/ 1 Model Deployment Creation of the model is generally not the end of the project.
Cleaned Data. Recommendations
Call Center Data Analysis Megaputer Case Study in Text Mining Merete Hvalshagen www.megaputer.com Megaputer Intelligence, Inc. 120 West Seventh Street, Suite 10 Bloomington, IN 47404, USA +1 812-0-0110
How to Work with a Reference Answer Set
How to Work with a Reference Answer Set Easily identify and isolate references of interest Quickly retrieve relevant information from the world s largest, publicly available reference database for chemistry
Text Mining for Health Care and Medicine. Sophia Ananiadou Director National Centre for Text Mining www.nactem.ac.uk
Text Mining for Health Care and Medicine Sophia Ananiadou Director National Centre for Text Mining www.nactem.ac.uk The Need for Text Mining MEDLINE 2005: ~14M 2009: ~18M Overwhelming information in textual,
MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts
MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts Julio Villena-Román 1,3, Sara Lana-Serrano 2,3 1 Universidad Carlos III de Madrid 2 Universidad Politécnica de Madrid 3 DAEDALUS
SciFinder. Essential Content. Proven Results. Are You Ready for the Web? Österreichischer Bibliothekartag Innsbruck 2011
SciFinder Essential Content. Proven Results. Are You Ready for the Web? Österreichischer Bibliothekartag Innsbruck 2011 What is SciFinder? Simply put SciFinder is the first choice for chemists and related
ProteinQuest user guide
ProteinQuest user guide 1. Introduction... 3 1.1 With ProteinQuest you can... 3 1.2 ProteinQuest basic version 4 1.3 ProteinQuest extended version... 5 2. ProteinQuest dictionaries... 6 3. Directions for
Healthcare Analytics. Aryya Gangopadhyay UMBC
Healthcare Analytics Aryya Gangopadhyay UMBC Two of many projects Integrated network approach to personalized medicine Multidimensional and multimodal Dynamic Analyze interactions HealthMask Need for sharing
Introduction to Data Mining
Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association
testo dello schema Secondo livello Terzo livello Quarto livello Quinto livello
Extracting Knowledge from Biomedical Data through Logic Learning Machines and Rulex Marco Muselli Institute of Electronics, Computer and Telecommunication Engineering National Research Council of Italy,
SEARCHING BIOTECHNOLOGY INFORMATION IN THE 2010s: Section II (Databases & Search Strategies)
SEARCHING BIOTECHNOLOGY INFORMATION IN THE 2010s: Section II (Databases & Search Strategies) Luca Falciola, IP Manager, Promethera Biosciences Sardegna Ricerche-Univ. Cagliari (Sep. 15 th 2014) Databases
Information Management
Information Management Dr Marilyn Rose McGee-Lennon [email protected] What is Information Management about Aim: to understand the ways in which databases contribute to the management of large amounts
How many of you have checked out the web site on protein-dna interactions?
How many of you have checked out the web site on protein-dna interactions? Example of an approximately 40,000 probe spotted oligo microarray with enlarged inset to show detail. Find and be ready to discuss
From Data to Foresight:
Laura Haas, IBM Fellow IBM Research - Almaden From Data to Foresight: Leveraging Data and Analytics for Materials Research 1 2011 IBM Corporation The road from data to foresight is long? Consumer Reports
Big Data Text Mining and Visualization. Anton Heijs
Copyright 2007 by Treparel Information Solutions BV. This report nor any part of it may be copied, circulated, quoted without prior written approval from Treparel7 Treparel Information Solutions BV Delftechpark
STUDY GUIDE AGRICULTURAL SCIENCES GRADE 11
STUDY GUIDE AGRICULTURAL SCIENCES GRADE 11 A publication of Impak Onderwysdiens (Pty) Ltd Copyright reserved. Apart from any fair dealing for the purpose of research, criticism or review as permitted under
Anomaly and Fraud Detection with Oracle Data Mining 11g Release 2
Oracle 11g DB Data Warehousing ETL OLAP Statistics Anomaly and Fraud Detection with Oracle Data Mining 11g Release 2 Data Mining Charlie Berger Sr. Director Product Management, Data
Master's projects at ITMO University. Daniil Chivilikhin PhD Student @ ITMO University
Master's projects at ITMO University Daniil Chivilikhin PhD Student @ ITMO University General information Guidance from our lab's researchers Publishable results 2 Research areas Research at ITMO Evolutionary
Exploring and Understanding Adverse Drug Reactions by Integrative Mining of Clinical Records and Biomedical Knowledge
Exploring and Understanding Adverse Drug Reactions by Integrative Mining of Clinical Records and Biomedical Knowledge http://euadr-project.org PEDRO LOPES [email protected] University of Manchester October
Extracting value from scientific literature: the power of mining full-text articles for pathway analysis
FOR PHARMA & LIFE SCIENCES WHITE PAPER Harnessing the Power of Content Extracting value from scientific literature: the power of mining full-text articles for pathway analysis Executive Summary Biological
An Introduction to the Use of Bayesian Network to Analyze Gene Expression Data
n Introduction to the Use of ayesian Network to nalyze Gene Expression Data Cristina Manfredotti Dipartimento di Informatica, Sistemistica e Comunicazione (D.I.S.Co. Università degli Studi Milano-icocca
Drexel-SDP GK-12 ACTIVITY
Drexel-SDP GK-12 ACTIVITY Subject Area(s): Biology Associated Unit: None Associated Lesson: None Activity Title : Plant or Animal Cell? Grade Level: 7 and 8 (7-9) Activity Dependency: None Time Required:
Translation Study Guide
Translation Study Guide This study guide is a written version of the material you have seen presented in the replication unit. In translation, the cell uses the genetic information contained in mrna to
Provalis Research Text Analytics and the Victory Index
point Provalis Research Text Analytics and the Victory Index Fern Halper, Ph.D. Fellow Daniel Kirsch Senior Analyst Provalis Research Text Analytics and the Victory Index Unstructured data is everywhere
RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison
RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the
Appendix 2 Molecular Biology Core Curriculum. Websites and Other Resources
Appendix 2 Molecular Biology Core Curriculum Websites and Other Resources Chapter 1 - The Molecular Basis of Cancer 1. Inside Cancer http://www.insidecancer.org/ From the Dolan DNA Learning Center Cold
PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS.
PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS Project Project Title Area of Abstract No Specialization 1. Software
11-792 Software Engineering EMR Project Report
11-792 Software Engineering EMR Project Report Team Members Phani Gadde Anika Gupta Ting-Hao (Kenneth) Huang Chetan Thayur Suyoun Kim Vision Our aim is to build an intelligent system which is capable of
Workshop. Neil Barrett PhD, Jens Weber PhD, Vincent Thai MD. Engineering & Health Informa2on Science
Engineering & Health Informa2on Science Engineering NLP Solu/ons for Structured Informa/on from Clinical Text: Extrac'ng Sen'nel Events from Pallia've Care Consult Le8ers Canada-China Clean Energy Initiative
Guidelines for Establishment of Contract Areas Computer Science Department
Guidelines for Establishment of Contract Areas Computer Science Department Current 07/01/07 Statement: The Contract Area is designed to allow a student, in cooperation with a member of the Computer Science
How to stop looking in the wrong place? Use PubMed!
How to stop looking in the wrong place? Use PubMed! 1 Why not just use? Plus s Fast! Easy to remember web address Its huge - you always find something It includes PubMed citations Downside Is simply finding
Data, Measurements, Features
Data, Measurements, Features Middle East Technical University Dep. of Computer Engineering 2009 compiled by V. Atalay What do you think of when someone says Data? We might abstract the idea that data are
The First Online 3D Epigraphic Library: The University of Florida Digital Epigraphy and Archaeology Project
Seminar on Dec 19 th Abstracts & speaker information The First Online 3D Epigraphic Library: The University of Florida Digital Epigraphy and Archaeology Project Eleni Bozia (USA) Angelos Barmpoutis (USA)
Next Generation Sequencing: Adjusting to Big Data. Daniel Nicorici, Dr.Tech. Statistikot Suomen Lääketeollisuudessa 29.10.2013
Next Generation Sequencing: Adjusting to Big Data Daniel Nicorici, Dr.Tech. Statistikot Suomen Lääketeollisuudessa 29.10.2013 Outline Human Genome Project Next-Generation Sequencing Personalized Medicine
life science data mining
life science data mining - '.)'-. < } ti» (>.:>,u» c ~'editors Stephen Wong Harvard Medical School, USA Chung-Sheng Li /BM Thomas J Watson Research Center World Scientific NEW JERSEY LONDON SINGAPORE.
Service Road Map for ANDS Core Infrastructure and Applications Programs
Service Road Map for ANDS Core and Applications Programs Version 1.0 public exposure draft 31-March 2010 Document Target Audience This is a high level reference guide designed to communicate to ANDS external
Big Data Healthcare. Fei Wang Associate Professor Department of Computer Science and Engineering School of Engineering University of Connecticut
Big Data Healthcare Fei Wang Associate Professor Department of Computer Science and Engineering School of Engineering University of Connecticut Healthcare Is in Crisis Hersh, W., Jacko, J. A., Greenes,
IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper
IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper CAST-2015 provides an opportunity for researchers, academicians, scientists and
Are you ready for more efficient and effective ways to manage discovery?
LexisNexis Early Data Analyzer + LAW PreDiscovery + Concordance Software Are you ready for more efficient and effective ways to manage discovery? Did you know that all-in-one solutions often omit robust
Sequence Formats and Sequence Database Searches. Gloria Rendon SC11 Education June, 2011
Sequence Formats and Sequence Database Searches Gloria Rendon SC11 Education June, 2011 Sequence A is the primary structure of a biological molecule. It is a chain of residues that form a precise linear
Concept and Applications of Data Mining. Week 1
Concept and Applications of Data Mining Week 1 Topics Introduction Syllabus Data Mining Concepts Team Organization Introduction Session Your name and major The dfiiti definition of dt data mining i Your
Integrating Bioinformatics, Medical Sciences and Drug Discovery
Integrating Bioinformatics, Medical Sciences and Drug Discovery M. Madan Babu Centre for Biotechnology, Anna University, Chennai - 600025 phone: 44-4332179 :: email: [email protected] Bioinformatics
Information Management course
Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli ([email protected])
Understanding and Supporting Intersubjective Meaning Making in Socio-Technical Systems: A Cognitive Psychology Perspective
Understanding and Supporting Intersubjective Meaning Making in Socio-Technical Systems: A Cognitive Psychology Perspective Sebastian Dennerlein Institute for Psychology, University of Graz, Universitätsplatz
Dr Alexander Henzing
Horizon 2020 Health, Demographic Change & Wellbeing EU funding, research and collaboration opportunities for 2016/17 Innovate UK funding opportunities in omics, bridging health and life sciences Dr Alexander
A leader in the development and application of information technology to prevent and treat disease.
A leader in the development and application of information technology to prevent and treat disease. About MOLECULAR HEALTH Molecular Health was founded in 2004 with the vision of changing healthcare. Today
Game Changers for Researchers: Altmetrics, Big Data, Open Access What Might They Change? Kiki Forsythe, M.L.S.
Game Changers for Researchers: Altmetrics, Big Data, Open Access What Might They Change? Kiki Forsythe, M.L.S. Definition of Game Changer A newly introduced element or factor that changes an existing situation
NAVIGATING SCIENTIFIC LITERATURE A HOLISTIC PERSPECTIVE. Venu Govindaraju
NAVIGATING SCIENTIFIC LITERATURE A HOLISTIC PERSPECTIVE Venu Govindaraju BIOMETRICS DOCUMENT ANALYSIS PATTERN RECOGNITION 8/24/2015 ICDAR- 2015 2 Towards a Globally Optimal Approach for Learning Deep Unsupervised
WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat
Information Builders enables agile information solutions with business intelligence (BI) and integration technologies. WebFOCUS the most widely utilized business intelligence platform connects to any enterprise
Bio-IT World 2013 Best Practices Awards
Published Resources for the Life Sciences 250 First Avenue, Suite 300, Needham, MA 02494 phone: 781-972-5400 fax: 781-972-5425 Bio-IT World 2013 Best Practices Awards Celebrating Excellence in Innovation
Final Project Report
CPSC545 by Introduction to Data Mining Prof. Martin Schultz & Prof. Mark Gerstein Student Name: Yu Kor Hugo Lam Student ID : 904907866 Due Date : May 7, 2007 Introduction Final Project Report Pseudogenes
UIMA and WebContent: Complementary Frameworks for Building Semantic Web Applications
UIMA and WebContent: Complementary Frameworks for Building Semantic Web Applications Gaël de Chalendar CEA LIST F-92265 Fontenay aux Roses [email protected] 1 Introduction The main data sources
Interpretation of Data (IOD) Score Range
These Standards describe what students who score in specific score ranges on the Science Test of ACT Explore, ACT Plan, and the ACT college readiness assessment are likely to know and be able to do. 13
Medical Information-Retrieval Systems. Dong Peng Medical Informatics Group
Medical Information-Retrieval Systems Dong Peng Medical Informatics Group Outline Evolution of medical Information-Retrieval (IR). The information retrieval process. The trend of medical information retrieval
HELP DESK SYSTEMS. Using CaseBased Reasoning
HELP DESK SYSTEMS Using CaseBased Reasoning Topics Covered Today What is Help-Desk? Components of HelpDesk Systems Types Of HelpDesk Systems Used Need for CBR in HelpDesk Systems GE Helpdesk using ReMind
A Knowledge-Poor Approach to BioCreative V DNER and CID Tasks
A Knowledge-Poor Approach to BioCreative V DNER and CID Tasks Firoj Alam 1, Anna Corazza 2, Alberto Lavelli 3, and Roberto Zanoli 3 1 Dept. of Information Eng. and Computer Science, University of Trento,
Types of Engineering Jobs
What Do Engineers Do? Engineers apply the theories and principles of science and mathematics to the economical solution of practical technical problems. I.e. To solve problems Often their work is the link
What s New in Pathway Studio Web 11.1
1 1 What s New in Pathway Studio Web 11.1 Elseiver is pleased to announce the release of Pathway Studio Web 11.1 for all database subscriptions (Mammal, Mammal+ChemEffect+DiseaseFx, Plant). This release
Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov
Search and Data Mining: Techniques Applications Anya Yarygina Boris Novikov Introduction Data mining applications Data mining system products and research prototypes Additional themes on data mining Social
Voluntary Genomic Data Submissions at the U.S. FDA
Voluntary Genomic Data Submissions at the U.S. FDA International Conference on Harmonization Chicago, IL November 9-10, 9 2005 Felix W. Frueh, PhD Associate Director for Genomics Office of Clinical Pharmacology
Chemistry Add-in for Word User s Guide
Chemistry Add-in for Word User s Guide Abstract This document describes how to use the Chemistry Add-in for Word, an add-in for Microsoft Word that provides a simple and flexible way to include chemical
Predicting the Stock Market with News Articles
Predicting the Stock Market with News Articles Kari Lee and Ryan Timmons CS224N Final Project Introduction Stock market prediction is an area of extreme importance to an entire industry. Stock price is
PREDICTIVE ANALYTICS: PROVIDING NOVEL APPROACHES TO ENHANCE OUTCOMES RESEARCH LEVERAGING BIG AND COMPLEX DATA
PREDICTIVE ANALYTICS: PROVIDING NOVEL APPROACHES TO ENHANCE OUTCOMES RESEARCH LEVERAGING BIG AND COMPLEX DATA IMS Symposium at ISPOR at Montreal June 2 nd, 2014 Agenda Topic Presenter Time Introduction:
Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources
1 of 8 11/7/2004 11:00 AM National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools
