The Preparation of Information in Data Science

Similar documents
THE SEMANTIC WEB AND IT`S APPLICATIONS

An Application Ontology to Support the Access to Data of Medical Doctors and Health Facilities in Brazilian Municipalities

DISCOVERING RESUME INFORMATION USING LINKED DATA

Evangelia Mitsopoulou, St George s University of London Panagiotis Bamidis, Aristotle University of Thessaloniki Daniela Giordano, University of

DRUM Distributed Transactional Building Information Management

The Development of the Clinical Trial Ontology to standardize dissemination of clinical trial data. Ravi Shankar

Linking Maritime Datasets to Dutch Ships and Sailors Cloud - Case studies on Archangelvaart and Elbing. J.A. Entjes July 10th, 2015

Linked Open Data A Way to Extract Knowledge from Global Datastores

Joshua Phillips Alejandra Gonzalez-Beltran Jyoti Pathak October 22, 2009

How To Write A Drupal Rdf Plugin For A Site Administrator To Write An Html Oracle Website In A Blog Post In A Flashdrupal.Org Blog Post

Serendipity a platform to discover and visualize Open OER Data from OpenCourseWare repositories Abstract Keywords Introduction

Too Much Data or Too Little Cooperation? Tom Plasterer, PhD. Research & Development Information (RDI) Director, US Cross-Science

Converging Web-Data and Database Data: Big - and Small Data via Linked Data

7/15/2015 THE CHALLENGE. Amazon, Google & Facebook have Big Data problems. in Oncology we have a Small Data Problem!

SPC BOARD (COMMISSIONE DI COORDINAMENTO SPC) AN OVERVIEW OF THE ITALIAN GUIDELINES FOR SEMANTIC INTEROPERABILITY THROUGH LINKED OPEN DATA

DataBridges: data integration for digital cities

Enhancing the University s Knowledge Management Using VIVO

Secure Semantic Web Service Using SAML

Semantic Interoperability

Templates and Archetypes: how do we know what we are talking about?

Reason-able View of Linked Data for Cultural Heritage

FHIM Model Content Overview

Semantic Modeling of Mortgage Backed Securities: Case Study. Mike Bennett, Enterprise Data Management Council Yefim Zhuk, Sallie Mae

Web Services - Consultant s View. From IT Stategy to IT Architecture. Agenda. Introduction

Siemens Future HANNOVER MESSE Internet of Things and Services Guido Stephan

LINKED OPEN DRUG DATA FROM THE HEALTH INSURANCE FUND OF MACEDONIA

A Risk Management Approach to Data Preservation

JOURNAL OF OBJECT TECHNOLOGY

Introduction to Service Oriented Architectures (SOA)

Algorithms, Flowcharts & Program Design. ComPro

Java Programming (10155)

How To Use An Orgode Database With A Graph Graph (Robert Kramer)

Publishing Linked Data Requires More than Just Using a Tool

Semantic Search in Portals using Ontologies

Lightweight Data Integration using the WebComposition Data Grid Service

Smart Financial Data: Semantic Web technology transforms Big Data into Smart Data

HL7 NCPDP e-prescribing harmonization: using the v3 HDF for as a basis for semantic interoperability

ONTOLOGY FOR MOBILE PHONE OPERATING SYSTEMS

SemWeB Semantic Web Browser Improving Browsing Experience with Semantic and Personalized Information and Hyperlinks

Health Data Analysis Specialty Track Curriculum Competencies

Ontology quality and fitness: A survey of so6ware support

ONEM2M SERVICE LAYER PLATFORM

Service Oriented Architecture and the DBA Kathy Komer Aetna Inc. New England DB2 Users Group. Tuesday June 12 1:00-2:15

Towards a reference architecture for Semantic Web applications

What s a BA to do with Data? Discover and define standard data elements in business terms. Susan Block, Program Manager The Vanguard Group

Cross-Sectional Integration of LAM Resources on the Basis of Authority Data

Open Source egovernment Reference Architecture Osera.modeldriven.org. Copyright 2006 Data Access Technologies, Inc. Slide 1

Visual Analysis of Statistical Data on Maps using Linked Open Data

Data Quality Committee Mission Statement

María Elena Alvarado gnoss.com* Susana López-Sola gnoss.com*

Service Oriented Architecture

Information Technology for KM

Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo

LinksTo A Web2.0 System that Utilises Linked Data Principles to Link Related Resources Together

Mining the Web of Linked Data with RapidMiner

Draft URI Strategy for the NL Public Sector

Linked2Safety FP

The Trellis Dynamic Infrastructure Optimization Platform for Data Center Infrastructure Management (DCIM)

SmartLink: a Web-based editor and search environment for Linked Services

Overview. Essential Questions. Grade 2 Mathematics, Quarter 4, Unit 4.4 Representing and Interpreting Data Using Picture and Bar Graphs

Week 1: Introduction. Transcript of Week 1 Podcast

A bright picture for biomedical informatics in the 21 st century

Presente e futuro del Web Semantico

In this Lecture you will Learn: Systems Development Methodologies. Why Methodology? Why Methodology?

Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study

Software Engineering Transfer Degree

Integrating FLOSS repositories on the Web

GEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington

A Collaborative System Software Solution for Modeling Business Flows Based on Automated Semantic Web Service Composition

Industry 4.0 and Big Data

Classifying Adverse Events From Clinical Trials

MEng, BSc Computer Science with Artificial Intelligence

Linked Open Government Data Analytics

Leveraging existing Web frameworks for a SIOC explorer to browse online social communities

The Value of Taxonomy Management Research Results

Towards A Semantic & Domain-agnostic Scientific Data Management System

What is a metamodel: the OMG s metamodeling infrastructure

A generic approach for data integration using RDF, OWL and XML

Benjamin Heitmann Digital Enterprise Research Institute, National University of Ireland, Galway

Clinical Quality Improvement

LinkZoo: A linked data platform for collaborative management of heterogeneous resources

Data Modeling Basics

Context Model Based on Ontology in Mobile Cloud Computing

How to Publish Linked Data on the Web

From Data to Foresight:

Visible Business Templates An Introduction

Database Resources. Subject: Information Technology for Managers. Level: Formation 2. Author: Seamus Rispin, current examiner

Open issues regarding legal metadata: IP licensing and management of different cognitive levels

California Enterprise Architecture Framework

Seeking Open Educational Resources to Compose Massive Open Online Courses in Engineering Education An Approach Based on Linked Open Data

Whitepaper Data Governance Roadmap for IT Executives Valeh Nazemoff

Independent Insight for Service Oriented Practice. An SOA Roadmap. John C. Butler Chief Architect. A CBDI Partner Company.

Storage Technology. Standards Trends

White Paper. An Introduction to Informatica s Approach to Enterprise Architecture and the Business Transformation Toolkit

Application of ontologies for the integration of network monitoring platforms

Andreas Harth, Katja Hose, Ralf Schenkel (eds.) Linked Data Management: Principles and Techniques

Chapter 12. The Product Coordination Team

Semantische webtechnologieën voor digitaal erfgoed en de geschiedwetenschap. Victor de Boer Web & Media the Network Institute

Transcription:

The Preparation of Information in Data Science

The Role of Ontologies in Unlocking Big Data Big Data holds the potential of revealing great insights from large diverse data sets if properly exploited with the right analytics To better realize this potential a shift needs to occur from representations of individual data sets to representations that enable interoperability across all data sets 2

The Common Core Development Method Rule governed development of an extensible set of ontologies to which data from sub-domains can be aligned and linked together Combines principles from the Linked Open Data Initiative, Open Biological and Biomedical Ontologies (OBO) Foundry, and object-oriented programming 3

Linked Open Data Initiative Began as a means for integrating data on the world wide web Based on a simple set of guiding principles* Use Universal Resource Identifiers (URIs) as names of things Use HTTP URIs so that people can look up those names When someone looks up a URI provide useful information Include links to other URIs so they can discover other things *Tim Berners-Lee Linked Open Data h:ps://www.w3.org/designissues/linkeddata 4

A Linked Open Data Success Story DBPedia Pages accessed from web browsers that link data from Wikipedia 5

Linked Open Data Issue - A Profusion of Ontologies Linking Open Data cloud diagram 2014, by Max Schmachtenberg, ChrisPan Bizer, Anja Jentzsch and Richard Cyganiak. h:p://lod-cloud.net/ 6

Effects of Profusion Costs increase relative to the amount of duplicative effort relative to the number of mappings relative to the number of vernaculars Effectiveness decreases Searches have low recall and precision Re-use creates ambiguities 7

OBO Foundry The Open Biological and Biomedical (OBO) Foundry is a collaborative group of organizations devoted to establishing best practices in ontology development Leverages the lessons learned from over $300M investment in ontology development 8

An OBO Foundry Best Practice Use a Common Upper EnPty Object Quality OrganizaPo n Physical ArPfact bearer_of Quality of OrganizaPo n Quality of Physical ArPfact has_quality has_quality Produces common patterns within ontologies Reuse of mappings from the sources Easier to include new sources of data Enables reuse of queries and analytics Structure of data stays constant Easier to transition to new domains of interest 9

Basic Formal An upper ontology with not more than 40 class terms and 20 relationships Provides an extensible structure for the interrelationships between basic entities Used as the upper ontology in hundreds of ontologies, primarily in the biomedical domain Used by at least one hundred different project 10

An OBO Foundry Best Practice - Truth as a Development Guideline Strive towards creating a digital copy of the world Adds the constraint that every assertion within an ontology must be true Reduces perspective from the ontology enabling links to many sources Provides an objective means for settling disputes over terminology 11

OBO Foundry Issue - Ontologies with Too Wide a Scope Good practice of reusing existing terminology But the of Biomedical Investigations (OBI) is not a logical choice for where the term Organization is maintained 12

Object Oriented Programming - Modularity as a Development Guideline One axis of modularity in the CCO is level of generality Upper and midlevel ontologies are stable and of manageable scale Upper Ontologies Describe the Structure of the World Mid-Level Ontologies Add General Content to the Structure Content and structure is inherited from higher levels Domain Level Ontologies Add Content Relevant to a Community 13

Object Oriented Programming - Modularity as a Development Guideline The second axis of modularity in the CCO is content parpcipates in Process Physical Object occurs on occurs at contained in has Temporal Region Site Site A:ribute 14

The Common Core Ontologies in Practice The Common Core Ontologies (CCO) are intended to serve as a vocabulary that can describe objects and processes that are common to many domains of interest The remaining objects and processes that are unique to particular domains of interest are described by ontologies that extend from the CCO in a repeatable, rule governed process 15

The Common Core and Domain Ontologies Basic Formal (BFO) Upper : Extended RelaPon Common Core : Domain : Event Agent Quality ArPfact GeospaPal Time Affec%ve State Ethnicity Occupa%on Ci%zenship Curriculum Sensor Undersea Warfare WatercraC Hydrographic Feature Agent Informa%on Physiographic Feature InformaPon EnPty Units of Measure Space Object Currency Unit 16

The Benefits of the Common Core Development Process 17