Creating Knowledge From Interlinked Data From Big Data to Smart Data



Similar documents
Linked Open Data Infrastructure for Public Sector Information: Example from Serbia

PICASSO Big Data Expert Group

Creating Data Value Chains by Linking Enterprise Data

Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study

Publishing Linked Data Requires More than Just Using a Tool

LDIF - Linked Data Integration Framework

Semantic Interoperability

DISCOVERING RESUME INFORMATION USING LINKED DATA

Open Data Integration Using SPARQL and SPIN

Linked Statistical Data Analysis

LinkZoo: A linked data platform for collaborative management of heterogeneous resources

GetLOD - Linked Open Data and Spatial Data Infrastructures

Lightweight Data Integration using the WebComposition Data Grid Service

Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo

How To Use An Orgode Database With A Graph Graph (Robert Kramer)

How To Build A Cloud Based Intelligence System

BIG DATA AGGREGATOR STASINOS KONSTANTOPOULOS NCSR DEMOKRITOS, GREECE. Big Data Europe

Visual Analysis of Statistical Data on Maps using Linked Open Data

Geospatial Data and the Semantic Web. The GeoKnow Project. Sebastian Hellmann AKSW/KILT research group, Leipzig University & DBpedia Association

THE SEMANTIC WEB AND IT`S APPLICATIONS

We have big data, but we need big knowledge

BUSINESS VALUE OF SEMANTIC TECHNOLOGY

Semantic Web Tool Landscape

LiDDM: A Data Mining System for Linked Data

RDF Dataset Management Framework for Data.go.th

DataOps: Seamless End-to-end Anything-to-RDF Data Integration

LINKED OPEN DRUG DATA FROM THE HEALTH INSURANCE FUND OF MACEDONIA

City Data Pipeline. A System for Making Open Data Useful for Cities. stefan.bischof@tuwien.ac.at

New Generation of Social Networks Based on Semantic Web Technologies: the Importance of Social Data Portability

Data Publishing with DaPaaS

TopBraid Insight for Life Sciences

Semantic SharePoint. Technical Briefing. Helmut Nagy, Semantic Web Company Andreas Blumauer, Semantic Web Company

Linked Data Interface, Semantics and a T-Box Triple Store for Microsoft SharePoint

STAR Semantic Technologies for Archaeological Resources.

Mining the Web of Linked Data with RapidMiner

Andreas Harth, Katja Hose, Ralf Schenkel (eds.) Linked Data Management: Principles and Techniques

From a World-Wide Web of Pages to a World-Wide Web of Things

Fraunhofer FOKUS. Fraunhofer Institute for Open Communication Systems Kaiserin-Augusta-Allee Berlin, Germany.

Linked Data Publishing with Drupal

Towards the Integration of a Research Group Website into the Web of Data

A Big Picture for Big Data

COLINDA: Modeling, Representing and Using Scientific Events in the Web of Data

ELIS Multimedia Lab. Linked Open Data. Sam Coppens MMLab IBBT - UGent

Towards a reference architecture for Semantic Web applications

The Catalogus Professorum Lipsiensis

Introduction to Ontologies

Scope. Cognescent SBI Semantic Business Intelligence

A Semantic web approach for e-learning platforms

Integrating Open Sources and Relational Data with SPARQL

Smart Cities require Geospatial Data Providing services to citizens, enterprises, visitors...

Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object

Dude, where s my graph? RDF Data Cubes for Clinical Trials Data.

- a Humanities Asset Management System. Georg Vogeler & Martina Semlak

Mining Big Data with RDF Graph Technology:

DBpedia German: Extensions and Applications

Semantics and Ontology of Logistic Cloud Services*

ON DEMAND ACCESS TO BIG DATA. Peter Haase fluid Operations AG

Taming Big Data Variety with Semantic Graph Databases. Evren Sirin CTO Complexible

Publishing Relational Databases as Linked Data

Network Graph Databases, RDF, SPARQL, and SNA

Pragmatic Web 4.0. Towards an active and interactive Semantic Media Web. Fachtagung Semantische Technologien September 2013 HU Berlin

LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model

SMART Apps. Rob Tweed M/Gateway Developments

Integrating Linked Open Data into Open Source Web Mapping

Big Data Analytics. Rasoul Karimi

The Ontological Approach for SIEM Data Repository

Department of Defense. Enterprise Information Warehouse/Web (EIW) Using standards to Federate and Integrate Domains at DOD

Programming the Semantic Web

Transcription:

Creating Knowledge From Interlinked Data From Big Data to Smart Data Prof. Dr. Sören Auer 2 5. 0 9. 2 0 1 4 Fraunhofer-Institut für Intelligente 1

The Web evolves into a Web of Data Linked Open Data Facebook Open Graph Fraunhofer-Institut für Intelligente 2

Die Evolution des Web & Intranets 1990 1995 2000 2005 2010 2015 Web 1.0 - Hypertext Static Web pages Hyperlinks Link directories Web 2.0 Social Apps Social Web Crowd-sourcing Mashups Web 3.0 Linked Data REST APIs, RDF, JSON-LD Vocabularies Rich-snippets, Semantic Search Intranet 1.0 - Hypertext Static Intranet pages Keyword search Hyperlinks Intranet 2.0 Social Enterprise Apps Salesforce Crowd-sourcing Mashups Intranet 3.0 Enterprise Data Intranet URI Scheme Enterprise taxonomies / knowledge bases RDB2RDF Mapping Fraunhofer-Institut für Intelligente 3

Towards Information Oriented Architectures IBM DeveloperWorks: The role of semantic models in smarter industrial operations http://www.ibm.com/developerworks/xml/library/x-ind-semanticmodels/ Fraunhofer-Institut für Intelligente 4

Linked Data Principles 1. Use URIs to identify the things in your data 2. Use http:// URIs so people (and machines) can look them up on the web 3. When a URI is looked up, return a description of the thing (in RDF format) 4. Include links to related things http://www.w3.org/designissues/linkeddata.html Fraunhofer-Institut für Intelligente 5

Linked Data in a Nutshell 1. Uses RDF Data Model organizes NAF LAC Subject Predicate Object starts takesplacein 27.11.2014 Nieuwegein 2. Is serialised in triples: NAF organizes LAC. LAC starts 20141127 ^^xsd:date. LAC takesplaceat dbpedia:nieuwegein. 3. Publish in the Intranet (or on the Web) Fraunhofer-Institut für Intelligente

Encode Relational Data in RDF Primary keys become subjects Columns become properties Cells become objects Mapping Standard: W3C R2RML Relational to RDF Mapping Language Many mature implementations: SparqlMap, Sparqlify, OnTop Fraunhofer-Institut für Intelligente

Encode Taxonomic Data in RDF Vehicle Car Truck Hatchback Cabrio Sedan Light Rigid Heavy Rigid Combination Hot Hatch Vehicle a rdfs:class Car rdfs:subclassof Vehicle Truck rdfs:subclassof Vehicle Cabrio rdfs:subclassof Car. Fraunhofer-Institut für Intelligente 8

RDF is the Lingua Franca of Data Integration RDF is simple We can easily encode and combine all kinds of data models (relational, taxonomic, graphs, object-oriented, ) RDF supports distributed data and schema We can seamlessly ev olv e simple semantic representations (vocabularies) to more complex ones (e.g. ontologies) Small repres entational units (URI/IRIs, triples) facilitate mixing and mashing RDF can be viewed from many perspectives : facts, graphs, ER, logical axioms, graphs, objects RDF integrates well with other formalisms - HTML (RDFa), XML (RDF/XML), JSON (JSON-LD), CSV, Linking and referencing between different knowledge bases, systems and platforms facilitates the creation of sustainable data ecos y s tem s Fraunhofer-Institut für Intelligente 9

The emerging Web of Data 2007 20082008 2008 2008 2009 2009 2010 Fraunhofer-Institut für Intelligente Linking Open Data cloud diagram,analyseby und Informationssysteme IAIS Richard Cyganiak and Anja Jentzsch.

Distributed Social Semantic Networking Fraunhofer-Institut für Intelligente

Linked Enterprise Data Principles 1. Evolve existing existing taxonomies into enterprise knowledge bases/hubs 2. Establish a enterprise wide URI scheme 3. Equip existing information systems in your intranet with Linked Data interfaces 4. Establish links between related information Fraunhofer-Institut für Intelligente 12

Linked Enterprise Data Advantages Light-weight linked data integration complements more complex SOA architectures Unified data (access) model simplifies data integration Increase standardization while preserving diversity Facilitate information flows along supply and value creation chains Dramatically reduce data integration costs, increase enterprise flexibility Fraunhofer-Institut für Intelligente 13

From Intranet to Enterprise Data Web around a knowledge hub Auer, Frischmuth, Klímek, Unbehauen, Holzweißig, Marquardt: Linked Data in Enterprise Inform ation Integration S ubm itted to S em antic Web Journal 2012. Fraunhofer-Institut für Intelligente

Creating Knowl edg e out of Interlinked Data Facilitates data integration along value-chains within and across enterprises When just data shall be exchanged and integrated SOA is quite expensive PricewaterhouseCoopers, Technology Forecast, 2009

Creating Knowl edg e out of Interlinked Data Linked Enterprise Intra Data Webs fill the gap between Intra-/Extranets and EIS/ERP Unstructured Information Management Structured Information Management Human-resources Requirements engineering Supply-chains Support the long tail of enterprise information domains

Towards Data Value Chains Data Value Chains today Extraction, Curation, Quality, Linking, Integration, Publication, Visualization, Analysis Increased Specialization Separation of concerns More interdisciplinarity Data Value Chains tomorrow Extraction, Curation Quality, Linking, Integration Publication, Visualization, Analysis Extraction Curation Quality Linking Integration Publication Visualization Analysis

Interlinking/ Fus ing Manual revision/ authoring Classification/ Enrichm ent Storage/ Query ing Linked Data Lifecycle Quality Analysis Ex traction Evolution / Repair Fraunhofer-Institut für Intelligente Search/ Browsing/ Ex ploration

Creating Knowledge out of Interlinked Data LOD2 Stack Components Limes Silk Silk-latc RDFAuthor OntoWiki Interlinking/ Fusing Sieve Sparqlify LOD2 Authentication Component SIREn CKAN (source) Dbpedia (source) sparqlproxy SPARQLed PoolParty Manual revision/ authoring lodms Classification/ Enrichmen t dl-learner LOD2 Provenance Component Data Cube Validation Tool rdf-dataset-integration Mondeca Sparql Endpoint Status Virtuoso RDF Store (conductor, sparql, isparql, ods) Storage/ Querying LOD2 demonstrator LOD2 stat. Workbench Quality Analysis ORE Web Linkage Validator CSVImpor t Dbpedia Spotlight UI R2R PoolParty Extractor stanbol D2R Virtuoso sponger Dbpedia Spotlight valiant Extraction Semantic Spatial Browser Search/ Browsing/ Exploratio n lod2webapi SigmaEE Evolution / Repair CubeViz Sindice (source) Data Cube Merging Tool LOD open refine Data Cube Slicing Tool http://lod2.eu

Creating Knowledge out of Interlinked Data Eccenca Linked Data Stack Core compontent of the LOD2 stack including support, maintenance, industrial-grade In production used bz: Wolters Kluwer, Daimler, EU Commission Strategic further development in R&D projects http://lod2.eu

Linked Enterprise Data Fraunhofer-Institut für Intelligente 21

Daimler CIO Michael Gorriz receives European Data Innovator Award Fraunhofer-Institut für Intelligente 22

Classic search Fraunhofer-Institut für Intelligente 23

Fraunhofer-Institut für Intelligente Creation of a knowledge base consisting of: 1. car models database

Subject Predicate Object Fraunhofer-Institut für Intelligente

Creation of a knowledge base: 2. Daim ler Corporate Thes aurus Fraunhofer-Institut für Intelligente Management of Enterprise Taxonomies with OntoWiki Based on the W3C SKOS standard Corporate Language Management at Daimler: 500k

Fraunhofer-Institut für Intelligente Semantic search suggestions based on the interlinked knowledge base

Fraunhofer-Institut für Intelligente

Fraunhofer-Institut für Intelligente

Fraunhofer-Institut für Intelligente

Cortex - Semantic Search Fraunhofer-Institut für Intelligente 31

Semantische Suchplattform IAIS-Cortex Herz der Deutschen Digitalen Bibliothek Flexible Data Integration (Import) Powerful toolbox for aggregation of heterogenous data sources >2.000 Data Providers Reliable Data Management Based on filesystem or cloud technology >10.000 concurrent users Individual Access Powerful semantic search and performant access of objects Fraunhofer-Institut für Intelligente 32

Deutsche Digitale Bibliothek Fraunhofer-Institut für Intelligente 33

Fraunhofer-Institut für Intelligente

Fraunhofer-Institut für Intelligente

Fraunhofer-Institut für Intelligente 36

Fraunhofer-Institut für Intelligente 37

Kontakt Fraunhofer-Institut IAIS Schloss Birlinghoven 53754 Sankt Augustin www.iais.fraunhofer.de Prof. Dr. Sören Auer Tel: 02241-14 1915 email: soeren.auer@iais.fraunhofer.de Die Rechte der Logos und Bilder verbleiben bei ihren Eigentümern. Fraunhofer-Institut für Intelligente 38