Best practices for Linked Data
|
|
|
- Gilbert James
- 10 years ago
- Views:
Transcription
1 Best practices for Linked Data Asunción Gómez-Pérez Facultad de Informática, Universidad Politécnica de Madrid Avda. Montepríncipe s/n, Boadilla del Monte, Madrid Phone: , Fax: Acknowledgements: M. Poveda, V. Rodríguez-Doncel, D. Vila BabeLData: TIN
2 Asunción Gómez-Pérez Spain 2013 Madrid, 18 th December Linked Data: why it is important? Facilitate data integration From heterogeous sources In different formats Different granularity In different languages From different countries Slide adapted from 5min Introduction to Linked Data - Olaf Hartig
3 Asunción Gómez-Pérez Spain 2013 Madrid, 18 th December 3 3 BD BNE BD VIAF BD AEMET BD IGN BD Prisa BD DBpedia Data Integration BNE Ubicado en Alcalá de Henares 1605 El Quijote Año de Publicación Autor birthplace Same as M. Cervantes M. Cervantes Alcalá de Henares M. Cervantes Year of publication creator Don Quixote 1960 Translated into Hebrew VIAF located Alcalá de Henares guía Tapas Siglo de Oro Alcalá de Henares Temperatura 20º
4 RDF(S) models Unique identifiers: URI identify or name a resource Foundations Equivalence links to other datasets Same As Data navigation Person Is creator of Cer Work Is a Is a Cervantes Is creator of Cer El Quijote Same As Same As Cervantes Cervantes Asunción Gómez-Pérez Spain 2013 Madrid, 18 th December 4
5 Asunción Gómez-Pérez Spain 2013 Madrid, 18 th December 5 5 The model (Ontology) and the data for humans Idiom Year translation Publication date Work Is creator of Person birthplace Place Ontology Located at Library Has subject Catalán 1960 translation Publication date El Quijote Is creator of Cervantes birthplace Alcalá de Henares Located in Has subject Vida de Cervantes Data BNE
6 Asunción Gómez-Pérez Spain 2013 Madrid, 18 th December 6 6 The model and the data for Machines Language Ontology translation work Is creator of Person Año Publication date birthplace Located in Has subject Biblioteca Catalán translation de Henares 1960 Publication date Don Quijote de la Mancha Es autor Cervantes Saavedra, Miguel de birthplace Has subject BNE Located in Vida de Miguel de Cervantes Saavedra Data
7 Asunción Gómez-Pérez Spain 2013 Madrid, 18 th December Linked Data is to be processed by machines
8 Asunción Gómez-Pérez Spain 2013 Madrid, 18 th December The generation process Providers Domains Sources Languages
9 The Linked Data Generation Process Specification Data Curation Exploitation Modelling Publication Generation Linking 9 There is no One-Size-Fits-All Formula
10 Lot of data in many domains Music On-line activities E-Gov Cross-domains Publications Geographic Life Sciences
11 I want to use Linked Open Data Who generated the LD dataset? When the LD dataset was created? How the LD dataset was created? Is the latest version of the LD dataset? Is the license information clearly stated in the LD dataset? How is LD licenses offered? Is the LD dataset monolingual or multilingual?
12 LOD observations How the LD generation process influence the use of the data by third parties? Vocabularies Licenses Language Provenance
13 Asunción Gómez-Pérez Spain 2013 Madrid, 18 th December How to prevent GIGO GARBAGE PROCESS
14 Vocabularies 14 th
15 Asunción Gómez-Pérez Spain 2013 Madrid, 18 th December Cervantes at the data level Same as URI URI URI URI URI Cervantes Same as Same as D. Quijote Author Phone Date of Birth #People Same as 1547 Size ,4 km²
16 rdf:type Cervantes and a bit of semantics rdf:type Person Retaurant URI URI URI URI URI Cervantes (Person) rdf:type Same as rdf:type Street Author D. Quijote Date of Birth rdf:type Municipality Asunción Gómez-Pérez Spain 2013 Madrid, 18 th December
17 17 Cervantes foaf foaf:agent foaf:group foaf:organization foaf:document foaf:person foaf:publications foaf:image foaf:mbox - foaf:firstname - foaf:surname - foaf:birthday foaf:img owl:thing foaf:knows foaf:depiction Miguel de Cervantes Saavedra foaf:firstname foaf:surname instanceof bibliothek:cervantes instanceof foaf:homepage instanceof instanceof foaf:birthday foaf:img /images/quixote.tif foaf:publications foaf:depiction
18 18 License Information
19 How Open is the Open Linked Data Cloud? LOD observations: Licenses
20 An example: the British National Bibliography
21 License Information is not up to date
22 Metadata information without license information
23 License information provided as XML
24 Linked Data Rights pattern
25 Lenguage 25
26 Rationale: LOD is dominated by the English Language Questions: 1. Searching resources in a particular language 2. Distribution of natural languages across RDF datasets? 3. Usage of language tags to indicate the natural language of RDF tags? 1. Distribution of usage of language tags 2. Distribution of literals tagged as English vs other languages 3. Distribution of literals tagged in languages other than English 26
27 Asunción Gómez-Pérez Spain 2013 Madrid, 18 th December 27 Example of multilingual library resource The dataset publisher does not tag the language of the content of different fields Ernest Hemingway and El viejo y el mar MARC 21 records
28 Asunción Gómez-Pérez Spain 2013 Madrid, 18 th December Multilingualism and the Linked Data Process How to represent language information for datasets? # VoiD description :bne a void:dataset; dcterms:language < # DCAT description :bne a dcat:dataset; dcterms:language < How to represent language information in Linked Data? Traditional annotation properties for most cases dbpedia:miguel_de_cervantes rdfs:label "Miguel de Cervantes"@es. "ミゲル デ セルバンテス"@ja. " "@ko. Richer models for more demanding applications # LEMON isbd:t1001 lemon:isreferenceof [lemon:issenseof :cartographic]. :cartographic a lemon:lexicalentry; lemon:form [lemon:writtenrep isocat:grammaticalgender isocat:masculine]; lemon:form [lemon:writtenrep isocat:grammaticalgender isocat:feminine]. isocat:grammaticalgender rdfs:subpropertyof lemon:property.
29 Implementation of the recording of data and metadata provenance Generation process Resource provenance DC File.txt creator creadondate John rights GPL used Revision Process generatedby PROVENANCE Model (RDF(S)) Filev1. txt RDF Store 29 1
30 Asuncion Gomez-Perez Spain 2013 Madrid, 18 th December Conclusions The use of Data curated Use vocabularies widely known License metadata in RDF Language metadata in RDF Provenance metadata in RDF Will influence the use of the linked data by third parties
31 Thanks for your attention! Asuncion Gomez-Perez Guidelines for Multilingual Linked Data. WIMS 2013 Madrid, June 31
32 There is no One-Size-Fits-All Formula Phase BNE IGN AEMET PRISA INE Modeling DC hydrontology Wgs84 time SSN ontology SIOC Scovo Data cube RDF generation MARiMbA geometry2rdf NOR2O CSV parser CSV parser NOR2O Links generation DNB VIAF LIBRIS DBPEDIA Silk Silk Silk DBPEDIA DBPEDIA Geolinkeddata.es Geonames Geolinkeddata.es NOR2O Geolinkeddata.es Publication Pubby sitemap4rdf Exploitation map4rdf SPARQL
33 The multilingual Web of Data: Current state Monolingual datasets Multilingual datasets RDF literals without language tag RDF literals with language tag ,567,324 3,154,779 3,365,930 1,906 2,201 1,984 10,250,936 10,594,338 12,272,806 January 2012 June 2012 December Number of Monolingual and multilingual datasets January 2012 June 2012 December Current usage of language tagging capabilities in RDF RDF literals with English tag RDF literals with other language tag 431, , ,785 2,135,664 2,751,065 2,808,145 January 2012 June 2012 December English tags versus other languages' tags 4. Evolution of top-10 languages 33
Publishing Linked Data There is no One-Size-Fits-All Formula
Publishing Linked Data There is no One-Size-Fits-All Formula Asunción Gómez-Pérez Facultad de Informática, Universidad Politécnica de Madrid Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid
Open Data. Asunción Gómez-Pérez Ontology Engineering Group Artificial Intelligence Department Universidad Politécnica de Madrid [email protected].
Open Data Asunción Gómez-Pérez Ontology Engineering Group Artificial Intelligence Department Universidad Politécnica de Madrid [email protected] @asungomezperez Acknowledgements: Oscar Corcho, Raul García,
Introduction to the Semantic Web
Introduction to the Semantic Web Asunción Gómez-Pérez {asun}@fi.upm.es http://www.oeg-upm.net Omtological Engineering Group Laboratorio de Inteligencia Artificial Facultad de Informática Universidad Politécnica
Towards the Integration of a Research Group Website into the Web of Data
Towards the Integration of a Research Group Website into the Web of Data Mikel Emaldi, David Buján, and Diego López-de-Ipiña Deusto Institute of Technology - DeustoTech, University of Deusto Avda. Universidades
Drupal. http://www.flickr.com/photos/funkyah/2400889778
Drupal 7 and RDF Stéphane Corlosquet, - Software engineer, MGH - Drupal 7 core RDF maintainer - SemWeb geek Linked Data Ventures, MIT, Oct 2010 This work is licensed under a Creative
Publishing Linked Data Requires More than Just Using a Tool
Publishing Linked Data Requires More than Just Using a Tool G. Atemezing 1, F. Gandon 2, G. Kepeklian 3, F. Scharffe 4, R. Troncy 1, B. Vatant 5, S. Villata 2 1 EURECOM, 2 Inria, 3 Atos Origin, 4 LIRMM,
Developing Web 3.0. Nova Spivak & Lew Tucker http://radarnetworks.com/ Tim Boudreau http://weblogs.java.net/blog/timboudreau/
Developing Web 3.0 Nova Spivak & Lew Tucker http://radarnetworks.com/ Tim Boudreau http://weblogs.java.net/blog/timboudreau/ Henry Story http://blogs.sun.com/bblfish 2007 JavaOne SM Conference Session
Publishing Relational Databases as Linked Data
Publishing Relational Databases as Linked Data Oktie Hassanzadeh University of Toronto March 2011 CS 443: Database Management Systems - Winter 2011 Outline 2 Part 1: How to Publish Linked Data on the Web
Joint Steering Committee for Development of RDA
Page 1 of 11 To: From: Subject: Joint Steering Committee for Development of RDA Gordon Dunsire, Chair, JSC Technical Working Group RDA models for authority data Abstract This paper discusses the models
Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo
DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo Expected Outcomes You will learn: Basic concepts related to ontologies Semantic model Semantic web Basic features of RDF and RDF
IAAA Grupo de Sistemas de Información Avanzados
Upgrading maps with Linked Data Lopez Pellicer Pellicer, Francisco J Lacasta, Javier Rentería, Walter, Universidad de Zaragoza Barrera, Jesús Lopez de Larrinzar, Juan Agudo, Jose M GeoSpatiumLab The Linked
How To Create A Federation Of A Federation In A Microsoft Microsoft System (R)
Fed4FIRE / Open-Multinet Resource Description Playground Alexander Willner Overview 2014-05-21 Overall Goal Federated Infrastructure Description and Discovery Language (FIDDLE) Context Assumptions and
Linked Statistical Data Analysis
Linked Statistical Data Analysis Sarven Capadisli 1, Sören Auer 2, Reinhard Riedl 3 1 Universität Leipzig, Institut für Informatik, AKSW, Leipzig, Germany, 2 University of Bonn and Fraunhofer IAIS, Bonn,
DISCOVERING RESUME INFORMATION USING LINKED DATA
DISCOVERING RESUME INFORMATION USING LINKED DATA Ujjal Marjit 1, Kumar Sharma 2 and Utpal Biswas 3 1 C.I.R.M, University Kalyani, Kalyani (West Bengal) India [email protected] 2 Department of Computer
Evaluation experiment for the editor of the WebODE ontology workbench
Evaluation experiment for the editor of the WebODE ontology workbench Óscar Corcho, Mariano Fernández-López, Asunción Gómez-Pérez Facultad de Informática. Universidad Politécnica de Madrid Campus de Montegancedo,
Programming the Semantic Web with Java. Taylor Cowan Travelocity 8982
Programming the Semantic Web with Java Taylor Cowan Travelocity 8982 AGENDA 2 > Semant ic Web Introduct ion > RDF basics > Coding Towards Jena s Semantic Web Framework API > Java to Model Binding with
EAC-CPF Ontology and Linked Archival Data
EAC-CPF Ontology and Linked Archival Data Silvia Mazzini 1, Francesca Ricci 2 1 Regesta.exe (Rome, Italy) [email protected] 2 Istituto per i beni artistici culturali e naturali della Regione Emilia-Romagna
An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials
ehealth Beyond the Horizon Get IT There S.K. Andersen et al. (Eds.) IOS Press, 2008 2008 Organizing Committee of MIE 2008. All rights reserved. 3 An Ontology Based Method to Solve Query Identifier Heterogeneity
City Data Pipeline. A System for Making Open Data Useful for Cities. [email protected]
City Data Pipeline A System for Making Open Data Useful for Cities Stefan Bischof 1,2, Axel Polleres 1, and Simon Sperl 1 1 Siemens AG Österreich, Siemensstraße 90, 1211 Vienna, Austria {bischof.stefan,axel.polleres,simon.sperl}@siemens.com
Integration of Polish National Bibliography within the repository platform for science and humanities
Marcin Roszkowski Integration of Polish National Bibliography within the repository platform for science and humanities The best thing to do to your data will be thought of by somebody else W3C LLD Agenda
Taming Big Data Variety with Semantic Graph Databases. Evren Sirin CTO Complexible
Taming Big Data Variety with Semantic Graph Databases Evren Sirin CTO Complexible About Complexible Semantic Tech leader since 2006 (née Clark & Parsia) software, consulting W3C leadership Offices in DC
María Elena Alvarado gnoss.com* [email protected] Susana López-Sola gnoss.com* [email protected]
Linked Data based applications for Learning Analytics Research: faceted searches, enriched contexts, graph browsing and dynamic graphic visualisation of data Ricardo Alonso Maturana gnoss.com *Piqueras
- a Humanities Asset Management System. Georg Vogeler & Martina Semlak
- a Humanities Asset Management System Georg Vogeler & Martina Semlak Infrastructure to store and publish digital data from the humanities (e.g. digital scholarly editions): Technically: FEDORA repository
Visual Analysis of Statistical Data on Maps using Linked Open Data
Visual Analysis of Statistical Data on Maps using Linked Open Data Petar Ristoski and Heiko Paulheim University of Mannheim, Germany Research Group Data and Web Science {petar.ristoski,heiko}@informatik.uni-mannheim.de
Linked Open Data Infrastructure for Public Sector Information: Example from Serbia
Proceedings of the I-SEMANTICS 2012 Posters & Demonstrations Track, pp. 26-30, 2012. Copyright 2012 for the individual papers by the papers' authors. Copying permitted only for private and academic purposes.
DISIT Lab, competence and project idea on bigdata. reasoning
DISIT Lab, competence and project idea on bigdata knowledge modeling, OD/LD and reasoning Paolo Nesi Dipartimento di Ingegneria dell Informazione, DINFO Università degli Studi di Firenze Via S. Marta 3,
GeoLinked Data. An application case/ Un caso de aplicación. Vilches Blázquez, Luis Manuel; Villazón-Terrazas, Boris; Corcho, O.; Gómez Pérez, Asunción
GeoLinked Data An application case/ Un caso de aplicación Vilches Blázquez, Luis Manuel; Villazón-Terrazas, Boris; Corcho, O.; Gómez Pérez, Asunción Resumen La Web de los Datos enlazados, del inglés Web
The Ontology and Architecture for an Academic Social Network
www.ijcsi.org 22 The Ontology and Architecture for an Academic Social Network Moharram Challenger Computer Engineering Department, Islamic Azad University Shabestar Branch, Shabestar, East Azerbaijan,
GetLOD - Linked Open Data and Spatial Data Infrastructures
GetLOD - Linked Open Data and Spatial Data Infrastructures W3C Linked Open Data LOD2014 Roma, 20-21 February 2014 Stefano Pezzi, Massimo Zotti, Giovanni Ciardi, Massimo Fustini Agenda Context Geoportal
Serendipity a platform to discover and visualize Open OER Data from OpenCourseWare repositories Abstract Keywords Introduction
Serendipity a platform to discover and visualize Open OER Data from OpenCourseWare repositories Nelson Piedra, Jorge López, Janneth Chicaiza, Universidad Técnica Particular de Loja, Ecuador [email protected],
Semantic Interoperability
Ivan Herman Semantic Interoperability Olle Olsson Swedish W3C Office Swedish Institute of Computer Science (SICS) Stockholm Apr 27 2011 (2) Background Stockholm Apr 27, 2011 (2) Trends: from
Web NDL Authorities: Authority Data of the National Diet Library, Japan, as Linked Data
Submitted on: 6/20/2014 Web NDL Authorities: Authority Data of the National Diet Library, Japan, as Linked Data Tadahiko Oshiba Library Support Division, Kansai-kan of the National Diet Library, Kyoto,
New Generation of Social Networks Based on Semantic Web Technologies: the Importance of Social Data Portability
New Generation of Social Networks Based on Semantic Web Technologies: the Importance of Social Data Portability Liana Razmerita 1, Martynas Jusevičius 2, Rokas Firantas 2 Copenhagen Business School, Denmark
Industry 4.0 and Big Data
Industry 4.0 and Big Data Marek Obitko, [email protected] Senior Research Engineer 03/25/2015 PUBLIC PUBLIC - 5058-CO900H 2 Background Joint work with Czech Institute of Informatics, Robotics and
Open Data Integration Using SPARQL and SPIN
Open Data Integration Using SPARQL and SPIN A Case Study for the Tourism Domain Antonino Lo Bue, Alberto Machi ICAR-CNR Sezione di Palermo, Italy Research funded by Italian PON SmartCities Dicet-InMoto-Orchestra
Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study
Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study Amar-Djalil Mezaour 1, Julien Law-To 1, Robert Isele 3, Thomas Schandl 2, and Gerd Zechmeister
Connecting the Smithsonian American Art Museum to the Linked Data Cloud
Connecting the Smithsonian American Art Museum to the Linked Data Cloud Pedro Szekely, Craig A. Knoblock, Fengyu Yang, Xuming Zhu, Eleanor E. Fink, Rachel Allen, and Georgina Goodlander, Los Angeles, California,
Dendro: collaborative research data management built on linked open data
Dendro: collaborative research data management built on linked open data João Rocha da Silva João Aguiar Castro Faculdade de Engenharia da Universidade do Porto/INESC TEC, Portugal, {joaorosilva,joaoaguiarcastro}@gmail.com
Building the Multilingual Web of Data: A Hands-on tutorial (ISWC 2014, Riva del Garda - Italy)
Building the Multilingual Web of Data: A Hands-on tutorial (ISWC 2014, Riva del Garda - Italy) Multilingual Word Sense Disambiguation and Entity Linking on the Web based on BabelNet Roberto Navigli, Tiziano
CASRAI, eurocris, Lattes, and VIVO: Four Perspectives on Research Information Standards
CASRAI, eurocris, Lattes, and VIVO: Four Perspectives on Research Information Standards David Baker, Keith Jeffery, José Salm, and Jon Corson-Rikert Laure Haak, Moderator August 24, 2012 1 Format A round
UNIMARC, RDA and the Semantic Web
Date submitted: 04/06/2009 UNIMARC, and the Semantic Web Gordon Dunsire Depute Director, Centre for Digital Library Research University of Strathclyde Glasgow, Scotland Meeting: 135. UNIMARC WORLD LIBRARY
Open Data collection using mobile phones based on CKAN platform
Proceedings of the Federated Conference on Computer Science and Information Systems pp. 1191 1196 DOI: 10.15439/2015F128 ACSIS, Vol. 5 Open Data collection using mobile phones based on CKAN platform Katarzyna
Network Graph Databases, RDF, SPARQL, and SNA
Network Graph Databases, RDF, SPARQL, and SNA NoCOUG Summer Conference August 16 2012 at Chevron in San Ramon, CA David Abercrombie Data Analytics Engineer, Tapjoy [email protected] About me
We have big data, but we need big knowledge
We have big data, but we need big knowledge Weaving surveys into the semantic web ASC Big Data Conference September 26 th 2014 So much knowledge, so little time 1 3 takeaways What are linked data and the
ELIS Multimedia Lab. Linked Open Data. Sam Coppens MMLab IBBT - UGent
Linked Open Data Sam Coppens MMLab IBBT - UGent Overview: Linked Open Data: Principles Interlinking Data LOD Server Tools Linked Open Data: Principles Term Linked Data was first coined by Tim Berners Lee
13 RDFS and SPARQL. Internet Technology. MSc in Communication Sciences 2011-12 Program in Technologies for Human Communication.
MSc in Communication Sciences 2011-12 Program in Technologies for Human Communication Davide Eynard nternet Technology 13 RDFS and SPARQL 2 RDF - Summary Main characteristics of RDF: Abstract syntax based
RDF y SPARQL: Dos componentes básicos para la Web de datos
RDF y SPARQL: Dos componentes básicos para la Web de datos Marcelo Arenas PUC Chile & University of Oxford M. Arenas RDF y SPARQL: Dos componentes básicos para la Web de datos Valladolid 2013 1 / 61 Semantic
Multilingual and Localization Support for Ontologies
Multilingual and Localization Support for Ontologies Mauricio Espinoza, Asunción Gómez-Pérez and Elena Montiel-Ponsoda UPM, Laboratorio de Inteligencia Artificial, 28660 Boadilla del Monte, Spain {jespinoza,
Proceedings of the SPDECE-2012. Ninth nultidisciplinary symposium on the design and evaluation of digital content for education
Proceedings of the SPDECE-2012. Ninth nultidisciplinary symposium on the design and evaluation of digital content for education 13 15 June 2011 Universidad de Alicante Alicante, Spain Edited by Manuel
DATA MANAGEMENT PLAN DELIVERABLE NUMBER RESPONSIBLE AUTHOR. Co- funded by the Horizon 2020 Framework Programme of the European Union
DATA MANAGEMENT PLAN Co- funded by the Horizon 2020 Framework Programme of the European Union DELIVERABLE NUMBER DELIVERABLE TITLE D7.4 Data Management Plan RESPONSIBLE AUTHOR DFKI GRANT AGREEMENT N. PROJECT
Mining the Web of Linked Data with RapidMiner
Mining the Web of Linked Data with RapidMiner Petar Ristoski, Christian Bizer, and Heiko Paulheim University of Mannheim, Germany Data and Web Science Group {petar.ristoski,heiko,chris}@informatik.uni-mannheim.de
A generic approach for data integration using RDF, OWL and XML
A generic approach for data integration using RDF, OWL and XML Miguel A. Macias-Garcia, Victor J. Sosa-Sosa, and Ivan Lopez-Arevalo Laboratory of Information Technology (LTI) CINVESTAV-TAMAULIPAS Km 6
CitationBase: A social tagging management portal for references
CitationBase: A social tagging management portal for references Martin Hofmann Department of Computer Science, University of Innsbruck, Austria [email protected] Ying Ding School of Library and Information Science,
Infrastructures, Pla/orms and Services for the Mul8lingual Digital Single Market
Panel Discussion 2 Infrastructures, Pla/orms and Services for the Mul8lingual Digital Single Market Par8cipants: Stelios Piperidis (Ins8tute for Language and Speech Processing, Greece) Khalid Choukri (ELRA/ELDA,
LDIF - Linked Data Integration Framework
LDIF - Linked Data Integration Framework Andreas Schultz 1, Andrea Matteini 2, Robert Isele 1, Christian Bizer 1, and Christian Becker 2 1. Web-based Systems Group, Freie Universität Berlin, Germany [email protected],
The Manuscript as Cultural Heritage: Digitisation ++
The Manuscript as Cultural Heritage: Digitisation ++ A Digital Humanities Point of View Not so much a conservation point of view slides on http://www.slideshare.net/gradmans Prof. Dr. Stefan Gradmann Humboldt-Universität
