Connecting the Smithsonian American Art Museum to the Linked Data Cloud



Similar documents
AAC Road Map. Introduction

Taming Big Data Variety with Semantic Graph Databases. Evren Sirin CTO Complexible

The Rijksmuseum Collection as Linked Data

Smithsonian American Art Museum

Europeana and schema.org

Towards the Russian Linked Culture Cloud: Data Enrichment and Publishing

City Data Pipeline. A System for Making Open Data Useful for Cities. stefan.bischof@tuwien.ac.at

Open Data Integration Using SPARQL and SPIN

STAR Semantic Technologies for Archaeological Resources.

Data Publishing with DaPaaS

LinkZoo: A linked data platform for collaborative management of heterogeneous resources

Definition of the Europeana Data Model v5.2.6

PlanetData Showcases: Linked/Open/Big Data in Smart Cities Data Integration

DISIT Lab, competence and project idea on bigdata. reasoning

Best practices for Linked Data

DBpedia German: Extensions and Applications

STAR Semantic Technologies for Archaeological Resources.

Visual Analysis of Statistical Data on Maps using Linked Open Data

A Case Study of Question Answering in Automatic Tourism Service Packaging

LOVER: Support for Modeling Data Using Linked Open Vocabularies

The WissKI Project A scholarly communication infrastructure

Design and Implementation of a Semantic Web Solution for Real-time Reservoir Management

Towards the Integration of a Research Group Website into the Web of Data

SAP CRM RAPID DEPLOYMENT SOLUTION. Package Overview

ON DEMAND ACCESS TO BIG DATA. Peter Haase fluid Operations AG

Exposing Open Street Map in the Linked Data cloud

Data Integration and Fusion using RDF

SemWeB Semantic Web Browser Improving Browsing Experience with Semantic and Personalized Information and Hyperlinks

María Elena Alvarado gnoss.com* Susana López-Sola gnoss.com*

Large-scale Reasoning with a Complex Cultural Heritage Ontology (CIDOC CRM)

Semantic Interoperability

Modelling «Base Bibliotek» as Linked Data

DC Proposal: Automation of Service Lifecycle on the Cloud by Using Semantic Technologies

CitationBase: A social tagging management portal for references

Publishing Linked Data Requires More than Just Using a Tool

Automating Cloud Service Level Agreements using Semantic Technologies

Mining the Web of Linked Data with RapidMiner

A Novel Cloud Based Elastic Framework for Big Data Preprocessing

Federated Data Management and Query Optimization for Linked Open Data

We have big data, but we need big knowledge

Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study

A HUMAN RESOURCE ONTOLOGY FOR RECRUITMENT PROCESS

Building the Multilingual Web of Data: A Hands-on tutorial (ISWC 2014, Riva del Garda - Italy)

Short Paper: Enabling Lightweight Semantic Sensor Networks on Android Devices

QASM: a Q&A Social Media System Based on Social Semantics

COLINDA: Modeling, Representing and Using Scientific Events in the Web of Data

Sieve: Linked Data Quality Assessment and Fusion

excellent graph matching capabilities with global graph analytic operations, via an interface that researchers can use to plug in their own

Data Validation with OWL Integrity Constraints

Introduction to SKOS. Bob DuCharme October 6, 2011

Data Quality in Information Integration and Business Intelligence

How To Make Sense Of Data With Altilia

Transcription:

Connecting the Smithsonian American Art Museum to the Linked Data Cloud Pedro Szekely, Craig A. Knoblock, Fengyu Yang, Xuming Zhu, Eleanor E. Fink, Rachel Allen, and Georgina Goodlander, Los Angeles, California, USA Nanchang Hangkong University, Nanchang, China Smithsonian American Art Museum, Washington, DC, USA

The The Smithsonian Smithsonian American American Art Art Museum Museum is is a a museum museum in in Washington, Washington, D.C. D.C. which which has has one one of of the the world's world's largest largest and and most most inclusive inclusive collections collections of of art, art, from from the the colonial colonial period period to to the the present, present, made made in in the the United United States. States. Wikipedia Wikipedia

Big Picture

Problem SAAM Data What ontology to use? Data consistency Structure mismatches What to link to? 100% precision How to enable museums to do this themselves?

Steps to Create Linked Data Map data to RDF select ontologies define mappings Link to external resources identify the links Curate the Linked Data museums demand 100% correctness

select ontologies

61 pages 35 pages 37 pages 138 pages

138 pages Complicated 37 pages 35 pages Many irrelevant classes and properties 61 pages

ore:aggregation edm:europeanaaggregation crm:e89_propositional_object edm:webresource edm:hasview edm:aggregatedcho edm:providedcho, E22_Man_Made_Object aac:culturalheritageobject dcterms:creator edm:agent/crm:e39_actor, foaf:person aac:person aac:associatedplace rdagr2:placeofbirth rdagr2:placeofdeath edm:place/crm:e53_place aac:place schema:address schema:postaladdress

skos:preflabel skos:concept skos:narrower dcterms:title dcterms:description dcterms:medium saam:objectid skos:preflabel saam:objectnumber saam:constituentid skos:preflabel skos:altlabel skos:concept edm:hastype rdagr2:biographicalinformation ore:aggregation edm:europeanaaggregation edm:aggregatedcho edm:providedcho, E22_Man_Made_Object aac:culturalheritageobject dcterms:creator edm:hasview edm:agent/crm:e39_actor, foaf:person aac:person rdagr2:placeofbirth aac:associatedplace edm:place/crm:e53_place aac:place dcterms:subject crm:e89_propositional_object edm:webresource dcterms:created dcterms:rights dcterms:format dcterms:date dcterms:provenance rdagr2:dateofbirth rdagr2:dateofdeath rdagr2:dateassociated WithThePerson rdagr2:placeofdeath schema:name schema:country skos:preflabel schema:addresscountry schema:addressregion schema:address schema:postaladdress schema:addresslocality

mapping the data to the ontologies how to enable museums to do this themselves?

Karma Interactive tool for rapidly extracting, cleaning, transforming, integrating, and publishing data Tabular Sources Karma Hierarchical Sources Services Model Database [ Knoblock, Szekely, et al. Semi automatically mapping structured sources into the semantic web. ISWC 2012 ]

specifying transformations and mapping to properties with Karma

aac ont:person rdf:type saam:person/15 aac ont:marriedname Alice Stanley Archeson rdf:type saam:person/2 aac ont:variantname George M. Aarons

http://isi.edu/integration/karma

mapping to object properties using Karma

http://isi.edu/integration/karma

Evaluation of Data Mapping Using Karma SAAM database 8 tables 29 columns Ontologies 407 classes 105 data properties 229 object properties

identifying and curating links

Multiple John Singer Sargent cb:person_john_singer_sargent a aac-ont:person ; ont0:dateofbirth "1879", "1885" ; ont0:dateofdeath "1925" ; foaf:name "John Singer Sargent". ima:person_john_singer_sargent a aac-ont:person ; dct:date "1856-1925" ; foaf:name "John Singer Sargent". dallas:person_john_singer_sargent a aac-ont:person ; ont0:dateofbirth "1856" ; ont0:dateofdeath "1925" ; foaf:name "John Singer Sargent". saam:person_4253 a aac-ont:person ; met:person_john_singer_sargent aac-ont:associatedplace a aac-ont:person ; saam:saamplace_1357324439768t1r13950_0, ont0:placeofresidence saam:saamplace_1357324439768t1r13951_0 ; "North and Central America", saam:constituentid "4253" ; "United States" ; rdagr2:biographicalinformation foaf:name "John Singer Sargent". Painter. Sargent traveled " ; rdagr2:dateassociatedwiththeperson "1990-10-1, "1995-5-8" ; rdagr2:dateofbirth "1856-1-12" ; rdagr2:dateofdeath "1925-4-15" ; rdagr2:placeofbirth saam:saamplace_1357324439768t1r13952_0 ; rdagr2:placeofdeath saam:saamplace_1357324439768t1r13953_0 ; foaf:name "John S. Sargent" ; skos:altlabel "John S. Sargent" ; skos:preflabel "John Singer Sargent".

John Singer Sargent cb:saamperson_john_singer_sargent a saam:saamperson ; ont0:dateofbirth "1879", "1885" ; ont0:dateofdeath "1925" ; skos:preflabel "John Singer Sargent". ima:saamperson_john_singer_sargent a saam:saamperson ; dct:date "1856-1925" ; foaf:name "John Singer Sargent". dallas:saamperson_john_singer_sargent a saam:saamperson ; ont0:dateofbirth "1856" ; ont0:dateofdeath "1925" ; foaf:name "John Singer Sargent". saam:saamperson_4253 a saam:saamperson ; met:saamperson_john_singer_sargent saam:associatedplace a saam:saamperson ; saam:saamplace_1357324439768t1r13950_0, ont0:placeofresidence saam:saamplace_1357324439768t1r13951_0 ; "North and Central America", saam:constituentid "4253" ; "United States" ; rdagr2:biographicalinformation foaf:name "John Singer Sargent". Painter. Sargent traveled " ; rdagr2:dateassociatedwiththeperson "1990-10-1, "1995-5-8" ; rdagr2:dateofbirth "1856-1-12" ; rdagr2:dateofdeath "1925-4-15" ; rdagr2:placeofbirth saam:saamplace_1357324439768t1r13952_0 ; rdagr2:placeofdeath saam:saamplace_1357324439768t1r13953_0 ; skos:altlabel "John S. Sargent" ; skos:preflabel "John Singer Sargent".

Linking John Singer Sargent saam:person_4253 owl:sameas cb:person_john_singer_sargent ; owl:sameas dallas:person_john_singer_sargent ; owl:sameas ima:person_john_singer_sargent ; owl:sameas met:person_john_singer_sargent ; owl:sameas dbpedia:john_singer_sargent ; owl:sameas nytimes:n49129220686803623753 ; owl:sameas w-flick:john_singer_sargent ;....

Intuition Estimate discrimination power of properties, e.g., of name, birth and death dates every combination of dates birth date death date # of people 1800 1820 147 1800 1821 284 1800 1822 213 similar idea to Song, D., Heflin, J.: Domain independent entity coreference for linking ontology instances. ACM Journal of Data and Information Quality (ACM JDIQ) (2012)

Evaluation of Automatic Linking SAAM names starting with A matched by hand 535 people 176 matches

Results of Automatic Linking DBPedia 2,194 New York Times 70 estimate 30 missing links to DBpedia Getty ULAN 2,110 Rijksmuseum 551 Geonames 3,068

Curating Links with Karma

Linking with Karma

results of automated linking and interactive curation recorded using PROV owl:sameas statements constructed using SPARQL CONSTRUCT queries over PROV records

deployment

L Q R A P S c i l b u p t n i o p d en University of Southern California Pedro Szekely and Craig Knoblock

University of Southern California Pedro Szekely and Craig Knoblock

s e i r e u q L Q SPAR t n i o p d n e r to ou University of Southern California Pedro Szekely and Craig Knoblock

s e i r e u q L Q SPAR t n i o p d n e r to ou University of Southern California Pedro Szekely and Craig Knoblock

Related Work Europeana 17 million items, 1,500 institutions Require exports in Europeana format Amsterdam Museum, Museum Finland Rich ontology, RDF to RDF mapping rules LODAC museums in Japan 114 museums, simple ontology Research Space, British Museum CIDOC CRM ontologies, complex mappings We focused significantly on Linking identification and curation

Next Steps Applications leveraging linked data Virtual museum Tools to create multimedia stories about art Tools to find inconsistencies Feed data to wikidata American Art Collective: a linked data consortium of museums

Merci