CultureSampo A Collective Memory of Finnish Cultural Heritage on the Semantic Web 2.0 CIDOC CRM Meeting, Jan 28, 2010, Helsinki



Similar documents
CultureSampo Finnish Culture on the Semantic Web: The Vision and First Results

Ontology-Based Query Expansion Widget for Information Retrieval

Finnish Fiction Literature as Linked Open Data - the BookSampo Dataset

Publishing and Using Ontologies as Mash-Up Services

Building a National Ontology Infrastructure

Date submitted: 1 July 2012 Eetu Mäkelä (1) Kaisa Hypén (2) and Eero Hyvönen (1)

Linked Open Ontology Cloud Managing a System of Interlinked Cross-domain Light-weight Ontologies

Appendix A: Inventory of enrichment efforts and tools initiated in the context of the Europeana Network

Reason-able View of Linked Data for Cultural Heritage

ONKI-SKOS Publishing and Utilizing Thesauri in the Semantic Web

Spatio-temporal Semantic Modeling of Historical Content

Combining Context Navigation with Semantic Autocompletion to Solve Problems in Concept Selection

HealthFinland a National Semantic Publishing Network and Portal for Health Information

Towards the Russian Linked Culture Cloud: Data Enrichment and Publishing

dati.culturaitalia.it a Pilot Project of CulturaItalia dedicated to Linked Open Data

Designing and Creating a Web Site Based on RDF Content

Annotea and Semantic Web Supported Collaboration

Elements of a National Semantic Web Infrastructure - Case Study Finland on the Semantic Web

Data-Gov Wiki: Towards Linked Government Data

Enabling the Semantic Web with Ready-to-Use Web Widgets

Deep Image Annotation: Making a Difference in Knowledge Organization

LinkZoo: A linked data platform for collaborative management of heterogeneous resources

STAR Semantic Technologies for Archaeological Resources.

Enabling the Semantic Web with Ready-to-Use Web Widgets

Efficient Content Creation on the Semantic Web Using Metadata Schemas with Domain Ontology Services (System Description)

BirdWatch Supporting Citizen Scientists for Better Linked Data Quality for Biodiversity Management

Developing geospatial ontologies: SUO and SAPO ontologies

AAC Road Map. Introduction

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies

View-Based User Interfaces for Information Retrieval on the Semantic Web

Ms Satu Savia, project manager of Museum 2015, The Finnish Museums Association.

María Elena Alvarado gnoss.com* Susana López-Sola gnoss.com*

Cross-Sectional Integration of LAM Resources on the Basis of Authority Data

Semantic Exploration of Archived Product Lifecycle Metadata under Schema and Instance Evolution

Evangelia Mitsopoulou, St George s University of London Panagiotis Bamidis, Aristotle University of Thessaloniki Daniela Giordano, University of

online University of Helsinki shaping Session:

Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study

Links, languages and semantics: linked data approaches in The European Library and Europeana.

The European Digital Library - An Introduction

CitationBase: A social tagging management portal for references

A Tool for Creating Semantic Web Portals

GetLOD - Linked Open Data and Spatial Data Infrastructures

Linked Open Data Infrastructure for Public Sector Information: Example from Serbia

Cataloguing is riding the waves of change Renate Beilharz Teacher Library and Information Studies Box Hill Institute

Fraunhofer FOKUS. Fraunhofer Institute for Open Communication Systems Kaiserin-Augusta-Allee Berlin, Germany.

Publishing Linked Data Requires More than Just Using a Tool

Short Paper: Enabling Lightweight Semantic Sensor Networks on Android Devices

Scalable End-User Access to Big Data HELLENIC REPUBLIC National and Kapodistrian University of Athens

Open Data Integration Using SPARQL and SPIN

Publishing Europe s Television Heritage on the Web.

Standards for Big Data in the Cloud

Semantic SharePoint. Technical Briefing. Helmut Nagy, Semantic Web Company Andreas Blumauer, Semantic Web Company

Visual Analysis of Statistical Data on Maps using Linked Open Data

Mapping VRA Core 4.0 to the CIDOC/CRM ontology

Linked Statistical Data Analysis

ON DEMAND ACCESS TO BIG DATA. Peter Haase fluid Operations AG

Knowledge as a Service for Agriculture Domain

ON DEMAND ACCESS TO BIG DATA THROUGH SEMANTIC TECHNOLOGIES. Peter Haase fluid Operations AG

Geospatio-temporal Semantic Web for Cultural Heritage

Semantic Knowledge Management System. Paripati Lohith Kumar. School of Information Technology

Europeana Core Service Platform

Pragmatic Web 4.0. Towards an active and interactive Semantic Media Web. Fachtagung Semantische Technologien September 2013 HU Berlin

Metadata Quality Control for Content Migration: The Metadata Migration Project at the University of Houston Libraries

Innovations for researchers in cultural and scientific heritage Milagros del Corral

SemWeB Semantic Web Browser Improving Browsing Experience with Semantic and Personalized Information and Hyperlinks

Mining the Web of Linked Data with RapidMiner

How To Write A Drupal Rdf Plugin For A Site Administrator To Write An Html Oracle Website In A Blog Post In A Flashdrupal.Org Blog Post

An adaptable domain specific dissemination infrastructure for enhancing the visibility of complementary and thematically related research information

Linked Open Data A Way to Extract Knowledge from Global Datastores

Pattern based mapping and extraction via CIDOC CRM

How to port a collection to the Semantic Web. presenter: Borys Omelayenko contributors: A. Tordai, G. Schreiber

- a Humanities Asset Management System. Georg Vogeler & Martina Semlak

City Data Pipeline. A System for Making Open Data Useful for Cities. stefan.bischof@tuwien.ac.at

LDIF - Linked Data Integration Framework

Semantic Interoperability

DISCOVERING RESUME INFORMATION USING LINKED DATA

Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo

Recipes for Semantic Web Dog Food. The ESWC and ISWC Metadata Projects

Achille Felicetti" VAST-LAB, PIN S.c.R.L., Università degli Studi di Firenze!

Acronym: Data without Boundaries. Deliverable D12.1 (Database supporting the full metadata model)

BauDenkMalNetz Creating a Semantically Annotated Web Resource of Historical Buildings

Ontology-based Modeling and Visualization of Cultural Spatio-temporal Knowledge

SPC BOARD (COMMISSIONE DI COORDINAMENTO SPC) AN OVERVIEW OF THE ITALIAN GUIDELINES FOR SEMANTIC INTEROPERABILITY THROUGH LINKED OPEN DATA

Strategic, Methodological and Technical Solutions for the Creation of Seamless Cultural Heritage Content and Access: Lithuanian Approach

[JOINT WHITE PAPER] Ontos Semantic Factory

Types and Annotations for CIDOC CRM Properties

DataBridges: data integration for digital cities

Big Data and Semantic Web in Manufacturing. Nitesh Khilwani, PhD Chief Engineer, Samsung Research Institute Noida, India

ChemCloud - Chemical e-science Information Cloud. Adrian Paschke, Freie Universitaet Berlin Stephan Heineke, FIZ CHEMIE

ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY

Towards the Integration of a Research Group Website into the Web of Data

Experiences from a Large Scale Ontology-Based Application Development

How To Create A Web Of Knowledge From Data And Content In A Web Browser (Web)

Collaborative Development of Knowledge Bases in Distributed Requirements Elicitation

How semantic technology can help you do more with production data. Doing more with production data

New Generation of Social Networks Based on Semantic Web Technologies: the Importance of Social Data Portability

A Collaborative Approach to Building Personal Knowledge Networks or How to Build a Knowledge Advantage Machine?

A Semantic web approach for e-learning platforms

DEVELOPING A VISUAL DIGITAL IMAGE COLLECTION. Calgary Collections 2005: The Changing Collections Environment CLA Pre-Conference June 15, 2005

Semantic Technologies for Archaeology Resources: Results from the STAR Project

Transcription:

CultureSampo A Collective Memory of Finnish Cultural Heritage on the Semantic Web 2.0 CIDOC CRM Meeting, Jan 28, 2010, Helsinki Prof. Eero Hyvönen Aalto University, School of Science and Technology and University of Helsinki Semantic Computing Research Group (SeCo) http://www.seco.tkk.fi/ 1

Background Projects 2003- National Ontology Project in Finland (FinnONTO) 2003-2007, 2008-2009, 2010-2011 (?) Funded by Tekes and 38 companies and organizations Finnish Cultural Foundation 2008-2010 SmartMuseum EU FP7 2008-2010 Semantic ubicom services 2010-2011 Joint work of some 30 researchers 2

Antikvaria-ryhmä 3

History MuseumFinland Finnish Museums on the Semantic Web Public service since 2004 http://www.museosuomi.fi CultureSampo I 2005 Research proto only CultureSampo 2 Research proto only 2006-2007 CultureSampo Public service since 2008 http://www.kulttuurisampo.fi 4

Contents Vision: Semantic Web 2.0 of Cultural Heritage Challenges: Content Complexity & Production Solution: Semantic Web + Web 2.0 Realization: CultureSampo -- Finnish Culture on the Semantic Web 2.0 5

The Vision: Semantic Web 2.0 of Cultural Heritage 6

Wouldn titbeniceif cultural organizations could publish easily their contents together on the web just by pushing a button, the contents would be automatically linked with other publishers contents and get semantically enriched, researchers and citizens could contribute with their own contents and knowledge, contents could be accessed easily from different thematic perspectives and contexts, intelligent search and browsing systems could find answers to questions in addition to data records, the aggregated contents and services could be reused in external applications easily, language barriers could be overcome? 7

Yes! This is the vision of CultureSampo 8

Challenges: Content Complexity & Production 9

Problem 1: Cultural Content Compexity - Heterogenous and Interlinked Artifacts Maps Encyclopedia Videos Buildings Narratives Literature Music Cultural sites Biographies Fine arts 10

Problem 2: Cultural Content Production System - Distributed and Independent Land survey Museums Web 2.0 sites Archieves Media Linked Data Citizens Libraries 11

Solution: the Semantic Web 2.0 12

CultureSampo in a Nutshell Land survey Museums Content Providers Archieves Semantic Metadata FinnONTO Ontology Infrastructure Web 2.0 sites Media Linked Data Citizens Libraries

Sampo -- The Magic Finnish Machine of Wisdom (Kalevala epic) National Ontology Infrastucture

Metadata + Ontologies = Web of Data of Cultural Heritage = A Collective Memory of Finnish Cultural Heritage 15

Biografiakeskus ja kirjastot keräävät henkilöhistoriaa henkilö nimi ammatti syntymapaikka... H1 Akseli Gallen-Kallela taiteilija Lemu H2 Gustaf Mannerheim marsalkka Askainen... nimi Akseli Gallen-Kallela ihminen tyyppi H1 ammatti s-paikka taiteiija Lemu Biography Center tyyppi nimi Gustaf Mannerheim H2 ammatti marsalkka s-paikka Askainen 16

Museo luetteloi maalauksia teos nimi tekijä aika aihe... T1 Mannerheimin muotokuva Akseli Gallen-Kallela 1929 Gustaf Mannerheim T2 Aino-triptyykki Akseli Gallen-Kallela 1891 Aino, Kalevala... nimi Akseli Gallen-Kallela tekijä T1 tyyppi maalaus... Art Museum Collection aika 1929 aihe nimi Gustaf Mannerheim 17

Maanmittauslaitos ylläpitää paikkarekistereitä kunta Askainen Helsinki Lemu Turku... lääni Varsinais-Suomen lääni Uudenmaan lääni Varsinais-Suomen lääni Varsinais-Suomen lääni kunta Lemu tyyppi tyyppi lääni Land Survey part-of tyyppi part-of... part-of Varsinais-Suomen lääni Suomi Askainen Turku part-of 18

FinnONTO kehittää ontologioita KOKO-ontologia pysyvä yläluokka yläluokka käsite muuttuva yläluokka abstrakti fyysinen objekti yläluokka yläluokka ajanjakso ammatti paikka yläluokka kunta ihminen taiteiija FinnONTO lääni maalaus marsalkka 19

Semanttinen RDF-verkko yhdistää kaiken: Web of Data käsitteet pysyvä yläluokka muuttuva yläluokka abstrakti yläluokka fyysinen objekti yläluokka paikka yläluokka yläluokka nimi Akseli Gallen-Kallela ajanjakso tyyppi ammatti kunta CultureSampo ihminen tyyppi tyyppi H1 ammatti taiteiija s-paikka Lemu tekijä tyyppi maalaus T1 tyyppi lääni yläluokka tyyppi... aihe aika 1929 part-of Varsinais-Suomen lääni part-of Suomi H2 nimi Gustaf Mannerheim ammatti marsalkka part-of part-of s-paikka Askainen Turku 20

Realization: CultureSampo -- Finnish Culture on the Semantic Web 2.0 21

CultureSampo = Three Components Portal http://www.kulttuurisampo.fi/ - Humans: user-interface - Machines: AJAX widgets, Web Services CultureSampo System Content Production System - Model - Standards - National Ontology Service ONKI - Annotation and other tools Content Infrastructure - W3C etc. standards - FinnONTO Infrastructure - Metadata schemas - Ontology system 22

Portal http://www.kulttuurisampo.fi/ - Humans: user-interface - Machines: AJAX widgets, Web Services CultureSampo System Content Production System - Model - Standards - National Ontology Service ONKI - Annotation and other tools Content Infrastructure - W3C etc. standards - FinnONTO Infrastructure - Metadata schemas - Ontology system 23

CultureSampo Component 1/3 Infrastructure 24

KOKO: Finnish Collaborative Holistic Ontology : Ontology developer groups view KOKO Ontology YSO... AFO VALO MAO TAO KOKO... 25

End-users view KOKO Ontology YSO AFO MAO TAO VALO KOKO 26

Finnish Ontology Library Service ONKI: http://www.yso.fi 75 vocabularies, taxonomies, and ontologies online as services 27

CultureSampo Component 2/3 Content Creation Process 28

CultureSampo Content Providers (28+) : museums, libraries, archieves, researcher organizations, media companies + citizens Finnish Content Providers 1 Agricola Suomen historiaverkko 2 Espoon kaupunginmuseo 3 Helsingin kaupunginkirjasto 4 Hiihtomuseo 5 Jyväskylän yliopisto, musiikin laitos 6 Kansallisbiografia 7 Kansallismuseo 8 Kuopion kulttuurihistoriallinen museo 9 Laatokan-Karjalan museo 10 Lahden kaupunginmuseo 11 Museovirasto 12 Pohjois-Karjalan museo 13 Radio- ja TV-museo 14 Seurasaaren ulkomuseo 15 Suomalaisen Kirjallisuuden Seura SKS 16 Suomen maatalousmuseo Sarka 17 Suomen merimuseo 18 Taideteollisen korkeakoulun kirjasto 19 Valtion taidemuseo 20 Veljekset Karhumäki Oy 21 Viipurin historiallinen museo 22 Yleisradio Oy International Content Providers 1 Geonames 2 Google (Maps) 3 Iconclass (vocab.) 4 Panoramio 5 Paul J. Getty Foundation (vocab.) 6 Wikipedia Thanks for cooperation! 29

Metadata Schemas Metadata schema Content type 1 artifact artifacts 2 art paintings, sculpture, drawings, abstract art 3 literature novels, short stories, comics 4 WWW page WWW pages 5 poetry 3 subtypes of poetry 6 fictive object places and persons in Kalevala 7 folk music 5 subtypes of folk music 8 photograph photographs 9 aerial photograph aerial photos 10 actors persons, organizations 11 biography biographies 12 historical event historical events 13 skill cultural process descriptions 14 video documented processes 15 built objects buildings etc. in nature 16 archeological sites archeological sites 30

Content Integration Dublin Core like metadata schemas Sharing elements Element subproperty-of hierarchies Dump-down priciple Harmozined element values Taken from large shared domain ontologies» Derived from thesauri in use Actions, Objects, Actors, Places,... Event-based subject descriptions Historical events (history ontology HISTO) Events in stories Events in videos Narrative event chains and decompositions 31

Changing Metadata into Events (CS 2007) Idea: Represent heterogeneous metadata in terms of homogenous real world semantics ->Interoperability ->Simple basis for semantic search and browsing 32

RDF Knowledge Base (March 17, 2009) Metadata 134,000 cultural collection items (artifacts, books, videos etc.) 285,000 other resources (places, persons etc.) 204 property types in metadata Ontologies KOKO ontologies (ca. 37,000 concepts) Additional international vocabularies» AAT, ULAN, Iconclass 253 property types in ontologies Size 11,4 million triples» 2,7 million triplets» 8,7 million additional reasoned triplets 33

Next CultureSampo RDF Repository Version 2010 Vocabularies & Ontologies International vocabularies/ontologies National vocabularies/ontologies WordNet SUO Finnish Place Ontology Geonames.org SAPO Finnish Spatio-temporal Ontology GNS Geonet Names Server KOKO (General Finnish Ontology) OpenStreetMap MAO Cultural Heritage Ontology TGN Thesaurus of Georaphic Names (Getty) AFO Agri-forestry Ontology AAT Art and Archtecture Thesaurus (Getty) VALO Photography Ontology ULAN Universal List of Artits Names (Getty) TAO Applied Art Ontology AVIO Bird Ontology of the World MAMO Mammal Ontology of the World Lingvoj Languages of the World Iconclass 34

New datasets BDPedia in different languages (341 million triples) (Wikipedia) Museum exhibitions and events (Museum Association) Cultural exhibitions and events related to history (Agricola network) Finnish history ontology HISTO (FinnONTO) New collections will be included in CultureSampo Entity Amount Literals 47 000 000 Language codes 238 URI resources 84 000 000 * as subject 24 700 000 * with a label 13 500 000 * as object 66 500 000 * with coordinates 19 500 000 SameAs triples 10 000 000 Triples in deductive closure 29 000 000 000 35

CultureSampo Contents: Web 2.0 Interfaces 1 Wikipedia articles On the Google Maps Geonames.org service Write an article -> it will be seen in CultureSampo Selected articles annotated (automatically) by ourselves 36

CultureSampo Contents: Web 2.0 Interfaces 2 Panoramio photo service On the Google Maps www.panoramio.com Take a photo -> it will be seen in CultureSampo 37

CultureSampo Contents: Web 2.0 Interfaces 3 Annotation channel for content items 38

CultureSampo Component 3/3 The Semantic Web 2.0 Portal 39

Portal Users Humans Google-like but semantic search Nine thematic perspectives into cultural heritage Machines! Semantic AJAX widgets 40

Limitations of Non-semantic Web NBA-H26069-467 :object cup and plate ; :material porcelain ; :creationplace Germany ; :creator Meissen. This metadata cannot answer the following questions: Find all vessels? Find all ceramic products? Find artifacts manufactured in Europe? Does the city of Meissen manufacture ceramics? 41

Semantic Web Solution 1: Ontology-based Semantic Search NBA-H26069-467 :object cup and plate ; :object_concept object:cup ; :object_concept object:plate ; :material porcelain ; :material_concept object:porcelain ; :creationplace Germany ; :creationplace_concept place:germany ; :creator Meissen :creator_concept actor:meissen. Find all vessels? Find all ceramic products? Find artifacts manufactured in Europe? Does the city of Meissen manufacture ceramics? NBA-H26069-467 place:germany creationlocation_concept object_concept object_concept material_concept... material:porcelain place ontology loc:partof object:cup place:europe place:meissen rdfs:subclassof object:plate... object ontology... object:vessel rdfs:subclassof actor ontology material ontology actor:meissen 42

Semantic Widget: Using CultureSampo like Google Maps in Mash-up Applications (e.g. SPARQL end-point) 43

Semantic Keyword Search with Autocompetion & Result Categorization 44

Nine Thematic Perspectives into Cultural Heritage Nine perspectives Three languages Lately viewed items Lately commented items

1. Map Browse and Search 1/4: Tab Search for Items on the Map 46

1. Map Browse and Search 2/4: Tab Historical Areas Historical Helsinki 1640-1945 Semantic recommendations 47

1. Map Browse and Search 3/4: Tab Historical Maps on Google Maps Wikipedia articles Panoramio photos Links to historical places 48

1. Map Browse and Search 4/4: Tab Find Near-by Sites of Interest 49

2. Relational Search: How is A. Gallen-Kallela related to Napolean I? 50

3. Search and Organize New Idea: Domain-centric Faceted Search for Dealing with Multiple Metadata Schemas 51

4. Collections Organizational Perspective 52

5. Finnish History Co-operation with History Researchers Timeline of historical Events External web page embedded in CultureSampo 53

6. Skill Documentations A Semantic Video Viewer Semantic process description Semantic recommendations Dynamic information about the video scene 54

7. Biography Perspective National Biography on the Semantic Web Semantically annotated biographies Links to related works, persons, places etc. 55

8. Semantic Kalevala: The computer understands the national epic Kalevala Translation into modern Finnish Semantically annotated 50 poems of the national epic - Events and narratives Links to related art etc. 56

Visualizing the Narrative Events of Kalevala 57

9. Karelia Perspective A Regional and Political View Semantically annotated Wikipedia pages Semantic recommendation links 58

Conclusions: Seven Points that Make CultureSampo Unique 1. Highly cross-domain: 26 content types and 16 metadata schemas 2. Sophisticated semantic annotation models (including events and processes) 3. New kind of semantic search and recommendation techniques 4. Versatile selection of semantic visualizations (map views, timelines, graphs, process visualization, semantic video viewing) 5. Based on a large nation wide collaboratively maintained infrastructure of ontologies and ontology services 6. Includes models of and tools for collaborative semantic content creation 7. Services are available for machines, too. 59

CultureSampo and CIDOC CRM Focus has been on domain ontologies and value/vocabulary integration Not central for CIDOC CRM In earlier CS versions (2005-2007) an event-based harmonization and integration approach was experimented (Ruotsalo, Hyvönen, ISWC-2007) E.g. DC painting metadata -> Painting event Focus on describing the real world, not other metadata schemas DC application approach is used in the public CS DC like metadata schemas mapped onto each other» Original DC is needed anyway for the end-user Values are harmonized by shared domain ontologies Also CIDOC CRM content is used in this way 60

Events are used in subject descriptions for describing the real world (semantic roles, narratives) We believe in events! Provenance events are not considered in CS Our search and recommendation engines could be applied to CIDOC CRM content in RDF easily (?) Are there open datasets with harmonized/aligned values available? 61

Publications & demos & more information: http://www.seco.tkk.fi/applications/kulttuurisampo 62

Questions? Demos, Publications, Services, Content, Projects: http://www.seco.tkk.fi/ Try CultureSampo yourself: http://www.kulttuurisampo.fi/ http://www.seco.tkk.fi 63