Online multilingual generation of Cultural Heritage content
|
|
|
- Marylou McDonald
- 10 years ago
- Views:
Transcription
1 Online multilingual generation of Cultural Heritage content Dana Dannélls Språkbanken, Department of Swedish Language University of Gothenburg MOLTO
2 Motivation New developments in technologies (e.g. Semantic Web) provide sophisticated information access to cultural heritage material e.g. enables users to broaden/narrow search based on multiple criteria at once Emerging European project initiatives provide cross-collection, cross-museum and cross-subject access to bigger sets of data collections MultimediaN E-culture, Europeana, Cornucopia, Michael
3 Direct access to cultural heritage objects
4 The CIDOC Conceptual Reference Model (CIDOC-CRM) The CIDOC Conceptual Reference Model (CIDOC CRM), developed by the International Committee for Documentation (CIDOC) of the International Council of Museums (ICOM) (Crofts et al., 2008). ISO standard since classes and 130 relationships Available in OWL
5 The CIDOC Conceptual Reference Model (CIDOC-CRM)
6 Project goals To build an ontology-based multilingual grammar for museum information starting from the CIDOC-CRM ontology for artefacts at Gothenburg City Museum To cover 15 languages for baseline functionality and 5 languages with a more complete coverage To build a prototype of a cross-language retrieval and representation system to be tested with objects in the museum, and automatically generate Wikipedia articles for museum artefacts in 5 languages
7 A record from the Gothenburg City Museum database Field name Value Field nr Prefix GIM Object nr Search word painting Class Class 2 Gothenburg portrait Amount 1 Producer E.Glud Produced year 1984 Length cm 106 Width cm 78 Description oilpainting represents a studio indoors History Up to 1986 belonged to Datema AB, Flöjelbergsg 8, Gbg Material oil colour Current keeper 2 Location Polstjärnegatan 4 Package nr. 299 Registration date Signature BI Search field BO:BU Bilder:TAVLOR PICT:GIM
8 The Painting ontology Purpose: to support integration and interoperability of the CIDOC-CRM ontology with other ontologies and schemata, including: CIDOC-CRM SUMO: Merge and Mid-Level Ontology Swedish Open Cultural Heritage (SOCH) The painting ontology contains: 197 classes 24 stems from CRM, 15 equivalent to SOCH, 45 equivalent to SUMO concepts 107 properties 17 are subproperties of the CRM properties
9 Integration of Gothenburg City Museum data
10 Museum Reason-able View Environment 8 thousand museum artifacts from the Gothenburg city museum database. ar
11 Ontology verbalization in GF Straightforward from the ontology: isa (Object, Painting) Guernica is a painting. createdby (Painting, Creator) Guernica is created by Pablo Picasso. hascreationdate (Painting, TimeSpan) Guernica was created in 1937.
12 A case study on English and Swedish The corpus data for analysis 40 parallel texts extracted from Wikipedia under the category Painting 300 object descriptions for each language extracted from museums online databases The results of the analysis a list of syntactic structures a list of discourse patterns
13 Syntactic structures PN -> NP Van Gogh Det -> CN -> NP The portrait The countess of Carnarvon NP -> Adv -> NP The bell in London V2 -> PP -> VP displayed at the Paris Salon painted by Jamie Wyeth V2 -> Adv -> VP displayed here suggest the hand of an artist V2 -> NP -> VP displays painting of tulip bearing her signature
14 Discourse patterns DP0 : painting painter year -> Text DP1 : painting museum painter size -> Text DP2 : painting painter repesented museum -> Text DP3 : painting material year painter -> Text DP4 : painting painter year museum colour size -> Text
15 Discourse patterns generation in GF I DP0 (eng) Sommer Joy was painted by Anders Zorn. (swe) Sommarnöje blev målad av Anders Zorn. DP1 (eng) Sommer Joy was painted in It measures 349 by 776 cm. (swe) Sommarnöje blev målad år Den är av storlek 349 och 776 cm. DP3 (eng) Sommer Joy is painted on paper in 1886 by Anders Zorn. (swe) Sommarnöje blev målad på papper 1886 av Anders Zorn.
16 Discourse patterns generation in GF II DP2 (eng) Sommer Joy is a painting made by Anders Zorn. The work depicts a view from Lilla Bommen at Hisingen. (swe) Sommarnöje är en målning av Anders Zorn. Den föreställer en utsikt från Lilla Bommen mot Hisingen. DP4 (eng) Sommer Joy was painted by Anders Zorn in the year It is of size 349 by 776 cm and is painted on paper. The painting is displayed at the Museum of World Culture. (swe) Sommarnöje blev målad av Anders Zorn år Den är av storlek 349 och 776 cm och är målad på paper. Målningen återfinns på Världskulturmuseet.
17 A description of a museum object
18 Current state of work Implementing more patterns for discourse generation Building a lexicon to cover the content of all object descriptions Translate lexical entities and write grammar for Finnish, French and Germen
Natural Language Interaction with Semantic Web Knowledge Bases and LOD
Natural Language Interaction with Semantic Web Knowledge Bases and LOD Mariana Damova, Dana Dannélls, Ramona Enache, Maria Mateva, Aarne Ranta Ontotext, AD, Sofia, Bulgaria Chalmers University and GU,
Definition of the CIDOC Conceptual Reference Model
Definition of the CIDOC Conceptual Reference Model Produced by the ICOM/CIDOC Documentation Standards Group, continued by the CIDOC CRM Special Interest Group Version 4.2.4 January 2008 Editors: Nick Crofts,
Definition of the CIDOC Conceptual Reference Model
Definition of the CIDOC Conceptual Reference Model Produced by the ICOM/CIDOC Documentation Standards Group, Continued by the CIDOC CRM Special Interest Group Version 5.1(draft) November 2012 Current Main
CRM dig : A generic digital provenance model for scientific observation
CRM dig : A generic digital provenance model for scientific observation Martin Doerr, Maria Theodoridou Institute of Computer Science, FORTH-ICS, Crete, Greece Abstract The systematic large-scale production
M LTO Multilingual On-Line Translation
O non multa, sed multum M LTO Multilingual On-Line Translation MOLTO Consortium FP7-247914 Project summary MOLTO s goal is to develop a set of tools for translating texts between multiple languages in
Mapping VRA Core 4.0 to the CIDOC/CRM ontology
1 st Workshop on Digital Information Management March 30-31, 2011 Mapping VRA Core 4.0 to the CIDOC/CRM ontology Panorea Gaitanou, Manolis Gergatsoulis Database and Information Systems Group (DBIS) Laboratory
From MARC21 and Dublin Core, through CIDOC CRM: First Tenuous Steps towards Representing Library Data in FRBRoo
From MARC21 and Dublin Core, through CIDOC CRM: First Tenuous Steps towards Representing Library Data in FRBRoo Cezary Mazurek, Krzysztof Sielski, Justyna Walkowska, Marcin Werla Poznań Supercomputing
Semantic Web in Cultural Heritage After 2020
Semantic Web in Cultural Heritage After 2020 Konstantinos N. Vavliakis 1,2, Georgios Th. Karagiannis 2, and Pericles A. Mitkas 1,3 1 Department of Electrical and Computer Engineering Aristotle University
INFORMATION INTEGRATION: MAPPING CULTURAL HERITAGE METADATA INTO CIDOC CRM CARRASCO, L. B., THALLER, M., CARVALHO, J. R. ***
INFORMATION INTEGRATION: MAPPING CULTURAL HERITAGE METADATA INTO CIDOC CRM CARRASCO, L. B., THALLER, M., CARVALHO, J. R. *** GT3: Organização da informação e do conhecimento no século XXI Abstract: Cultural
ARIADNE CONSERVATION DOCUMENTATION SYSTEM: CONCEPTUAL DESIGN AND PROJECTION ON THE CIDOC CRM. FRAMEWORK AND LIMITS
ARIADNE CONSERVATION DOCUMENTATION SYSTEM: CONCEPTUAL DESIGN AND PROJECTION ON THE CIDOC CRM. FRAMEWORK AND LIMITS Department of Conservation of Antiquities & Works of Art, TEI Athens Ag Spiridonos 12210
Syntactic Theory on Swedish
Syntactic Theory on Swedish Mats Uddenfeldt Pernilla Näsfors June 13, 2003 Report for Introductory course in NLP Department of Linguistics Uppsala University Sweden Abstract Using the grammar presented
Towards the Russian Linked Culture Cloud: Data Enrichment and Publishing
Towards the Russian Linked Culture Cloud: Data Enrichment and Publishing Dmitry Mouromtsev 1, Peter Haase 2,1, Eugene Cherny 1,3, Dmitry Pavlov 4,1, Alexey Andreev 1, and Anna Spiridonova 1 1 ITMO University,
STAR Semantic Technologies for Archaeological Resources. http://hypermedia.research.glam.ac.uk/kos/star/
STAR Semantic Technologies for Archaeological Resources http://hypermedia.research.glam.ac.uk/kos/star/ Project Outline 3 year AHRC funded project Started January 2007, finish December 2009 Collaborators
Ontology-Based Multilingual Information Retrieval
Ontology-Based Multilingual Information Retrieval Jacques Guyot * Saïd Radhouani *,** Gilles Falquet * * Centre universitaire d informatique 24, rue Général-Dufour, CH-1211 Genève 4, Switzerland ** Laboratoire
CultureSampo Finnish Culture on the Semantic Web: The Vision and First Results
CultureSampo Finnish Culture on the Semantic Web: The Vision and First Results Eero Hyvönen, Tuukka Ruotsalo, Thomas Häggström, Mirva Salminen, Miikka Junnila, Mikko Virkkilä, Mikko Haaramo, Eetu Mäkelä,
Timeline (1) Text Mining 2004-2005 Master TKI. Timeline (2) Timeline (3) Overview. What is Text Mining?
Text Mining 2004-2005 Master TKI Antal van den Bosch en Walter Daelemans http://ilk.uvt.nl/~antalb/textmining/ Dinsdag, 10.45-12.30, SZ33 Timeline (1) [1 februari 2005] Introductie (WD) [15 februari 2005]
IMPRESSIONIST PAINTERS
IMPRESSIONISM. Impressionism was born in Paris (France) in 19th century and in 1874 the Impressionists held their first show. The Impressionists painted not a landscape but the immediate impression of
CIDOC-CRM Extensions for Conservation Processes: A Methodological Approach
CIDOC-CRM Extensions for Conservation Processes: A Methodological Approach Evgenia Vassilakaki 1,a), Daphne Kyriaki- Manessi 1,b), Spiros Zervos 1,c) and Georgios Giannakopoulos 1,d) 1 Dept. Library science
Ontology-based Archetype Interoperability and Management
Ontology-based Archetype Interoperability and Management Catalina Martínez-Costa, Marcos Menárguez-Tortosa, J. T. Fernández-Breis Departamento de Informática y Sistemas, Facultad de Informática Universidad
KHRESMOI. Medical Information Analysis and Retrieval
KHRESMOI Medical Information Analysis and Retrieval Integrated Project Budget: EU Contribution: Partners: Duration: 10 Million Euro 8 Million Euro 12 Institutions 9 Countries 4 Years 1 Sep 2010-31 Aug
Semantic annotation of requirements for automatic UML class diagram generation
www.ijcsi.org 259 Semantic annotation of requirements for automatic UML class diagram generation Soumaya Amdouni 1, Wahiba Ben Abdessalem Karaa 2 and Sondes Bouabid 3 1 University of tunis High Institute
Development of an Ontology for the Document Management Systems for Construction
Development of an Ontology for the Document Management Systems for Construction Alba Fuertes a,1, Núria Forcada a, Miquel Casals a, Marta Gangolells a and Xavier Roca a a Construction Engineering Department.
Methodology for CIDOC CRM based data integration with spatial data
CAA'2010 Fusion of Cultures Francisco Contreras & Fco. Javier Melero (Editors) Methodology for CIDOC CRM based data integration with spatial data G. Hiebel 1, K. Hanke 1, I. Hayek 2 1 Surveying and Geoinformation
LOD2014 Linked Open Data: where are we? 20 th - 21 st Feb. 2014 Archivio Centrale dello Stato. SBN in Linked Open Data
LOD2014 Linked Open Data: where are we? 20 th - 21 st Feb. 2014 Archivio Centrale dello Stato SBN in Linked Open Data Istituto Centrale per il Catalogo Unico delle biblioteche italiane (ICCU) 20-21/02/2014
Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology
Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology Hong-Linh Truong Institute for Software Science, University of Vienna, Austria [email protected] Thomas Fahringer
Björn Lundquist UiT The Arctic University of Norway
Nordic Atlas of Language Structures (NALS) Journal, Vol. 1, 149 153 C opyright Björn Lundquist 2014 Licensed under a C reative C ommons Attribution 3.0 License Prefixed negation Björn Lundquist UiT The
Types and Annotations for CIDOC CRM Properties
Types and Annotations for CIDOC CRM Properties Vladimir Alexiev Ontotext Corp, 135 Tsarigradsko Shosse Blvd, Sofia, Bulgaria [email protected] Abstract. The CIDOC CRM provides an extensive
Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives
Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives Ramona Enache and Adam Slaski Department of Computer Science and Engineering Chalmers University of Technology and
Multilingual and Localization Support for Ontologies
Multilingual and Localization Support for Ontologies Mauricio Espinoza, Asunción Gómez-Pérez and Elena Montiel-Ponsoda UPM, Laboratorio de Inteligencia Artificial, 28660 Boadilla del Monte, Spain {jespinoza,
Concept for an Ontology Based Web GIS Information System for HiMAT
Concept for an Ontology Based Web GIS Information System for HiMAT Gerald Hiebel Klaus Hanke University of Innsbruck Surveying and Geoinformation Unit {gerald.hiebel; klaus.hanke}@uibk.ac.at Abstract The
Search Result Diversification Methods to Assist Lexicographers
Search Result Diversification Methods to Assist Lexicographers Lars Borin Markus Forsberg Karin Friberg Heppin Richard Johansson Annika Kjellandsson Språkbanken, Department of Swedish, University of Gothenburg
FRBR. object-oriented definition and mapping to FRBR ER (version 1.0)
FRBR object-oriented definition and mapping to FRBR ER (version 1.0) International Working Group on FRBR and CIDOC CRM Harmonisation supported by Delos NoE Editors: Chryssoula Bekiari Martin Doerr Patrick
STAR Semantic Technologies for Archaeological Resources. http://hypermedia.research.glam.ac.uk/kos/star/
STAR Semantic Technologies for Archaeological Resources http://hypermedia.research.glam.ac.uk/kos/star/ Project Outline 3 year AHRC funded project Started January 2007, finish December 2009 Collaborators
Integration of Heterogeneous Metadata in Europeana. Cesare Concordia [email protected] Institute of Information Science and Technology-CNR
Integration of Heterogeneous Metadata in Europeana Cesare Concordia [email protected] Institute of Information Science and Technology-CNR Outline What is Europeana The Europeana data model The
UNIMARC, RDA and the Semantic Web
Date submitted: 04/06/2009 UNIMARC, and the Semantic Web Gordon Dunsire Depute Director, Centre for Digital Library Research University of Strathclyde Glasgow, Scotland Meeting: 135. UNIMARC WORLD LIBRARY
Lesson 8: The Post-Impressionists. Pages 44-51
Lesson 8: The Post-Impressionists Pages 44-51 Post-Impressionist Artists: Cezanne Seurat Van Gogh Gauguin Munch Klimt Picasso Rousseau Child with a Dove, Pablo Picasso Paul Cezanne Oldest Post- Impressionist
TEI and Cultural Heritage Ontologies
TEI and Cultural Heritage Ontologies Interchange of information? Øyvind Eide & Christian-Emil Ore Unit for Digital Documentation, University of Oslo, Norway Motivation: Grey literature in Museums 1 Original
Integration of Cultural Information
The CIDOC CRM, a Standard for the Integration of Cultural Information Stephen Stead CIDOC Conceptual Reference Model Special Interest Group ICS-FORTH, Crete, Greece November, 2008 1 Slide 1 Welcome to
An Approach to Eliminate Semantic Heterogenity Using Ontologies in Enterprise Data Integeration
Proceedings of Student-Faculty Research Day, CSIS, Pace University, May 3 rd, 2013 An Approach to Eliminate Semantic Heterogenity Using Ontologies in Enterprise Data Integeration Srinivasan Shanmugam and
METS and the CIDOC CRM a Comparison
METS and the CIDOC CRM a Comparison Martin Doerr February 2011 Acknowledgments The work was commissioned and financed by Cultural Heritage Imaging (http://www.c-h-i.org) with majority funding from the
Implementing the CIDOC CRM with a relational database
MCN Spectra. 24 (1), Spring 1999 Implementing the CIDOC CRM with a relational database Introduction The CIDOC Conceptual Reference Model (CRM) is an object oriented semantic reference model for cultural
Arches: An Open Source GIS for the Inventory and Management of Immovable Cultural Heritage
Arches: An Open Source GIS for the Inventory and Management of Immovable Cultural Heritage David Myers 1, Alison Dalgity 1, Ioannis Avramides 2, and Dennis Wuthrich 3 1 The Getty Conservation Institute,
Introduction. Philipp Koehn. 28 January 2016
Introduction Philipp Koehn 28 January 2016 Administrativa 1 Class web site: http://www.mt-class.org/jhu/ Tuesdays and Thursdays, 1:30-2:45, Hodson 313 Instructor: Philipp Koehn (with help from Matt Post)
Formalization of the CRM: Initial Thoughts
Formalization of the CRM: Initial Thoughts Carlo Meghini Istituto di Scienza e Tecnologie della Informazione Consiglio Nazionale delle Ricerche Pisa CRM SIG Meeting Iraklio, October 1st, 2014 Outline Overture:
Comprendium Translator System Overview
Comprendium System Overview May 2004 Table of Contents 1. INTRODUCTION...3 2. WHAT IS MACHINE TRANSLATION?...3 3. THE COMPRENDIUM MACHINE TRANSLATION TECHNOLOGY...4 3.1 THE BEST MT TECHNOLOGY IN THE MARKET...4
Semantic Transformation of Web Services
Semantic Transformation of Web Services David Bell, Sergio de Cesare, and Mark Lycett Brunel University, Uxbridge, Middlesex UB8 3PH, United Kingdom {david.bell, sergio.decesare, mark.lycett}@brunel.ac.uk
A Generic Database Schema for CIDOC-CRM Data Management
A Generic Database Schema for CIDOC-CRM Data Management Kai Jannaschk 1, Claas Anders Rathje 1, Bernhard Thalheim 1 and Frank Förster 2 1 Christian-Albrechts-University at Kiel, Information Systems Engineering
Formal Ontologies in Model-based Software Development
Formal Ontologies in Model-based Software Development Hele-Mai Haav, Andres Ojamaa, Vahur Kotkas, Pavel Grigorenko, Jaan Penjam Institute of Cybernetics at TUT About In general, ontologies as formal models
Integrating data from The Perseus Project and Arachne using the CIDOC CRM An Examination from a Software Developer s Perspective
Integrating data from The Perseus Project and Arachne using the CIDOC CRM An Examination from a Software Developer s Perspective Robert Kummer, Perseus Project at Tufts University and Research Archive
ONTOLOGIES A short tutorial with references to YAGO Cosmina CROITORU
ONTOLOGIES p. 1/40 ONTOLOGIES A short tutorial with references to YAGO Cosmina CROITORU Unlocking the Secrets of the Past: Text Mining for Historical Documents Blockseminar, 21.2.-11.3.2011 ONTOLOGIES
Facilitating access to cultural heritage content in Czechia: National Authority Files and INTERMI project
Submitted on: 18.06.2015 Facilitating access to cultural heritage content in Czechia: National Authority Files and INTERMI project Marie Balíková National Library of the Czech Republic, Prague, Czechia
Structure of the talk. The semantics of event nominalisation. Event nominalisations and verbal arguments 2
Structure of the talk Sebastian Bücking 1 and Markus Egg 2 1 Universität Tübingen [email protected] 2 Rijksuniversiteit Groningen [email protected] 12 December 2008 two challenges for a
Creating an RDF Graph from a Relational Database Using SPARQL
Creating an RDF Graph from a Relational Database Using SPARQL Ayoub Oudani, Mohamed Bahaj*, Ilias Cherti Department of Mathematics and Informatics, University Hassan I, FSTS, Settat, Morocco. * Corresponding
Building a Spanish MMTx by using Automatic Translation and Biomedical Ontologies
Building a Spanish MMTx by using Automatic Translation and Biomedical Ontologies Francisco Carrero 1, José Carlos Cortizo 1,2, José María Gómez 3 1 Universidad Europea de Madrid, C/Tajo s/n, Villaviciosa
Following a guiding STAR? Latest EH work with, and plans for, Semantic Technologies
Following a guiding STAR? Latest EH work with, and plans for, Semantic Technologies Presented by Keith May Based on research work of English Heritage staff especially Paul Cripps & Phil Carlisle (NMR DSU)
Innovations for researchers in cultural and scientific heritage Milagros del Corral
Innovations for researchers in cultural and scientific heritage Milagros del Corral Directora de la Biblioteca Nacional de España Vicepresidenta del Comité Ejecutivo, The European Library Europeana: the
Semantic Indexing via Knowledge Organization Systems: Applying the CIDOC-CRM to Archaeological Grey Literature
Semantic Indexing via Knowledge Organization Systems: Applying the CIDOC-CRM to Archaeological Grey Literature Andreas Vlachidis A thesis submitted in partial fulfilment of the requirements of the University
Semantic Interoperability
Ivan Herman Semantic Interoperability Olle Olsson Swedish W3C Office Swedish Institute of Computer Science (SICS) Stockholm Apr 27 2011 (2) Background Stockholm Apr 27, 2011 (2) Trends: from
EXPLOITING FOLKSONOMIES AND ONTOLOGIES IN AN E-BUSINESS APPLICATION
EXPLOITING FOLKSONOMIES AND ONTOLOGIES IN AN E-BUSINESS APPLICATION Anna Goy and Diego Magro Dipartimento di Informatica, Università di Torino C. Svizzera, 185, I-10149 Italy ABSTRACT This paper proposes
AAC Road Map. Introduction
AAC Road Map Introduction The American Art Collaborative (AAC), comprised of thirteen museums, has spent the past nine months engaged in learning about Linked Open Data (LOD) and planning how to move forward
How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.
Svetlana Sokolova President and CEO of PROMT, PhD. How the Computer Translates Machine translation is a special field of computer application where almost everyone believes that he/she is a specialist.
Cultural Heritage and Metabolism
The use of CRM Core in Multimedia Annotation Patrick Sinclair 1, Matthew Addis 2, Freddy Choi 2, Martin Doerr 3, Paul Lewis 1 and Kirk Martinez 1 1 Electronics and Computer Science, University of Southampton,
Approaches of Using a Word-Image Ontology and an Annotated Image Corpus as Intermedia for Cross-Language Image Retrieval
Approaches of Using a Word-Image Ontology and an Annotated Image Corpus as Intermedia for Cross-Language Image Retrieval Yih-Chen Chang and Hsin-Hsi Chen Department of Computer Science and Information
