The WissKI Project A scholarly communication infrastructure



Similar documents
Gerald Hiebel 1, Øyvind Eide 2, Mark Fichtner 3, Klaus Hanke 1, Georg Hohmann 4, Dominik Lukas 5, Siegfried Krause 4

Embedding an ontology in form fields on the web

AAC Road Map. Introduction

SCHEDULE. "Interconnected data worlds. Workshop on the implementation of CIDOC-CRM. Monday Dienstag

AN IMPLEMENTATION OF THE CIDOC CONCEPTUAL REFERENCE MODEL (4.2.4) IN OWL-DL

Drupal.

dati.culturaitalia.it a Pilot Project of CulturaItalia dedicated to Linked Open Data

Soar Technology Knowledge Management System (KMS)

Indian Journal of Science International Weekly Journal for Science ISSN EISSN Discovery Publication. All Rights Reserved

Methodology for CIDOC CRM based data integration with spatial data

Business Analytics: A Knowledge Community and Repository Infrastructure for R Models. Master Teamproject Prof. Dr. Alexander Mädche, Martin Kretzer

Cataloging in the Cloud: Shared Shelf and ArchaeoCore

LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model

STAR Semantic Technologies for Archaeological Resources.

The Open University s repository of research publications and other research outputs

The Semantic Web Rule Language. Martin O Connor Stanford Center for Biomedical Informatics Research, Stanford University

LinksTo A Web2.0 System that Utilises Linked Data Principles to Link Related Resources Together

escidoc: una plataforma de nueva generación para la información y la comunicación científica

RADSAGA Resubmission. First meeting. Steffen MUELLER Alexander KOELPIN. Institute for Electronics Engineering (LTE) University of Erlangen-Nuremberg

Semantic Web Tool Landscape

From MARC21 and Dublin Core, through CIDOC CRM: First Tenuous Steps towards Representing Library Data in FRBRoo

Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology

Delverable 6.1: XML mark up tool and services

Webform 3. Building Surveys in Drupal

STAR Semantic Technologies for Archaeological Resources.

BauDenkMalNetz Creating a Semantically Annotated Web Resource of Historical Buildings

A GENERALIZED APPROACH TO CONTENT CREATION USING KNOWLEDGE BASE SYSTEMS

Graph Database Performance: An Oracle Perspective

Using Dataverse Virtual Archive Technology for Research Data Management. Jonathan Crabtree Thu-Mai Christian Amanda Gooch

by Wolfgang Ziegler (fago) and Klaus Purer (klausi) DrupalCon Paris 2009 Rules Leveraging rule based automation!

- a Humanities Asset Management System. Georg Vogeler & Martina Semlak

How To Write A Drupal Rdf Plugin For A Site Administrator To Write An Html Oracle Website In A Blog Post In A Flashdrupal.Org Blog Post

Library and Information Careers: Emerging Trends and Titles. San Jose State University 2011

Encoding Library of Congress Subject Headings in SKOS: Authority Control for the Semantic Web

Linked Data Interface, Semantics and a T-Box Triple Store for Microsoft SharePoint

Following a guiding STAR? Latest EH work with, and plans for, Semantic Technologies

Collaborative Development of Knowledge Bases in Distributed Requirements Elicitation

D3.1: SYSTEM TEST SUITE

Sharing Digital Resources and Metadata for Open and Flexible Knowledge Management Systems

Innovation Development using a Semantic Web Platform: A case-study from the National Health Service in England

Invenio: A Modern Digital Library for Grey Literature

M.Sc. Program in Informatics and Telecommunications

Standards, Tools and Web 2.0

Business rules and science

Draft Response for delivering DITA.xml.org DITAweb. Written by Mark Poston, Senior Technical Consultant, Mekon Ltd.

Connecting the Smithsonian American Art Museum to the Linked Data Cloud

V-MUST DEVELOPMENT CAMP: Call for Archiving Components Identification and Integration C1b.2013

Semantic Technologies for Archaeology Resources: Results from the STAR Project

i2b2 Clinical Research Chart

THE SYNERGY REFERENCE MODEL OF DATA PROVISION AND AGGREGATION

Design and Implementation of a Semantic Web Solution for Real-time Reservoir Management

COMBINING AND EASING THE ACCESS OF THE ESWC SEMANTIC WEB DATA

Open Source Software in Life Science Research. Woodhead Publishing Series in Biomedicine

Integrating data from The Perseus Project and Arachne using the CIDOC CRM An Examination from a Software Developer s Perspective

ELPUB Digital Library v2.0. Application of semantic web technologies

B SVF - Bavaria Long Term Preservation

Museums Places of Authenticity?

Addressing Self-Management in Cloud Platforms: a Semantic Sensor Web Approach

BSc in Information Technology Degree Programme. Syllabus

SMILA. (SeMantic Information Logistics Architecture) Creation Review by Igor Novakovic, August Georg Schmidt; made available under the EPL v1.

Adding Robust Digital Asset Management to Oracle s Storage Archive Manager (SAM)

Overview. United By DNA's Web Platform Identification Project

Presente e futuro del Web Semantico

Taking full advantage of the medium does also mean that publications can be updated and the changes being visible to all online readers immediately.

Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object

HOPS Project presentation

Europass Curriculum Vitae

Integrating data from The Perseus Project and Arachne using the CIDOC CRM An Examination from a Software Developer s Perspective

Big Data and Semantic Web in Manufacturing. Nitesh Khilwani, PhD Chief Engineer, Samsung Research Institute Noida, India

XML for Manufacturing Systems Integration

Caravela: Semantic Content Management with Automatic Information Integration and Categorization (System Description)

How We Did It. Unique data model abstraction layer to integrate, but de-couple EHR data from patient website design.

A Job Recruitment System Using Semantic Web Technology

DELIVERABLE. D5.3: Product Development Plan. Project Acronym: Ev3 Grant Agreement number: Project Title: Europeana Version 3

Semantic Knowledge Management System. Paripati Lohith Kumar. School of Information Technology

A collaborative platform for knowledge management

A HUMAN RESOURCE ONTOLOGY FOR RECRUITMENT PROCESS

PhD Program An Overview. Department of Health Informatics SHRP

Optimizing Drupal Performance. Benchmark Results

Self-describing Agents

Ontologies for elearning

UNIMARC, RDA and the Semantic Web

Arches: An Open Source GIS for the Inventory and Management of Immovable Cultural Heritage

DEVELOPING A VISUAL DIGITAL IMAGE COLLECTION. Calgary Collections 2005: The Changing Collections Environment CLA Pre-Conference June 15, 2005

LinkZoo: A linked data platform for collaborative management of heterogeneous resources

Software Architecture Document

AgroPortal. a proposition for ontologybased services in the agronomic domain

Semantic SharePoint. Technical Briefing. Helmut Nagy, Semantic Web Company Andreas Blumauer, Semantic Web Company

Typo3_smartsite. Smartsite CMS Release 5 5/24/2006

Student Orientation. Department of Health Informatics SHRP-UMDNJ

Automation of metadata processing

CURATING DIGITAL HERITAGE BY MUSIS - THE SOUTH-WESTERN GERMAN MUSEUM NETWORK

Towards the Integration of a Research Group Website into the Web of Data

Grant: LIFE08 NAT/GR/ Total Budget: 1,664, Life+ Contribution: 830, Year of Finance: 2008 Duration: 01 FEB 2010 to 30 JUN 2013

Content Management Systems: Drupal Vs Jahia

Co-evolving document collections and knowledge structures. CoDAK. Dr. Evgeny Knutov! ! (MSc Seminar Nov )

Page 1 of 5. (Modules, Subjects) SENG DSYS PSYS KMS ADB INS IAT

The System Architecture and Tools of the Partnership for Assessment of Readiness for College and Careers. Juliette Siegfried

Transcription:

The WissKI Project A scholarly communication infrastructure Georg Hohmann Germanisches Nationalmuseum Nürnberg Department of Museum Informatics CIDOC 2009 Santiago de Chile, 30. September 2009 Overview Project funded by the DFG (German Research Foundation) 2009-2011 Partners Staff Friedrich-Alexander-University of Erlangen-Nuremberg (FAU), Chair of Computer Science 8 Artificial Intelligence (Prof. Günther Görz) Germanisches Nationalmuseum (GNM) Nuremberg, Head of Museum Informatics (Dr. Siegfried Krause) Zoologisches Forschungsmuseum Alexander Koenig (ZFMK) Bonn, Head of Biodiversity Informatics (Dr. Karl-Heinz Lampe) Dipl.-Inf. Martin Scholz (FAU) Georg Hohmann M.A. (GNM) Dipl.-Inf. Mark Fichtner (ZFMK) 2 Student Assistants 30.09.2009 Georg Hohmann - Germanisches Nationalmuseum Nürnberg - Department of Museum Informatics - CIDOC 2009 - The WissKI Project 2

Goal To create a wiki-based software system, that... supports scientific communication. supports scientific documentation in memory institutions. provides long-term availability of research results. assures the identity of authorship. assures the authenticity of information. enables the persistence of citations. supports quality management. supports the preparation of scientific publications. supports semantic content analysis based on ontologies. serves as a platform for curated knowledge. 30.09.2009 Georg Hohmann - Germanisches Nationalmuseum Nürnberg - Department of Museum Informatics - CIDOC 2009 - The WissKI Project 3 Use Cases ZFMK Biodat specimen database Database of the research diaries of Wilhelm Aerts (1885-1964) GNM Database of Goldsmith's Art in Nuremberg 16th 19th century Database of the Early Duerer Research Project 30.09.2009 Georg Hohmann - Germanisches Nationalmuseum Nürnberg - Department of Museum Informatics - CIDOC 2009 - The WissKI Project 4

Unique Features Other projects dealing with wiki technology and knowledge management, e.g. KiWi - "Knowledge in a Wiki" - project (http://www.kiwi-project.eu) OntoWiki (http://ontowiki.net) WIKINGER WIKI Next Generation Enhanced Repository (http://www.wikinger-escience.de) Unique Features of WissKI Pure OWL-DL architecture Accentuation of communicational aspects The only project with focus on humanities data Tools for scientific documentation Automatic text analysis for semantic annotation 30.09.2009 Georg Hohmann - Germanisches Nationalmuseum Nürnberg - Department of Museum Informatics - CIDOC 2009 - The WissKI Project 5 Workpackages 2009 Creating an integrated overall concept Selecting a software framework to build on Preprocessing of data from the use cases Creating suitable ontologies Incorporating of common authority files Creating local authority files Developing a first software prototype 30.09.2009 Georg Hohmann - Germanisches Nationalmuseum Nürnberg - Department of Museum Informatics - CIDOC 2009 - The WissKI Project 6

Overall Concept Import Erlangen CRM/OWL Export OWL-DL/XML Instances & Ontologies OAI-PMH Museumdat/XML Instances OAI-PMH SKOS/XML REST/SOAP Web API Web API Web API Validation Mapping Validation Import Biodat/OWL Base Ontology = Museumdat/OWL enhanced Instances Local Name Authorities Names, Places,... Global Name Authorities AAT, TGN, ULAN,... GNM-DMS/OWL Mapping Web API Web API Export Web API OWL-DL/XML Instances & Ontologies OAI-PMH Museumdat/XML Instances OAI-PMH SKOS/XML REST/SOAP = Base Components = Local Data Models = Data = Interfaces = Hierarchical Ontology Connection (is-a) = Equivalent Ontology Connection (same-as) = Instance Of = Data Flow 30.09.2009 Georg Hohmann - Germanisches Nationalmuseum Nürnberg - Department of Museum Informatics - CIDOC 2009 - The WissKI Project 7 Software Framework Drupal Core Modules Third-Party CCK Views WikiTools WysiwygAPI ImageAPI... WissKI OWL/RDF System Discussion System Automatic Text Annotator Authority Files Management Import/Export API... Database Triple Store All software used is available under free software licences. 30.09.2009 Georg Hohmann - Germanisches Nationalmuseum Nürnberg - Department of Museum Informatics - CIDOC 2009 - The WissKI Project 8

Ontologies Available as first drafts Erlangen CRM / OWL OWL-DL 1.0 implementation of the CIDOC CRM 5.0.1 WissKI Base Ontology Enhanced OWL-DL 1.0 implementation of Museumdat Biodat OWL OWL-DL 1.0 Ontology of the ZFMK Biodat Database & Aerts Diaries Duerer OWL OWL-DL 1.0 Ontology of the Early Duerer Research Project Database In development Gold OWL OWL-DL 1.0 Ontology of the GNM Goldsmith's Art Database GNM-DMS OWL OWL-DL 1.0 Ontology of the GNM Documentation Database 30.09.2009 Georg Hohmann - Germanisches Nationalmuseum Nürnberg - Department of Museum Informatics - CIDOC 2009 - The WissKI Project 9 Content Creation Each item is an instance of a class in a given ontology. Each instance is represented by RDF-triples and stored inside a triple store. Instances can be created in three ways: Via submitting a free text. Via traditional webforms. Via uploading of an image or other multimedia files. Every manual or automatic data entry is assisted by authority files. 30.09.2009 Georg Hohmann - Germanisches Nationalmuseum Nürnberg - Department of Museum Informatics - CIDOC 2009 - The WissKI Project 10

Automatic Semantic Text Annotation Assumption Scientists are used to communicate and record data by writing texts (papers, articles, books, etc.). Approach WYSIWYG-Editor enhanced with capabilities of automatic semantic markup. Automatic recognition of named entities (place names, person names, calendar date etc.) based on name authorities. User should be motivated to revise annotations. The text itself is treated as an instance of E31.Document. 30.09.2009 Georg Hohmann - Germanisches Nationalmuseum Nürnberg - Department of Museum Informatics - CIDOC 2009 - The WissKI Project 11 Automatic Semantic Text Annotation: Screenshot 1 30.09.2009 Georg Hohmann - Germanisches Nationalmuseum Nürnberg - Department of Museum Informatics - CIDOC 2009 - The WissKI Project 12

Automatic Semantic Text Annotation: Screenshot 2 30.09.2009 Georg Hohmann - Germanisches Nationalmuseum Nürnberg - Department of Museum Informatics - CIDOC 2009 - The WissKI Project 13 Manual Data Entry Assumption Some scientists are used (and like) to record data by entering data manually in a database with a forms-based interface (MS Access etc.). Approach Forms are dynamically built from the concept description inside the used ontologies. Available properties are shown depending on their definition. Data is standarized with regard to the loaded ontology. Input of values is supplemented by the data from authority files. 30.09.2009 Georg Hohmann - Germanisches Nationalmuseum Nürnberg - Department of Museum Informatics - CIDOC 2009 - The WissKI Project 14

Manual Data Entry: Screenshot 1 30.09.2009 Georg Hohmann - Germanisches Nationalmuseum Nürnberg - Department of Museum Informatics - CIDOC 2009 - The WissKI Project 15 Manual Data Entry: Screenshot 2 30.09.2009 Georg Hohmann - Germanisches Nationalmuseum Nürnberg - Department of Museum Informatics - CIDOC 2009 - The WissKI Project 16

Communication The choosen software framework itself provides several community functionalities like mailing(lists), forums, blogs, support for multimedia files, user profiles etc. Instances can be discussed. Data generated by these discussions can replace or modify the previous data. A curator decides which statements represent the official statements of a project. Facilities for cooperative preparation of scientific publications Automatic data exchange between different installations of WissKI. Import/Export of data via widely adopted standards and based on standarized vocabulary. 30.09.2009 Georg Hohmann - Germanisches Nationalmuseum Nürnberg - Department of Museum Informatics - CIDOC 2009 - The WissKI Project 17 Discussion: Screenshot 1 30.09.2009 Georg Hohmann - Germanisches Nationalmuseum Nürnberg - Department of Museum Informatics - CIDOC 2009 - The WissKI Project 18

Achievements so far A prototype is already in development. Process in most working packages is just in time. Automatic semantic text annotation (proof-of-concept). Automatic form creation based on ontology concepts (proof-of-concept). Integration of authority files. Byproduct: A tool to convert Getty authority xml to SKOS. 6 ontologies are available in different stages of development. Data of some use cases is already prepared to test the system. E.g. Aerts Diaries ~ 177 places, 12829 determination events, 1069 examinations events ~ approx. 1.4 Million triples. Project website online (http://wiss-ki.eu). Papers published and talks given related to the project. 30.09.2009 Georg Hohmann - Germanisches Nationalmuseum Nürnberg - Department of Museum Informatics - CIDOC 2009 - The WissKI Project 19