Metadata Workshop The way to open science, June 22, 2016 Abstracts Roberto Barbera... 2 From Metadata to Open Data to Open Science... 2 Tim Head... 2 How to get to a bright future of reusable science... 2 Anna Fensel... 3 Towards Semantic APIs for Research Data Services... 3 Walter Koch... 3 Aggregation and Management of Metadata in the Context of Europeana... 3 Matej Durco... 3 Metadata for the Humanities a use case from ÖAW... 3 Georg Neubauer... 4 Visualization of typed links in linked data... 4 Barbara Petritsch... 4 Metadata for research data in practice... 4 Andreas Jeitler... 4 Accessibility Metadata - Metadata accessibility... 4 Sarah Teetor... 5 The Challenges of Building and Maintaining an Image Database... 5 Sandor Kopacsi... 5 Classification Server... 5 Stefan Szepe... 5
Metadata Challenges at the University of Music and Performing Arts... 5 José Luis Preza... 5 Automated Information Enrichment for a Better Search... 5 Roberto Barbera From Metadata to Open Data to Open Science The plethora of new services and capabilities of what is currently called Web 2.0 or 3.0 is having a big influence on science. In all disciplines, methods of doing research and creating and sharing knowledge are becoming more and more digital and they have recently been grouped under the terms Science 2.0 or Open Science. Open Science is a means and not an end in itself and it is much more than just open access to publications. It includes and requires full and federated access to datasets, data collections, algorithms, configurations, tools and applications, codes, workflows, scripts, libraries, services, system software, infrastructure, compilers, hardware, etc. Open Science is considered a key enabler of full reproducibility and re- usability of scientific results, which in turn are the base of an Open Innovation Economy and of a Knowledge Society. Transparency, openness, and reproducibility are recognized as vital features of science and pillars of the Scientific Method and it is strongly believed that an Open Science attitude should be taught to researchers since the very early stages of their careers. This keynote speech presents the issue of the so- called "reproducibility crisis" and outlines the components of a federated Open Science Platform able to tackle and cure it. Authorship of scientific results is also discussed. Tim Head How to get to a bright future of reusable science The fast progress of science is only possible because we stand on the shoulders of giants. However this is becoming more and more difficult as the use of code to perform research increases. Getting someone else's code to run often takes days or weeks. With such a large upfront investment required it is not surprising that most people do not try to do this. Instead they reimplement parts or all of the work. Akin to building your own printing press in order to distribute a questionnaire. We can do better. This talk will discuss the road towards a future in which reusing someone else's research code takes minutes, not days. I will discuss some of the key technologies involved, cultural changes needed and open- source projects that are tackling this problem
Anna Fensel Towards Semantic APIs for Research Data Services Rapid development of Internet and Web technology is changing the state of the art in communication of knowledge, or results of research activities. Particularly, Semantic technology, linked and open data become key enablers for successful and efficient progress in research. At first, I define the research data service (RDS) and discuss typical current and possible future usage scenarios involving RDS. Further, I discuss the state of the art in the areas of semantic service and data annotation and API construction, as well as infrastructural solutions, applicable for RDS realisation. At last, innovative methods of online dissemination, promotion and efficient communication of research are discussed. Walter Koch Aggregation and Management of Metadata in the Context of Europeana The European Digital Library, Europeana, has introduced in 2010 an open Data model that integrates and re- uses a variety of international standards for metadata. This open model supports the joint indexing of metadata from diverse cultural and scientific domains and recognizes the individual requirements for the various application areas. This way, spezialised metadata, f.e. for audio content, can be processed and aggregated with dedicated versions of the Europeana Data Model. Furthermore, the usage of controlled vocabularies in SKOS standard supports the enriched metadata presentation and linkage of data, f.e. for the biodiversity heritage or the local heritage domain. During the data enrichment process a variety of micro services is used that allow for customised data ingest, data annotation and data linking. In addition, new content provider- directed applications like the DISMARC Documentation and Annotation Platform allow for self- organized metadata monitoring and provide advanced annotation features for semantic linkage and enhanced visibility of data. Matej Durco Metadata for the Humanities a use case from ÖAW Lately, we have observed a new drive and considerable enthusiasm at the Academy: many new digital humanities projects are starting at the numerous humanities institutes. Very diverse in their nature and goals, they all have one thing in common: the need to discover, describe and manage research data both pre- existing and newly created in the course of their projects. Being part of a wide range of diverse communities of practice, the researchers need to be able to structure this information in a diverse array of formats. The newly created institute ACDH- OEAW is dedicated to support these digital humanities projects, providing technical expertise and consulting regarding conceptualisation and modelling of data.
In this presentation, we want to share our experiences, discussing the encountered metadata formats, issues and the implications for repositories and research infrastructures. Georg Neubauer Visualization of typed links in linked data The presentation concerns the visualization of typed links in Linked Data. Its content mentions a part about Linked Data and the Semantic Web, founded by Tim Berners Lee in 2001. After the description of the LOD cloud which is a gigantic global graph system interconnecting many databases, appropriate models for graph based data visualizations are described. The presentation then treats the construction of two different visualization views for Linked Data triples which compose relations to given input nodes. Relation Finder, a web app with a node- link diagram in short RelFinder is compared concerning design and usability to another visualization that has a linear or list diagram of the nodes. To find out the preferences of the visulaizations a test was applied within the result that the design of the linear version is more favourable handling Linked Data triples in visualizations. Barbara Petritsch Metadata for research data in practice As of Ausgust 2015 the IST Austria Library has been operating an institutional data repository: IST DataRep. The conception of the repository was determined by two main objectives: Eligibility for publishing data and usability. The talk will illustrate how these objectives affected the decisions on and the underlying reasons for the chosen metadata schema. Andreas Jeitler Accessibility Metadata - Metadata accessibility Accessible digital documents like e- books, electronic magazines or other kinds of digital media are a precondition for a self- determined participation of people with disabilities within the scientific community. But the most accessible document would be of no use, if you are not able to locate it within a repository. As metadata support the localisation process we must reflect how people with certain disabilities may use them. Are the accessible themselves? Accessibility metadata on the other hand are a special set of metadata to describe the accessibility features of a digital document.
Sarah Teetor The Challenges of Building and Maintaining an Image Database Because of the complex and multi- level nature of archival images, it is an ongoing challenge to establish a metadata architecture and metadata standards that are easy to navigate currently and take into consideration the users' future requirements. This contribution will present a use case from the material and visual side of the humanities based on the efforts of the Digital Research Archive for Byzantium (DiFAB), housed at the Department of Art History at the University of Vienna. Using the example of one monument and its photographic documentation, this talk will highlight some issues concerning metadata for images of material culture, such as: the various analog and digital forms of documentation; the use of available thesauri or other controlled vocabularies especially problems of historical geography, multilingualism, and culturally specific architectural and iconographic terminologies ; and the importance of both precise and fuzzy dating for cultural historians and their research archives. Sandor Kopacsi Classification Server This presentation will describe the implementation of a Classification Server that helps in metadata harmonization in a long term Preservation System of digital objects. After a short introduction to classifications and knowledge organizations, the requirements of the system to be implemented will be summarized. Some SKOS (Simple Knowledge Organization System) management tools will be briefly presented, that we have examined, including Skosmos, the solution we have selected for the implementation. Skosmos is an open source, web- based SKOS browser based on Jena Fuseki SPARQL server. The presentation will show the main steps of the installation of the applied tools, and some potential problems with the classifications to be used, as well as possible solutions. Stefan Szepe Metadata Challenges at the University of Music and Performing Arts This presentation summarizes the experience of applying various metadata standards to audio- visual digital assets in the institutional repository at the University of Music and Performing Arts in Wien. I focus on the challenges encountered satisfying the metadata needs of the various disciplines (classification requirements), address the processes implemented to solve these challenges and outline the status quo of the institutional repository in terms of metadata management. José Luis Preza Automated Information Enrichment for a Better Search
The process of adding the Metadata when uploading a digital object onto a repository is usually manual. This means that the user has to have already at hand the keywords and all the other information about the asset. This paper addresses the possibility of enriching the manual metadata by generating automated metadata using the cognitive services provided by technologies like IBM Watson platform.