From concepts to ontologies in metadata: a happy explication of a cognitive scenario for discoverability



Similar documents
ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November ISSN

The Importance of Bioinformatics and Information Management

Biomedical Informatics: Computer Applications in Health Care and Biomedicine

Building a National Ontology Infrastructure

Lightweight Data Integration using the WebComposition Data Grid Service

MULTILINGUAL ACCESS TO CONTENT THROUGH CIDOC CRM ONTOLOGY

Ontology-Based Query Expansion Widget for Information Retrieval

How to get started with research data management training services for the academic library?

CREDIT TRANSFER: GUIDELINES FOR STUDENT TRANSFER AND ARTICULATION AMONG MISSOURI COLLEGES AND UNIVERSITIES

Explorer's Guide to the Semantic Web

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.

POSITION DETAILS. Digitisation & Digital Services

MED 2400 MEDICAL INFORMATICS FUNDAMENTALS

Text Mining for Health Care and Medicine. Sophia Ananiadou Director National Centre for Text Mining

Developing Data Workshops

Methodological Issues for Interdisciplinary Research

HydroDesktop Overview

Translation and Localization Instruments of Improving the Economic Mechanism

CREATING LEARNING OUTCOMES

MS Health Informatics ( )

SemWeB Semantic Web Browser Improving Browsing Experience with Semantic and Personalized Information and Hyperlinks

Modern foreign languages

Introduction wikinomics

Biomedical Informatics: Its Scientific Evolution and Future Promise. What is Biomedical Informatics?

SharePoint 2013 Site Owner and Power User Boot Camp SP31; 3 Days, Instructor-led

How To Create A Knotworking In The Library

RFI Summary: Executive Summary

Modularising Multilingual and Multicultural Academic Communication Competence for BA and MA level MAGICC CONSULTATION INTERVIEW STUDENTS

LANGUAGE LEARNING CENTRES

Program Overview. Updated 06/13

INVITES APPLICATIONS FOR POST DOCTORAL FELLOWSHIP

Why SBVR? Donald Chapin. Chair, OMG SBVR Revision Task Force Business Semantics Ltd

Report of the Ad Hoc Committee for Development of a Standardized Tool for Encoding Archival Finding Aids

Multilingual and Localization Support for Ontologies

Intelligent Search for Answering Clinical Questions Coronado Group, Ltd. Innovation Initiatives

The Value of Taxonomy Management Research Results

Structure of Presentation. The Role of Programming in Informatics Curricula. Concepts of Informatics 2. Concepts of Informatics 1

Master in Digital Humanities

Encoding Library of Congress Subject Headings in SKOS: Authority Control for the Semantic Web

FACULTY STUDY PROGRAMME FOR POSTGRADUATE STUDIES

PRACTICAL ASPECTS OF TRANSLATION: BRIDGING THE GAP BETWEEN THEORY AND PRACTICE. Andrea LIBEG, Petru Maior University, Tg Mureş, Romania.

Linked Data Interface, Semantics and a T-Box Triple Store for Microsoft SharePoint

Creating User-Friendly Web Sites

Summary of Responses to the Request for Information (RFI): Input on Development of a NIH Data Catalog (NOT-HG )

Certified Information Professional 2016 Update Outline

ENHANCEMENT OF UDC DATA FOR USE AND SHARING IN A NETWORKED ENVIRONMENT. Aida Slavic Maria Ines Cordeiro Gerhard Riesthuis

MEng, BSc Applied Computer Science

APPENDIX to CAHIIM 2012 Curriculum Requirements Health Informatics Master s Degree

Writing learning objectives

Healthcare Measurement Analysis Using Data mining Techniques

So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02)

CHAPTER 1 INTRODUCTION

Master of Science in Artificial Intelligence

College of Communication and Information. Library and Information Science

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies

Syllabus Information Architecture (LIS )

Presented to The Federal Big Data Working Group Meetup On 07 June 2014 By Chuck Rehberg, CTO Semantic Insights a Division of Trigent Software

Date submitted: June 1, 2011

MEng, BSc Computer Science with Artificial Intelligence

Cognitive Interaction Toolkit

a Data Science Univ. Piraeus [GR]

Telecommunication (120 ЕCTS)

ElŜbieta GAJEWSKA : Levels of language proficiency and teaching LSP

An Agent-Based Concept for Problem Management Systems to Enhance Reliability

Case Study: Institution-Based Information Systems, Egypt National Agricultural Research Information Management System (NARIMS)

Social Script Theory and Cross-Cultural Communication

Brief Course Descriptions for Academic English Cluster

Text Analytics with Ambiverse. Text to Knowledge.

Transformation of Free-text Electronic Health Records for Efficient Information Retrieval and Support of Knowledge Discovery

Faculty of Engineering and Science Curriculum - Aalborg University

Categories of Study: Curriculum Goals and Requirements

CODE OF ETHICS FOR COMMUNITY INTERPRETERS

Secure Semantic Web Service Using SAML

SoberIT Software Business and Engineering Institute

The astronomical Virtual Observatory : lessons learnt, looking forward. Françoise Genova - Forum VO-PDC d après ADASS XXI, Paris, nov.

Competencies for Secondary Teachers: Computer Science, Grades 4-12

What is intelligent content?

International Relations Program Integrated Assessment Plan April 27, 2007

Knowledge Management Challenges in Web-Based Adaptive e-learning Systems

2QWRORJ\LQWHJUDWLRQLQDPXOWLOLQJXDOHUHWDLOV\VWHP

CRITICAL AND CREATIVE THINKING RUBRIC GRADUATE PROGRAMS

MASTER S DEGREE PROGRAMME IN BIOINFORMATICS

Build Your Knowledge!

Guidelines for Best Practices in Data Management Roles and Responsibilities

ONTOLOGY BASED FEEDBACK GENERATION IN DESIGN- ORIENTED E-LEARNING SYSTEMS

LONG BEACH CITY COLLEGE MEMORANDUM

Why are Organizations Interested?

Turning three overlapping thesauri into a Global Agricultural Concept Scheme

Organizational Behavior Terminology and Concepts. By 1September 04

Linked Statistical Data Analysis

Introductory Guide to the Common European Framework of Reference (CEFR) for English Language Teachers

escidoc: una plataforma de nueva generación para la información y la comunicación científica

LabelTranslator - A Tool to Automatically Localize an Ontology

BS Environmental Science ( )

PROGRAMME SPECIFICATION POSTGRADUATE PROGRAMME

Semantic SharePoint. Technical Briefing. Helmut Nagy, Semantic Web Company Andreas Blumauer, Semantic Web Company

Indiana University School of Informatics and Computing

SavvyDox Publishing Augmenting SharePoint and Office 365 Document Content Management Systems

Transcription:

Template for the preparation of full texts for the online conference proceedings From concepts to ontologies in metadata: a happy explication of a cognitive scenario for discoverability Raisa Iivonen 1, Päivi Lipsanen 1, Ari Muhonen 2 and Esko Siirala 1 1 Viikki Campus Library, Helsinki University Library, Helsinki, Finland 2 Jyväskylä University Library, Jyväskylä, Finland Corresponding author: Raisa Iivonen, raisa.iivonen@helsinki.fi Abstract The start of a research project is the key point of the basic process of discovery. It is one of the most important phases to study when we want to understand the information seeking behavior of a scientist. Metadata embedded with ontologies will be Santa s best help in the future, assuming the ground work is thoroughly done by scientists themselves and also by metadata professionals. In this presentation we will explicate a scenario how to support a collaborative method in order to create a reliable bank of terminology where the concepts are defined, validated and accepted by the scientific community. Scientific knowledge is available only if it is found or discovered. In computer science and information science, ontology formally represents knowledge as a set of concepts within a domain, using a shared vocabulary to denote the types, properties and interrelationships of those concepts. Ontologies are the structural frameworks for organizing information and are used in artificial intelligence, the Semantic Web, biomedical informatics, library science, and information architecture as a form of knowledge representation about the world or some part of it. The core of the work is to translate and create scientific terms into Finnish a language with no structural relations to Indo-European languages such as English, German or French. The Finnish Medical Society Duodecim has a consistent, systematic and relatively long tradition of translating e.g. medical entries into Finnish. Its publications anchor the new medical terms into the Finnish language. The outcome of this work is e.g. the database FinMeSH, which is used through the interface Termix, created and maintained by the medical library Terkko. When the semantical part of the work is reliably done, then it is possible to create user-friendly applications with the help of an ontology. A case study will be presented: a reasonable way to create a Finnish veterinary terminology is needed. Crowd sourcing and wiki are means to solve the acute problem of translating e.g. MeSH terms those which are veterinary medical concepts - into Finnish. (1) Key words: Ontologies; Information Storage and Retrieval; Metaphysics.

Aim In this paper we will explicate a scenario how to support a collaborative method in order to create a reliable bank of terminology where the concepts are defined, validated and accepted by the scientific community. Scientific knowledge is not available if it is not found or discovered. Academic libraries bring happiness by producing discoverability. Discoverability or findability is an essential part of availability. Huge collections of important scientific publications are useless if they are not accessible. Availability is naturally indispensable, but insufficient if the contents of the publications stay in the shade. According to Peter Morville (2), findability is the quality of being locatable or navigable the degree to which a particular object is easy do discover or locate the degree to which a system or environment supports navigation and retrieval The design of a tool or toolkit which helps to solve the problems of finding or discovering relevant information is one of our goals, and therefore a well designed plan or scenario how to achieve this is needed. As a case we represent the project of compiling veterinary terminology in Finnish. Methods Semantic web Ontologies Natural semantic metalanguage The Semantic Web is the extension of the World Wide Web that enables people to share content beyond the boundaries of applications and websites Ontologies are considered one of the pillars of the Semantic Web, although they do not have a universally accepted definition. A (Semantic Web) vocabulary can be considered as a special form of (usually light-weight) ontology, or sometimes also merely as a collection of URIs with an (usually informally) described meaning. The Natural semantic metalanguage (NSM) is a linguistic theory and a practical, meaning-based approach to linguistic analysis. The NSM approach is based on evidence that there is a small core of basic, universal meanings, known as semantic primes, which can be expressed by words or other linguistic expressions in all languages. This common core of meaning can be used as a tool for linguistic and cultural analysis: to explain the meanings of complex and culture-specific words and grammatical constructions (using semantic explications), and to articulate culture-specific values and attitudes (using cultural scripts). The theory also provides a semantic foundation for universal grammar and for linguistic typology. Using NSM allows us to formulate analyses which are clear, precise, crosstranslatable, non-anglocentric, and intelligible to people without specialist linguistic training

Semantic explications are explanatory paraphrases, framed in the metalanguage of simple and universal semantic primes. They can range in length from two or three words to literally dozens of interrelated clauses. They are essentially "texts" composed in a specified subset of ordinary language. Example of a semantic explication: The happy explication The explication for the English emotion term 'happy' (with the verb 'to be'), shows how a prototypical cognitive scenario can be incorporated into an explication. The feeling experienced by X is not described directly; rather it is described as LIKE the good feeling experienced by a person who thinks certain prototypical thoughts someone X is happy (at this time): someone X thinks like this at this time: "many good things are happening to me as I want I can do many things now as I want this is good" because of this, this someone feels something good at this time like someone can feel when they think like this This approach to emotion semantics allows a great deal of subtle differentiation between closely related emotions (e.g. 'happy', 'joyful', 'pleased', 'content', 'related', 'jubilant', and so on). To see this, here is a parallel explication for the word 'contented'. Notice that it follows the same overall structure or "semantic template": someone X is contented (at this time): someone X thinks like this at this time: "something good is happening to me now< I want this I don't want anything else now" because of this, this someone feels something good at this time like someone can feel when they think like this A well designed ontology is needed for the indexing purposes and a well planned scenario is needed to create such multipurpose system with many users and specialists (too many cooks spoil the broth) We define cognitive scenario as all the conditions through which information processing passes, distributed between the human and the environmental side. (3) In the performing arts, a scenario (from Italian: that which is pinned to the scenery is a synoptical collage of an event or series of actions and events. In the Commedia dell'arte it was an outline of entrances, exits, and action describing the plot of a play, and was literally pinned to the back of the scenery. The cognitive scenario will be a plan to use semantic web and its tools natural semantic metadata and ontologies to create a toolkit for better indexing. Case: Veterinary terminology On Viikki Campus the campus library is responsible for indexing journal articles in national databases. Appropriate thesauri are always needed; as a solution the library has created a Finnish terminology Agriforest, which contains e.g. of FAO s Agrovoc. Now it has been ontologized and now goes by the name of AFO. It is available via the Finnish Ontology Service Finto. AFO includes only a few veterinary terms. MeSH (FinMesh) is used when medical terms are needed in indexing veterinary books and articles, but MeSH is not comprehensive from the

veterinary point of view (CABI s subject headings have not been used in Finland). Results The benefit of using ontologies is the crosstranslatability built in the system. The end user is able to make searches in one language, and the ontologized vocabulary gives a systematic tree where one can choose terms needed. E.g. Medical Subject Headings MeSH is translated to European languages and is searchable in Finnish via the interface Termix which includes FinMeSH. The terms are translated by a board of experts in the Medical Society Duodecim and the interface is served by the medical library Terkko. Besides medicine, other disciplines need validation of terms, too. A tool for this is the Bank of Finnish Terminology in Arts and Sciences (BFT), where terms are defined and discussed using wiki. In Finland the veterinary community, e.g. the Faculty of Veterinary Medicine, the Finnish Veterinary Association and the Finnish Food Safety Authority, creates and uses veterinary terminology while doing their everyday work. These terms should be presented on the wiki platform so that the veterinary community can discuss about the validation of the terms. The library is a natural coordinator for this collaboration. Discussion Finnish language differs from other European languages in all these aspects Vocabulary Grammar Practice Examples:

In Europe we speak many languages, in some countries we have several more or less official languages. The European Union has 24 official languages. Scientific results are nowadays published mainly in English which has become a lingua franca in the academic environment. A need to acquire information in one s native language still exists: many journals publish original scientific peer reviewed articles written in national languages. The audience wants to get new, reliable information from their colleagues in their mother tongue. E.g. the Finnish Veterinary Journal has monitored the information needs of the veterinary society in Finland, and every five years the results give the same message: veterinary practitioners want to read about recent clinical research, preferably at home, on the couch and from printed journals. The journal is a 120 year old success story and it is still going strong! At the same time the library has to deal with the problem of not having an appropriate terminology when indexing these peer reviewed articles. Fulfilling the information needs of the busy veterinarians the library creates user friendly interfaces; indexed articles will remain in shade if the indexing terms are not accurate. The future lies on the ontologized vocabularies like MeSH (FinMeSH), AFO (Agriforest in Finnish) and YSO (the Finnish General Upper Ontology). Crowd sourcing like wiki will help the validation, which need discussion in the scientific society, BFT ( the Bank of Finnish Terminology in Arts and Sciences) offers the media for carrying out the process. Conclusion Did we get a happy explication? Scientific knowledge is not available if it is not discovered. With skillful and reliable indexing the publications are found. Academic libraries bring happiness by producing findability and discoverability. Acknowledgments Finnish Veterinary Association Finnish Veterinary Journal Faculty of Veterinary Medicine, University of Helsinki

Viikki Campus Library (Helsinki University Library) Jyväskylä University Library Bank of Finnish Terminology in Arts and Sciences National Library of Finland REFERENCES 1. Iivonen, R. & al. Finnish Veterinary Terminology: translating medical and veterinary subject headings using collaborative process http://sites-final.uclouvain.be/eahil2012/conference/?q=node/904 2. Morville, P. Ambient findability. O Reilly, 2005. 3. Veneziano, V. Cognitive Scenarios for Problem Solving in S/W Engineering http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.92.6358&rep=rep1&type=pdf