A Software Tool for Thesauri Management, Browsing and Supporting Advanced Searches
|
|
|
- Horace Bryant
- 10 years ago
- Views:
Transcription
1 J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria 105 A Software Tool for Thesauri Management, Browsing and Supporting Advanced Searches J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria Computer Science and Systems Engineering Department University of Zaragoza {jnog banares jlacasta javy}@unizar.es ABSTRACT Thesauri have been widely used to improve the precision and recall of information retrieval in digital libraries for decades. They provide a uniform and consistent vocabulary for indexing metadata ("description of the data holdings ) and for supplying users with a suitable vocabulary for the retrieval. Therefore, a thesaurus management tool becomes a vital component in the development of any kind of digital library and this also extensible in the context of a Spatial Data Infrastructure (SDI), a digital library specialised in geographic information resources. This work presents a thesauri management tool with two main objectives: a basic management and browsing of thesauri; and providing some mechanisms to enhance cross-discipline interoperability between different thesauri. INTRODUCTION According to ISO (1986), a thesaurus is a set of terms that describe the vocabulary of a controlled indexing language, formally organized so that the a priori relationships between concepts (for example synonymous terms, broader terms, narrower terms and related terms) are made explicit. Thesauri have been widely used to improve the precision and recall of information retrieval in digital libraries for decades. They provide a uniform and consistent vocabulary for indexing metadata ("description of the data holdings ) and for supplying users with a suitable vocabulary for the retrieval. Moreover, they can be used to expand users queries by automatically adding new terms to the query; or they can improve metadata interoperability for accessing information within networked knowledge organisation systems. Therefore, a thesaurus management tool becomes a vital component in the development of any kind of digital library. This is also extensible in the context of a Spatial Data Infrastructure (SDI). The main objective of an SDI, apart from involving many other issues, is to provide the discovery, evaluation and access to spatial data for a community of users. And hence,
2 106 J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria an SDI can be considered as digital library specialised in geographic information resources. This work presents a tool which has been developed at the University of Zaragoza with two main objectives: a basic management and browsing of thesauri; and providing some mechanisms to enhance cross-discipline interoperability between different thesauri. Following section sketches the the overall architecture of this tool. Then, section three presents the basic capabilities of the tool, basically the management, and browsing of thesauri according to the ISO norms for monolingual and multilingual thesauri (ISO (1986) and ISO (1985) respectively). Section four explains the enhanced capabilities of this thesaurus tool. These enhanced capabilities are mainly oriented to the semantic disambiguation of thesauri against an upper-level ontology as WordNet (Miller 1990). Around this semantic desambiguation, the tool provides facilities to expand automatically thesaurus terms with new terms from other thesauri having an equivalent meaning. Finally, this work ends with a conclusions and future lines section. ARCHITECTURE The tool has been developed in Java and it is deployed with two levels of operation: a simplified version which stores the thesaurus structure (only BT, NT relationships) on an Access 2000 database; and a complete version with full functionality that stores thesauri in an Oracle 9i database. The application allows to access to different databases via JDBC. The complete version takes advantage of the Oracle Intermedia Text package (CTX_THES) capabilities. This package implements the ISO-2788 norm for monolingual thesauri and also provides language translation relationships.
3 J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria 107 Level 3. Application level ThesaurusMngmt Thesauri Application Level 2. Interface Thesaurus.gui Graphical Interface Level 1. Object Model Keywords Keywords Thesaurus.model Thesaurus Polysemy Polysemy Wordnet Wordnet Level 0. Database Metadata keywords Thesaurus Wordnet Fig. 1: Layered architecture of Thesaurus Tool Fig. 1 shows the architecture of the application. As it can be observed, a four-layer architecture has been used to separate gradually the final user application from the lowest level, the database layer. The database level contains WordNet lexical knowledge base, the thesauri stored in Oracle 9i, and a repository of metadata records (whose keywords may be expanded, see section Enhanced Capabilities ). The next level provides the object model that represents the information entities stored at the database level and the methods to access them. And in addition to this, this level also includes the classes for the extraction of polysemy, the semantic disambiguation of thesauri and the expansion of keywords due to the lowlevel services provided. The second level contains generic graphic interface components for the visualization of thesauri. And finally, the third level is the application level that provides the integrated tool for thesauri management, browsing, semantic disambiguation and keyword expansion functionality. BASIC CAPABILITIES This tool facilitates the edition of thesauri according to the ISO norms for monolingual and multilingual thesauri (ISO (1986) and ISO (1985) respectively). That is to say, it supports the definition of concepts and relationships between concepts including synonym terms (SYN,USE), broader terms (BT), narrower terms (NT), related terms (RT), preferred terms (PT)
4 108 J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria and scope notes (SN) and language translations (TR). The visualization and browsing of thesaurus terms is possible with several graphical interface presentations. At present an alphabetical and a hierarchical presentation of thesauri is available. And to facilitate the discovery and visualization of terms, it is also possible to perform queries of "like style queries. Another remarkable feature of the tool is its multilingual access support. Apart from the typical internationalisation of the application, the thesauri are browsed (if exists translation) according to the default system language or the language selected by the user. Fig. 2 provides an overview of the graphical user interface of the application. There, it is shown how a user has displayed the UNESCO thesaurus (UNESCO 2003) and the GEMET thesaurus (GEMET 1999), selecting different presentations (alphabetical presentation for UNESCO and hierarchical for GEMET) and different languages (English for UNESCO and German for GEMET). Besides, the term BIOCHEMISTRY (or BIOCHEMIE in German) has been browsed in both thesauri, showing a different hierarchical path of terms in every case. Fig. 2: Overview of the graphical user interface
5 J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria 109 As concerns import and export capabilities, this tool facilitates the exchange of thesauri by means of text files. These files may use two formats to encode the thesaurus phrases that belong to each thesaurus branch, where a branch represents the hierarchical tree (BT/NT relationships) whose root is a term with no broader terms in the thesaurus. With first format (see Fig. 5), each line contains the succession of narrower terms (separated by dots) from the branch root to the new term. Additionally, other relationships (SYN, TR,...) are on the right side of the new term with a +<relationship>+<value>. Second format is similar to the previous one with the difference the branch terms are hierarchically numbered. The main limitation of these formats is that they are not standardized. Future improvements of this capability should be directed to the exchange of thesauri in interoperable formats like RDFS/XML. Fig. 3: Import/Export of thesauri Finally, another feature of the basic operation of this tool is the incorporation of a users control system. Thanks to this, it is possible to deploy different versions of this application that prevent users without the required permission from updating the edition of thesauri and other restricted operations. For instance, we have included this tool as an additional component
6 110 J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria to browse thesauri for a metadata edition tool (see Zarazaga, Lacasta et al. 2003) and there the user only has privileges to visualize the thesaurus terms, not to modify the content of the thesaurus. ENHANCED CAPABILITIES As it has been mentioned in the introduction, the enhanced capabilities of this tool are based on the semantic disambiguation of thesauri against the concepts of an upper level ontology (representation of concepts and their relationships in a particular application domain). Following subsections detail these enhanced capabilities. Browsing of WordNet ontology First of all, this tool allows the visualization of WordNet ontology, as if it were another thesaurus created by the tool (see Fig. 4). WordNet can be considered as an ontology of general level, which is structured in a hierarchy of synsets, where synsets are defined as set of synonyms representing a particular concept. Fig. 4: Visualization of WordNet ontology This functionality is provided by the Java wordnet package depicted in the architecture. This package facilitates the access to the libraries able to browse lexical database (see to
7 J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria 111 download WordNet). As the software of these libraries is implemented in C language and our application has been developed in Java, we had to implement the crosswalk that access to WordNet native libraries via JNI (Java Native Interface) and returns the information of synsets in the same way as the information related with thesauri created by the tool. Search of polysemic senses in Wordnet Given that this tool provides access to WordNet, it also facilitates the possibility to find the senses of a term (single world or set of related words) in WordNet. This functionality is provided at low level by the package Polisemy, which was presented in the architecture. Given a term, this package looks up it in the WordNet database and extracts all the possible synsets. In case a term is a compound term (more than one word) and is not directly included in WordNet, the Polisemy component would extract all synsets corresponding to each word in the compound term. Furthermore, this component uses morphological techniques to reduce the number of not-found words and to search the senses of adjectives which are associated with a noun. For instance, given the adjective administrative, the component will look the synsets associated with administration. Fig. 5 displays the window that facilitates the extraction of WordNet synsets given a term or phrase. In this case, Fig. 5 shows the polisemic senses of term administration. Fig. 5: Extraction of polisemic senses of term administration
8 112 J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria Semantic disambiguation of thesauri Perhaps, the most outstanding capability of this tool is the semantic disambiguation of thesauri in relation to WordNet concepts (or synsets). The tool applies an unsupervised disambiguation method that takes advantage of the thesaurus hierarchical structure (broader and narrower terms), which is used as the word context for a voting algorithm to find the closest sense. That is to say, given a thesaurus term with several synsets (or senses), we must determine what is the closest sense in relation with the rest of terms that conform a thesaurus branch (a branch represents the hierarchical tree of BT/NT terms whose root is a term with no broader terms). For that purpose, the method extracts the synsets of every term in the branch so as to confront each other and compute their semantic distance. For each ambiguous thesaurus term, the method will choose the synset having the highest degreee of similarity with respect to the synsets of other terms in the branch. More details about the semantic disambiguation method can be found in (Mata, Bañares et al. 2002). biosphere It is not found in WordNet 1.6. *** biosphere.anatomy alternative names for the body of a human being; "Leonardo studied the human body"; "he has a strong physique"; "the spirit is willing but the flesh is weak" the branch of morphology that deals with the structure of animals a detailed analysis; "he studied the anatomy of crimes" Wrong assignment. *** biosphere.anatomy.cell biology the basic structural and functional unit of all organisms; cells may exist as independent units of life as in monads or may form colonies or tissues as in higher plants and animals a device that delivers an electric current as the result of a chemical reaction small room is which a monk or nun lives a room where a prisoner is kept any small compartment; "the cells of a honeycomb" a small unit serving as the nucleus of a larger political movement *** all the plant and animal life of a particular region characteristic life processes and phenomena of living organisms: "the biology of viruses" the science that studies living organisms *** Fig. 6: Tracing the disambiguation process. Cursive text has been manually included 4. The disambiguation method can be applied whenever a new thesaurus is imported. The WordNet synsets that are finally associated with the thesaurus terms are stored as a TR relationship (using SYNSET as language). This 4 Biosphere is not a noun of the WordNet 1.6, however it is included in the WordNet 1.7.
9 J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria 113 TR relationship is used to indicate the translation of a term and in this case SYNSET language it is interpreted as the disambiguation language. Fig. 6 shows a piece of the debug file which traces the disambiguation process whenever it is applied. Such file stores the possible synsets associated with each thesaurus term and the weight which was assigned by the disambiguation method. Finally, once the disambiguation method is applied, the user will be able to visualize the synset that was finally assigned (see Fig. 7) to each term in the same way as other relationships. In fact, we could update manually the synset assigned to a thesaurus term. Fig. 7: Disambiguated senses are also shown by the thesaurus viewer Keywords expansion The final capability of this tool is the automatic expansion of a set of keywords. Given a set of keywords belonging to an initial set of thesauri, this tool will obtain a set of related terms belonging to a new thesaurus. This functionality is based on the semantic disambiguation of thesauri and is oriented to enhance classifications and description of resources. For instance, digital libraries can make profit of this method to expand the keywords stored in their metadata records and improve the performance of discovery services. On the other hand, although metadata creators usually select terms from well recognised thesauri, very different thesauri may have been used if metadata records describe resources from separate application domains.
10 114 J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria Therefore, this method facilitates an homogenous classification of all metadata records. The expansion will generate terms in other thesauri, which share a similar sense with the ones selected by metadata creator. This way, a user query containing a term in a specific domain, do not need to be enhanced with synonyms or semantic equivalent terms in other domains since, reducing as well the response time of discovery services. Additionally, this automatic expansion method could also be applied to the user query. The method to expand the keyword section is based on a basic routine which estimates the probability to expand an original keyword section with a new term belonging to a new different thesaurus, not used in the original keyword section yet. This basic routine is composed of two main steps. First step is the collection of all the synsets corresponding to the terms, which were selected by metadata creator. As a result of this first step, we obtain an initial collection of synsets. Secondly, a comparison between the synsets of the new term and the initial collection of synsets is performed. This comparison consists of the computation of a reliability percentage for the new term, which is calculated as the number of synset coincidences divided by the number of synsets of the new term and multiplied by 99. The reason to use a final factor of 99 and not 100 is to obtain a maximum reliability percentage of 99 for automatically expanded terms, reserving uniquely a 100-reliability percentage for the terms which were originally selected by metadata creators. If this reliability percentage is greater than a threshold reliability percentage, which was defined previously by the user who performed the expansion, this new term is added in the keyword section.
11 J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria 115 Fig. 8: Keywords expansion window Fig. 8 displays the window that facilitates the expansion of keywords. As an example of this capability, the expansion of terms appearing in Tab. 1 (also introduced in Fig. 8) will be shown. These terms could correspond to the manual classification of a resource, which is included within the keywords section of a metadata record. The terms were selected from CEO- DISPLINE (a controlled list of 30 terms proposed to identify disciplines) and CEOPARAMETER (a controlled list of 1037 terms proposed to identify the types of features contained in a geospatial data resource) thesauri that are defined in (CEO 1999). Tab. 1: Manually introduced classifications Thesaurus Original term Reliabilit y CEOPARAMETER earth science atmosphere 100 CEODISCIPLINE weather & climate 100 Tab. 2 shows the results of the expansion method for the input terms is in Tab. 1, all of them having a reliability value over 49. Summing up, 11 new terms were found, which belonged to three better structured thesauri: GE- MET (the General General European Multilingual Thesaurus (GEMET 1999), the Alexandria Digital Library Feature Type Thesaurus (ADL 1999, referenced as ADL-FTT in Tab. 2), and a list thematic keywords proposed by the "NASA Master Directory project (NASA 1996). Tab. 2: Terms automatically expanded
12 116 J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria Thesaurus Expanded term Reliabilit y GEMET atmosphere climate climate weather climate weather weather condition climatic issue climatic issue weather climatic issue.weather weather condition ADL-FTT regions climatic regions 49.5 NASA atmospheric science atmospheric science atmospheric pressure atmospheric science atmospheric temperature CONCLUSIONS AND FUTURE LINES This paper has presented a flexible tool to manage thesauri and providing enhanced functionality for improvement of classifications. In fact, this tool has been incorporated as an additional component of a metadata edition tool (Zarazaga-Soria, Lacasta et al. 2003) that enables metadata creators to select the appropriate term for the distinct metadata fields. Future lines of this work will be directed to the creation of a Thesaurus Web Service providing some of the functionality offered by this tool. This Web service will reuse the software components presented in this paper in order to offer on-line thesaurus browsing, WordNet polysemy extraction or keywords expansion. As a last remark, it should be mentioned that we are also using the semantic disambiguation of thesauri to test different information retrieval strategies for geographic data catalogs. These geographic catalogs manage metadata records compliant with some metadata scheme that includes a keywords section or subject metadata element. Besides, it is assumed that the terms used to fill the content of this keyword section are selected from disambiguated thesauri. Therefore, it is possible to index metadata records according to a unified system, which is precisely the synsets associated with the disambiguated thesaurus terms. And finally, thanks to this homogenous indexing of metadata records, classical information retrieval algorithms can be applied to compute the similarity between the metadata records and the concepts expressed by the user queries.
13 J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria 117 ACKNOWLEDGEMENTS The basic technology of this work has been partially supported by the Spanish Ministry of Science and Technology through the project TIC C03-01 from the National Plan for Scientific Research, Development and Technology Innovation, and by the project P089/2001 from the Aragón Government. Javier Lacasta s work has been partially supported by a grant from the Aragón Government and the European Social Fund. REFERENCES ADL (2003): Homepage of the Alexandria Digital Library Project: (last accessed, May 2003). CEO (1999): Recommendations on Metadata. Describing the data, services and information you have available! (version 2.0). A User Guide provided by the Center for Earth Obser-vation Programme (CEO programme) of the European Commission. February GEMET (1999): General European Multilingual Thesaurus. Version 2.0. ETC/CDS (last accessed, May 2003). ISO (1985): ISO Documentation: Guidelines for the establishment and development of multilingual thesauri. ISO (1986): ISO Documentation: Guidelines for the establishment and development of monolingual thesauri. Mata, E., Bañares, J.A., Gutierrez, J., Muro-Medrano, P.R., Rubio, J. (2002): Semantic Disambiguation of Thesaurus as a Mechanism to Facilitate Multilingual and Thematic Interoperability of Geographical Information Catalogues. Proc. of the 5th AGILE Conference: Palma de Mallorca (Spain), April Miller, G.A. (1990): WordNet: An on-line lexical database. International Journal of Lexicography, 3(4) (Special Issue). NASA (1996): Draft geospatial thematic keywords from the NASA Master Directory in short and long format for CSDGM of FGDC. Version available from (last accessed, May 2003). UNESCO (2003): UNESCO Thesaurus. United Nations Educational, Scientific and Cultural Organization. Available from (last accessed, May 2003).
14 118 J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria Zarazaga-Soria, F.J., Lacasta, J., Nogueras-Iso, J., Torres, M.P., Muro- Medrano, P.R. (2003): A Java Tool for Creating ISO/FGDC Geographic Metadata. Proceedings of GI-days Münster (Germany), June 2003.
15
16
17
A Java Tool for Creating ISO/FGDC Geographic Metadata
F.J. Zarazaga-Soria, J. Lacasta, J. Nogueras-Iso, M. Pilar Torres, P.R. Muro-Medrano17 A Java Tool for Creating ISO/FGDC Geographic Metadata F. Javier Zarazaga-Soria, Javier Lacasta, Javier Nogueras-Iso,
D 2.2.3 EUOSME: European Open Source Metadata Editor (revised 2010-12-20)
Project start date: 01 May 2009 Acronym: EuroGEOSS Project title: EuroGEOSS, a European Theme: FP7-ENV-2008-1: Environment (including climate change) Theme title: ENV.2008.4.1.1.1: European Environment
Encoding Library of Congress Subject Headings in SKOS: Authority Control for the Semantic Web
Encoding Library of Congress Subject Headings in SKOS: Authority Control for the Semantic Web Corey A Harper University of Oregon Libraries Tel: +1 541 346 1854 Fax:+1 541 346 3485 [email protected]
A GENERALIZED APPROACH TO CONTENT CREATION USING KNOWLEDGE BASE SYSTEMS
A GENERALIZED APPROACH TO CONTENT CREATION USING KNOWLEDGE BASE SYSTEMS By K S Chudamani and H C Nagarathna JRD Tata Memorial Library IISc, Bangalore-12 ABSTRACT: Library and information Institutions and
ThManager: An Open Source Tool for creating and visualizing SKOS
ThManager: An Open Source Tool for creating and visualizing SKOS J. Lacasta, J. Nogueras-Iso, F.J. López-Pellicer, P.R. Muro-Medrano, F. J. Zarazaga-Soria Computer Science and Systems Engineering Department,
Semantic SharePoint. Technical Briefing. Helmut Nagy, Semantic Web Company Andreas Blumauer, Semantic Web Company
Semantic SharePoint Technical Briefing Helmut Nagy, Semantic Web Company Andreas Blumauer, Semantic Web Company What is Semantic SP? a joint venture between iquest and Semantic Web Company, initiated in
CatMDEdit Metadata editor
Advanced Information Systems Laboratory GeoSpatiumLab S.L. CatMDEdit Metadata editor University of Zaragoza Computer Science and Systems Engineering Department Advanced Information Systems Laboratory (IA3)
T HE I NFORMATION A RCHITECTURE G LOSSARY
T HE I NFORMATION A RCHITECTURE G LOSSARY B Y K AT H AGEDORN, ARGUS A SSOCIATES M ARCH 2000 I NTRODUCTION This glossary is intended to foster development of a shared vocabulary within the new and rapidly
How To Write An Inspire Directive
INSPIRE Infrastructure for Spatial Information in Europe Detailed definitions on the INSPIRE Network Services Title Detailed definitions on the INSPIRE Network Services Creator Date 2005-07-22 Subject
A terminology model approach for defining and managing statistical metadata
A terminology model approach for defining and managing statistical metadata Comments to : R. Karge (49) 30-6576 2791 mail [email protected] Content 1 Introduction... 4 2 Knowledge presentation...
Application of ontologies for the integration of network monitoring platforms
Application of ontologies for the integration of network monitoring platforms Jorge E. López de Vergara, Javier Aracil, Jesús Martínez, Alfredo Salvador, José Alberto Hernández Networking Research Group,
Semantic Search in Portals using Ontologies
Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br
Weblogs Content Classification Tools: performance evaluation
Weblogs Content Classification Tools: performance evaluation Jesús Tramullas a,, and Piedad Garrido a a Universidad de Zaragoza, Dept. of Library and Information Science. Pedro Cerbuna 12, 50009 [email protected]
Product Navigator User Guide
Product Navigator User Guide Table of Contents Contents About the Product Navigator... 1 Browser support and settings... 2 Searching in detail... 3 Simple Search... 3 Extended Search... 4 Browse By Theme...
EXPLORING AND SHARING GEOSPATIAL INFORMATION THROUGH MYGDI EXPLORER
EXPLORING AND SHARING GEOSPATIAL INFORMATION THROUGH MYGDI EXPLORER Subashini Panchanathan Malaysian Centre For Geospatial Data Infrastructure ( MaCGDI ) Ministry of National Resources and Environment
GeoNetwork, The Open Source Solution for the interoperable management of geospatial metadata
GeoNetwork, The Open Source Solution for the interoperable management of geospatial metadata Ing. Emanuele Tajariol, GeoSolutions Ing. Simone Giannecchini, GeoSolutions GeoSolutions GeoSolutions GeoNetwork
ADVANCED GEOGRAPHIC INFORMATION SYSTEMS Vol. II - Using Ontologies for Geographic Information Intergration Frederico Torres Fonseca
USING ONTOLOGIES FOR GEOGRAPHIC INFORMATION INTEGRATION Frederico Torres Fonseca The Pennsylvania State University, USA Keywords: ontologies, GIS, geographic information integration, interoperability Contents
Multilingual and Localization Support for Ontologies
Multilingual and Localization Support for Ontologies Mauricio Espinoza, Asunción Gómez-Pérez and Elena Montiel-Ponsoda UPM, Laboratorio de Inteligencia Artificial, 28660 Boadilla del Monte, Spain {jespinoza,
Flattening Enterprise Knowledge
Flattening Enterprise Knowledge Do you Control Your Content or Does Your Content Control You? 1 Executive Summary: Enterprise Content Management (ECM) is a common buzz term and every IT manager knows it
Catalogue or Register? A Comparison of Standards for Managing Geospatial Metadata
Catalogue or Register? A Comparison of Standards for Managing Geospatial Metadata Gerhard JOOS and Lydia GIETLER Abstract Publication of information items of any kind for discovery purposes is getting
Selecting a Taxonomy Management Tool. Wendi Pohs InfoClear Consulting #SLATaxo
Selecting a Taxonomy Management Tool Wendi Pohs InfoClear Consulting #SLATaxo InfoClear Consulting What do we do? Content Analytics Strategy and Implementation, including: Taxonomy/Ontology development
Andrea Buffam, Natural Resources Canada Canadian Metadata Forum National Library of Canada Ottawa, Ontario September 19 20, 2003
Geospatial Metadata Andrea Buffam, Natural Resources Canada Canadian Metadata Forum National Library of Canada Ottawa, Ontario September 19 20, 2003 The Presentation - Geospatial Metadata This presentation
Enabling Semantic Search in Geospatial Metadata Catalogue to Support Polar Sciences
Enabling Semantic Search in Geospatial Metadata Catalogue to Support Polar Sciences Wenwen Li 1 and Vidit Bhatia 2 1 GeoDa Center for Geospatial Analysis and Computation School of Geographical Sciences
Ontology-Based Query Expansion Widget for Information Retrieval
Ontology-Based Query Expansion Widget for Information Retrieval Jouni Tuominen, Tomi Kauppinen, Kim Viljanen, and Eero Hyvönen Semantic Computing Research Group (SeCo) Helsinki University of Technology
Introduction to Geospatial Web Services
Geospatial Web Services Introduction to Geospatial Web Services An introduction and inventory of geospatial web services and their importance to interoperability in the geospatial domain. vers. 20100604
Lightweight Data Integration using the WebComposition Data Grid Service
Lightweight Data Integration using the WebComposition Data Grid Service Ralph Sommermeier 1, Andreas Heil 2, Martin Gaedke 1 1 Chemnitz University of Technology, Faculty of Computer Science, Distributed
Portal Version 1 - User Manual
Portal Version 1 - User Manual V1.0 March 2016 Portal Version 1 User Manual V1.0 07. March 2016 Table of Contents 1 Introduction... 4 1.1 Purpose of the Document... 4 1.2 Reference Documents... 4 1.3 Terminology...
HydroDesktop Overview
HydroDesktop Overview 1. Initial Objectives HydroDesktop (formerly referred to as HIS Desktop) is a new component of the HIS project intended to address the problem of how to obtain, organize and manage
ISO 19119 and OGC Service Architecture
George PERCIVALL, USA Keywords: Geographic Information, Standards, Architecture, Services. ABSTRACT ISO 19119, "Geographic Information - Services," has been developed jointly with the Services Architecture
Information and documentation The Dublin Core metadata element set
ISO TC 46/SC 4 N515 Date: 2003-02-26 ISO 15836:2003(E) ISO TC 46/SC 4 Secretariat: ANSI Information and documentation The Dublin Core metadata element set Information et documentation Éléments fondamentaux
Distributed Database for Environmental Data Integration
Distributed Database for Environmental Data Integration A. Amato', V. Di Lecce2, and V. Piuri 3 II Engineering Faculty of Politecnico di Bari - Italy 2 DIASS, Politecnico di Bari, Italy 3Dept Information
Ontology for Home Energy Management Domain
Ontology for Home Energy Management Domain Nazaraf Shah 1,, Kuo-Ming Chao 1, 1 Faculty of Engineering and Computing Coventry University, Coventry, UK {nazaraf.shah, k.chao}@coventry.ac.uk Abstract. This
The Value of Taxonomy Management Research Results
Taxonomy Strategies November 28, 2012 Copyright 2012 Taxonomy Strategies. All rights reserved. The Value of Taxonomy Management Research Results Joseph A Busch, Principal What does taxonomy do for search?
Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object
Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object Anne Monceaux 1, Joanna Guss 1 1 EADS-CCR, Centreda 1, 4 Avenue Didier Daurat 31700 Blagnac France
CINDOR Conceptual Interlingua Document Retrieval: TREC-8 Evaluation.
CINDOR Conceptual Interlingua Document Retrieval: TREC-8 Evaluation. Miguel Ruiz, Anne Diekema, Páraic Sheridan MNIS-TextWise Labs Dey Centennial Plaza 401 South Salina Street Syracuse, NY 13202 Abstract:
How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.
Svetlana Sokolova President and CEO of PROMT, PhD. How the Computer Translates Machine translation is a special field of computer application where almost everyone believes that he/she is a specialist.
LABERINTO at ImageCLEF 2011 Medical Image Retrieval Task
LABERINTO at ImageCLEF 2011 Medical Image Retrieval Task Jacinto Mata, Mariano Crespo, Manuel J. Maña Dpto. de Tecnologías de la Información. Universidad de Huelva Ctra. Huelva - Palos de la Frontera s/n.
ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY
ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY Yu. A. Zagorulko, O. I. Borovikova, S. V. Bulgakov, E. A. Sidorova 1 A.P.Ershov s Institute
I. INTRODUCTION NOESIS ONTOLOGIES SEMANTICS AND ANNOTATION
Noesis: A Semantic Search Engine and Resource Aggregator for Atmospheric Science Sunil Movva, Rahul Ramachandran, Xiang Li, Phani Cherukuri, Sara Graves Information Technology and Systems Center University
A Workbench for Prototyping XML Data Exchange (extended abstract)
A Workbench for Prototyping XML Data Exchange (extended abstract) Renzo Orsini and Augusto Celentano Università Ca Foscari di Venezia, Dipartimento di Informatica via Torino 155, 30172 Mestre (VE), Italy
A HUMAN RESOURCE ONTOLOGY FOR RECRUITMENT PROCESS
A HUMAN RESOURCE ONTOLOGY FOR RECRUITMENT PROCESS Ionela MANIU Lucian Blaga University Sibiu, Romania Faculty of Sciences [email protected] George MANIU Spiru Haret University Bucharest, Romania Faculty
WordNet Website Development And Deployment Using Content Management Approach
WordNet Website Development And Deployment Using Content Management Approach N eha R P rabhugaonkar 1 Apur va S N ag venkar 1 Venkatesh P P rabhu 2 Ramdas N Karmali 1 (1) GOA UNIVERSITY, Taleigao - Goa
European Forest Information and Communication Platform
1 Metadata Model for the European Forest Information and Communication Platform D. Tilsner 1, C. Figueiredo 1, H. Silva 2, B. Chartier 3, J. San-Miguel 4, A. Camia 4, M. Millot 4 1 EDISOFT, S.A., Lisbon,
Selbo 2 an Environment for Creating Electronic Content in Software Engineering
BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 9, No 3 Sofia 2009 Selbo 2 an Environment for Creating Electronic Content in Software Engineering Damyan Mitev 1, Stanimir
Taxonomy Development Business Classification Schemes Charmaine M. Brooks, CRM
Taxonomy Development Business Classification Schemes Taxonomy Thesaurus - Classification Taxonomy is the practice and science of taxonomic classification. Thesaurus more commonly means a listing of words
Integrating catalog and GIS tools: access to resources from CatMDEdit thanks to gvsig
Integrating catalog and GIS tools: access to resources from CatMDEdit thanks to gvsig Javier Nogueras Iso 1, Jesús Barrera 1, Fernando Gracia Crespo 2, Silvia Laiglesia 2, Pedro R. Muro Medrano 1 1 Computer
Development of an Ontology for the Document Management Systems for Construction
Development of an Ontology for the Document Management Systems for Construction Alba Fuertes a,1, Núria Forcada a, Miquel Casals a, Marta Gangolells a and Xavier Roca a a Construction Engineering Department.
CLOUD BASED N-DIMENSIONAL WEATHER FORECAST VISUALIZATION TOOL WITH IMAGE ANALYSIS CAPABILITIES
CLOUD BASED N-DIMENSIONAL WEATHER FORECAST VISUALIZATION TOOL WITH IMAGE ANALYSIS CAPABILITIES M. Laka-Iñurrategi a, I. Alberdi a, K. Alonso b, M. Quartulli a a Vicomteh-IK4, Mikeletegi pasealekua 57,
Semantic Jira - Semantic Expert Finder in the Bug Tracking Tool Jira
Semantic Jira - Semantic Expert Finder in the Bug Tracking Tool Jira Velten Heyn and Adrian Paschke Corporate Semantic Web, Institute of Computer Science, Koenigin-Luise-Str. 24, 14195 Berlin, Germany
Combining SAWSDL, OWL DL and UDDI for Semantically Enhanced Web Service Discovery
Combining SAWSDL, OWL DL and UDDI for Semantically Enhanced Web Service Discovery Dimitrios Kourtesis, Iraklis Paraskakis SEERC South East European Research Centre, Greece Research centre of the University
Turning Emergency Plans into Executable
Turning Emergency Plans into Executable Artifacts José H. Canós-Cerdá, Juan Sánchez-Díaz, Vicent Orts, Mª Carmen Penadés ISSI-DSIC Universitat Politècnica de València, Spain {jhcanos jsanchez mpenades}@dsic.upv.es
Approaches of Using a Word-Image Ontology and an Annotated Image Corpus as Intermedia for Cross-Language Image Retrieval
Approaches of Using a Word-Image Ontology and an Annotated Image Corpus as Intermedia for Cross-Language Image Retrieval Yih-Chen Chang and Hsin-Hsi Chen Department of Computer Science and Information
Cloud Storage Standards Overview and Research Ideas Brainstorm
Cloud Storage Standards Overview and Research Ideas Brainstorm Mark Carlson, SNIA TC and Sun Chair, SNIA Cloud Storage TWG CMU SDI Lecture 12 th November 2009 Abstract! Cloud Storage is a new business
An Integrated Approach to Learning Object Sequencing
An Integrated Approach to Learning Object Sequencing Battur Tugsgerel 1, Rachid Anane 2 and Georgios Theodoropoulos 1 1 School of Computer Science, University of Birmingham, UK. {b.tugsgerel, g.k.theodoropoulos}@cs.bham.ac.uk
An Efficient Database Design for IndoWordNet Development Using Hybrid Approach
An Efficient Database Design for IndoWordNet Development Using Hybrid Approach Venkatesh P rabhu 2 Shilpa Desai 1 Hanumant Redkar 1 N eha P rabhugaonkar 1 Apur va N agvenkar 1 Ramdas Karmali 1 (1) GOA
Concept for an Ontology Based Web GIS Information System for HiMAT
Concept for an Ontology Based Web GIS Information System for HiMAT Gerald Hiebel Klaus Hanke University of Innsbruck Surveying and Geoinformation Unit {gerald.hiebel; klaus.hanke}@uibk.ac.at Abstract The
Building a Question Classifier for a TREC-Style Question Answering System
Building a Question Classifier for a TREC-Style Question Answering System Richard May & Ari Steinberg Topic: Question Classification We define Question Classification (QC) here to be the task that, given
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 INTELLIGENT MULTIDIMENSIONAL DATABASE INTERFACE Mona Gharib Mohamed Reda Zahraa E. Mohamed Faculty of Science,
Semaphore Overview. A Smartlogic White Paper. Executive Summary
Semaphore Overview A Smartlogic White Paper Executive Summary Enterprises no longer face an acute information access challenge. This is mainly because the information search market has matured immensely
Semantic Web based e-learning System for Sports Domain
Semantic Web based e-learning System for Sports Domain S.Muthu lakshmi Research Scholar Dept.of Information Science & Technology Anna University, Chennai G.V.Uma Professor & Research Supervisor Dept.of
MBARI Deep Sea Guide: Designing a web interface that represents information about the Monterey Bay deep-sea world.
MBARI Deep Sea Guide: Designing a web interface that represents information about the Monterey Bay deep-sea world. Pierre Venuat, University of Poitiers Mentors: Brian Schlining and Nancy Jacobsen Stout
Taxonomy learning factoring the structure of a taxonomy into a semantic classification decision
Taxonomy learning factoring the structure of a taxonomy into a semantic classification decision Viktor PEKAR Bashkir State University Ufa, Russia, 450000 [email protected] Steffen STAAB Institute AIFB,
Taxonomies for Auto-Tagging Unstructured Content. Heather Hedden Hedden Information Management Text Analytics World, Boston, MA October 1, 2013
Taxonomies for Auto-Tagging Unstructured Content Heather Hedden Hedden Information Management Text Analytics World, Boston, MA October 1, 2013 About Heather Hedden Independent taxonomy consultant, Hedden
Final Report - HydrometDB Belize s Climatic Database Management System. Executive Summary
Executive Summary Belize s HydrometDB is a Climatic Database Management System (CDMS) that allows easy integration of multiple sources of automatic and manual stations, data quality control procedures,
METADATA STANDARDS AND METADATA REGISTRIES: AN OVERVIEW
METADATA STANDARDS AND METADATA REGISTRIES: AN OVERVIEW Bruce E. Bargmeyer, Environmental Protection Agency, and Daniel W. Gillman, Bureau of Labor Statistics Daniel W. Gillman, Bureau of Labor Statistics,
The University of Jordan
The University of Jordan Master in Web Intelligence Non Thesis Department of Business Information Technology King Abdullah II School for Information Technology The University of Jordan 1 STUDY PLAN MASTER'S
GeoNetwork User Manual
GeoNetwork User Manual Release 2.10.4-0 GeoNetwork July 06, 2015 Contents 1 Preface 3 1.1 About this Project..................................... 3 1.2 License Information....................................
The ORIENTGATE data platform
Seminar on Proposed and Revised set of indicators June 4-5, 2014 - Belgrade (Serbia) The ORIENTGATE data platform WP2, Action 2.4 Alessandra Nuzzo, Sandro Fiore, Giovanni Aloisio Scientific Computing and
Software Architecture Document
Software Architecture Document Project Management Cell 1.0 1 of 16 Abstract: This is a software architecture document for Project Management(PM ) cell. It identifies and explains important architectural
The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team
The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team Photo courtesy Andrew Mahoney NSF Vision What is AON? a
HELP DESK SYSTEMS. Using CaseBased Reasoning
HELP DESK SYSTEMS Using CaseBased Reasoning Topics Covered Today What is Help-Desk? Components of HelpDesk Systems Types Of HelpDesk Systems Used Need for CBR in HelpDesk Systems GE Helpdesk using ReMind
Draft Martin Doerr ICS-FORTH, Heraklion, Crete Oct 4, 2001
A comparison of the OpenGIS TM Abstract Specification with the CIDOC CRM 3.2 Draft Martin Doerr ICS-FORTH, Heraklion, Crete Oct 4, 2001 1 Introduction This Mapping has the purpose to identify, if the OpenGIS
Information Services for Smart Grids
Smart Grid and Renewable Energy, 2009, 8 12 Published Online September 2009 (http://www.scirp.org/journal/sgre/). ABSTRACT Interconnected and integrated electrical power systems, by their very dynamic
GEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington
GEOG 482/582 : GIS Data Management Lesson 10: Enterprise GIS Data Management Strategies Overview Learning Objective Questions: 1. What are challenges for multi-user database environments? 2. What is Enterprise
An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials
ehealth Beyond the Horizon Get IT There S.K. Andersen et al. (Eds.) IOS Press, 2008 2008 Organizing Committee of MIE 2008. All rights reserved. 3 An Ontology Based Method to Solve Query Identifier Heterogeneity
Databases and computerized information retrieval
**** 1 Databases and computerized information retrieval Introduction **** What is a database? 2 A database is a collection of similar data records stored in a common file (or collection of files). ****
SOLUTION BRIEF CA ERwin Modeling. How can I understand, manage and govern complex data assets and improve business agility?
SOLUTION BRIEF CA ERwin Modeling How can I understand, manage and govern complex data assets and improve business agility? SOLUTION BRIEF CA DATABASE MANAGEMENT FOR DB2 FOR z/os DRAFT CA ERwin Modeling
Building a SDI for small countries the Portuguese example
Building a SDI for small countries the Portuguese example Rui Pedro Julião Instituto Geográfico Português Deputy Director-General [email protected] Abstract Portugal was one of the SDI pioneers in the beginning
This document is no longer current. Please go to the following URL for more information:
This document is no longer current. Please go to the following URL for more information: http://www.nationalarchives.gov.uk/electronicrecords/reqs2002/default.htm Requirements for Electronic Records Management
GeoNetwork, The Open Source Solution for the interoperable management of geospatial metadata
GeoNetwork, The Open Source Solution for the interoperable management of geospatial metadata Ing. Simone Giannecchini, GeoSolutions Ing. Emanuele Tajariol, GeoSolutions Outline GeoNetwork Introduction
