A GENERALIZED APPROACH TO CONTENT CREATION USING KNOWLEDGE BASE SYSTEMS



Similar documents
Encoding Library of Congress Subject Headings in SKOS: Authority Control for the Semantic Web

Advanced Solutions of Microsoft SharePoint Server 2013

Semantic SharePoint. Technical Briefing. Helmut Nagy, Semantic Web Company Andreas Blumauer, Semantic Web Company

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November ISSN

A Recommendation Framework Based on the Analytic Network Process and its Application in the Semantic Technology Domain

EndNote Beyond the Basics

Linked Data Interface, Semantics and a T-Box Triple Store for Microsoft SharePoint

JMulTi/JStatCom - A Data Analysis Toolkit for End-users and Developers

Master of Science in Healthcare Informatics and Analytics Program Overview

The Development of the Clinical Trial Ontology to standardize dissemination of clinical trial data. Ravi Shankar

Adlib Library. Software for the professional management of collections in libraries and information centres. Comprehensive, Flexible, User-friendly

CatDV Pro Workgroup Serve r

Databases in Organizations

A Software Tool for Thesauri Management, Browsing and Supporting Advanced Searches

Business Insight Report Authoring Getting Started Guide

Microsoft SharePoint 2010 Site Collection and Site Administration Course 50547A; 5 Days, Instructor-led

US Patent and Trademark Office Department of Commerce

Metadata Quality Control for Content Migration: The Metadata Migration Project at the University of Houston Libraries

Chapter 1. Dr. Chris Irwin Davis Phone: (972) Office: ECSS CS-4337 Organization of Programming Languages

An Ontology-based e-learning System for Network Security

Reusable Knowledge-based Components for Building Software. Applications: A Knowledge Modelling Approach

Release 2.1 of SAS Add-In for Microsoft Office Bringing Microsoft PowerPoint into the Mix ABSTRACT INTRODUCTION Data Access

Weblogs Content Classification Tools: performance evaluation

Semantic Search in Portals using Ontologies

Semantic Knowledge Management System. Paripati Lohith Kumar. School of Information Technology

TOWARD A THESAURUS OF RADIOCARBON DATING AND RELATED TERMS. DILETTE POLACH Radiocarbon Dating Research, PO Box 43, Garran, ACT, 2605, Australia

Adobe Flash Catalyst CS5.5

Integrated Library Systems (ILS) Glossary

The Prolog Interface to the Unstructured Information Management Architecture

Structured Content: the Key to Agile. Web Experience Management. Introduction

Enterprise Application Development in SharePoint 2010

Contents. Launching FrontPage Working with the FrontPage Interface... 3 View Options... 4 The Folders List... 5 The Page View Frame...

INSPIRE Dashboard. Technical scenario

Service Overview. KANA Express. Introduction. Good experiences. On brand. On budget.

ProClarity Analyst Training

Fast and Easy Delivery of Data Mining Insights to Reporting Systems

A HUMAN RESOURCE ONTOLOGY FOR RECRUITMENT PROCESS

The Recipe for Sarbanes-Oxley Compliance using Microsoft s SharePoint 2010 platform

Secure Semantic Web Service Using SAML

How To Build A Connector On A Website (For A Nonprogrammer)

LinksTo A Web2.0 System that Utilises Linked Data Principles to Link Related Resources Together

NeMO - NeDiMAH Methods Ontology. Use Case manual: How to report your case using throughout the Excel template

Combining SAWSDL, OWL DL and UDDI for Semantically Enhanced Web Service Discovery

How To Use Noetix

Language Evaluation Criteria. Evaluation Criteria: Readability. Evaluation Criteria: Writability. ICOM 4036 Programming Languages

Intelligent Search for Answering Clinical Questions Coronado Group, Ltd. Innovation Initiatives

Programmabilty. Programmability in Microsoft Dynamics AX Microsoft Dynamics AX White Paper

SAP Business Objects Business Intelligence platform Document Version: 4.1 Support Package Data Federation Administration Tool Guide

Administrator's Guide

City Data Pipeline. A System for Making Open Data Useful for Cities. stefan.bischof@tuwien.ac.at

Business Intelligence Using SharePoint 2013 and Office365

OpenText Information Hub (ihub) 3.1 and 3.1.1

Dynamic Decision-Making Web Services Using SAS Stored Processes and SAS Business Rules Manager

Frequently Asked Questions Sage Pastel Intelligence Reporting

2 AIMS: an Agent-based Intelligent Tool for Informational Support

An Ontology Model for Organizing Information Resources Sharing on Personal Web

Flattening Enterprise Knowledge

Annotea and Semantic Web Supported Collaboration

Semantic Technology Accelerates Document Search: How LMI Implements Semantic Search with OpenPolicy

Using Microsoft Business Intelligence Dashboards and Reports in the Federal Government

MatchPoint Technical Features Tutorial Colygon AG Version 1.0

Semaphore Overview. A Smartlogic White Paper. Executive Summary

The Advantages of Using NCL 2.3

The Intelligence Engine.

Course Code NCS2013: SharePoint 2013 No-code Solutions for Office 365 and On-premises

BUILDING WEB JOURNAL DIRECTORY AND ITS ARTICLES WITH DRUPAL

Studio. Rapid Single-Source Content Development. Author XYLEME STUDIO DATA SHEET

An Ontology in Project Management Knowledge Domain

Web Design Competition College of Computing Science, Department of Information Systems. New Jersey Institute of Technology

Semantic Web Services for e-learning: Engineering and Technology Domain

300 Intelligence Reporting. Sage Intelligence Reporting Customer Frequently asked questions

Big Data Visualization with JReport

CONCEPTCLASSIFIER FOR SHAREPOINT

SMART CRM Desk for Service Sector. Solution for Customer Relationship Mgmt (CRM) in Service Industry

Leverage SharePoint with PSI:Capture

FreeForm Designer. Phone: Fax: POB 8792, Natanya, Israel Document2

ADVANCED GEOGRAPHIC INFORMATION SYSTEMS Vol. II - Using Ontologies for Geographic Information Intergration Frederico Torres Fonseca

Deciding When to Deploy Microsoft Windows SharePoint Services and Microsoft Office SharePoint Portal Server White Paper

ifinder ENTERPRISE SEARCH

An Ontology Based Text Analytics on Social Media

Analytics Canvas Tutorial: Cleaning Website Referral Traffic Data. N m o d a l S o l u t i o n s I n c. A l l R i g h t s R e s e r v e d

Metadata Hierarchy in Integrated Geoscientific Database for Regional Mineral Prospecting

PRODUCTIVITY PACK FOR PIVOTAL CRM

WHAT'S NEW IN SHAREPOINT 2013 WEB CONTENT MANAGEMENT

Workshop on Good Practice of Documentation Rome,ISS

ONTOLOGY-BASED MULTIMEDIA AUTHORING AND INTERFACING TOOLS 3 rd Hellenic Conference on Artificial Intelligence, Samos, Greece, 5-8 May 2004

Reporting Services. White Paper. Published: August 2007 Updated: July 2008

ELPUB Digital Library v2.0. Application of semantic web technologies

Simplify survey research with IBM SPSS Data Collection Data Entry

A guide to the lifeblood of DAM:

HELP DESK SYSTEMS. Using CaseBased Reasoning

Design and Development of Ontology for Risk Management in Software Project Management

ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY

MS 50547B Microsoft SharePoint 2010 Collection and Site Administration

PRODUCTIVITY PACK FOR PIVOTAL CRM

Vendor briefing Business Intelligence and Analytics Platforms Gartner 15 capabilities

Rotorcraft Health Management System (RHMS)

Intelligent Manage for the Operating System Services

FINDservice program will 'CREATE' your customer

Qlik Sense Enterprise

Transcription:

A GENERALIZED APPROACH TO CONTENT CREATION USING KNOWLEDGE BASE SYSTEMS By K S Chudamani and H C Nagarathna JRD Tata Memorial Library IISc, Bangalore-12 ABSTRACT: Library and information Institutions and centers all over the world are attempting to convert their holding into electronic storage. Collections of library document are accessed by users for their analytical and interpretational activities in research, academic consultation occupations and developmental work. Quickly finding an article in the Microsoft Knowledge Base through search by using content keywords and query words is an essential requirement of the present era. Using the keywords and query words that are listed in one article may help find other articles that have similar content. However, some query words are only used for older content and may not help you find the most current information.protégé is a free, open source ontology editor and knowledgebase framework.the Protégé platform supports two main ways of modeling ontology s via the Protégé-Frames and Protégé-OWL editors. Protégé ontologies can be exported into a variety of formats. It works on the principles of using classes, slots, forms, instances, queries. Classes can have subclasses. Slots are the templates for each class through which data can be entered. Forms set the indexed instance set up. Instances are the data entry display forms using the defined slots. Queries are the search mechanisms through the metadata. Content creation is the ultimate unit of information. It has tremendous use in day-to-day life.it has significant role in decision making. The need of content management system started appearing effectively and efficiently since time immemorial. For creating an knowledge based system, the system designer has to select 4 basic elements. They are: Cards (Records), Thesaurus, Class no, Indexing. These components of the system are connected to the scope and application of the subject matter to which the content belongs to. The components are data about the content assertions. In this way it is easier to store, organize, process and retrieve the information. The Content Knowledge Base guided search allows users to find precise content technical information, Developer Centered articles, and information service. The institutional guided search functionality lets users fine-tune their queries based on established categories. INTRODUCTION Library and information Institutions and centers all over the world are attempting to convert their holding into electronic storage. Collections of library document are accessed by users for their analytical and interpretational activities in research, academic, consultation occupations and developmental work. One has to go through contents first and then accessing text on computer or manual system through linking of underlined words. Content creation from institution through simple to sophisticated indexing system plays a crucial role. Its bring together users of the emerging standards in this arena. Quickly finding an article in the Microsoft Knowledge Base through search by using content keywords and query words is an essential requirement of the present era. Using the

keywords and query words that are listed in one article may help find other articles that have similar content. However, some query words are only used for older content and may not help you find the most current information. This paper explores the use of protege2000 as a tool for content creation. KNOWLEDGE-BASED SYSTEMS Simple knowledge base system identifiers are communities of practice for developing standard vocabularies for specific local needs. User s access the contents of knowledge, the patterns they contain, textures and related layout and position information which allows users to query the system based on perceptual features. The lack of standardized access and interchange formats impedes wider use of knowledge base systems resources. We have developed a demonstrator of Protege2000 ( http://protégé. Stanford.edu) for the precise project that explored thesaurus-based query expansion in protégé. Protégé is a free, open source ontology editor and knowledge-base framework. The Protégé platform supports two main ways of modeling ontology s via the Protégé- Frames and Protégé-OWL editors. Protégé ontologies can be exported into a variety of formats. Protégé is based on Java, is extensible, and provides a plug-and-play environment that makes it a flexible base for rapid prototyping and application development. (more).protégé is supported by a strong community of developers and academic, government and corporate users, who are using Protégé for knowledge solutions in areas as diverse as biomedicine, intelligence gathering, and corporate modeling. Protégé's flexible architecture makes it easy to configure and extend the tool. Protégé has an open-source Java API for the development of custom-tailored user interface components or arbitrary Semantic Web services. PROTÉGÉ 2000 AS A TOOL FOR CONTENT CREATION Protégé 2000 is a knowledge based system software. It works on the principles of using classes, slots, forms, instances, queries. Classes can have subclasses. Slots are the templates for each class through which data can be entered. Forms set the indexed instance set up. Instances are the data entry display forms using the defined slots. Queries are the search mechanisms based on the metadata. For creating an knowledge based system, the system designer has to select 4 basic elements They are: 1. Cards (Records) 2. Thesaurus 3. Class no 4. Indexing

These components of the system are connected to the scope and application of the subject matter to which the content belongs to. The components are data about the content assertions. In this way it is easier to store, organize, process and retrieve the information. The content is managed with the system that consists of database record file containing content components. Knowledge of context, not only makes information more meaningful, but it helps different individuals to use the same information at different times. Creating knowledge base system requires cards, thesaurus, Class no, indexing rules which are to be developed through an expert or artificial intelligent system to create concept based subject indexing strings of keywords covering semantics and syntax aspects. This knowledge based system that acts as a front end. Expert systems provide techniques to perform design making tasks based on a programmed set of rules and logic with in specific subject areas.

Content creation is the ultimate unit of information. It has tremendous use in day-to-day life. It has significant role in decision making. The need of content management system started appearing effectively and efficiently since time immemorial. Quality wise some content is conformed which is hence reliable, and the rest unconfirmed and therefore not much useful. It is available in several forms or types necessitating a content management system that reflect the changes that take place in the content. The system created at JRD Tata Memorial Library, IISc has three components, Namely, 1. Document description 2. vocabulary 3. thesaurus. Application, as an initial experiment with the protégé, a rich client browser, displaying detail for thesaurus concepts, Controlled vocabularies, Document description, thesaurus have their own usefulness. Organizations continue to require efficient, effective access to controlled content for creating consistent data through metadata for their collections. All these are described in the following paragraphs. Fig.1 is an description of the metadata used to store document description. Only six important metadata elements have been chosen to represent the documents. Each metadata is repeatable. FIG 1

The metadata elements are Title Thesterms Series Authors Class No Keywods Many controlled vocabularies are available for representing the content of documents and other resources.as applicable for each term retrieved, the above data is displayed in the research pane as expandable. The controlled vocabulary used here is the Library of Congress Subject Headings The classification scheme used can be any scheme and multiple assignments can be made from the same scheme. In this context, for the project undertaken here which is a generalized approach to content creation multiple class numbers are given from DDC. Similarly thesaurus terms are also given from Library of Congress subject headings. This field is also repeatable. Also, the Instances can be viewed from a selected data element of the metadata. Fig 2 is an display of the data entry form. The data entered are multiple. The screen opens out to accommodate multiple entries in the box. Data can be both numeric or text. FIG 2

The protégé 2000 terms list is a vocabulary with some broader and narrower term relationships. We added more hierarchical structure to this list with concept form the documents to develop the metadata thesaurus. In the example above, the concept scheme moved from a term list to a thesaurus. Fig 3 is a display of the screen from the thesaurus maintenance system of the software. Its elements are the usual thesaurus metadata like lead in term, RT terms, BT terms, NT terms, etc. FIG 3 FIG 4 is a query module.. It has the ability to retrieve from any field and also any combination of fields. Each cell has the capacity to select the metadata and combine metadata for searching. It searches the given term anywhere in the specified fields.

FIG 4 Once the query is searched the result will be displayed as a record. By clicking on the record the full data in the metadata fields is displayed. This is presented in FIG 5 FIG 5

To enhance search capability a vocabulary is also created. This presents to the user the mapping of a common vocabulary through the thesaurus and also its class number. This is helpful as an entry point for the vocabulary to the user who is not aware of the system terminology. Fig 6 presents the data structure for this. It has the following elements: Lead-in terms Classnoin Ddc Expansionsifany Mappedtothesaurus FIG 6 This approach can be followed even in Greenstone digital library software and it can be web enabled. This approach will serve information searching with a more powerful and user friendly system. However, expanding the search requires interlinking of the databases appropriately.

Conclusion The Content Knowledge Base guided search allows users to find precise content i.e. that is keyword or technical information Developer Centered articles, and information service. The institutional guided search functionality lets users fine-tune their queries based on established categories. Each search that a user performs automatically provides structured feedback to database administrator about the technical quality of the knowledge base content. This allows us to contribute new technical information continually to make the knowledge base and the guided search more effective. The Content Knowledge Base guided search is a self-service user information search tool that allows searchers to access information quickly and efficiently from contents for precise information, Expert systems Develop content of articles books, and Knowledge base Management service, notes on policies and procedures. Each search that a user performs automatically provides structured feedback to information about the technical quality of the knowledge base. This allows us to contribute new technical information continually to make the knowledge base more effective.. Bibliography: 1.Information and knowledge management : by D Kamalavijayan, Macmillan: 2005 2. http://protege.stanford.edu/plugins/owl/ 3. http://protégé. Stanford.edu 4. Information management in e-libraries. Ed by S.Parthan and V.K.J.Jeevan.: Allied pub. 2002