K@ A collaborative platform for knowledge management



Similar documents
LKMS/Mnemosyne: Semantic Web technologies and Markup techniques for a Legal Knowledge Management System.

The Open Source CMS. Open Source Java & XML

Content Management Systems: Drupal Vs Jahia

Tool-Assisted Knowledge to HL7 v3 Message Translation (TAMMP) Installation Guide December 23, 2009

Communiqué 4. Standardized Global Content Management. Designed for World s Leading Enterprises. Industry Leading Products & Platform

aloe-project.de White Paper ALOE White Paper - Martin Memmel

Software Architecture Document

Data Management for Biobanks

White Paper Converting Lotus Notes Applications to the Cloud Using the CIMtrek converter Product

Quick start. A project with SpagoBI 3.x

- a Humanities Asset Management System. Georg Vogeler & Martina Semlak

MANDARAX + ORYX An Open-Source Rule Platform

Software Architecture Document

SemWeB Semantic Web Browser Improving Browsing Experience with Semantic and Personalized Information and Hyperlinks

Category: Business Process and Integration Solution for Small Business and the Enterprise

Front-End Performance Testing and Optimization

EUR-Lex 2012 Data Extraction using Web Services

GeoNetwork, The Open Source Solution for the interoperable management of geospatial metadata

Graph Database Performance: An Oracle Perspective

IBM Rational Asset Manager

Internet Engineering: Web Application Architecture. Ali Kamandi Sharif University of Technology Fall 2007

Oracle Identity Analytics Architecture. An Oracle White Paper July 2010

The Electronic Document Management Application (EDM)

Comparison of Triple Stores

Semantic Knowledge Management System. Paripati Lohith Kumar. School of Information Technology

A Java proxy for MS SQL Server Reporting Services

MEGA Web Application Architecture Overview MEGA 2009 SP4

Content Management Systems: Drupal Vs Jahia

Prognoz Payment System Data Analysis. Description of the solution

Application of OASIS Integrated Collaboration Object Model (ICOM) with Oracle Database 11g Semantic Technologies

Pentaho Reporting Overview

Business Process Management

Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object

System requirements. Java SE Runtime Environment(JRE) 7 (32bit) Java SE Runtime Environment(JRE) 6 (64bit) Java SE Runtime Environment(JRE) 7 (64bit)

MULTICULTURAL CONTENT MANAGEMENT SYSTEM

Technical Information Abstract

Configuring Apache HTTP Server as a Reverse Proxy Server for SAS 9.3 Web Applications Deployed on Oracle WebLogic Server

Web 2.0-based SaaS for Community Resource Sharing

An introduction to creating Web 2.0 applications in Rational Application Developer Version 8.0

Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo

Towards a Semantic Wiki Wiki Web

Lightweight Data Integration using the WebComposition Data Grid Service

Deploying Oracle Business Intelligence Publisher in J2EE Application Servers Release

WHITE PAPER. Domo Advanced Architecture

Introduction: Database management system

Combining SAWSDL, OWL DL and UDDI for Semantically Enhanced Web Service Discovery

data.bris: collecting and organising repository metadata, an institutional case study

HP Systinet. Software Version: Windows and Linux Operating Systems. Concepts Guide

How to make a good Software Requirement Specification(SRS)

Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc.

Feature Overview Signavio products. Version 9.8.1

HOW TO DO A SMART DATA PROJECT

Reference Software Workshop Tutorial - The Basics

full file at

A Generic Transcoding Tool for Making Web Applications Adaptive

GeoNetwork, The Open Source Solution for the interoperable management of geospatial metadata

Energy Management Web-based embedded solution for monitoring of distributed conventional energy applications Type Em 2 -Server

A Framework for Developing the Web-based Data Integration Tool for Web-Oriented Data Warehousing

DataDirect XQuery Technical Overview

<Insert Picture Here> Michael Hichwa VP Database Development Tools Stuttgart September 18, 2007 Hamburg September 20, 2007

Integration Platforms Problems and Possibilities *

Configuring Apache HTTP Server as a Reverse Proxy Server for SAS 9.2 Web Applications Deployed on BEA WebLogic Server 9.2

Feature Overview Signavio products. Version 9.3

Portals, Portlets & Liferay Platform

Semantic Stored Procedures Programming Environment and performance analysis

Combining Unstructured, Fully Structured and Semi-Structured Information in Semantic Wikis

An Oracle White Paper May Creating Custom PDF Reports with Oracle Application Express and the APEX Listener

Oracle Warehouse Builder 10g

Last Updated: July STATISTICA Enterprise Server Security

Introduction to Apache Roller. Matt Raible Apache Roller Committer June 2007

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies

BI Publisher Reporting in Release 12 Tips and Techniques

COURSE CONTENT FOR WINTER TRAINING ON Web Development using PHP & MySql

Using Oracle Data Integrator with Essbase, Planning and the Rest of the Oracle EPM Products

Building Views and Charts in Requests Introduction to Answers views and charts Creating and editing charts Performing common view tasks

Typo3_tridion. SDL Tridion R5 3/21/2008

APPENDIX A Web Redesign Infrastructure. Deployment Overview

CERTIFIED MULESOFT DEVELOPER EXAM. Preparation Guide

Introduction. Introduction: Database management system. Introduction: DBS concepts & architecture. Introduction: DBS versus File system

LinkZoo: A linked data platform for collaborative management of heterogeneous resources

ANSYS EKM Overview. What is EKM?

Web Programming. Robert M. Dondero, Ph.D. Princeton University

Publishing Linked Data Requires More than Just Using a Tool

Xythos WebFile Server Architecture A Technical Guide to the Core Technology, Components, and Design of the Xythos WebFile Server Platform

Semantic Search in Portals using Ontologies

Maximizing ROI on Test and Durability

MERMIG The advanced collaboration software

Smart Cities require Geospatial Data Providing services to citizens, enterprises, visitors...

Transcription:

White Paper K@ A collaborative platform for knowledge management Quinary SpA www.quinary.com via Pietrasanta 14 20141 Milano Italia t +39 02 3090 1500 f +39 02 3090 1501 Copyright 2004 Quinary SpA

Index 1. Summary... 3 2. K@ main features...4 2.1 Document Management... 4 2.2 User Tracking... 4 2.3 Ontologies and Annotations... 4 3. Technology... 5 2

1. Summary K@ (to be read kat ) is a collaborative web-based platform for knowledge management. With K@ users can access and share a common repository of documents, web links and notes while the system keeps track of people interaction. It supports organisation along user-defined hierarchies of categories (the Knowledge Area Tree, KAT hereafter), against which documents are classified (manually or automatically by means of external classifier engines), and provides a set of instruments for users to browse and modify the KAT and to insert and search for documents. It supplies functionalities of a standard document management system, in which documents can be uploaded or referenced by an URL, and also provides free editable text areas to share comments and ideas in the form of Wiki pages. K@ is able to maintain the association between documents and semantic annotations with respect to a formal ontology according to Semantic Web standards. It provides a number of tools tracking users actions and behaviour in order to provide a better user experience and to facilitate sharing. K@ is currently actively maintained and extended, and used both within Quinary and at selected end user sites. 3

2. K@ main features 2.1 Document Management K@ manages several KATs (trees of classification), with versions; supports multilingual names and descriptions; provides security management. It works with different document sources and files. It supports classification in more than one category, with a ranking parameter. It provides interfaces enabling the integration of external classifiers. The search engine, based on Apache Lucene, is in charge of indexing documents of various formats, including MS Word, RTF and PDF. Searches can be filtered by context; a search engine manager has been integrated to provide concurrent searches on external engines. Kzilla, an addon for Mozilla browsers, is also provided: it is a sidebar that communicates with K@ while you are browsing the web. It provides list of related documents enriching your browsing context with information from your internal repository. 2.2 User Tracking K@ stores static and dynamic information about users, making them available in order to help people in collaboration and ease of use. A profiling system is in charge of tracking users behavior to create a better user experience: favorite areas and documents are dynamically rendered in the context supporting the fastest access to a large repository; users can see who is doing what and look for areas and documents starting from other users experience. We are currently experimenting a Collaborative Filtering Engine based on [CoFE] to use data from user tracking to generate suggestion of interest ('this new document may be of interest for you' please note that unlike in conventional recommendation system, we do not use explicit ratings but rather implicit ones derived from user behaviour). 2.3 Ontologies and Annotations To allow connecting annotations to documents, the SemantiK plugin is provided. SemantiK is a platform featuring integration, searching, presentation and editing of knowledge 4

expressed through the RDF language. It stores the whole set of RDF triples in the RDF repository [Sesame] in order to obtain inferencing capabilities. Tipically the ontology is domain specific and must be expressed in RDFS. Java classes reflecting the ontology can be defined in order to specify a particular rendering, searching or knowledge integration behaviour for instances of the corresponding RDF Class. Then, semantic annotations can be manually inserted by means of a web-based user interface, featuring an ontology driven search engine that tries to lessen the burden of finding resources in the Knowledge Base. To final users, the process consists in adding metadata to documents through smart web forms. Annotations may also be created by means of external knowledge brokers, such as Information Extraction tools, dedicated services accessing external databases, or wrappers for HTML scrapers: SemantiK can push document s content to the knowledge broker and store the RDF answer in the KB. Experimental activites with Knowledge Brokers are currently ongoing in the framework of EC sponsored DotKom project. In any case, the Knowledge Integration layer of SemantiK is responsible of checking data being stored in the KB in order to avoid duplication of resources; it relies on a fuzzy measure of closeness between instances. Annotation can be exploited in many ways: they can be browsed and queried through the K@ web interface; they enrich the document repository with a Semantic Web layer, for external agents to access; moreover a similarity measure based on semantic features can be computed between documents and a classification engine motivated by it can be plugged in K@. 3. Technology K@ is a standard J2EE web application, deployed in a Tomcat application server. It relies on several Java open source components. It requires a SQL DBMS, currently supporting Oracle, MySql, and Firebird. The document repository is stored in the filesystem and is indexed by a search engine derived from Apache Lucene. It integrates a JSPWiki via XML-RPC protocol. The Semantic Annotation Module, called SemantiK, runs within K@ and provides communication with Sesame through HTTP calls. Sesame stores RDF triples in an RDBMS. 5

K@ is known to work effectively with any recent browser (MS Explorer, Firefox, Mozilla, Safari). K@ supports output as RSS feeds and provides and interface to create custom feeds. XML import and output modules for KATs, documents and users are provided. RSS Aggregator Browser Knowledge addon (Kzilla) Information Extraction Tools (Gate) semantic web recommender system (CoFE) external classifier K@ user module search engine (lucene) SemantiK RDF repository/reasoner (Sesame) J2EE App.Server (Tomcat) Wiki (JSPWiki) document filters filesystem search indexes filesystem docs repository RDBMS metadata, users db, sesame triples filesystem wiki repository Figure 1 - K@ Architecture References CoFe - COllaborative Filtering Engine: http://eecs.oregonstate.edu/iis/cofe DotKom: http://www.dot-kom.org Firebird: http://firebird.sourceforge.net/ Gate - General Architecture for Text Engineering: http://gate.ac.uk JSPWiki: http://www.jspwiki.org/ Lucene: http://jakarta.apache.org/lucene MySql: http://www.mysql.com Quinary, K@, SemantiK: http://www.quinary.com Semantic Web, RDF, RDFS: http://www.w3.org/2001/sw/ Tomcat: http://jakarta.apache.org/tomcat 6