A semantic extension of a hierarchical storage management system for small and medium-sized enterprises.



Similar documents
Data Warehouses in the Path from Databases to Archives

Automatic Annotation Wrapper Generation and Mining Web Database Search Result

Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007

Ontology for Home Energy Management Domain

I. INTRODUCTION NOESIS ONTOLOGIES SEMANTICS AND ANNOTATION

Web Mining. Margherita Berardi LACAM. Dipartimento di Informatica Università degli Studi di Bari

Content Management Using Rational Unified Process Part 1: Content Management Defined

Intinno: A Web Integrated Digital Library and Learning Content Management System

Miracle Integrating Knowledge Management and Business Intelligence

Lightweight Data Integration using the WebComposition Data Grid Service

2009 ikeep Ltd, Morgenstrasse 129, CH-3018 Bern, Switzerland (

Knowledge-based Approach in Information Systems Life Cycle and Information Systems Architecture

Intelligent Manage for the Operating System Services

A Model-based Software Architecture for XML Data and Metadata Integration in Data Warehouse Systems

Secure Semantic Web Service Using SAML

Big Data and Analytics: Challenges and Opportunities

estatistik.core: COLLECTING RAW DATA FROM ERP SYSTEMS

Semantic Concept Based Retrieval of Software Bug Report with Feedback

Self Organizing Maps for Visualization of Categories

Flattening Enterprise Knowledge

Web-based Multimedia Content Management System for Effective News Personalization on Interactive Broadcasting

Chapter 1: Introduction

Information Management

EXPLOITING FOLKSONOMIES AND ONTOLOGIES IN AN E-BUSINESS APPLICATION

THE QUALITY OF DATA AND METADATA IN A DATAWAREHOUSE

Intelligent Human Machine Interface Design for Advanced Product Life Cycle Management Systems

A Framework for Collaborative Project Planning Using Semantic Web Technology

Deriving Business Intelligence from Unstructured Data

Workflow Object Driven Model

Analysis of Social Media Streams

ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY

An Approach Towards Customized Multi- Tenancy

Requirements Ontology and Multi representation Strategy for Database Schema Evolution 1

Personalized e-learning a Goal Oriented Approach

OVERVIEW OF JPSEARCH: A STANDARD FOR IMAGE SEARCH AND RETRIEVAL

SWAP: ONTOLOGY-BASED KNOWLEDGE MANAGEMENT WITH PEER-TO-PEER TECHNOLOGY

Content Management Using the Rational Unified Process By: Michael McIntosh

Distributed Framework for Data Mining As a Service on Private Cloud

Multimedia Databases. Wolf-Tilo Balke Philipp Wille Institut für Informationssysteme Technische Universität Braunschweig

Active Repository and Active Migration Manager. Connection to Tamper-Proof Storage Systems. The Architectur. Archiving Policies

A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks

Report on the Dagstuhl Seminar Data Quality on the Web

Databases in Organizations

CSE 132A. Database Systems Principles

Enforcing Data Quality Rules for a Synchronized VM Log Audit Environment Using Transformation Mapping Techniques

A Knowledge Management Framework Using Business Intelligence Solutions

FUTURE RESEARCH DIRECTIONS OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING *

Analysis of Data Mining Concepts in Higher Education with Needs to Najran University

Recognition and Privacy Preservation of Paper-based Health Records

Soar Technology Knowledge Management System (KMS)

Search Result Optimization using Annotators

Pragmatic Web 4.0. Towards an active and interactive Semantic Media Web. Fachtagung Semantische Technologien September 2013 HU Berlin

Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object

Fraunhofer FOKUS. Fraunhofer Institute for Open Communication Systems Kaiserin-Augusta-Allee Berlin, Germany.

Using LSI for Implementing Document Management Systems Turning unstructured data from a liability to an asset.

Context Aware Mobile Network Marketing Services

Semantic Search in Portals using Ontologies

City Data Pipeline. A System for Making Open Data Useful for Cities. stefan.bischof@tuwien.ac.at

Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence

SEMANTIC WEB BASED INFERENCE MODEL FOR LARGE SCALE ONTOLOGIES FROM BIG DATA

International Journal of Engineering Technology, Management and Applied Sciences. November 2014, Volume 2 Issue 6, ISSN

International Journal of Engineering Research-Online A Peer Reviewed International Journal Articles available online

Managing and Tracing the Traversal of Process Clouds with Templates, Agendas and Artifacts

Community Edition 3.3. Getting Started with Alfresco Explorer Document Management

Understanding Web personalization with Web Usage Mining and its Application: Recommender System

Monitoring Web Browsing Habits of User Using Web Log Analysis and Role-Based Web Accessing Control. Phudinan Singkhamfu, Parinya Suwanasrikham

Modeling Temporal Data in Electronic Health Record Systems

Extension of a SCA Editor and Deployment-Strategies for Software as a Service Applications

FIPA agent based network distributed control system

Novel Data Extraction Language for Structured Log Analysis

Optimization of Image Search from Photo Sharing Websites Using Personal Data

Master s Program in Information Systems

New Web tool to create educational and adaptive courses in an E-Learning platform based fusion of Web resources

SavvyDox Publishing Augmenting SharePoint and Office 365 Document Content Management Systems

Yet Another Triple Store Benchmark? Practical Experiences with Real-World Data

MULTI AGENT-BASED DISTRIBUTED DATA MINING

A Survey on Data Warehouse Architecture

EaseTag Cloud Storage Solution

Bringing Business Objects into ETL Technology

Transcription:

Faculty of Computer Science Institute of Software- and Multimedia Technology, Chair of Multimedia Technology A semantic extension of a hierarchical storage management system for small and medium-sized enterprises. Axel Schröder, Ronny Fritzsche, Sandro Schmidt, Annett Mitschick and Klaus Meißner Berlin, 09/29/2011

Outline Problem Motivation Existing approaches Concept Prototype Conclusion and future work TU Dresden, 26.09.11 SDA 2011 Folie 2 von 18

Problem Amount of data stored in companies increases every year [1] Fast memory is very expensive Current storage management systems treat every file individually without regarding relations between stored files Most data is unstructured and content wise unorganized [2] Every person has its own understanding of schemas to organize data TU Dresden, 26.09.11 SDA 2011 Folie 3 von 18

Motivation Using a storage management system (SMS) can help to save costs Even a SMS has to handle a tradeoff between costs, capacity and access time è Using inherent and additional semantic information is the key to a more efficient and more intelligent way to store information è Introduction of a semantic-aware component to enable a SMS to use semantic information (semantic storage extension - SSE) TU Dresden, 26.09.11 SDA 2011 Folie 4 von 18

Existing approaches Wide range of projects dealing with semantics in text and multimedia documents [such as 3,4] Common used approach: separate meta data from files to increase system performance [e. g. 5] Distinction between Semantic Information in File Systems Semantic Information in Additional Systems Such as information from metadata TU Dresden, 26.09.11 SDA 2011 Folie 5 von 18

Existing approaches Semantic Information in File Systems TagFS à annotation of files [6] Automatically and user-controlled annotation of files with keywords (containing metadata, folder names, paths, ) Semantic Vectors [7] Meta data converted in vectors spanning a feature space Capture events to link related documents [8] TU Dresden, 26.09.11 SDA 2011 Folie 6 von 18

Existing approaches Semantic Information in Additional Systems Information from metadata Implicit semantic knowledge (e. g. CBIR, face recognition, ) è consequences for our approach - using of: Metadata / attributes from file system Explicit knowledge (e. g. from a domain expert) Implicit semantic knowledge TU Dresden, 26.09.11 SDA 2011 Folie 7 von 18

Concept Architecture TU Dresden, 26.09.11 SDA 2011 Folie 8 von 18

Concept Semantic Service Component TU Dresden, 26.09.11 SDA 2011 Folie 9 von 18

Concept Modifications in the SMS Requesting the SSE Simple request (file id only) Directed request (file id + planned action) Response from SSE Just an advice / recommendation Complete analysis and partial analysis of stored data Semantic Service Component need access to stored data TU Dresden, 26.09.11 SDA 2011 Folie 10 von 18

Concept Request the SSE Directed request for migration SSE returns true if no requested file has relations to used files Directed request for retrieval SSE returns true if there are relations to active files Directed request for deletion SSE returns false if there is a high relevance to active files TU Dresden, 26.09.11 SDA 2011 Folie 11 von 18

Concept Design of the SSE Policies to model explicit semantic knowledge Basic policies to model processes / project and temporal boundaries Relational and constraint policies to associate projects with persons, places, TU Dresden, 26.09.11 SDA 2011 Folie 12 von 18

Concept Design of the SSE TU Dresden, 26.09.11 SDA 2011 Folie 13 von 18

Prototype SSE Implemented in Java Using AXIS2 SSC Implemented in Java Using AXIS2 Admin interface Implemented in WPF HSM (SMS not yet connected to the prototype) Implemented in C# and WPF TU Dresden, 26.09.11 SDA 2011 Folie 14 von 18

Conclusion Great lack of using semantics in SMS Introduction of the SSE Domain-independent architecture through using a SSC and policies to model explicit knowledge Future work to: Improve prototype Evaluate concept (especially polices and ontology) TU Dresden, 26.09.11 SDA 2011 Folie 15 von 18

References (1) Troopens, U., Erkens, R., Muller-Friedt, W., Wolafka, R., Haustein, N.: Storage Networks Explained. John Wiley & Sons Ltd., Mainz, 2 edn. (2009) (2) Gnasa, M.: Fraunhofer IAIS: Text Mining und Information Retrieval (2009), http://www.iais.fraunhofer.de/4862.html (3) Orio, N.: Music Retrieval: A Tutorial and Review. Foundations and Trends R in Information Retrieval 1(1), 1{96 (2006) (4) Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys 34(1), 1-47 (2002) (5) Hackl, G., Pausch, W., Schönherr, S., Specht, G., Thiel, G.: Synchronous Metadata Management of Large Storage Systems. Proceedings of the Fourteenth International Database Engineering & Applications Symposium pp. 2-7 (2010) (6) Bloehdorn, S., Gorlitz, O., Schenk, S.: TagFS - Tag Semantics for Hierarchical File Systems. Proceedings of the 6th International Conference on Knowledge Management I-KNOW'06 (2006) TU Dresden, 26.09.11 SDA 2011 Folie 16 von 18

References (7) Mahalingam, M., Tang, C., Xu, Z.: Towards a semantic, deep archival file system. The Ninth IEEE Workshop on Future Trends of Distributed Computing Systems pp. 115-121 (2003) (8) Weippl, E.R., Klemen, M., Linnert, M., Fenz, S., Goluch, G., Tjoa, A.M.: Semantic Storage : A Report on Performance and Flexibility. Lecture Notes in Computer Science 3588, 586-595 (2005) TU Dresden, 26.09.11 SDA 2011 Folie 17 von 18

Thank you for your attention. TU Dresden, 26.09.11 SDA 2011 Folie 18 von 18