Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology



Similar documents
Information Technology for KM

OWL based XML Data Integration

Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo

BUSINESS VALUE OF SEMANTIC TECHNOLOGY

City Data Pipeline. A System for Making Open Data Useful for Cities. stefan.bischof@tuwien.ac.at

OWL Ontology Translation for the Semantic Web

An Ontology-based e-learning System for Network Security

Information Services for Smart Grids

Grids, Logs, and the Resource Description Framework

Secure Semantic Web Service Using SAML

ONTOLOGY-ORIENTED INFERENCE-BASED LEARNING CONTENT MANAGEMENT SYSTEM

Artificial Intelligence & Knowledge Management

Semantic Interoperability

Theme 6: Enterprise Knowledge Management Using Knowledge Orchestration Agency

Semantics and Ontology of Logistic Cloud Services*

Application of ontologies for the integration of network monitoring platforms

SCALEA-G: a Unified Monitoring and Performance Analysis System for the Grid

MERGING ONTOLOGIES AND OBJECT-ORIENTED TECHNOLOGIES FOR SOFTWARE DEVELOPMENT

Semantic Stored Procedures Programming Environment and performance analysis

Extending SOA Infrastructure for Semantic Interoperability

Semantic Knowledge Management System. Paripati Lohith Kumar. School of Information Technology

Core Enterprise Services, SOA, and Semantic Technologies: Supporting Semantic Interoperability

Department of Defense. Enterprise Information Warehouse/Web (EIW) Using standards to Federate and Integrate Domains at DOD

Publishing Linked Data Requires More than Just Using a Tool

A Semantic web approach for e-learning platforms

P ERFORMANCE M ONITORING AND A NALYSIS S ERVICES - S TABLE S OFTWARE

ONTOLOGY-BASED MULTIMEDIA AUTHORING AND INTERFACING TOOLS 3 rd Hellenic Conference on Artificial Intelligence, Samos, Greece, 5-8 May 2004

A Semantic Service-Oriented Architecture for Business Process Fusion

Implementing Ontology-based Information Sharing in Product Lifecycle Management

Ontology and automatic code generation on modeling and simulation

Semantic Web Technologies and Data Management

Basic Scheduling in Grid environment &Grid Scheduling Ontology

Service Oriented Architecture

UNIVERSIDAD DE LAS AMÉRICAS PUEBLA ESCUELA DE INGENIERÍA. Departamento de Ingeniería en Sistemas Computacionales

A Secure Mediator for Integrating Multiple Level Access Control Policies

DDI Lifecycle: Moving Forward Status of the Development of DDI 4. Joachim Wackerow Technical Committee, DDI Alliance

Open Source egovernment Reference Architecture Osera.modeldriven.org. Copyright 2006 Data Access Technologies, Inc. Slide 1

FIPA agent based network distributed control system

Knowledge Management

A Semantic Web Based Approach to Knowledge Management for Grid Applications

Semantic Information on Electronic Medical Records (EMRs) through Ontologies

A generic approach for data integration using RDF, OWL and XML

Semantic Business Process Management

Supporting Change-Aware Semantic Web Services

A Framework for Ontology-Based Knowledge Management System

technische universiteit eindhoven WIS & Engineering Geert-Jan Houben

Experiences from a Large Scale Ontology-Based Application Development

A Framework for Collaborative Project Planning Using Semantic Web Technology

An Approach to Eliminate Semantic Heterogenity Using Ontologies in Enterprise Data Integeration

Open Data Integration Using SPARQL and SPIN

Managing enterprise applications as dynamic resources in corporate semantic webs an application scenario for semantic web services.

An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials

A Comparative Study Ontology Building Tools for Semantic Web Applications

LinkZoo: A linked data platform for collaborative management of heterogeneous resources

Integrating data from The Perseus Project and Arachne using the CIDOC CRM An Examination from a Software Developer s Perspective

Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object

Big Data, Fast Data, Complex Data. Jans Aasman Franz Inc

The Development of the Clinical Trial Ontology to standardize dissemination of clinical trial data. Ravi Shankar

Incorporating Semantic Discovery into a Ubiquitous Computing Infrastructure

Model Driven Interoperability through Semantic Annotations using SoaML and ODM

RDF Dataset Management Framework for Data.go.th

Introduction to the Semantic Web

Introduction to ontologies and tools; some examples

Defining a benchmark suite for evaluating the import of OWL Lite ontologies

ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY

Design and Implementation of a Semantic Web Solution for Real-time Reservoir Management

The Ontological Approach for SIEM Data Repository

Improving EHR Semantic Interoperability Future Vision and Challenges

Characterizing Knowledge on the Semantic Web with Watson

Scalable End-User Access to Big Data HELLENIC REPUBLIC National and Kapodistrian University of Athens

Semantic Integration in Enterprise Information Management

A COLLABORATIVE PERSPECTIVE OF CRM

Simplifying e Business Collaboration by providing a Semantic Mapping Platform

Graph Database Performance: An Oracle Perspective

CHAPTER 6 EXTRACTION OF METHOD SIGNATURES FROM UML CLASS DIAGRAM

Transcription:

Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology Hong-Linh Truong Institute for Software Science, University of Vienna, Austria truong@par.univie.ac.at Thomas Fahringer Institute for Computer Science, University of Innsbruck, Austria Thomas.Fahringer@uibk.ac.at http://www.par.univie.ac.at/~truong/projects/perfonto 1 ICCS 2004, Krakow 9 June, 2004

Outline Motivation Our Approach Ontology & Ontology Languages PERFONTO Architecture of ontology-based performance analysis, data sharing and tools integration Prototype overview 2 Conclusions and Future work

Motivation Perfomance data and tools integration Performance data mostly is proprietary to a performance tool Not to be shared and understood by various tools/services How do we share and reuse performance data? Developing wrapper libraries for converting data Making data available in relational database, XML (even just a few did!) Data integration based on structural approach, not semantic approach Performance monitoring tools collect raw data but do not directly model and clarify the relevant aspects of objects monitored Grid performance monitoring and analysis No single source provides monitoring and performance data Performance monitoring and analysis are conducted across multiple Grid sites High level services (e.g. scheduler, resource matching) have to utilize data from many sources A world wide of performance monitoring tools and data Diversity and autonomy of performance and monitoring tools Syntactic and semantic heterogeneity 3

Motivation (const( const.) Performance monitoringand analysis resources and applications in Virtual Organizations Both syntactic and semantic interoperability are required Well-defined service/tool operations are needed We believe adding more semantics into data collected will Make data less propriatery to a specific tool, enhancing knowledge sharing and reuse Foster to automatically detect, correct and predict behavior of systems and applications at runtime (e.g. self-organizing) Allow to move analysis components as close to monitored source as possible Support intelligence performance analysis 4

Our Approach Well-defined service/tool operations achieved by employing Grid/Web Services Rich semantics and shared vocabularies achieved by using ontology We advocate Semantic Grid But what is Ontology? Most widely-accepted and -used definition "Formal, explicit specification of a shared conceptualization" Gruber, 1993 Ontology in Semantic Web Computer-usable definitions of basic concepts in a domain and the relationships among them. 5

Ontology Primitives Definitions Classes (Things) General things in a domain of interest Properties (Attributes) The properties those things may have Relationships The relationships that can exist among things name Performance Metric value RegionSummary Axioms Constrain the interpretation and the well-formed use of these definitions CodeRegion Processing Unit 6 Loop Function Call

Expected Benefits of Using Ontology in Performance Analysis Domain Performance data and tools integration Describe and model data with rich semantics, highly expressiveness Provide shared vocabularies Map and translate data of disparate representations Semantic interactions between tools in an automatic performance analysis systems Enhance automatic performance analysis High level performance data query Enable rule-based performance analysis 7

Ontology Languages Existing Languages RDF (Resource Description Framework), DAML (DARPA Agent Markup Language), OIL (Ontology Inference Layer), DAML+OIL, OWL (WebOnt) OWL (Web Ontology Language): being standardized by W3C Extension of RDF and derived from DAML+OIL Object-oriented approach Set of definitions of classes and properties Properties: object properties and data properties Transitive, symmetric, inverse, functional Use XML schema data types Class axiom: specifies necessary and/or sufficient characteristics of a class (e.g. sub class, equivalent) Property axiom: defines additional characteristics of a property, e.g. range, domain, relations to other properties 8

PERFONTO Currently initial work on using ontology for system monitoring and management, but not for performance experiments of applications so far PERFONTO (ontology for performance analysis domain) Domain of interest: performance monitoring, analysis and data integration in parallel and distributed computing. Using OWL (Web Ontology Language) Model and represent performance data both sytem resources and application experiments Provide shared vocabularies on performance analysis domain PERFONTO comprises two parts 9 Experiment-related concept Resource-related concept

Experiment-related related concept Describe application experiments and their associated performance data 10

Excerp OWL for RegionSummary Property definition Class definition 11

Resource-Related Related Concept Describe static, benchmarked and dynamic (performance) information of computing and network systems Current descriptions: capabilities static and benchmarked information 12

Shared Vocabularies/Terminologies Performance metric catalog Example: similarity in meanings of vocabularies <papi:metric rdf:id="papi_flops"> <owl:sameas rdf:resource="perfonto:flops" /> /> </papi:metric> 13

System Architecture for Ontology-based Performance Analysis, Data Sharing and Tools Integration 14

Prototype Implementation Ontology-related tasks done by Jena Toolkit A Semantic Web Toolkit, HP Lab at UK APIs for processing RDF, RDFS, OWL Network APIs for accessing remove RDF database Data storage: file or persistent database (e.g. PostgreSQL) Query with RDQL (RDF Query Data Language) Rule-based inference engine 15

Prototype Implementation (const) Ontology-based performance data repository service Grid serivices based on GT 3.2 (Globus Tookit) Persistent data storage powered by PostgreSQL Store, retrieve and query operations for ontology and instances (individuals) Reasoning done at client side Ontology-based performance analysis service(opas) First support search on performance data Validating performance and monitoring data collected with PERFONTO Work on rule-based performance analysis 16

Search on Ontological Data Ontology-based performance data search High-level, more intelligent search model based on ontological data Query easily understood and written by end users, not only by tool developers RDQL (RDF Data Query Language) SQL-alike query language for RDF SELECT vars, FROM documents, WHERE expressions, AND filters, USING namespace declarations Query with triple patterns and constraints over RDF model Implemented in Jena search engine 17

Search on Ontological Data (const.) Q: Search all code regions executed in node gsr410 with wallclock time >=3E8 microsecond SELECT?regionsummary WHERE (?regionsummary perfonto:inprocessingunit?processingunit) (?processingunit perfonto:innode "gsr410 ) (?regionsummary perfonto:hasmetric?metric) (?metric perfonto:hasmetricname "wtime") (?metric perfonto:hasmetricvalue?value) AND (?value >=3E8) USING perfonto FOR <http://www.par.univie.ac.at/project/scalea/perfonto#> 18

Reasoning about Ontological Data Ontology allows additional facts to be inferred by using axioms and rules Reasoning on monitoring and performance ontological data Intelligence resource matching Rule-based interferences Validating data Automatic/autonomic monitoring and performance analysis Automatic performance analysis based on rules Runtime self-healing, -management 19

Reasoning Example Q: if a MPI Q: if a MPI code region has big message size then prints it out [rule_detect_bigmessages: (?regionsummary perfonto:ofcoderegion?coderegion), (?coderegion rdf:type perfonto:mpcoderegion), (?coderegion perfonto:hascrtype "CR_MPIP2P"), (?regionsummary perfonto:hasmetric?metric), (?metric perfonto:hasmetricname "AvgMessageLength"), (?metric perfonto:hasmetricvalue?length), greaterthan(?length, BIG_MESSAGES_THREADHOLD) -> print(?regionsummary,"big message hold!")] 20

Search OPAS GUI: Example 21

Conclusion and Future work Conclusion Investigating the application of ontology to the domain of Grid performance analysis and monitoring System architecture for ontology-based performance data analysis, data sharing and tools integration in the Grid Ontology would be a good solution for seamlessly utilizing monitoring and performance data and integrating different performance monitoring and measurement tools in Grids Ontology could help to simplify performance analysis (e.g. performance data search) and to enhance automatic performance analysis (e.g. rule-based reasoning on performance data) Future work Implementing full prototype, reevaluating and enhancing PERFONTO Ontology-based monitoring Task-based ontology for automatic performance analysis Shared conceptualization looking for community work! http://www.par.univie.ac.at/~truong/projects/perfonto 22