Requirements Ontology and Multi representation Strategy for Database Schema Evolution 1
|
|
|
- William Scott
- 10 years ago
- Views:
Transcription
1 Requirements Ontology and Multi representation Strategy for Database Schema Evolution 1 Hassina Bounif, Stefano Spaccapietra, Rachel Pottinger Database Laboratory, EPFL, School of Computer and Communication Science Lausanne, Switzerland Data Management and Mining Laboratory, University of British Columbia Vancouver, Canada [email protected], [email protected], [email protected] Abstract. With the emergence of enterprise wide information systems, ontologies have become by definition a valuable aid for efficient database schemas modeling and integration besides their disseminated use in other important disciplines such as semantic web and natural language processing. This paper presents another important utilization field of ontology related to database schemas and which is schema evolution topic. More specifically, our research work concentrates on a new three layered approach for schema evolution based on domain ontology that we have called a requirements ontology and multi strategies to a powerful change management and costeffective evolution. This a priori approach for schema evolution, in contrast with existing a posteriori solutions, can be employed for any data model and for both 1) design from scratch and evolution and 2) redesign and evolution. The paper focuses on the two main foundations of this approach which are the requirements ontology and the multi representation strategy based on stamping mechanism. Keywords: Ontological Engineering, multi representation strategy, Schema Evolution 1 Introduction With the emergence of enterprise wide information systems, the number of ontologies in semantic driven data access and processing is increasing. It is the case of the use of ontologies in semantic web and natural language processing. In addition to that, ontologies have become by definition a valuable aid for efficient database schemas modeling and integration. In this work, we have investigated another area in which ontologies have a colossal potential of utilization and which is related to information systems. That is database schema evolution, an important, complex and very active research issue. Several solutions have been proposed and much progress has been made in data structures, rules, constraints, schemata models and meta models. In this 1 This work is carried out as part of IM2 ( Interactive Multimodal Information Management) ( Swiss National Competence Centre in Research (NCCR), supported by the Swiss National Research Fund
2 context, we advocate a new approach for schema evolution in which potential changes are inspected and integrated into the schema for future use. This approach is based on prediction. Generally, prediction means to envision with exactitude or not the changes that could occur over time. However, in this research work, we intend to detect the changes that are plausible to carry out on a database schema and are important for the database users. Our intent is to move one step towards, developing multi disciplinary and a priori approaches for database schema evolution, in contrast with existing a posteriori solutions that track the change instead of planning it. Our approach relies on the use of both 1) requirements ontology that contains the changes that are plausible to carry out on a database schema and are important for the database users and 2) multi strategies to a powerful Change management and compatibility with any conventional data model. We have chosen to focus on one of the multiple strategies we have defined, i.e. the multi representation strategy based on stamping mechanisms. The objectives of this section is to explain the problem of schema evolution as well as the principles and the characteristics of a new approach that we propose to resolve it 1.1 Problem Description Intuitively, schema evolution means the ability of a schema to undergo changes over time without any loss of the extant data. However, besides managing the changes to the schema, applications and data linked to it need to be adapted as well. Changes to the schema are divided into three categories depending on their consequences on it[1]: 1 Additive : extra semantic knowledge is modelled 2 Subtractive: less semantic knowledge is modelled 3 Descriptive: the same semantic knowledge is modelled in a different manner To better understand the general problem of database schema evolution, we need to consider it from two different sides, depending on the kind of solution we choose: 1) From a posteriori solution side, 2) From a priori solution side. 1) From a posteriori solution side: Historically, from this side, to resolve the schema evolution problem, one should take into consideration two major criteria, which are respectively [2]: a) the semantic of change, i.e. the understanding of the change that has taken place because of several reasons such as the new perceptions of the real world over time and technology development and performance strategies and b) the propagation of this change on the schema immediately or at deferred time fixed by the database administrator. There is a posterior order in which the change must be received after by the schema and its components. Schema evolution is resolved either by versioning the original schema, by modifying it using restricted evolution primitives, by adopting views on the top of it or by refining it by accommodating the exceptional information in the database [3]. All these solutions react to changes that could occur on the schema. However, they are insufficient solutions, especially when the schema is facing complex changes. For instance, in the modification approach, changing the schema may lead to a loss of information. Whereas in the versioning approach, replication of the schema avoids
3 data loss; it however creates complex navigation through the different generated versions and slows down the DBMS (Database Management System). As for the combining solution, i.e. a solution that includes two existing approaches, for example the work presented in [4], it allows to avoid the above mentioned problems. It is at the same time, characterized by the complexity and the onerous mechanisms to be executed. 2) From a priori solution side: To resolve the problem of schema evolution from this perspective, one must clearly take into account these imperative criteria: Understand the current database structure and content Identify the dependencies among the current database schema, data and applications because the impact of one element on another needs to be known and accounted for before making changes on the database Detect potential changes that are plausible to occur on the schema Understand the potential future changes and new applications and identify their impacts on the current schema Consider the two possible situations, related to the database, that are respectively: 1) the situation in which the initial database schema is not created yet and 2) the situation in which the initial schema has already been created; however, it needs to be redesigned. Compared to the previous side, here, the order of applicability of the changes has been modified. The changes are incorporated before they really occur. There is what is called an a priori order in which the potential change must be received before by the schema and its components. 1.2 Predictive Approach for Schema Evolution This new a priori approach for schema evolution is characterized by the following: A priori Approach Dual Applicability for 1) design from scratch and evolution and 2) re design and evolution Requirements analysis and their categorization into two categories: current and future Multiple strategies for data management and compatibility with any data Evolution cost evaluation a) A priori Approach This approach, in contrast with existing posterior solutions for evolution such as modification or versioning approaches, tackles the problem of evolution before the changes occurred effectively on the database schema. It tends to react preventively by incorporating the changes in the schemas for future use. However, it can also be applied when the changes need to be materialized on the schema. b) Dual Applicability
4 This approach takes into consideration two situations of the database: 1) the situation in which the schema is designed from scratch and 2) the situation in which the schema is redesigned. Therefore, the proposed approach can be applied for both: Design and evolution of the schema Re design and evolution of the schema c) Requirements Analysis and Categorization The predictive approach does not work in the same way as existing a posteriori solutions for evolution. As a matter of fact it explicitly includes a requirement analysis phase in which, besides the current user requirements that are performed with the help of database user s feedbacks and comments, additional requirements called potential future requirements are investigated. These new requirements, representing potential future needs that might emerge during the lifecycle of the database are inspected inside a schema repository. We propose to model all these requirements with a structured method based on domain ontology called a requirements ontology. This method is complex and includes the use of data mining techniques and the schema repository. We are not interested in the requirements process and their associated specification templates that are applied usually in the construction of a new system. However, we require that the resulting output of the requirements modeling be defined in terms of requirement ontology. d) Multiple Strategies to Change Management and Compatibility with any Conventional Data Model This approach for evolution is based on three modeling levels: the ontological level, the conceptual level and finally the logical level. At the ontological level, requirements ontology is used. This is illustrated in figure 1. A detailed description of the role, the structure and the construction of this ontology is presented later in this paper. Fig. 1. The requirements ontology with the two kinds of concepts and relationships (links) At the conceptual level, a part or selected parts of the requirements ontology representing the potential future requirements are included into the conceptual schema of the database to be designed or redesigned using one of the strategies in figure 2 presented below. 1 Direct conversion strategy: in this strategy, the selected part(s) of the requirements ontology are converted to a conceptual schema with a direct mapping. 2 Multiple representations based on view or stamping mechanisms 3 Combined approach (Direct and Multi representation strategies)
5 The formalism presented in the figure 1 is used in the Motivating Examples section to show how the database designer takes into account the ontological level represented by the requirements ontology for an anticipated schema evolution. Fig. 2. The multi representation strategy with views and stamping mechanism at the conceptual layer At the logical level, the conceptual schema is converted as it is into a logical schema. However, the tables and attributes that correspond to future requirements are not functional yet. They are included in the schema but cannot be acceded to by the DBMS. However, in the evolution phase, i.e. when the database schema needs to be changed, these accompanying structures are accepted or rejected based on their relevance with the future state. e) Evolution Cost Evaluation Another important objective of this approach is to estimate the cost related to the planning of the evolution using requirements ontology. We have defined a simple cost model based on two factors: the importance of a concept, respectively a relationship in the requirements ontology and their tendencies to the changes. The details of this cost model are not developed in this paper due to the lack of space The advantages of this approach are various. Indeed, it reinforces the conceptual schemas and makes them ready for evolution, it is compatible with any data model and finally, it facilitates the work of database designers and helps them save time and fund on the evolution of their databases. 1.3 Contribution of the Paper The contributions of this paper are as follows:
6 1. Presentation of the predictive approach for database schema evolution, including the characteristics and the differences with other existing approaches for schema evolution 2. Presentation of the requirements ontology, including its role, construction and structure 3. Presentation of the multi representation strategy with the two defined mechanisms views and stamping 4. Presentation of examples showing how the predictive approach works and outlining the role of both the requirements ontology and the multirepresentation strategy. 1.4 Outline of the Rest of the Paper This paper explains the articulation of this new approach for database schema evolution and outlines the use of domain ontology and a multi representation strategy. The paper comprises five main sections. Section 2 presents 1) the role of the requirements ontology in the predictive approach for evolution and 2) the structure and how it is built from the schema repository. Section 3 describes the multirepresentation strategy and how it is used for schema evolution. Section 4 presents some motivating examples to demonstrate the feasibility of the proposed approach as well as the two categories of database designers that can use it. Section 5 is a conclusion and a summary of the important points dealt with in this paper and introduces perspectives on the future work. 2 Requirements Ontology for Schema Evolution The requirements ontology is a domain ontology in which requirements are expressed with concepts (terms), relationships and constraints. For example, if a database designer needs to create a database for meetings, the requirements ontology associated to this database contains concepts, relationships and constraints related to meeting domain such as MEETING, PARTICIPANT, ROOM and AGENDA concepts and IS, HAS relationship types and so on. This is illustrated in figure 3. Kind of Kind of Kind of Fig. 3. Presentation of a part of requirements ontology of meeting domain
7 The requirements ontology looks like a global entity relationship model however it is not an entity relationship model because it contains more semantics related to a specific domain. 2.1 Requirements Ontology Role There are two ways in which this ontology is used in the predictive approach for evolution. These two primary functions are 1) design and evolution and 2) redesign and evolution. 1) Design and evolution In case the initial database schema is not created yet, the ontology fulfils several tasks, as presented in [5] and [6]: it generates a design from scratch using the defined terms and relationships as a representative model of the domain. It suggests possible missing entities and relationships in the case just a part or selected parts of it is/are considered by the database designer. In addition, the inclusion of synonyms helps to identify the most appropriate label for an entity or a relationship. The requirements ontology offers additional features; it includes terms, relationships and constraints that might represent potential future requirements and identifies in advance their dependencies with terms and relationships representing current requirements. Consequently, it facilitates the work of database designer when changes should be made on the schema. 2) Redesign and evolution In case the initial schema has already been created and however needs to be redesigned, then the ontology fulfils other important tasks as presented in [5] and [6]. For example, it is used to check for missing entities or relationships or inconsistencies in an existing or partial design because the data model produced in the redesign process, called a reverse engineered (RE) data model [7], cannot be considered as a conceptual schema. The RE model converts all the logical schema tables to entities without making distinction between data tables and the other tables of the schema whose function is to join tables. This model is not a logical schema because some important schema information is lost during the conversion process, such as in the case of foreign keys. Besides all these tasks, the requirements ontology allows to understand the entities that have undergone changes and to identify their dependencies with existing entities that have not changed in the schema. 2.2 Requirements Ontology Construction The requirements ontology is developed using both a schema repository in which the main concepts are extracted and WordNet ontology [8] for extracting their corresponding synonyms and antonyms. The requirements ontology consists of two kinds of partitions, the ones representing current requirements called Current subdomains and the ones corresponding to potential future requirements called Future sub domains. This is illustrated in figure 4.
8 The process of the requirements ontology creation is iterative and complex some how compared to existing approaches. It consists on four main phases: knowledge acquisition, Data mining and informal conceptualisation, Evaluation for Refinement or Revision and Formal Conceptualization 1 Knowledge acquisition and pre processing: consists of schemas collection and preparation 2 Data mining algorithms and informal conceptualization: in which concepts and relationships are extracted from schema data sets repository in an unsupervised way and used as output for the informal conceptualization of the ontology from scratch. 3 Evaluation for Refinement or Revision: means to test the validity of the concepts belonging to the taxonomy and to decide to keep or reject them using qualitative and quantitative methods. 4 Formal Conceptualization: consists in building formally the requirements ontology using OWL and description logic. These phases are not very developed in this paper because they necessitate a considerable space. Fig. 4. The requirements ontology divided into two main domains: the current domain for the current requirements and the future domain for the future requirements The schema repository contains many different schemas that model a specific domain. These schemas and their related versions may be of different types, such as ER, relational, object and object relational schemas. The XML databases can be included as well as the ontological schemas expressed in OWL technology [9]. The schema repository has a dual role in building the requirements ontology [10]: (1) the repository serves in the data mining process to identify and analyze trends on different kinds of schemas collected. (2) The repository contains selected concepts and relationships to be included in the requirements ontology. 2.3 Requirements Ontology Structure The structure of this ontology includes: a) Concepts, b) Relationships, c) constraints d) Current versus Future Labels, described below: a) Concepts (Terms) Description: Each term has one or several attributes with one or several values and one or several synonyms and antonyms. We have chosen to define the terms of this ontology by using the analogy method which is a form of Case Based Reasoning (CBR) [11] with small modifications to adapt it to our case study. The analogy method is based on the use of historic data, i.e. past data and similar data to retrieve the information and knowledge in order to solve a problem. In our case, besides the terms that represent current requirements, we need to define the terms that represent potential future
9 requirements. Therefore, terms are extracted from the schema repository as follows. either: 1) from schemas belonging to the same domain of the database to be modeled 2) from schemas belonging to a similar domain to the domain of the database to be modelled therefore it is referred as first domain in similarity 3) from schemas belonging to a similar domain of the similar domain of the domain of the database to be modelled. This domain is referred as second domain in similarity. b) Relationships Description Relations are between two concepts. There are six kinds of relations: hierarchic identified by the label kind of, which expresses the specialization of one concept regarding another and inherits attributes from this super concept; composition identified by the label has, which expresses that a concept is a part of another concept; descriptive, when it is possible to define several types of relations and is identified by a verb form; reflexive allows self loops in which an arc whose endpoints are the same concept and, finally synonym and antonym identified respectively by the labels synon and antony. We consider the previous example of meeting domain to show some relationships among concepts: kind of (meeting, conference) is a hierarchic relationship, has (meeting, utterances) is a composition relationship, lives (Person, Country), originates (Person, Country) and represents (Person, Country) are descriptive relationships and finally invites (Person, Person) is a reflexive relationship. c) Constraints Similar to the work presented in [5] and [6], we use four types of constraints which are respectively: 1) pre requisite constraint, 2) mutually inclusive constraint, 3) mutual exclusive constraint and 4) temporal constraint. d) Current versus future labels The requirements ontology is a labeled graph: special labels are added and exploited in order to indicate whether a concept, respectively a relationship belongs to current or future requirements. A concept respectively, a relationship belongs to either current requirements or future requirements but not to both at the same time. This is main structure characteristic that distinguishes the requirements ontology from the remaining domain ontologies. This classification of concepts/relationships is needed in the evaluation of the evolution cost. The details are not presented of this work because of lack of space. 3 Multi representation Strategy The multi representation strategy is well known in the object modeling field, as well as in the spatial databases. In [12], the multi representation strategy based on stamping in geographic databases is presented. On the other hand, in the object modeling, the multi representation is called semantic object views. It allows to make the object visible for certain applications and to hide it to others using the views mechanism. In this work, we focus on the multi presentation based on stamping. This
10 strategy consists in using stamps at the conceptual level in order to have different representations for the modeling of the same universe of discourse i.e. the modeling of the same real world. A stamp S is defined as a vector S=<s1, s2, Sn> where each s represents the representation of the real world. For example, in the following simple example, we have defined a stamp S =<S1, S2> in which, according to the element S1 of the stamp S, the conceptual schema contains the entities E1, E1 and the relationships A1. However, according to the element S2 of the stamp S, the conceptual schema contains the entities E1 and E2 and one relationship A1. This is illustrated in figure 5. Fig.5. A Simple Example using Multi Representation Based on Stamping The stamping mechanism is not a simple mechanism as it may appear. For example, in the case of successive evolutions on the database schema, the stamp components and the constraints on the stamps should be studied carefully in order to avoid any potential contradiction among them. 4 Motivating Examples In this section, we present examples of schema changes to illustrate how the predictive approach for evolution discussed in this paper works. The examples portray the three categories of changes presented previously, which are 1) the additive evolution, 2) the subtractive evolution and 3) the descriptive evolution. Integrity constraints are not presented in the following examples because of lack of space; however, they play an important role in enhancing the use of the ontology and in avoiding redundancy and contradiction among concepts (entities) and relationships. 4.1 Additive Evolution Adding an element on existing schema is not always obvious because there are two types of additive changes: simple and complex. For the simple additive change, the database administrator can use the functionalities of the DBMS (Database Management Systems) to add for example a table or an attribute. Whereas for the complex additive change, adding an element perturbs the dependency between existing elements and causes damaging effects on existing applications. Consequently, the logical schema is in inconsistent state and the associated applications do not work anymore. A case of complex additive change is illustrated in the side 1 of the figure 6 in which the addition of the entity E1 creates problems for existing applications. The way to resolve such a problem with the predictive consists in:
11 1 At the ontological level: the database designer examines whether the requirements ontology reveals the existence of concepts/relationships that belong to the category of future requirements and represent potential simple and complex additive changes. 2 At the conceptual level, the database designer incorporates these concepts/ relationships using the multi representation strategy based on stamping mechanism. The resulted conceptual schema represents consequently two universes of discourse (real world). This is illustrated in the side 2 of the figure 6. Ontological Layer Conceptual Layer Fig.6. Presentation of additive Evolution on a Simple Example with both Classical and Predictive approaches A case of complex and simple additive changes on the schema is illustrated in figure 7 where we consider the example that represents the modeling of a simple case of travel, in which the requirement is to determine the clients traveling around the world. S1 : Predicted Conceptual Schema S0 : initial Conceptual Schema Fig.7. Modelling a simple example of travel with both classical and predictive approaches In a classical design approach, the initial conceptual schema S0 contains four entities which are Client, Travel, City and Transport Company. However, in the predictive
12 approach, the schema S1 that has been proposed by the requirements ontology contains six entities. The two additional entities Insurance and Carrier represent two potential future changes on the schema that belong to simple and complex additive changes respectively. 4.2 Subtractive Evolution Subtractive evolution occurs when elements on the schema become negligible and are no longer required. However, deleting an element on existing schema is not always obvious because there are two types of subtractive changes: simple and complex. For the simple subtractive change, the database administrator can use the functionalities of the DBMS (Database Management Systems) to delete the non required elements. Whereas for the complex subtractive evolution, the DBMS does not offer any functions for it and the changes have direct and critical consequences on the schema and applications. A case of complex subtractive change is illustrated in the side 1 of the figure 8 in which the deletion of the entity E2 creates problems for the existing applications that need such entity. The way to resolve such a problem with the predictive consists in: 1 At the ontological level: the database designer examines whether the requirements ontology reveals the existence of concepts/relationships that belong to the category of current requirements and represent potential simple and complex subtractive changes that need to be changed in the future. 2 At the conceptual level: the database designer incorporates these concepts/ relationships using the multi representation strategy based on stamping mechanism. The resulted conceptual schema represents consequently two universes of discourse (real world). The whole process is illustrated in the side 2 of the figure 8. Ontological Layer Conceptual Layer Fig.8. Presentation of subtractive evolution on a simple example with both Classical and Predictive approaches
13 4.3 Descriptive Evolution Descriptive evolution is made for convenience or efficiency. It is the hardest to handle in traditional database systems because it implies more than one risky modification operation on the schema. The consequences of the changes on the schema are also critical, such as data loss. This is illustrated in the side 1 of the figure 9. Similarly to the previous, we follow the same steps for the resolution of this problem according to the predictive approach. The whole process is illustrated in the side 2 of the figure 9. Current Concept Current Concept Current Concept Ontological Layer Conceptual Layer Fig.9. Presentation of Descriptive evolution on a simple example with both Classical and Predictive approaches 5 Conclusion and future Work In this paper, we have presented another area where ontologies have a colossal potential of utilization and which is related to information systems. It concerns database schema evolution. The approach we propose belongs to a new tendency called the tendency of a priori approaches. It implies the investigation of potential future requirements besides the current requirements during the standard requirements analysis phase of schema design or redesign and their inclusion into the conceptual schema. Those requirements are determined with the help of a domain ontology called
14 a requirements ontology using data mining techniques and schema repository. The advantages of this approach include: 1) new perspectives in the way requirements are inspected and integrated into the schema, 2) two categories of database designers were taken into consideration, the category of those who design a schema from scratch and the category of those who redesign the schema from existing schemas using reverse engineering and dependency graphs, 3) the reinforcement of the conceptual schemas, 4) and finally the compatibility of the approach with any data model. The effectiveness of this approach for evolution is limited by the amount and the quality of the knowledge accumulated inside the requirements ontology. Therefore, we have taken into consideration the problem of the evolution of the requirements ontology as well. For this purpose, we have adopted the multirepresentation strategy based on stamping mechanism. In [13] a multi representation solution for ontologies is presented. This solution develops a language based on description logic (DL) [14] to implement the stamping mechanism. Unfortunately, this new approach is not without problems. Predictions of potential future requirements might not turn into effect. The changes that have been included might never be used; however they can be removed completely without any alteration on the database schema. Future work will proceed in both theoretical and practical directions. The theory will focus on extending the idea behind the requirements ontology and the stamping mechanism. The practical work consists in testing this approach significantly through several case studies with the use of a prototype that is under development. References 1. Connor, R. C. H., Q. I. Cutts, et al.: Using Persistence Technology to Control Schema Evolution. Proceedings of the ACM symposium on Applied computing Phoenix, Arizona, United States ACM Press New York, NY, United States, (1994) 2. Roddick, J. F :A survey of schema versioning issues for database systems. Information and Software Technology (37(7)): (1995) 3. Borgida, A. and K. E. Williamson: Accommodating Exceptions in Databases, and Refining the Schema by Learning from them. 11th International Conference on Very Large Data Bases, Stockholm, Sweden,, Morgan Kaufmann, (1985) 4. Benatallah, B. A :Unified Framework for Supporting Dynamic Schema Evolution in Object Database. 18th International Conference on Conceptual Modeling (ER99) Springer Verlag London, UK, (1999). 5. Sugumaran, V. and V. C. Storey: Ontologies for conceptual modeling: their creation, use, and management." Data Knowledge. Engineering 42(3): (2002) 6. Sugumaran, V. and V. C. Storey: An Ontology Based Framework for Generating and Improving Database Design. Natural Language Processing and Information Systems, 6th International Conference on Applications of Natural Language to Information Systems, NLDB, Stockholm, Sweden, Springer, (2002) 7. Kroenke, D. M.: Database Processing: Fundamentals, Design and Implementation. Database Processing: Fundamentals, Design and Implementation. G. S. d. Acevedo, Pearson Prentice Hall: (2004) 8. WordNet: 9. Lacy, L. W.: OWL: Representing Information Using the Web Ontology Language. (2005) 10. Bounif H., Pottinger R.: Schema Repository for Database Schema Evolution, 2 nd international workshop on Data Management in Global Data Repositories (GREP) 2006 at International Conference on Database and Expert Systems Applications 06
15 11. Aamodt, A. and E. Plaza."Case Based Reasoning: Foundational Issues, Methodological Variations, and Systems Approaches." AI Communications 7(1): (1994) 12. Spaccapietra S. et al.:supporting Multiple Representations in Spatio Temporal databases. In Proceedings of the 6th EC GI & GIS Workshop, Lyon, France, June (2000), 13. Benslimane D. et al.: Multi representation in ontologies. Proceedings of 7th East European Conference on Advances in Databases and Information Systems, ADBIS 2003, Dresden, Germany, September 3 6, (2003). 14. Baader, F. and W. Nutt: Basic Description Logics. The Description Logic Handbook: theory, implementation and application. F. Baader: 43 95, Cambridge University Press. (2003).
DATA QUALITY AND SCALE IN CONTEXT OF EUROPEAN SPATIAL DATA HARMONISATION
DATA QUALITY AND SCALE IN CONTEXT OF EUROPEAN SPATIAL DATA HARMONISATION Katalin Tóth, Vanda Nunes de Lima European Commission Joint Research Centre, Ispra, Italy ABSTRACT The proposal for the INSPIRE
BUSINESS RULES AS PART OF INFORMATION SYSTEMS LIFE CYCLE: POSSIBLE SCENARIOS Kestutis Kapocius 1,2,3, Gintautas Garsva 1,2,4
International Conference 20th EURO Mini Conference Continuous Optimization and Knowledge-Based Technologies (EurOPT-2008) May 20 23, 2008, Neringa, LITHUANIA ISBN 978-9955-28-283-9 L. Sakalauskas, G.W.
Data Quality Mining: Employing Classifiers for Assuring consistent Datasets
Data Quality Mining: Employing Classifiers for Assuring consistent Datasets Fabian Grüning Carl von Ossietzky Universität Oldenburg, Germany, [email protected] Abstract: Independent
Draft Martin Doerr ICS-FORTH, Heraklion, Crete Oct 4, 2001
A comparison of the OpenGIS TM Abstract Specification with the CIDOC CRM 3.2 Draft Martin Doerr ICS-FORTH, Heraklion, Crete Oct 4, 2001 1 Introduction This Mapping has the purpose to identify, if the OpenGIS
I. INTRODUCTION NOESIS ONTOLOGIES SEMANTICS AND ANNOTATION
Noesis: A Semantic Search Engine and Resource Aggregator for Atmospheric Science Sunil Movva, Rahul Ramachandran, Xiang Li, Phani Cherukuri, Sara Graves Information Technology and Systems Center University
A terminology model approach for defining and managing statistical metadata
A terminology model approach for defining and managing statistical metadata Comments to : R. Karge (49) 30-6576 2791 mail [email protected] Content 1 Introduction... 4 2 Knowledge presentation...
Report on the Dagstuhl Seminar Data Quality on the Web
Report on the Dagstuhl Seminar Data Quality on the Web Michael Gertz M. Tamer Özsu Gunter Saake Kai-Uwe Sattler U of California at Davis, U.S.A. U of Waterloo, Canada U of Magdeburg, Germany TU Ilmenau,
Web-Based Genomic Information Integration with Gene Ontology
Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, [email protected] Abstract. Despite the dramatic growth of online genomic
DATABASE DEVELOPMENT LIFE CYCLE
DATABASE DEVELOPMENT LIFE CYCLE Pranshu Gupta 1 Ramon A. Mata-Toledo 2 Morgan D. Monger 3 Abstract A software development life cycle model (SDLC) consists of a set of processes (planning, requirements,
2 AIMS: an Agent-based Intelligent Tool for Informational Support
Aroyo, L. & Dicheva, D. (2000). Domain and user knowledge in a web-based courseware engineering course, knowledge-based software engineering. In T. Hruska, M. Hashimoto (Eds.) Joint Conference knowledge-based
A Contribution to Expert Decision-based Virtual Product Development
A Contribution to Expert Decision-based Virtual Product Development László Horváth, Imre J. Rudas Institute of Intelligent Engineering Systems, John von Neumann Faculty of Informatics, Óbuda University,
CASSANDRA: Version: 1.1.0 / 1. November 2001
CASSANDRA: An Automated Software Engineering Coach Markus Schacher KnowGravity Inc. Badenerstrasse 808 8048 Zürich Switzerland Phone: ++41-(0)1/434'20'00 Fax: ++41-(0)1/434'20'09 Email: [email protected]
Reverse Engineering of Relational Databases to Ontologies: An Approach Based on an Analysis of HTML Forms
Reverse Engineering of Relational Databases to Ontologies: An Approach Based on an Analysis of HTML Forms Irina Astrova 1, Bela Stantic 2 1 Tallinn University of Technology, Ehitajate tee 5, 19086 Tallinn,
DATA QUALITY DATA BASE QUALITY INFORMATION SYSTEM QUALITY
DATA QUALITY DATA BASE QUALITY INFORMATION SYSTEM QUALITY The content of those documents are the exclusive property of REVER. The aim of those documents is to provide information and should, in no case,
Implementation of hybrid software architecture for Artificial Intelligence System
IJCSNS International Journal of Computer Science and Network Security, VOL.7 No.1, January 2007 35 Implementation of hybrid software architecture for Artificial Intelligence System B.Vinayagasundaram and
1 File Processing Systems
COMP 378 Database Systems Notes for Chapter 1 of Database System Concepts Introduction A database management system (DBMS) is a collection of data and an integrated set of programs that access that data.
WHITE PAPER DATA GOVERNANCE ENTERPRISE MODEL MANAGEMENT
WHITE PAPER DATA GOVERNANCE ENTERPRISE MODEL MANAGEMENT CONTENTS 1. THE NEED FOR DATA GOVERNANCE... 2 2. DATA GOVERNANCE... 2 2.1. Definition... 2 2.2. Responsibilities... 3 3. ACTIVITIES... 6 4. THE
æ A collection of interrelated and persistent data èusually referred to as the database èdbèè.
CMPT-354-Han-95.3 Lecture Notes September 10, 1995 Chapter 1 Introduction 1.0 Database Management Systems 1. A database management system èdbmsè, or simply a database system èdbsè, consists of æ A collection
Facilitating Business Process Discovery using Email Analysis
Facilitating Business Process Discovery using Email Analysis Matin Mavaddat [email protected] Stewart Green Stewart.Green Ian Beeson Ian.Beeson Jin Sa Jin.Sa Abstract Extracting business process
Lightweight Data Integration using the WebComposition Data Grid Service
Lightweight Data Integration using the WebComposition Data Grid Service Ralph Sommermeier 1, Andreas Heil 2, Martin Gaedke 1 1 Chemnitz University of Technology, Faculty of Computer Science, Distributed
Metadata Management for Data Warehouse Projects
Metadata Management for Data Warehouse Projects Stefano Cazzella Datamat S.p.A. [email protected] Abstract Metadata management has been identified as one of the major critical success factor
Software Engineering from an Engineering Perspective: SWEBOK as a Study Object
Software Engineering from an Engineering Perspective: SWEBOK as a Study Object Alain Abran a,b, Kenza Meridji b, Javier Dolado a a Universidad del País Vasco/Euskal Herriko Unibertsitatea b Ecole de technologie
A Design and implementation of a data warehouse for research administration universities
A Design and implementation of a data warehouse for research administration universities André Flory 1, Pierre Soupirot 2, and Anne Tchounikine 3 1 CRI : Centre de Ressources Informatiques INSA de Lyon
Semantic Search in Portals using Ontologies
Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br
Clinical Database Information System for Gbagada General Hospital
International Journal of Research Studies in Computer Science and Engineering (IJRSCSE) Volume 2, Issue 9, September 2015, PP 29-37 ISSN 2349-4840 (Print) & ISSN 2349-4859 (Online) www.arcjournals.org
Building Ontology Networks: How to Obtain a Particular Ontology Network Life Cycle?
See discussions, stats, and author profiles for this publication at: http://www.researchgate.net/publication/47901002 Building Ontology Networks: How to Obtain a Particular Ontology Network Life Cycle?
Context Capture in Software Development
Context Capture in Software Development Bruno Antunes, Francisco Correia and Paulo Gomes Knowledge and Intelligent Systems Laboratory Cognitive and Media Systems Group Centre for Informatics and Systems
The process of database development. Logical model: relational DBMS. Relation
The process of database development Reality (Universe of Discourse) Relational Databases and SQL Basic Concepts The 3rd normal form Structured Query Language (SQL) Conceptual model (e.g. Entity-Relationship
Distributed Database for Environmental Data Integration
Distributed Database for Environmental Data Integration A. Amato', V. Di Lecce2, and V. Piuri 3 II Engineering Faculty of Politecnico di Bari - Italy 2 DIASS, Politecnico di Bari, Italy 3Dept Information
How To Write A Diagram
Data Model ing Essentials Third Edition Graeme C. Simsion and Graham C. Witt MORGAN KAUFMANN PUBLISHERS AN IMPRINT OF ELSEVIER AMSTERDAM BOSTON LONDON NEW YORK OXFORD PARIS SAN DIEGO SAN FRANCISCO SINGAPORE
Clustering Technique in Data Mining for Text Documents
Clustering Technique in Data Mining for Text Documents Ms.J.Sathya Priya Assistant Professor Dept Of Information Technology. Velammal Engineering College. Chennai. Ms.S.Priyadharshini Assistant Professor
Component visualization methods for large legacy software in C/C++
Annales Mathematicae et Informaticae 44 (2015) pp. 23 33 http://ami.ektf.hu Component visualization methods for large legacy software in C/C++ Máté Cserép a, Dániel Krupp b a Eötvös Loránd University [email protected]
Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object
Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object Anne Monceaux 1, Joanna Guss 1 1 EADS-CCR, Centreda 1, 4 Avenue Didier Daurat 31700 Blagnac France
Improving Interoperability in Mechatronic Product Developement. Dr. Alain Biahmou, Dr. Arnulf Fröhlich, Dr. Josip Stjepandic
International Conference on Product Lifecycle Management 1 Improving Interoperability in Mechatronic Product Developement Dr. Alain Biahmou, Dr. Arnulf Fröhlich, Dr. Josip Stjepandic PROSTEP AG Dolivostr.
Chapter 8 The Enhanced Entity- Relationship (EER) Model
Chapter 8 The Enhanced Entity- Relationship (EER) Model Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 Outline Subclasses, Superclasses, and Inheritance Specialization
Answers to Review Questions
Tutorial 2 The Database Design Life Cycle Reference: MONASH UNIVERSITY AUSTRALIA Faculty of Information Technology FIT1004 Database Rob, P. & Coronel, C. Database Systems: Design, Implementation & Management,
Chapter 10 Practical Database Design Methodology and Use of UML Diagrams
Chapter 10 Practical Database Design Methodology and Use of UML Diagrams Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 10 Outline The Role of Information Systems in
A Semantic Model for Multimodal Data Mining in Healthcare Information Systems
A Semantic Model for Multimodal Data Mining in Healthcare Information Systems Dimitris IAKOVIDIS 1 and Christos SMAILIS Department of Informatics and Computer Technology, Technological Educational Institute
REA Business Management Ontology: Conceptual Modeling of Accounting, Finance and Management Control
REA Management Ontology: Conceptual Modeling of Accounting, Finance and Management Control Walter S.A. Schwaiger Institute of Management Science TU Wien, Vienna, Austria [email protected] Abstract.
Module 2. Software Life Cycle Model. Version 2 CSE IIT, Kharagpur
Module 2 Software Life Cycle Model Lesson 4 Prototyping and Spiral Life Cycle Models Specific Instructional Objectives At the end of this lesson the student will be able to: Explain what a prototype is.
Chapter 1: Introduction
Chapter 1: Introduction Database System Concepts, 5th Ed. See www.db book.com for conditions on re use Chapter 1: Introduction Purpose of Database Systems View of Data Database Languages Relational Databases
Elite: A New Component-Based Software Development Model
Elite: A New Component-Based Software Development Model Lata Nautiyal Umesh Kumar Tiwari Sushil Chandra Dimri Shivani Bahuguna Assistant Professor- Assistant Professor- Professor- Assistant Professor-
HELP DESK SYSTEMS. Using CaseBased Reasoning
HELP DESK SYSTEMS Using CaseBased Reasoning Topics Covered Today What is Help-Desk? Components of HelpDesk Systems Types Of HelpDesk Systems Used Need for CBR in HelpDesk Systems GE Helpdesk using ReMind
Masters in Information Technology
Computer - Information Technology MSc & MPhil - 2015/6 - July 2015 Masters in Information Technology Programme Requirements Taught Element, and PG Diploma in Information Technology: 120 credits: IS5101
Data Modeling Basics
Information Technology Standard Commonwealth of Pennsylvania Governor's Office of Administration/Office for Information Technology STD Number: STD-INF003B STD Title: Data Modeling Basics Issued by: Deputy
Busting 7 Myths about Master Data Management
Knowledge Integrity Incorporated Busting 7 Myths about Master Data Management Prepared by: David Loshin Knowledge Integrity, Inc. August, 2011 Sponsored by: 2011 Knowledge Integrity, Inc. 1 (301) 754-6350
Information Services for Smart Grids
Smart Grid and Renewable Energy, 2009, 8 12 Published Online September 2009 (http://www.scirp.org/journal/sgre/). ABSTRACT Interconnected and integrated electrical power systems, by their very dynamic
Domain Knowledge Extracting in a Chinese Natural Language Interface to Databases: NChiql
Domain Knowledge Extracting in a Chinese Natural Language Interface to Databases: NChiql Xiaofeng Meng 1,2, Yong Zhou 1, and Shan Wang 1 1 College of Information, Renmin University of China, Beijing 100872
A Multidatabase System as 4-Tiered Client-Server Distributed Heterogeneous Database System
A Multidatabase System as 4-Tiered Client-Server Distributed Heterogeneous Database System Mohammad Ghulam Ali Academic Post Graduate Studies and Research Indian Institute of Technology, Kharagpur Kharagpur,
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 INTELLIGENT MULTIDIMENSIONAL DATABASE INTERFACE Mona Gharib Mohamed Reda Zahraa E. Mohamed Faculty of Science,
(Refer Slide Time: 01:52)
Software Engineering Prof. N. L. Sarda Computer Science & Engineering Indian Institute of Technology, Bombay Lecture - 2 Introduction to Software Engineering Challenges, Process Models etc (Part 2) This
KNOWLEDGE ORGANIZATION
KNOWLEDGE ORGANIZATION Gabi Reinmann Germany [email protected] Synonyms Information organization, information classification, knowledge representation, knowledge structuring Definition The term
Configuration Management Models in Commercial Environments
Technical Report CMU/SEI-91-TR-7 ESD-9-TR-7 Configuration Management Models in Commercial Environments Peter H. Feiler March 1991 Technical Report CMU/SEI-91-TR-7 ESD-91-TR-7 March 1991 Configuration Management
Combining SAWSDL, OWL DL and UDDI for Semantically Enhanced Web Service Discovery
Combining SAWSDL, OWL DL and UDDI for Semantically Enhanced Web Service Discovery Dimitrios Kourtesis, Iraklis Paraskakis SEERC South East European Research Centre, Greece Research centre of the University
THE ROLE OF KNOWLEDGE MANAGEMENT SYSTEM IN SCHOOL: PERCEPTION OF APPLICATIONS AND BENEFITS
THE ROLE OF KNOWLEDGE MANAGEMENT SYSTEM IN SCHOOL: PERCEPTION OF APPLICATIONS AND BENEFITS YOHANNES KURNIAWAN Bina Nusantara University, Department of Information Systems, Jakarta 11480, Indonesia E-mail:
Dataset Preparation and Indexing for Data Mining Analysis Using Horizontal Aggregations
Dataset Preparation and Indexing for Data Mining Analysis Using Horizontal Aggregations Binomol George, Ambily Balaram Abstract To analyze data efficiently, data mining systems are widely using datasets
EXPLOITING FOLKSONOMIES AND ONTOLOGIES IN AN E-BUSINESS APPLICATION
EXPLOITING FOLKSONOMIES AND ONTOLOGIES IN AN E-BUSINESS APPLICATION Anna Goy and Diego Magro Dipartimento di Informatica, Università di Torino C. Svizzera, 185, I-10149 Italy ABSTRACT This paper proposes
Chapter 13 BUILDING INFORMATION SYSTEMS. How does building new systems produce organizational change?
MANAGING THE DIGITAL FIRM, 12 TH EDITION Learning Objectives Chapter 13 BUILDING INFORMATION SYSTEMS VIDEO CASES Case 1: IBM: Business Process Management in a Service Oriented Architecture and Managing
DIMENSION HIERARCHIES UPDATES IN DATA WAREHOUSES A User-driven Approach
DIMENSION HIERARCHIES UPDATES IN DATA WAREHOUSES A User-driven Approach Cécile Favre, Fadila Bentayeb, Omar Boussaid ERIC Laboratory, University of Lyon, 5 av. Pierre Mendès-France, 69676 Bron Cedex, France
On- and Off-Line User Interfaces for Collaborative Cloud Services
On- and Off-Line User Interfaces for Collaborative Cloud Services Wolfgang Stuerzlinger York University, Dept of Computer Science & Engineering 4700 Keele Street Toronto, Canada http://www.cse.yorku.ca/~wolfgang
Semantic-ontological combination of Business Rules and Business Processes in IT Service Management
Semantic-ontological combination of Business Rules and Business Processes in IT Service Management Alexander Sellner 1, Christopher Schwarz 1, Erwin Zinser 1 1 FH JOANNEUM University of Applied Sciences,
A Meta-model of Business Interaction for Assisting Intelligent Workflow Systems
A Meta-model of Business Interaction for Assisting Intelligent Workflow Systems Areti Manataki and Yun-Heh Chen-Burger Centre for Intelligent Systems and their Applications, School of Informatics, The
A semantic extension of a hierarchical storage management system for small and medium-sized enterprises.
Faculty of Computer Science Institute of Software- and Multimedia Technology, Chair of Multimedia Technology A semantic extension of a hierarchical storage management system for small and medium-sized
A Model-driven Approach to Predictive Non Functional Analysis of Component-based Systems
A Model-driven Approach to Predictive Non Functional Analysis of Component-based Systems Vincenzo Grassi Università di Roma Tor Vergata, Italy Raffaela Mirandola {vgrassi, mirandola}@info.uniroma2.it Abstract.
Database Design Methodology
Database Design Methodology Three phases Database Design Methodology Logical database Physical database Constructing a model of the information used in an enterprise on a specific data model but independent
Tool Support for Model Checking of Web application designs *
Tool Support for Model Checking of Web application designs * Marco Brambilla 1, Jordi Cabot 2 and Nathalie Moreno 3 1 Dipartimento di Elettronica e Informazione, Politecnico di Milano Piazza L. Da Vinci,
Context Model Based on Ontology in Mobile Cloud Computing
Context Model Based on Ontology in Mobile Cloud Computing Changbok Jang, Euiin Choi * Dept. Of Computer Engineering, Hannam University, Daejeon, Korea [email protected], [email protected] Abstract.
Reusable Knowledge-based Components for Building Software. Applications: A Knowledge Modelling Approach
Reusable Knowledge-based Components for Building Software Applications: A Knowledge Modelling Approach Martin Molina, Jose L. Sierra, Jose Cuena Department of Artificial Intelligence, Technical University
The Role of Requirement Engineering in Software Development Life Cycle 1
The Role of Engineering in Software Development Life Cycle 1 Abhijit Chakraborty, 2 Mrinal Kanti Baowaly, 3 Ashraful Arefin, 4 Ali Newaz Bahar 1, 2 Department of Computer Science and Telecommunication
Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens
Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens 1 Optique: Improving the competitiveness of European industry For many
SPATIAL DATA CLASSIFICATION AND DATA MINING
, pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal
Business Intelligence: Recent Experiences in Canada
Business Intelligence: Recent Experiences in Canada Leopoldo Bertossi Carleton University School of Computer Science Ottawa, Canada : Faculty Fellow of the IBM Center for Advanced Studies 2 Business Intelligence
Investigating Role of Service Knowledge Management System in Integration of ITIL V3 and EA
Investigating Role of Service Knowledge Management System in Integration of ITIL V3 and EA Akbar Nabiollahi, Rose Alinda Alias, Shamsul Sahibuddin Faculty of Computer Science and Information System Universiti
Reading Questions. Lo and Yeung, 2007: 2 19. Schuurman, 2004: Chapter 1. 1. What distinguishes data from information? How are data represented?
Reading Questions Week two Lo and Yeung, 2007: 2 19. Schuurman, 2004: Chapter 1. 1. What distinguishes data from information? How are data represented? 2. What sort of problems are GIS designed to solve?
Linked Data Interface, Semantics and a T-Box Triple Store for Microsoft SharePoint
Linked Data Interface, Semantics and a T-Box Triple Store for Microsoft SharePoint Christian Fillies 1 and Frauke Weichhardt 1 1 Semtation GmbH, Geschw.-Scholl-Str. 38, 14771 Potsdam, Germany {cfillies,
How To Evaluate Web Applications
A Framework for Exploiting Conceptual Modeling in the Evaluation of Web Application Quality Pier Luca Lanzi, Maristella Matera, Andrea Maurino Dipartimento di Elettronica e Informazione, Politecnico di
IAI : Expert Systems
IAI : Expert Systems John A. Bullinaria, 2005 1. What is an Expert System? 2. The Architecture of Expert Systems 3. Knowledge Acquisition 4. Representing the Knowledge 5. The Inference Engine 6. The Rete-Algorithm
Databases in Organizations
The following is an excerpt from a draft chapter of a new enterprise architecture text book that is currently under development entitled Enterprise Architecture: Principles and Practice by Brian Cameron
COMPARING MATRIX-BASED AND GRAPH-BASED REPRESENTATIONS FOR PRODUCT DESIGN
12 TH INTERNATIONAL DEPENDENCY AND STRUCTURE MODELLING CONFERENCE, 22 23 JULY 2010, CAMBRIDGE, UK COMPARING MATRIX-BASED AND GRAPH-BASED REPRESENTATIONS FOR PRODUCT DESIGN Andrew H Tilstra 1, Matthew I
INTERNATIONAL CONFERENCE ON ENGINEERING DESIGN ICED 01 GLASGOW, AUGUST 21-23, 2001
INTERNATIONAL CONFERENCE ON ENGINEERING ICED 01 GLASGOW, AUGUST 21-23, 2001 REDUCING DEVELOPMENT CYCLE BY DATA MANAGEMENT WITHIN THE OFFICE Mario Storga, Davor Pavlic and Dorian Marjanovic Keywords: Product
How To Develop Software
Software Engineering Prof. N.L. Sarda Computer Science & Engineering Indian Institute of Technology, Bombay Lecture-4 Overview of Phases (Part - II) We studied the problem definition phase, with which
DATABASE MANAGEMENT SYSTEMS
CHAPTER DATABASE MANAGEMENT SYSTEMS This chapter reintroduces the term database in a more technical sense than it has been used up to now. Data is one of the most valuable assets held by most organizations.
OWL based XML Data Integration
OWL based XML Data Integration Manjula Shenoy K Manipal University CSE MIT Manipal, India K.C.Shet, PhD. N.I.T.K. CSE, Suratkal Karnataka, India U. Dinesh Acharya, PhD. ManipalUniversity CSE MIT, Manipal,
Project Knowledge Management Based on Social Networks
DOI: 10.7763/IPEDR. 2014. V70. 10 Project Knowledge Management Based on Social Networks Panos Fitsilis 1+, Vassilis Gerogiannis 1, and Leonidas Anthopoulos 1 1 Business Administration Dep., Technological
A Business Process Services Portal
A Business Process Services Portal IBM Research Report RZ 3782 Cédric Favre 1, Zohar Feldman 3, Beat Gfeller 1, Thomas Gschwind 1, Jana Koehler 1, Jochen M. Küster 1, Oleksandr Maistrenko 1, Alexandru
