Logical and categorical methods in data transformation (TransLoCaTe)

Size: px
Start display at page:

Download "Logical and categorical methods in data transformation (TransLoCaTe)"


1 Logical and categorical methods in data transformation (TransLoCaTe) 1 Introduction to the abbreviated project description This is an abbreviated project description of the TransLoCaTe project, with an emphasis on the project s second part (2.2), for which a PhD student is currently being sought. The TransLoCaTe project consists of two interacting parts, the first (run by H. Forssell) more abstract or theoretical, and the second (PhD) more applied. The first part will investigate and develop new techniques, perspectives, and ideas for the field of database representation, outside the current paradigm the relational model and in order to address the deficiencies of that model, especially in the area of data transformation (in a wide sense of the term, including transforming to or from ontologies). The project s second part will begin in the other end, with the current research frontier in data transformation, particularly in the new field of ontology-based data access. While the first part will start with developing a new, abstract framework for representing data with an aim to benefit current developments in the field, the second part will start with an analysis of the current challenges and methods of the field, then develop a framework for addressing them. In a manner of speaking, while the first part has techniques and ideas, and wants to explore what results can be obtained from them, the second knows what results it wants to obtain, and is looking for techniques to obtain them. The two parts are to continuously interact and feed off each other. 2 Background and status of knowledge 2.1 Models of databases and data transformation Databases being, essentially, collections of (possibly interrelated) tables of data, the foundational question is how to best represent such collections of tables mathematically, in order to study their properties and find suitable ways to manipulate them. The dominant mathematical model since its invention by E.F. Codd [5] has been the so-called relational model, which has provided a powerful, yet quite simple theoretical tool. Although very successful, areas exist in which the relational model is less adequate than in others. For instance, this is the case in how it represents missing information 1, and, more centrally for us, in the means that it provides to compare and transform data structured in different ways. Being such a well-entrenched paradigm, it can be difficult for researchers to think along different lines in such areas. It is the task of the project s first part to think outside the relational box and bring techniques and results from areas such as category theory, categorical logic, and logic more widely, to the question of representing data and transformations of data. The relational model of databases A database being, then, roughly a collection of tables, its shape or schema can be specified by giving the number of tables and the number of columns in each table, e.g. as a list of table names, each with an associated list of (distinct) column names, referred 1 see e.g. 1

2 to as attributes. A particular collection of tables of this shape, an instance of the database schema, is then a filling out of rows in the form of an assignment of a (finite) set of tuples, of the correct arity, to each table name. Thus the situation can be represented logically by letting a database schema be a (first-order, finite, relational) signature and each instance be a (finite) structure in the usual model theoretic sense. Queries then correspond to formulas over the signature, and constraints, at least those so-called dependencies that can be formulated as query inclusions, correspond to axioms. In practice (see [1]), it suffices with axioms that can be formulated as implications or sequents of a quite restricted class of formulas known as positive-primitive (in e.g. [16] or regular (in e.g. [14]). Thus, fruitfully if somewhat simplified, the theory of databases can be seen as the (finite) model theory of regular theories (in the sense of [14]) over a finite relational signature. With this model in hand, much of the theory of databases is now quite well understood, at least as concerns the static picture, that is, the properties of instances of a fixed schema. Data transformation In somewhat sweeping generality, we take data transformation here to involve converting data from a source schema to a target schema in a suitable way, e.g. to produce an instance of the target schema from an instance from the source schema, or to define a set of valid target instances with respect to a given source instance, or even to answer queries formulated over the target schema with respect to source instances without moving any data. Accordingly, we take data transformation to include both the data exchange of e.g. [8], and the data integration of e.g. [15]. Now, in the relational model, once you fix a schema you have a well-defined and well-understood mathematical object of the database instances structured over that schema, in that you have the category of instances and homomorphisms, in which you have operations such as products and techniques such as the chase algorithm. But the model does not tell you how to compare or transform instances structured over different schemas. One immediate suggestion is to use the notion of theory translation from mathematical logic and the functors (mappings, if you prefer) between instances that these induce. However, a much studied approach over the last decade is instead to simply define a relation on the instances of two different schemas in the familiar terms of dependencies. Such a data transformation setting is then given by a source schema S, a target schema T possibly with dependencies of its own and a schema mapping in the form of a set Σ st of source-to-target dependencies in the form of query inclusions with source schema queries included in target schema queries. As such, the approach can be seen as defining a transformation from S to T by defining a third schema S T consisting of a copy of S, a copy of T, and the new source-to-target dependencies. One can then study the mapping with the usual tried and tested relational notions and techniques (certain answers, chase algorithm, etc.). We shall get back to this picture in more detail in Section 2.2, where it forms part of the basic set up (the target schema there being that of an ontology, which introduces additional questions and an additional need for abstraction). However, although one may need or want to define a transformation in precisely this way in many or certain practical situations, this approach can hardly be said to constitute a general, flexible, and principled approach to comparing, mapping, and transforming instances of different database schemas. Rather, it is the study of how you can relate two schemas by adding extra dependencies, and notions which one would expect to be basic for a concept of schema mapping, such as composition and inverse, are highly non-trivial [7, 9]. 2.2 Mappings for heterogeneous schemas for ontology based data access The second part of the project is directed at the recent area of research that has become known as ontology-based data access, which is an approach to the problem of handling and accessing big data by means of information systems that use ontological reasoning. The problem of handling large amounts of data from heterogenous and distributed data sources, also known as the problem of big data, is currently a challenge in many applications. On the Web, sources of semi-structured, overlapping, and semantically-related data are currently proliferating at a phenomenal rate. Likewise in 2

3 industry, many companies amass large collections of semantically related data. This state of affairs has created a demand for more powerful and flexible information systems (ISs). This new generation of ISs will need to integrate incomplete and semi-structured information from heterogeneous sources, employ rich and flexible schemas, and answer queries by taking into account both knowledge and data. Ontology-based data access [20] has recently been proposed as an architectural principle for such systems. The main idea is to develop a unified view of the data by describing the relevant domain in an ontology, which then provides the vocabulary used to ask queries. Thus this forms an instance of a data transformation setting in the general sense of section 2.1; queries are being posed over a target schema, now in the form of an ontology, and the data to answer them are structured over a source (database) schema (or several). The advantage of an ontology-based IS is that it can use ontological statements, such as the concept hierarchy and other axioms, to derive new facts and thus enrich query answers with implicit knowledge. The ontology thus intermediates between the different data sources, allowing users a unified view of the data in a suitable language. This idea has been incorporated into systems such as QuOnto [2], Owlgres, 2 ROWLKit [6], and REQUIEM [19], and ontology reasoners such as RACER [13], FaCT++ [23], Pellet [18], and HermiT [17]. In order to accomplish this, ontology-based ISs need to combine reasoning and query answering over ontologies with building and maintain collections of mappings between ontologies and data sources. In current approaches, the notion of mapping employed is akin to that sketched in section 2.1 in that a mappings are defined in terms rules φ ψ relating a query φ over the data sources to a query ψ over the ontology. As such, ontology-based information systems can be seen as a variation of data integration (see e.g. [15]), where data stored under one relational schema needs to be available for query answering over a different schema. The main difference to data integration is that mappings in ontology-based data access map not between two schemas, but between a schema and an ontology. Thus the problems related to mappings in ontology-based ISs are to some extent the same or similar to those in database-to-database transformations, but the new set-up also present some unique challenges. The following three problem areas form the starting point for the second part of the project: Mapping between heteregenous schemas As noted, existing research on mappings has been done in the context of data integration and data exchange, where information in a source database, with a schema S, needs to be expressed using a different target schema T [22], via mappings that govern the transformation. In data exchange, the goal is to create a new database over T containing, as far as possible, the information from the source database. In data integration, on the other hand, the data stays as is, while the system allows users to pose queries using schema T. In both cases, the schemas S and T are over the same language, usually relational databases [3, 12]. In this setting, a schema mapping is a first-order logic formula φ ψ, where e.g. φ is a conjunction of atomic formulas from the source schema, and ψ a conjunction of atomic formulas from the target schema. Informally, such a mapping states that whenever a pattern of facts appears in the source database, a corresponding pattern must appear in the target. Such mappings are known as tuple-generating dependencies (tgds), and their expressive power and the complexity of working with them have been extensively studied [11, 22]. In OBDA, however, the mappings are between two schemas over different languages usually between a database schema (first-order logic) and an ontology (description logic). As such, it is not clear what results from data integration and exchange carry over to this setting, or to what extent the different classes of tgds that have been defined and studied in the literature make sense for OBDA. In sum, the fact that in OBDA one needs to relate two different kinds of entities, so to speak, namely databases and ontologies, presents a challenge, invalidates some of the previous research done, and indicates the need for a more abstract approach

4 Query answering with mappings Numerous query answering algorithms have been developed in both the ontology and the database settings. Mainstream RDBMSs currently employ sophisticated query optimisation techniques based on the assumption that the database instance already satisfies the dependencies a valid assumption when dependencies are used only as checks. Query answering becomes much harder if an ontology, or a set of dependencies, needs to be taken into account. A number of (worst-case) complexity results are known for answering queries over DLs [4, 10] and dependencies [1]. Currently, all practical approaches to building ontology-based ISs rely on query answering via query rewriting: answers to a query q over an ontology O and a database instance DB are computed by first rewriting q (using O) into another query q and then evaluating q over DB. Query rewriting is particularly suitable in scenarios where the ontology-based IS has no direct control over the data and cannot modify it. The query obtained through rewriting depends to a significant extent on the choice of mappings. As such, realistic algorithms need to work with and analyze the mappings treating the mappings as a given is likely to e.g. lead to redundant queries to execute over the data sources. Therefore, knowing what restrictions on mappings make for efficient query answering would be beneficial for OBDA. Current research in this area [21] uses fairly simple mappings, and thus the topic is still largely unexplored. Managing mappings in the face of change It is not clear how mappings can be maintained in the face of changes in data sources, or in the ontology. In fact, it not even clear what operations on mappings are required. Here, again, the question of what queries φ and ψ a mapping rule φ ψ allows becomes significant. Previous work on mapping management [3] is very recent, and has looked at mappings in the context of data integration and exchange. As discussed above, such results may or may not apply to OBDA settings due to the heterogenity of schemas. On one hand, some types of changes in the data sources, such as adding or altering a table, can to some extent be accommodated within existing frameworks for data integration and exchange [3]. On the other hand, other types of changes, such as the deletion of an axiom in the ontology, do not fit into this framework at all. For example, a subclass axiom that is removed may necessitate the addition of a new rule to avoid information loss. To discover and repair problems arising such manners, a system for mapping maintenance must consider not only the mappings, but also the ontology and possibly the data sources. That is to say, in a dynamic setting, mappings will have to be maintained in the face of changing source databases as well as changing target ontologies. This calls for a conceptual framework which is able to encompass heterogeneous database-to-ontology transformations together with homogeneous database-database and ontology-ontology schema changes. 2.3 The Optique project The Scalable End-user Access to Big Data project (short name: Optique), an FP7 Large-scale Integrating Project that runs until November 2016, aims at applying ontology-based data access to large industry use cases from Siemens and Statoil. Optique has a component that addresses ontology and mapping management, and has assembled a consortium of world leading experts in the domain of databases and ontology-based information systems in order to implement new practical solutions to the problems of managing large collections of mappings. The TransLoCaTe project will work in close collaboration with Optique so as to exploit the results, insights and use case information from Optique, complemeting the practical focus of Optique with foundational theoretical research. 4

5 2.4 Interaction The project consist of two parts, one top-down starting from abstract methods and ideas and one bottom-up starting from the concrete problems facing an area of current research in data transformation. It is clear that the two parts are at the outset some distance apart; the first part will investigate abstract models which it believes will be suitable for data transformation setting, but it does not have techniques ready that are established as suitable for the problems facing the second part. The second part will investigate problems facing an area of data integration which it believes calls for more abstract methods, but it has not identified those methods to be precisely those that the first part sets out to investigate. Nevertheless, each parts stands to benefit extensively from, as it fills a certain void, in the other. For the first part, its idea is that one should model databases in a way more suitable for dynamical settings, but the abstractions and approaches do not spring directly from current applications. The focus, concern, and hands on expertise with such applications is what the second part brings to the project. For the second part, the data transformation challenges it sets out to solve are rather clear, but they seem to call for an interdisciplinary, and more abstract, approach than what is currently being employed. The focus and experience with such approaches is what the first part brings to the project. In order to keep in close contact with database users and the database community, as well as to ongoing research in mapping management for ontology-based data access, the project will from its start work in close collaboration with the Optique project. References [1] S. Abiteboul, R. Hull, and V. Vianu. Foundations of Databases. Addison-Wesley, [2] A. Acciarri, D. Calvanese, G. De Giacomo, D. Lembo, M. Lenzerini, M. Palmieri, and R. Rosati. QuOnto: Querying Ontologies. In Proc. AAAI, pages , [3] M. Arenas, J. Pérez, J. L. Reutter, and C. Riveros. Foundations of schema mapping management. In J. Paredaens and D. V. Gucht, editors, Proceedings of the 29th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS 10), pages ACM, [4] D. Calvanese, G. De Giacomo, and M. Lenzerini. On the Decidability of Query Containment under Constraints. In Proc. PODS, pages , [5] E. F. Codd. A relational model of data for large shared data banks. Communications of the ACM, 13(6): , [6] C. Corona, M. Ruzzi, and D. F. Savo. Filling the gap between OWL 2 QL and QuOnto: ROWLKit. In Proc. DL, [7] R. Fagin. Inverting schema mappings. In PODS 06 Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, pages 50 59, [8] R. Fagin, P. G. Kolaitis, R. J. Miller, and L. Popa. Data exchange: Semantics and query answering. Theoretical Computer Science, 336(1):89 124, [9] R. Fagin, P. G. Kolaitis, L. Popa, and W.-C. Tan. Composing schema mappings: Second-order dependencies to the rescue. ACM Transactions on Database Systems (TODS), 30(4): , [10] B. Glimm, I. Horrocks, C. Lutz, and U. Sattler. Conjunctive Query Answering for the Description Logic SHIQ. J. Artif. Intell. Res., 31: , [11] G. Gottlob, R. Pichler, and V. Savenkov. Normalization and optimization of schema mappings. The VLDB Journal, 20(2): , [12] G. Gottlob and P. Senellart. Schema mapping discovery from data instances. Journal of the ACM, 57(2), [13] V. Haarslev and R. Möller. RACER System Description. In Proc. IJCAR, pages , [14] P. T. Johnstone. Sketches of an Elephant, volume 43 and 44 of Oxford Logic Guides. Clarendon Press, Oxford, [15] M. Lenzerini. Data integration: A theoretical perspective. In Proceedings of the ACM Symposium on Principles of Database Systems, pages ,

6 [16] M. Makkai. A theorem on barr-exact categories, with an infinitary generalization. Annals of Pure and Applied Logic, 47: , [17] B. Motik, R. Shearer, and I. Horrocks. Hypertableau Reasoning for Description Logics. J. Artif. Intell. Res., 36: , [18] B. Parsia and E. Sirin. Pellet: An OWL-DL Reasoner. Poster at ISWC, [19] H. Pérez-Urbina, I. Horrocks, and B. Motik. Efficient Query Answering for OWL 2. In Proc. ISWC, pages , [20] A. Poggi, D. Lembo, D. Calvanese, G. De Giacomo, M. Lenzerini, and R. Rosati. Linking Data to Ontologies. J. Data Semantics, 10: , [21] A. Poggi, D. Lembo, D. Calvanese, G. Giacomo, M. Lenzerini, and R. Rosati. Linking data to ontologies. In S. Spaccapietra, editor, Journal on Data Semantics X, volume 4900 of Lecture Notes in Computer Science, pages Springer, [22] B. ten Cate and P. G. Kolaitis. Structural characterizations of schema-mapping languages. Communications of the ACM, 53(1): , Jan [23] D. Tsarkov and I. Horrocks. FaCT++ Description Logic Reasoner: System Description. In Proc. IJCAR, pages ,

Data Quality in Ontology-Based Data Access: The Case of Consistency

Data Quality in Ontology-Based Data Access: The Case of Consistency Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence Data Quality in Ontology-Based Data Access: The Case of Consistency Marco Console, Maurizio Lenzerini Dipartimento di Ingegneria

More information

On Rewriting and Answering Queries in OBDA Systems for Big Data (Short Paper)

On Rewriting and Answering Queries in OBDA Systems for Big Data (Short Paper) On Rewriting and Answering ueries in OBDA Systems for Big Data (Short Paper) Diego Calvanese 2, Ian Horrocks 3, Ernesto Jimenez-Ruiz 3, Evgeny Kharlamov 3, Michael Meier 1 Mariano Rodriguez-Muro 2, Dmitriy

More information

A Hybrid Approach for Ontology Integration

A Hybrid Approach for Ontology Integration A Hybrid Approach for Ontology Integration Ahmed Alasoud Volker Haarslev Nematollaah Shiri Concordia University Concordia University Concordia University 1455 De Maisonneuve Blvd. West 1455 De Maisonneuve

More information


INTEGRATION OF XML DATA IN PEER-TO-PEER E-COMMERCE APPLICATIONS INTEGRATION OF XML DATA IN PEER-TO-PEER E-COMMERCE APPLICATIONS Tadeusz Pankowski 1,2 1 Institute of Control and Information Engineering Poznan University of Technology Pl. M.S.-Curie 5, 60-965 Poznan

More information

Completing Description Logic Knowledge Bases using Formal Concept Analysis

Completing Description Logic Knowledge Bases using Formal Concept Analysis Completing Description Logic Knowledge Bases using Formal Concept Analysis Franz Baader, 1 Bernhard Ganter, 1 Barış Sertkaya, 1 and Ulrike Sattler 2 1 TU Dresden, Germany and 2 The University of Manchester,

More information

OntoPIM: How to Rely on a Personal Ontology for Personal Information Management

OntoPIM: How to Rely on a Personal Ontology for Personal Information Management OntoPIM: How to Rely on a Personal Ontology for Personal Information Management Vivi Katifori 2, Antonella Poggi 1, Monica Scannapieco 1, Tiziana Catarci 1, and Yannis Ioannidis 2 1 Dipartimento di Informatica

More information

Composing Schema Mappings: An Overview

Composing Schema Mappings: An Overview Composing Schema Mappings: An Overview Phokion G. Kolaitis UC Santa Scruz & IBM Almaden Joint work with Ronald Fagin, Lucian Popa, and Wang-Chiew Tan The Data Interoperability Challenge Data may reside

More information

Optique System: Towards Ontology and Mapping Management in OBDA Solutions

Optique System: Towards Ontology and Mapping Management in OBDA Solutions Optique System: Towards Ontology and Mapping Management in OBDA Solutions Peter Haase 2, Ian Horrocks 3, Dag Hovland 6, Thomas Hubauer 5, Ernesto Jimenez-Ruiz 3, Evgeny Kharlamov 3, Johan Klüwer 1 Christoph

More information

Consistent Answers from Integrated Data Sources

Consistent Answers from Integrated Data Sources Consistent Answers from Integrated Data Sources Leopoldo Bertossi 1, Jan Chomicki 2 Alvaro Cortés 3, and Claudio Gutiérrez 4 1 Carleton University, School of Computer Science, Ottawa, Canada. bertossi@scs.carleton.ca

More information

Dependencies Revisited for Improving Data Quality

Dependencies Revisited for Improving Data Quality Dependencies Revisited for Improving Data Quality Wenfei Fan University of Edinburgh & Bell Laboratories Wenfei Fan Dependencies Revisited for Improving Data Quality 1 / 70 Real-world data is often dirty

More information

Logical Foundations of Relational Data Exchange

Logical Foundations of Relational Data Exchange Logical Foundations of Relational Data Exchange Pablo Barceló Department of Computer Science, University of Chile pbarcelo@dcc.uchile.cl 1 Introduction Data exchange has been defined as the problem of

More information

Data Validation with OWL Integrity Constraints

Data Validation with OWL Integrity Constraints Data Validation with OWL Integrity Constraints (Extended Abstract) Evren Sirin Clark & Parsia, LLC, Washington, DC, USA evren@clarkparsia.com Abstract. Data validation is an important part of data integration

More information

The Optique Project: Towards OBDA Systems for Industry (Short Paper)

The Optique Project: Towards OBDA Systems for Industry (Short Paper) The Optique Project: Towards OBDA Systems for Industry (Short Paper) D. Calvanese 3, M. Giese 10, P. Haase 2, I. Horrocks 5, T. Hubauer 7, Y. Ioannidis 9, E. Jiménez-Ruiz 5, E. Kharlamov 5, H. Kllapi 9,

More information

CSCI 599: Foundations of Databases, Knowledge Representation, Data Integration and Data Exchange

CSCI 599: Foundations of Databases, Knowledge Representation, Data Integration and Data Exchange CSCI 599: Foundations of Databases, Knowledge Representation, Data Integration and Data Exchange Basic Information Place and Time: Fall 03, Tuesdays and Thursdays, 5:00-6:0pm Instructors: José Luis Ambite,

More information

A Tutorial on Data Integration

A Tutorial on Data Integration A Tutorial on Data Integration Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Antonio Ruberti, Sapienza Università di Roma DEIS 10 - Data Exchange, Integration, and Streaming November 7-12,

More information

OBDA for Temporal Querying and Streams

OBDA for Temporal Querying and Streams OBDA for Temporal Querying and Streams Christian Neuenstadt, Ralf Möller, and Özgür L. Özçep Institute of Information Systems (Ifis) University of Lübeck Lübeck, Germany {moeller,neuenstadt,oezcep}@ifis.uni-luebeck.de

More information

Question Answering and the Nature of Intercomplete Databases

Question Answering and the Nature of Intercomplete Databases Certain Answers as Objects and Knowledge Leonid Libkin School of Informatics, University of Edinburgh Abstract The standard way of answering queries over incomplete databases is to compute certain answers,

More information

Query Processing in Data Integration Systems

Query Processing in Data Integration Systems Query Processing in Data Integration Systems Diego Calvanese Free University of Bozen-Bolzano BIT PhD Summer School Bressanone July 3 7, 2006 D. Calvanese Data Integration BIT PhD Summer School 1 / 152

More information

An Efficient and Scalable Management of Ontology

An Efficient and Scalable Management of Ontology An Efficient and Scalable Management of Ontology Myung-Jae Park 1, Jihyun Lee 1, Chun-Hee Lee 1, Jiexi Lin 1, Olivier Serres 2, and Chin-Wan Chung 1 1 Korea Advanced Institute of Science and Technology,

More information

Genomic CDS: an example of a complex ontology for pharmacogenetics and clinical decision support

Genomic CDS: an example of a complex ontology for pharmacogenetics and clinical decision support Genomic CDS: an example of a complex ontology for pharmacogenetics and clinical decision support Matthias Samwald 1 1 Medical University of Vienna, Vienna, Austria matthias.samwald@meduniwien.ac.at Abstract.

More information

OBDA: Query Rewriting or Materialization? In Practice, Both!

OBDA: Query Rewriting or Materialization? In Practice, Both! OBDA: Query Rewriting or Materialization? In Practice, Both! Juan F. Sequeda 1, Marcelo Arenas 2, and Daniel P. Miranker 1 1 Department of Computer Science, The University of Texas at Austin 2 Department

More information

The Recovery of a Schema Mapping: Bringing Exchanged Data Back

The Recovery of a Schema Mapping: Bringing Exchanged Data Back The Recovery of a Schema Mapping: Bringing Exchanged Data Back MARCELO ARENAS and JORGE PÉREZ Pontificia Universidad Católica de Chile and CRISTIAN RIVEROS R&M Tech Ingeniería y Servicios Limitada A schema

More information

Distributed Query Processing on the Cloud: the Optique Point of View (Short Paper)

Distributed Query Processing on the Cloud: the Optique Point of View (Short Paper) Distributed Query Processing on the Cloud: the Optique Point of View (Short Paper) Herald Kllapi 2, Dimitris Bilidas 2, Ian Horrocks 1, Yannis Ioannidis 2, Ernesto Jimenez-Ruiz 1, Evgeny Kharlamov 1, Manolis

More information

Scalable End-user Access to Big Data

Scalable End-user Access to Big Data Chapter 6 Scalable End-user Access to Big Data Martin Giese, 1 Diego Calvanese, 2 Peter Haase, 3 Ian Horrocks, 4 Yannis Ioannidis, 5 Herald Kllapi, 5 Manolis Koubarakis, 5 Maurizio Lenzerini, 6 Ralf Möller,

More information

Scalable End-User Access to Big Data

Scalable End-User Access to Big Data Scalable End-User Access to Big Data Diego Calvanese KRDB Research Centre for Knowledge and Data Free University of Bozen-Bolzano, Italy Free University of Bozen-Bolzano 16th International Conference on

More information

XML Data Integration

XML Data Integration XML Data Integration Lucja Kot Cornell University 11 November 2010 Lucja Kot (Cornell University) XML Data Integration 11 November 2010 1 / 42 Introduction Data Integration and Query Answering A data integration

More information

Data Integration. Maurizio Lenzerini. Universitá di Roma La Sapienza

Data Integration. Maurizio Lenzerini. Universitá di Roma La Sapienza Data Integration Maurizio Lenzerini Universitá di Roma La Sapienza DASI 06: Phd School on Data and Service Integration Bertinoro, December 11 15, 2006 M. Lenzerini Data Integration DASI 06 1 / 213 Structure

More information

Advances in Accessing Big Data with Expressive Ontologies

Advances in Accessing Big Data with Expressive Ontologies Advances in Accessing Big Data with Expressive Ontologies Ralf Möller 1, Christian Neuenstadt 1, Özgür L. Özçep1, and Sebastian Wandelt 2 1 Hamburg University of Technology, 21073 Hamburg, Germany 2 Humboldt-Universität

More information

DLDB: Extending Relational Databases to Support Semantic Web Queries

DLDB: Extending Relational Databases to Support Semantic Web Queries DLDB: Extending Relational Databases to Support Semantic Web Queries Zhengxiang Pan (Lehigh University, USA zhp2@cse.lehigh.edu) Jeff Heflin (Lehigh University, USA heflin@cse.lehigh.edu) Abstract: We

More information

Ontology-based Data Integration with MASTRO-I for Configuration and Data Management at SELEX Sistemi Integrati

Ontology-based Data Integration with MASTRO-I for Configuration and Data Management at SELEX Sistemi Integrati Ontology-based Data Integration with MASTRO-I for Configuration and Data Management at SELEX Sistemi Integrati Alfonso Amoroso 1, Gennaro Esposito 1, Domenico Lembo 2, Paolo Urbano 2, Raffaele Vertucci

More information

Query Reformulation over Ontology-based Peers (Extended Abstract)

Query Reformulation over Ontology-based Peers (Extended Abstract) Query Reformulation over Ontology-based Peers (Extended Abstract) Diego Calvanese 1, Giuseppe De Giacomo 2, Domenico Lembo 2, Maurizio Lenzerini 2, and Riccardo Rosati 2 1 Faculty of Computer Science,

More information

Schema Mappings and Data Exchange

Schema Mappings and Data Exchange Schema Mappings and Data Exchange Phokion G. Kolaitis University of California, Santa Cruz & IBM Research-Almaden EASSLC 2012 Southwest University August 2012 1 Logic and Databases Extensive interaction

More information

Data Integration and Exchange. L. Libkin 1 Data Integration and Exchange

Data Integration and Exchange. L. Libkin 1 Data Integration and Exchange Data Integration and Exchange L. Libkin 1 Data Integration and Exchange Traditional approach to databases A single large repository of data. Database administrator in charge of access to data. Users interact

More information

Data Posting: a New Frontier for Data Exchange in the Big Data Era

Data Posting: a New Frontier for Data Exchange in the Big Data Era Data Posting: a New Frontier for Data Exchange in the Big Data Era Domenico Saccà and Edoardo Serra DIMES, Università della Calabria, 87036 Rende, Italy sacca@unical.it, eserra@deis.unical.it 1 Preliminaries

More information

Web-Based Genomic Information Integration with Gene Ontology

Web-Based Genomic Information Integration with Gene Ontology Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, kai.xu@nicta.com.au Abstract. Despite the dramatic growth of online genomic

More information

Chapter 17 Using OWL in Data Integration

Chapter 17 Using OWL in Data Integration Chapter 17 Using OWL in Data Integration Diego Calvanese, Giuseppe De Giacomo, Domenico Lembo, Maurizio Lenzerini, Riccardo Rosati, and Marco Ruzzi Abstract One of the outcomes of the research work carried

More information

P2P Data Integration and the Semantic Network Marketing System

P2P Data Integration and the Semantic Network Marketing System Principles of peer-to-peer data integration Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Università di Roma La Sapienza Via Salaria 113, I-00198 Roma, Italy lenzerini@dis.uniroma1.it Abstract.

More information

High Performance Query Answering over DL-Lite Ontologies

High Performance Query Answering over DL-Lite Ontologies Proceedings of the Thirteenth International Conference on Principles of Knowledge Representation and Reasoning High Performance Query Answering over DL-Lite Ontologies Mariano Rodríguez-Muro and Diego

More information

Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM)

Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM) Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM) Extended Abstract Ioanna Koffina 1, Giorgos Serfiotis 1, Vassilis Christophides 1, Val Tannen

More information

What to Ask to a Peer: Ontology-based Query Reformulation

What to Ask to a Peer: Ontology-based Query Reformulation What to Ask to a Peer: Ontology-based Query Reformulation Diego Calvanese Faculty of Computer Science Free University of Bolzano/Bozen Piazza Domenicani 3, I-39100 Bolzano, Italy calvanese@inf.unibz.it

More information

Data Management in Peer-to-Peer Data Integration Systems

Data Management in Peer-to-Peer Data Integration Systems Book Title Book Editors IOS Press, 2003 1 Data Management in Peer-to-Peer Data Integration Systems Diego Calvanese a, Giuseppe De Giacomo b, Domenico Lembo b,1, Maurizio Lenzerini b, and Riccardo Rosati

More information

The composition of Mappings in a Nautural Interface

The composition of Mappings in a Nautural Interface Composing Schema Mappings: Second-Order Dependencies to the Rescue Ronald Fagin IBM Almaden Research Center fagin@almaden.ibm.com Phokion G. Kolaitis UC Santa Cruz kolaitis@cs.ucsc.edu Wang-Chiew Tan UC

More information

Report on the Dagstuhl Seminar Data Quality on the Web

Report on the Dagstuhl Seminar Data Quality on the Web Report on the Dagstuhl Seminar Data Quality on the Web Michael Gertz M. Tamer Özsu Gunter Saake Kai-Uwe Sattler U of California at Davis, U.S.A. U of Waterloo, Canada U of Magdeburg, Germany TU Ilmenau,

More information

Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens

Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens 1 Optique: Improving the competitiveness of European industry For many

More information

Situation Recognition for Service Management Systems Using OWL 2 Reasoners

Situation Recognition for Service Management Systems Using OWL 2 Reasoners Situation Recognition for Service Management Systems Using OWL 2 Reasoners Waltenegus Dargie, Eldora, Julian Mendez, Christoph Möbius, Kateryna Rybina, Veronika Thost, Anni-Yasmin Turhan, Chair for Automata

More information

Data integration and reconciliation in Data Warehousing: Conceptual modeling and reasoning support

Data integration and reconciliation in Data Warehousing: Conceptual modeling and reasoning support Data integration and reconciliation in Data Warehousing: Conceptual modeling and reasoning support Diego Calvanese Giuseppe De Giacomo Riccardo Rosati Dipartimento di Informatica e Sistemistica Università

More information

Peer Data Exchange. ACM Transactions on Database Systems, Vol. V, No. N, Month 20YY, Pages 1 0??.

Peer Data Exchange. ACM Transactions on Database Systems, Vol. V, No. N, Month 20YY, Pages 1 0??. Peer Data Exchange Ariel Fuxman 1 University of Toronto Phokion G. Kolaitis 2 IBM Almaden Research Center Renée J. Miller 1 University of Toronto and Wang-Chiew Tan 3 University of California, Santa Cruz

More information

Enforcing Data Quality Rules for a Synchronized VM Log Audit Environment Using Transformation Mapping Techniques

Enforcing Data Quality Rules for a Synchronized VM Log Audit Environment Using Transformation Mapping Techniques Enforcing Data Quality Rules for a Synchronized VM Log Audit Environment Using Transformation Mapping Techniques Sean Thorpe 1, Indrajit Ray 2, and Tyrone Grandison 3 1 Faculty of Engineering and Computing,

More information

A generic approach for data integration using RDF, OWL and XML

A generic approach for data integration using RDF, OWL and XML A generic approach for data integration using RDF, OWL and XML Miguel A. Macias-Garcia, Victor J. Sosa-Sosa, and Ivan Lopez-Arevalo Laboratory of Information Technology (LTI) CINVESTAV-TAMAULIPAS Km 6

More information

Application of ontologies for the integration of network monitoring platforms

Application of ontologies for the integration of network monitoring platforms Application of ontologies for the integration of network monitoring platforms Jorge E. López de Vergara, Javier Aracil, Jesús Martínez, Alfredo Salvador, José Alberto Hernández Networking Research Group,

More information

Query Answering in Peer-to-Peer Data Exchange Systems

Query Answering in Peer-to-Peer Data Exchange Systems Query Answering in Peer-to-Peer Data Exchange Systems Leopoldo Bertossi and Loreto Bravo Carleton University, School of Computer Science, Ottawa, Canada {bertossi,lbravo}@scs.carleton.ca Abstract. The

More information

Grid Data Integration based on Schema-mapping

Grid Data Integration based on Schema-mapping Grid Data Integration based on Schema-mapping Carmela Comito and Domenico Talia DEIS, University of Calabria, Via P. Bucci 41 c, 87036 Rende, Italy {ccomito, talia}@deis.unical.it http://www.deis.unical.it/

More information

Improving EHR Semantic Interoperability Future Vision and Challenges

Improving EHR Semantic Interoperability Future Vision and Challenges Improving EHR Semantic Interoperability Future Vision and Challenges Catalina MARTÍNEZ-COSTA a,1 Dipak KALRA b, Stefan SCHULZ a a IMI,Medical University of Graz, Austria b CHIME, University College London,

More information

Relational model. Relational model - practice. Relational Database Definitions 9/27/11. Relational model. Relational Database: Terminology

Relational model. Relational model - practice. Relational Database Definitions 9/27/11. Relational model. Relational Database: Terminology COS 597A: Principles of Database and Information Systems elational model elational model A formal (mathematical) model to represent objects (data/information), relationships between objects Constraints

More information

Developing a Distributed Reasoner for the Semantic Web

Developing a Distributed Reasoner for the Semantic Web Developing a Distributed Reasoner for the Semantic Web Raghava Mutharaju, Prabhaker Mateti, and Pascal Hitzler Wright State University, OH, USA. {mutharaju.2, prabhaker.mateti, pascal.hitzler}@wright.edu

More information

Schema Mappings and Agents Actions in P2P Data Integration System 1

Schema Mappings and Agents Actions in P2P Data Integration System 1 Journal of Universal Computer Science, vol. 14, no. 7 (2008), 1048-1060 submitted: 1/10/07, accepted: 21/1/08, appeared: 1/4/08 J.UCS Schema Mappings and Agents Actions in P2P Data Integration System 1

More information

Accessing Data Integration Systems through Conceptual Schemas (extended abstract)

Accessing Data Integration Systems through Conceptual Schemas (extended abstract) Accessing Data Integration Systems through Conceptual Schemas (extended abstract) Andrea Calì, Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Università

More information

Complete Query Answering Over Horn Ontologies Using a Triple Store

Complete Query Answering Over Horn Ontologies Using a Triple Store Complete Query Answering Over Horn Ontologies Using a Triple Store Yujiao Zhou, Yavor Nenov, Bernardo Cuenca Grau, and Ian Horrocks Department of Computer Science, University of Oxford, UK Abstract. In

More information

Service Oriented Architecture

Service Oriented Architecture Service Oriented Architecture Charlie Abela Department of Artificial Intelligence charlie.abela@um.edu.mt Last Lecture Web Ontology Language Problems? CSA 3210 Service Oriented Architecture 2 Lecture Outline

More information



More information

Characterizing Knowledge on the Semantic Web with Watson

Characterizing Knowledge on the Semantic Web with Watson Characterizing Knowledge on the Semantic Web with Watson Mathieu d Aquin, Claudio Baldassarre, Laurian Gridinoc, Sofia Angeletou, Marta Sabou, and Enrico Motta Knowledge Media Institute (KMi), The Open

More information


CHAPTER 1 INTRODUCTION 1 CHAPTER 1 INTRODUCTION Exploration is a process of discovery. In the database exploration process, an analyst executes a sequence of transformations over a collection of data structures to discover useful

More information

Matching Semantic Service Descriptions with Local Closed-World Reasoning

Matching Semantic Service Descriptions with Local Closed-World Reasoning Matching Semantic Service Descriptions with Local Closed-World Reasoning Stephan Grimm 1, Boris Motik 1, and Chris Preist 2 1 FZI Research Center for Information Technologies at the University of Karlsruhe

More information

Data Integration. May 9, 2014. Petr Kremen, Bogdan Kostov (petr.kremen@fel.cvut.cz, bogdan.kostov@fel.cvut.cz)

Data Integration. May 9, 2014. Petr Kremen, Bogdan Kostov (petr.kremen@fel.cvut.cz, bogdan.kostov@fel.cvut.cz) Data Integration Petr Kremen, Bogdan Kostov petr.kremen@fel.cvut.cz, bogdan.kostov@fel.cvut.cz May 9, 2014 Data Integration May 9, 2014 1 / 33 Outline 1 Introduction Solution approaches Technologies 2

More information

Some Methodological Clues for Defining a Unified Enterprise Modelling Language

Some Methodological Clues for Defining a Unified Enterprise Modelling Language Some Methodological Clues for Defining a Unified Enterprise Modelling Language Michaël Petit University of Namur, Belgium, mpe@info.fundp.ac.be Abstract The need for a Unified Enterprise Modelling Language

More information

Data Integration: A Theoretical Perspective

Data Integration: A Theoretical Perspective Data Integration: A Theoretical Perspective Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Università di Roma La Sapienza Via Salaria 113, I 00198 Roma, Italy lenzerini@dis.uniroma1.it ABSTRACT

More information

National Technical University of Athens. Optimizing Query Answering over Expressive Ontological Knowledge

National Technical University of Athens. Optimizing Query Answering over Expressive Ontological Knowledge National Technical University of Athens School of Electrical and Computer Engineering Division of Computer Science Optimizing Query Answering over Expressive Ontological Knowledge DOCTOR OF PHILOSOPHY

More information

Enabling End User Access to Big Data in the O&G Industry

Enabling End User Access to Big Data in the O&G Industry Enabling End User Access to Big Data in the O&G Industry Johan W. Klüwer (DNV) and Michael Schmidt (fluidops) 1 / 28 HELLENIC REPUBLIC National and Kapodistrian University of Athens 2 / 28 . Paradigm Shift

More information

Data Quality in Information Integration and Business Intelligence

Data Quality in Information Integration and Business Intelligence Data Quality in Information Integration and Business Intelligence Leopoldo Bertossi Carleton University School of Computer Science Ottawa, Canada : Faculty Fellow of the IBM Center for Advanced Studies

More information

Schema mapping and query reformulation in peer-to-peer XML data integration system

Schema mapping and query reformulation in peer-to-peer XML data integration system Control and Cybernetics vol. 38 (2009) No. 1 Schema mapping and query reformulation in peer-to-peer XML data integration system by Tadeusz Pankowski Institute of Control and Information Engineering Poznań

More information

Databases in Organizations

Databases in Organizations The following is an excerpt from a draft chapter of a new enterprise architecture text book that is currently under development entitled Enterprise Architecture: Principles and Practice by Brian Cameron

More information

Semantic Knowledge Management System. Paripati Lohith Kumar. School of Information Technology

Semantic Knowledge Management System. Paripati Lohith Kumar. School of Information Technology Semantic Knowledge Management System Paripati Lohith Kumar School of Information Technology Vellore Institute of Technology University, Vellore, India. plohithkumar@hotmail.com Abstract The scholarly activities

More information

Incremental Query Answering for Implementing Document Retrieval Services

Incremental Query Answering for Implementing Document Retrieval Services Incremental Query Answering for Implementing Document Retrieval Services Volker Haarslev and Ralf Möller Concordia University, Montreal University of Applied Sciences, Wedel Abstract Agent systems that

More information

Second-Order Characterizations of Definientia in Formula Classes

Second-Order Characterizations of Definientia in Formula Classes Faculty of Computer Science Institute of Artificial Intelligence Knowledge Representation and Reasoning Second-Order Characterizations of Definientia in Formula Classes Christoph Wernhard KRR Report 14-03

More information

Appendix B Data Quality Dimensions

Appendix B Data Quality Dimensions Appendix B Data Quality Dimensions Purpose Dimensions of data quality are fundamental to understanding how to improve data. This appendix summarizes, in chronological order of publication, three foundational

More information

Semantic Variability Modeling for Multi-staged Service Composition

Semantic Variability Modeling for Multi-staged Service Composition Semantic Variability Modeling for Multi-staged Service Composition Bardia Mohabbati 1, Nima Kaviani 2, Dragan Gašević 3 1 Simon Fraser University, 2 University of British Columbia, 3 Athabasca University,

More information


CURRICULUM VITAE JORGE PÉREZ EDUCATION CURRICULUM VITAE JORGE PÉREZ Ph.D. Student, Department of Computer Science Pontificia Universidad Católica de Chile Email: jperez@ing.puc.cl, Http: www.ing.puc.cl/~jperez 2009 Ph.D. Student in

More information

Information Services for Smart Grids

Information Services for Smart Grids Smart Grid and Renewable Energy, 2009, 8 12 Published Online September 2009 (http://www.scirp.org/journal/sgre/). ABSTRACT Interconnected and integrated electrical power systems, by their very dynamic

More information

Database Marketing, Business Intelligence and Knowledge Discovery

Database Marketing, Business Intelligence and Knowledge Discovery Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski

More information

Geospatial Information with Description Logics, OWL, and Rules

Geospatial Information with Description Logics, OWL, and Rules Reasoning Web 2012 Summer School Geospatial Information with Description Logics, OWL, and Rules Presenter: Charalampos Nikolaou Dept. of Informatics and Telecommunications National and Kapodistrian University

More information

Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology

Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology Hong-Linh Truong Institute for Software Science, University of Vienna, Austria truong@par.univie.ac.at Thomas Fahringer

More information

White Paper Accessing Big Data

White Paper Accessing Big Data White Paper Accessing Big Data We are surrounded by vast and steadily increasing amounts of data, be it in industrial or personal contexts. In order to maintain a leading position in industry, one soon

More information

Ontology-Based Discovery of Workflow Activity Patterns

Ontology-Based Discovery of Workflow Activity Patterns Ontology-Based Discovery of Workflow Activity Patterns Diogo R. Ferreira 1, Susana Alves 1, Lucinéia H. Thom 2 1 IST Technical University of Lisbon, Portugal {diogo.ferreira,susana.alves}@ist.utl.pt 2

More information

An Approach to Support Data Integrity for Web Services Using Semantic RESTful Interfaces

An Approach to Support Data Integrity for Web Services Using Semantic RESTful Interfaces An Approach to Support Data Integrity for Web Services Using Semantic RESTful Interfaces Hermano Albuquerque Lira SERPRO Av. Pontes Vieira, 832, CEP 60.130-240 hermano.lira@serpro.gov.br Tadeu Matos Nunes

More information



More information


INTEROPERABILITY IN DATA WAREHOUSES INTEROPERABILITY IN DATA WAREHOUSES Riccardo Torlone Roma Tre University http://torlone.dia.uniroma3.it/ SYNONYMS Data warehouse integration DEFINITION The term refers to the ability of combining the content

More information


KEYWORD SEARCH IN RELATIONAL DATABASES KEYWORD SEARCH IN RELATIONAL DATABASES N.Divya Bharathi 1 1 PG Scholar, Department of Computer Science and Engineering, ABSTRACT Adhiyamaan College of Engineering, Hosur, (India). Data mining refers to

More information

Structured and Semi-Structured Data Integration


More information

Techniques to Produce Good Web Service Compositions in The Semantic Grid

Techniques to Produce Good Web Service Compositions in The Semantic Grid Techniques to Produce Good Web Service Compositions in The Semantic Grid Eduardo Blanco Universidad Simón Bolívar, Departamento de Computación y Tecnología de la Información, Apartado 89000, Caracas 1080-A,

More information

XML data integration in SixP2P a theoretical framework

XML data integration in SixP2P a theoretical framework XML data integration in SixP2P a theoretical framework Tadeusz Pankowski Institute of Control and Information Engineering Poznań University of Technology Poland Faculty of Mathematics and Computer Science

More information

A Multi-agent System for Knowledge Management based on the Implicit Culture Framework

A Multi-agent System for Knowledge Management based on the Implicit Culture Framework A Multi-agent System for Knowledge Management based on the Implicit Culture Framework Enrico Blanzieri Paolo Giorgini Fausto Giunchiglia Claudio Zanoni Department of Information and Communication Technology

More information

Semantically Enhanced Web Personalization Approaches and Techniques

Semantically Enhanced Web Personalization Approaches and Techniques Semantically Enhanced Web Personalization Approaches and Techniques Dario Vuljani, Lidia Rovan, Mirta Baranovi Faculty of Electrical Engineering and Computing, University of Zagreb Unska 3, HR-10000 Zagreb,

More information

How To Understand Data Integration

How To Understand Data Integration Data Integration 1 Giuseppe De Giacomo e Antonella Poggi Dipartimento di Informatica e Sistemistica Antonio Ruberti Università di Roma La Sapienza Seminari di Ingegneria Informatica: Integrazione di Dati

More information

IRIS - Integrated Rule Inference System

IRIS - Integrated Rule Inference System IRIS - Integrated Rule Inference System Barry Bishop and Florian Fischer Semantic Technology Institute (STI) Innsbruck, University of Innsbruck, Austria firstname.lastname@sti2.at Abstract. Ontologies,

More information


ONTOLOGY BASED FEEDBACK GENERATION IN DESIGN- ORIENTED E-LEARNING SYSTEMS ONTOLOGY BASED FEEDBACK GENERATION IN DESIGN- ORIENTED E-LEARNING SYSTEMS Harrie Passier and Johan Jeuring Faculty of Computer Science, Open University of the Netherlands Valkenburgerweg 177, 6419 AT Heerlen,

More information

Repair Checking in Inconsistent Databases: Algorithms and Complexity

Repair Checking in Inconsistent Databases: Algorithms and Complexity Repair Checking in Inconsistent Databases: Algorithms and Complexity Foto Afrati 1 Phokion G. Kolaitis 2 1 National Technical University of Athens 2 UC Santa Cruz and IBM Almaden Research Center Oxford,

More information

CS2Bh: Current Technologies. Introduction to XML and Relational Databases. Introduction to Databases. Why databases? Why not use XML?

CS2Bh: Current Technologies. Introduction to XML and Relational Databases. Introduction to Databases. Why databases? Why not use XML? CS2Bh: Current Technologies Introduction to XML and Relational Databases Spring 2005 Introduction to Databases CS2 Spring 2005 (LN5) 1 Why databases? Why not use XML? What is missing from XML: Consistency

More information

Consistent Query Answering in Databases Under Cardinality-Based Repair Semantics

Consistent Query Answering in Databases Under Cardinality-Based Repair Semantics Consistent Query Answering in Databases Under Cardinality-Based Repair Semantics Leopoldo Bertossi Carleton University School of Computer Science Ottawa, Canada Joint work with: Andrei Lopatenko (Google,

More information

Multidimensional Contexts for Data Quality Assessment

Multidimensional Contexts for Data Quality Assessment Multidimensional Contexts for Data Quality Assessment Aida Malaki, Leopoldo Bertossi and Flavio Rizzolo Carleton University, School of Computer Science Ottawa, Canada {amaleki,bertossi,flavio}@scs.carleton.ca

More information

Data exchange. L. Libkin 1 Data Integration and Exchange

Data exchange. L. Libkin 1 Data Integration and Exchange Data exchange Source schema, target schema; need to transfer data between them. A typical scenario: Two organizations have their legacy databases, schemas cannot be changed. Data from one organization

More information