Peer Data Management Systems Concepts and Approaches

Size: px
Start display at page:

Download "Peer Data Management Systems Concepts and Approaches"

Transcription

1 Peer Data Management Systems Concepts and Approaches Armin Roth HPI, Potsdam, Germany Nov. 10, 2010 Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

2 Agenda 1 Large-scale Information Sharing 2 PDMS Architecture 3 System Characteristics 4 Comparison of Approaches 5 Conclusion + Future Research Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

3 Large-scale Information Sharing Large-scale Information Sharing Regional hospital north Red cross capital headquarters Medication logistics company Government control center National medication inventory Capital hospital Capital medication inventory National pharmacies association Medicine company Medicines sans fronties national base Regional hospital east Regional hospital south Peer Peer schema Peer data Peer mapping Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

4 PDMS PDMS Architecture Heterogeneity Peer Autonomy Mediator: Queries passed to neighbors Flexibility High Redundancy Information Loss Regional Red cross hospital capital north headquarters Government control National center medication inventory Capital medication inventory Capital hospital National pharmacies association Medication logistics company Medicines sans fronties Medicine national base company Regional hospital east Peer Regional Peer schema hospital Peer data south Peer mapping Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

5 PDMS Architecture Distributed Information Systems Distributed DBMS Distribution P2P System PDMS DBMS Data Warehouse Heterogeneity Autonomy Mediator-based Information System Federated DBMS [OV99] Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

6 PDMS Architecture General System Model PDMS set P of peers P i with P i = {G i, S i, L i, M i }: Peer schema G i Local schema S i Local mappings L i Peer mappings M i Peer mappings m M i M j are assertions φ Gi φ Gj resp. φ Gj φ Gi with queries φ Gi and φ Gj of different arity P i P j L G i i S i M i /M j L j G j S j Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

7 PDMS Architecture Peer Mappings Different peers P i, P j heterogeneous in Data model Schema Query language Data schema interplay [BCHL05] Intens./extens. completeness Language of mapping assertions φ Gi φ Gj must bridge all these types of heterogeneity [MBDH02] Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

8 Example Example PDMS Architecture Global-as-view Mapping P 1 R 1 a b c d e 20 #tuples R 5 (a,b,c,d,e) R 1 (a,b,c,d,e), b = 'US' Source R 2 (a,b,c), R 3 (c,d,e) R 1 (a,b,c,d,e) P 5 R 5 a b c d e 90 P 2 Local-asview 60 Mapping R 4 (a,b,c,d) R 2 (a,b,c), R 3 (c,d,e), d > 1 R 2 a b c R 3 c d e 60 R 6 (c,d,e) R 3 (c,d,e), d > 10 R 6 c d e R 4 P 6 P c d e 4 10 R 5 (a,b,c,d,e) R 6 (c,d,e) 9 Armin Roth Paris, July 13, 2010 Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

9 PDMS Architecture Semantics of PDMS Query Answering [CGLR04] Special case: all queries in mapping assertions CQ Semantics of an individual peer: FOL theory T Pi (Global) source database D Set of all models of PDMS P wrt. D: sem D (P) = { I I is a model of all T Pi based on D I satisfies all M i } Meaning of I satisfying M i varies in different approaches for peer mappings Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

10 PDMS Architecture Applications for PDMS Fusion of organisations Semantic Web [HIMT03, HHNR05] Disaster Management [HIST03] Groupware [ANR07] In general: Large, loosely coupled integrated information systems Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

11 System Characteristics System Model [HRZ + 08] Category Data model Topology Mapping language Possible Alternatives Relational XML (incl. web services) RDF Arbitrary Arbitrary without cycles GLaV Subset of FOL Mapping tables Data schema interplay (e.g., HePToX) Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

12 Semantics System Characteristics Expressiveness and interpretation of mapping language determines semantics of query answering data exchange 2 principal approaches 1 Global reasoning: Mappings are interpreted as material logical implication 2 Local reasoning: Only exchange of certain answers Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

13 System Characteristics Autonomy/Modularity Important category in distributed systems with many stakeholders Types: Design autonomy (modeling, naming) Communication autonomy (decide about cooperations) Execution autonomy (scheduling of requests) Influenced by Semantics Functional requirements (e.g., update propagation, global catalog) Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

14 Piazza [HIST03] Comparison of Approaches Data model Mapping language Query language Peer autonomy Semantics of query answering Query optimization Relational, XML GLaV, definitional mappings CQ Global catalog Open-world wrt. certain peer Containment-based pruning at query planning time Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

15 Comparison of Approaches Hyper [CGL + 04, CGLR04] Data model Mapping language Query language Peer autonomy Semantics of query answering Query optimization Other Relational GLaV CQ Preserved Based on epistemic logic, exchange of certain answers none Inconsistency tolerance Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

16 Comparison of Approaches Hyperion [AKK + 03, KAM03] Data model Mapping language Query language Peer autonomy Semantics of query answering Query optimization Other Relational (others also possible) Generalization of GLaV CQ, value search Preserved Open-world and closed-world possible unknown Update propagation Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

17 Comparison of Approaches Hyperion Highly dynamic and scalable Schema mapping expressions Mapping tables: Correspondences between data values Many-to-many mappings Automatically inferring new entries Respect autonomy of the peers Supports value search (point queries) Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

18 Comparison of Approaches Hyperion: Semantics of Mapping Tables Mapping table: X Y with sets of attribute values resp. variables X, Y (many-to-many) Semantics of practical interest: closed-open-world, closed-closed-world Influences combination of mapping tables Open- Closedworld world present Any indicated X -value Y-value Y-values missing Any no X -value Y-value Y-value Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

19 Hyperion: Example Comparison of Approaches GDB id SwissProt id MIN id GDB: P GDB: O GDB: P GDB id GDB: SwissProt id O00662 GDB id MIM id GDB: Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

20 Comparison of Approaches Logical Relational Model [SGMB03] Domain relation: any subset of dom i dom j Relational space: set of local databases and a domain relation Coordination formula: CF ::= i : φ CF CF CF CF CF CF i : x.cf i : x.cf (i set of peers) Example: (Doc : fn, ln, pn, gender, pr). (Doc : Patient(1234, fn, ln, pn, gender, pr) Hospital : (hid, n, a).patient(hid, 1234, n, gender, a, Davis, pr) n = concat(fn, ln))) Query answering: coordination formulas as deductive rules Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

21 Comparison of Approaches Logical Relational Model Data model Relational Mapping language Coordination formulas: Subset of FOL (implication, conjunction, disjunction, universal and existential quantification wrt. different domains) Query language Equal to mapping language Peer autonomy Preserved (recursive local reasoning) Semantics of query answering Local reasoning (satisfyability of coordination formulas) Query optimization unknown Other Update propagation (using coordination formulas) Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

22 Comparison of Approaches Humboldt Peers [Rot07] Data model Mapping language Query language Peer autonomy Semantics of query answering Query optimization Other Relational extensionally sound GaV: x ȳ(φ S ( x, ȳ) z g( x, z)) extensionally sound LaV: x ȳ(s( x, ȳ) z φ G ( x, z)) CQ with semi-interval selections Highly preserved Exchange of certain answers Completeness-driven pruning, limitation of resource consumption Cardinality estimation based on query feedback Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

23 Comparison of Approaches Active XML [ABM08] Data model Mapping language Query language Peer autonomy Semantics of query answering Query optimization XML with web service invocations web services XQuery, XPath Limited Reasoning encapsulated by web services Several techniques considering embedded web service calls Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

24 Conclusion + Future Research Conclusion PDMS: flexible architecture for large-scale information sharing Main system characteristics: mapping and query languages, peer autonomy, semantics Semantics depend on interpretation of mappings Comparison of existing PDMS approaches Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

25 Conclusion + Future Research Future Research Reduce redundancy in query answering Considering data quality in query answering Building and optimizing of network of peers and mappings Dealing with different/varying data models and query languages Approximative query processing and non-standard query operators (e.g., top-k) Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

26 References I Conclusion + Future Research [ABM08] [AKK + 03] [ANR07] [BCHL05] [CGL + 04] S. Abiteboul, O. Benjelloun, and T. Milo. The Active XML project: an overview. VLDB J., 17(5): , M. Arenas, V. Kantere, A. Kementsietsidis, I. Kiringa, R. J. Miller, and J. Mylopoulos. The Hyperion project: From data integration to data coordination. ACM SIGMOD Record, 32(3):53 58, Alexander Albrecht, Felix Naumann, and Armin Roth. Networked PIM using PDMS. In Proc. of the Workshop on Networking Meets Databases (NetDB), A. Bonifati, Q. Chang, T. Ho, and L.V.S. Lakshmanan. HepToX: Heterogeneous peer to peer XML databases. Technical report, U. of British Columbia and Icar CNR, Italy, D. Calvanese, G. De Giacomo, M. Lenzerini, R. Rosati, and G. Vetere. Hyper: A framework for peer-to-peer data integration on grids. In Proc. of the Int. Conference on Semantics of a Networked World: Semantics for Grid Databases (ICSNW 2004), Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

27 References II Conclusion + Future Research [CGLR04] [HHNR05] [HIMT03] [HIST03] [HRZ + 08] Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini, and Riccardo Rosati. Logical foundations of peer-to-peer data integration. In Proc. of the Symposium on Principles of Database Systems (PODS), Ralf Heese, Sven Herschel, Felix Naumann, and Armin Roth. Self-extending peer data management. In Proc. of the Conf. Datenbanksysteme in Business, Technologie und Web (BTW), Karlsruhe, Germany, Alon Y. Halevy, Zachary Ives, Peter Mork, and Igor Tatarinov. Piazza: Data management infrastructure for semantic web applications. In Proc. of the Int. World Wide Web Conf. (WWW), Alon Y. Halevy, Zachary Ives, Dan Suciu, and Igor Tatarinov. Schema mediation in peer data management systems. In Proc. of the Int. Conf. on Data Engineering (ICDE), Katja Hose, Armin Roth, Andrï 1 Zeitz, Kai-Uwe Sattler, and Felix Naumann. 2 A research agenda for query processing in large-scale peer data management systems. Information Systems, 33(7-8): , Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

28 References III Conclusion + Future Research [KAM03] A. Kementsietsidis, M. Arenas, and R. J. Miller. Mapping data in peer-to-peer systems: Semantics and algorithmic issues. In SIGMOD 2003, pages , [MBDH02] J. Madhavan, P. A. Bernstein, P. Domingos, and A. Y. Halevy. Representing and reasoning about mappings between domain models. In Proc. of the National Conf. on Artificial Intelligence (AAAI), [OV99] [Rot07] [SGMB03] M. T. Özsu and P. Valduriez. Principles of distributed database systems. Prentice Hall, 2nd edition, Armin Roth. Completeness-driven query answering in peer data management systems. In Proc. of the VLDB 2007 PhD Workshop, L. Serafini, F. Giunchiglia, J. Mylopoulos, and P. A. Bernstein. Local relational model: A logical formalization of database coordination. In Proc. of CONTEXT, Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, / 28

INTEGRATION OF XML DATA IN PEER-TO-PEER E-COMMERCE APPLICATIONS

INTEGRATION OF XML DATA IN PEER-TO-PEER E-COMMERCE APPLICATIONS INTEGRATION OF XML DATA IN PEER-TO-PEER E-COMMERCE APPLICATIONS Tadeusz Pankowski 1,2 1 Institute of Control and Information Engineering Poznan University of Technology Pl. M.S.-Curie 5, 60-965 Poznan

More information

P2P Data Integration and the Semantic Network Marketing System

P2P Data Integration and the Semantic Network Marketing System Principles of peer-to-peer data integration Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Università di Roma La Sapienza Via Salaria 113, I-00198 Roma, Italy lenzerini@dis.uniroma1.it Abstract.

More information

Query Reformulation over Ontology-based Peers (Extended Abstract)

Query Reformulation over Ontology-based Peers (Extended Abstract) Query Reformulation over Ontology-based Peers (Extended Abstract) Diego Calvanese 1, Giuseppe De Giacomo 2, Domenico Lembo 2, Maurizio Lenzerini 2, and Riccardo Rosati 2 1 Faculty of Computer Science,

More information

Query Processing in Data Integration Systems

Query Processing in Data Integration Systems Query Processing in Data Integration Systems Diego Calvanese Free University of Bozen-Bolzano BIT PhD Summer School Bressanone July 3 7, 2006 D. Calvanese Data Integration BIT PhD Summer School 1 / 152

More information

Data Management in Peer-to-Peer Data Integration Systems

Data Management in Peer-to-Peer Data Integration Systems Book Title Book Editors IOS Press, 2003 1 Data Management in Peer-to-Peer Data Integration Systems Diego Calvanese a, Giuseppe De Giacomo b, Domenico Lembo b,1, Maurizio Lenzerini b, and Riccardo Rosati

More information

Data Integration. Maurizio Lenzerini. Universitá di Roma La Sapienza

Data Integration. Maurizio Lenzerini. Universitá di Roma La Sapienza Data Integration Maurizio Lenzerini Universitá di Roma La Sapienza DASI 06: Phd School on Data and Service Integration Bertinoro, December 11 15, 2006 M. Lenzerini Data Integration DASI 06 1 / 213 Structure

More information

Query Answering in Peer-to-Peer Data Exchange Systems

Query Answering in Peer-to-Peer Data Exchange Systems Query Answering in Peer-to-Peer Data Exchange Systems Leopoldo Bertossi and Loreto Bravo Carleton University, School of Computer Science, Ottawa, Canada {bertossi,lbravo}@scs.carleton.ca Abstract. The

More information

A Tutorial on Data Integration

A Tutorial on Data Integration A Tutorial on Data Integration Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Antonio Ruberti, Sapienza Università di Roma DEIS 10 - Data Exchange, Integration, and Streaming November 7-12,

More information

What to Ask to a Peer: Ontology-based Query Reformulation

What to Ask to a Peer: Ontology-based Query Reformulation What to Ask to a Peer: Ontology-based Query Reformulation Diego Calvanese Faculty of Computer Science Free University of Bolzano/Bozen Piazza Domenicani 3, I-39100 Bolzano, Italy calvanese@inf.unibz.it

More information

The Piazza peer data management system

The Piazza peer data management system Department of Computer & Information Science Departmental Papers (CIS) University of Pennsylvania Year 2004 The Piazza peer data management system Alon Y. Halevy Zachary G. Ives Jayant Madhavan Peter Mork

More information

Self-Extending Peer Data Management

Self-Extending Peer Data Management elf-extending Peer Data Management Ralf Heese ven Herschel Felix Naumann Armin Roth {rheese herschel}@informatik. hu-berlin.de Humboldt-Universität zu Berlin Databases and Information ystems Berlin, Germany

More information

Grid Data Integration based on Schema-mapping

Grid Data Integration based on Schema-mapping Grid Data Integration based on Schema-mapping Carmela Comito and Domenico Talia DEIS, University of Calabria, Via P. Bucci 41 c, 87036 Rende, Italy {ccomito, talia}@deis.unical.it http://www.deis.unical.it/

More information

Shuffling Data Around

Shuffling Data Around Shuffling Data Around An introduction to the keywords in Data Integration, Exchange and Sharing Dr. Anastasios Kementsietsidis Special thanks to Prof. Renée e J. Miller The Cause and Effect Principle Cause:

More information

Accessing Data Integration Systems through Conceptual Schemas (extended abstract)

Accessing Data Integration Systems through Conceptual Schemas (extended abstract) Accessing Data Integration Systems through Conceptual Schemas (extended abstract) Andrea Calì, Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Università

More information

Navigational Plans For Data Integration

Navigational Plans For Data Integration Navigational Plans For Data Integration Marc Friedman University of Washington friedman@cs.washington.edu Alon Levy University of Washington alon@cs.washington.edu Todd Millstein University of Washington

More information

Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM)

Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM) Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM) Extended Abstract Ioanna Koffina 1, Giorgos Serfiotis 1, Vassilis Christophides 1, Val Tannen

More information

Consistent Answers from Integrated Data Sources

Consistent Answers from Integrated Data Sources Consistent Answers from Integrated Data Sources Leopoldo Bertossi 1, Jan Chomicki 2 Alvaro Cortés 3, and Claudio Gutiérrez 4 1 Carleton University, School of Computer Science, Ottawa, Canada. bertossi@scs.carleton.ca

More information

UNIVERSITY OF TRENTO A PEER-TO-PEER DATABASE MANAGEMENT SYSTEM. Albena Roshelova. June 2004. Technical Report # DIT-04-057

UNIVERSITY OF TRENTO A PEER-TO-PEER DATABASE MANAGEMENT SYSTEM. Albena Roshelova. June 2004. Technical Report # DIT-04-057 UNIVERSITY OF TRENTO DEPARTMENT OF INFORMATION AND COMMUNICATION TECHNOLOGY 38050 Povo Trento (Italy), Via Sommarive 14 http://www.dit.unitn.it A PEER-TO-PEER DATABASE MANAGEMENT SYSTEM Albena Roshelova

More information

Data integration and reconciliation in Data Warehousing: Conceptual modeling and reasoning support

Data integration and reconciliation in Data Warehousing: Conceptual modeling and reasoning support Data integration and reconciliation in Data Warehousing: Conceptual modeling and reasoning support Diego Calvanese Giuseppe De Giacomo Riccardo Rosati Dipartimento di Informatica e Sistemistica Università

More information

The Role of Ontologies in Data Integration

The Role of Ontologies in Data Integration The Role of Ontologies in Data Integration Isabel F. Cruz Huiyong Xiao ADVIS Lab Department of Computer Science University of Illinois at Chicago, USA {ifc hxiao}@cs.uic.edu Abstract In this paper, we

More information

Data Integration Over a Grid Infrastructure

Data Integration Over a Grid Infrastructure Hyper: A Framework for Peer-to-Peer Data Integration on Grids Diego Calvanese 1, Giuseppe De Giacomo 2, Maurizio Lenzerini 2, Riccardo Rosati 2, and Guido Vetere 3 1 Faculty of Computer Science, Free University

More information

Schema Mappings and Agents Actions in P2P Data Integration System 1

Schema Mappings and Agents Actions in P2P Data Integration System 1 Journal of Universal Computer Science, vol. 14, no. 7 (2008), 1048-1060 submitted: 1/10/07, accepted: 21/1/08, appeared: 1/4/08 J.UCS Schema Mappings and Agents Actions in P2P Data Integration System 1

More information

DATA CO-ORDINATION OF SEMANTIC SUPERVISING IN P2P DATABASE SYSTEM

DATA CO-ORDINATION OF SEMANTIC SUPERVISING IN P2P DATABASE SYSTEM DATA CO-ORDINATION OF SEMANTIC SUPERVISING IN P2P DATABASE SYSTEM 1 GANESAN VEERAPPAN, 2 SURESH GNANA DHAS 1 Asst. Professor, Dept. of MCA, Sri Venkateswara College of Engg. & Tech., Thiruvallur Dt. 2

More information

Peer Data Exchange. ACM Transactions on Database Systems, Vol. V, No. N, Month 20YY, Pages 1 0??.

Peer Data Exchange. ACM Transactions on Database Systems, Vol. V, No. N, Month 20YY, Pages 1 0??. Peer Data Exchange Ariel Fuxman 1 University of Toronto Phokion G. Kolaitis 2 IBM Almaden Research Center Renée J. Miller 1 University of Toronto and Wang-Chiew Tan 3 University of California, Santa Cruz

More information

Schema Mediation in Peer Data Management Systems

Schema Mediation in Peer Data Management Systems Schema Mediation in Peer Data Management Systems Alon Y. Halevy Zachary G. Ives Dan Suciu Igor Tatarinov University of Washington Seattle, WA, USA 98195-2350 {alon,zives,suciu,igor}@cs.washington.edu Abstract

More information

Schema mapping and query reformulation in peer-to-peer XML data integration system

Schema mapping and query reformulation in peer-to-peer XML data integration system Control and Cybernetics vol. 38 (2009) No. 1 Schema mapping and query reformulation in peer-to-peer XML data integration system by Tadeusz Pankowski Institute of Control and Information Engineering Poznań

More information

INTEROPERABILITY IN DATA WAREHOUSES

INTEROPERABILITY IN DATA WAREHOUSES INTEROPERABILITY IN DATA WAREHOUSES Riccardo Torlone Roma Tre University http://torlone.dia.uniroma3.it/ SYNONYMS Data warehouse integration DEFINITION The term refers to the ability of combining the content

More information

Chapter 17 Using OWL in Data Integration

Chapter 17 Using OWL in Data Integration Chapter 17 Using OWL in Data Integration Diego Calvanese, Giuseppe De Giacomo, Domenico Lembo, Maurizio Lenzerini, Riccardo Rosati, and Marco Ruzzi Abstract One of the outcomes of the research work carried

More information

A Hybrid Approach for Ontology Integration

A Hybrid Approach for Ontology Integration A Hybrid Approach for Ontology Integration Ahmed Alasoud Volker Haarslev Nematollaah Shiri Concordia University Concordia University Concordia University 1455 De Maisonneuve Blvd. West 1455 De Maisonneuve

More information

9 Collaborative Business Intelligence

9 Collaborative Business Intelligence 9 Collaborative Business Intelligence Stefano Rizzi Department of Electronics, Computer Sciences and Systems (DEIS) University of Bologna Bologna, Italy stefano.rizzi@unibo.it Summary. The idea of collaborative

More information

The Piazza Peer Data Management Project

The Piazza Peer Data Management Project The Piazza Peer Management Project Igor Tatarinov 1, Zachary Ives 2, Jayant Madhavan 1, Alon Halevy 1, Dan Suciu 1, Nilesh Dalvi 1, Xin (Luna) Dong 1, Yana Kadiyska 1, Gerome Miklau 1, Peter Mork 1 1 Department

More information

GRIDB: A SCALABLE DISTRIBUTED DATABASE SHARING SYSTEM FOR GRID ENVIRONMENTS *

GRIDB: A SCALABLE DISTRIBUTED DATABASE SHARING SYSTEM FOR GRID ENVIRONMENTS * GRIDB: A SCALABLE DISTRIBUTED DATABASE SHARING SYSTEM FOR GRID ENVIRONMENTS * Maha Abdallah Lynda Temal LIP6, Paris 6 University 8, rue du Capitaine Scott 75015 Paris, France [abdallah, temal]@poleia.lip6.fr

More information

Grid Data Integration Based on Schema Mapping

Grid Data Integration Based on Schema Mapping Grid Data Integration Based on Schema Mapping Carmela Comito and Domenico Talia DEIS, University of Calabria, Via P. Bucci 41 c, 87036 Rende, Italy {ccomito, talia}@deis.unical.it http://www.deis.unical.it/

More information

XML data integration in SixP2P a theoretical framework

XML data integration in SixP2P a theoretical framework XML data integration in SixP2P a theoretical framework Tadeusz Pankowski Institute of Control and Information Engineering Poznań University of Technology Poland Faculty of Mathematics and Computer Science

More information

Data Management for Peer-to-Peer Computing: A Vision 1

Data Management for Peer-to-Peer Computing: A Vision 1 Data Management for Peer-to-Peer Computing: A Vision 1 Philip A. Bernstein 2, Fausto Giunchiglia 3, Anastasios Kementsietsidis 4, John Mylopoulos 3, Luciano Serafini 5, and Ilya Zaihrayeu 2 Abstract. We

More information

Integration and Coordination in in both Mediator-Based and Peer-to-Peer Systems

Integration and Coordination in in both Mediator-Based and Peer-to-Peer Systems Dottorato di Ricerca in Ingegneria dell Informazione e sua applicazione nell Industria e nei Servizi Integration and Coordination in in both Mediator-Based and Peer-to-Peer Systems presenter: (pense@inform.unian.it)

More information

Accessing XML Documents using Semantic Meta Data in a P2P Environment

Accessing XML Documents using Semantic Meta Data in a P2P Environment Accessing XML Documents using Semantic Meta Data in a P2P Environment Dominic Battré and Felix Heine and AndréHöing University of Paderborn Paderborn Center for Parallel Computing Fürstenallee 11, 33102

More information

Data Integration: A Theoretical Perspective

Data Integration: A Theoretical Perspective Data Integration: A Theoretical Perspective Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Università di Roma La Sapienza Via Salaria 113, I 00198 Roma, Italy lenzerini@dis.uniroma1.it ABSTRACT

More information

A Principled Approach to Data Integration and Reconciliation in Data Warehousing

A Principled Approach to Data Integration and Reconciliation in Data Warehousing A Principled Approach to Data Integration and Reconciliation in Data Warehousing Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini, Daniele Nardi, Riccardo Rosati Dipartimento di Informatica e Sistemistica,

More information

Probabilistic Message Passing in Peer Data Management Systems

Probabilistic Message Passing in Peer Data Management Systems Probabilistic Message Passing in Peer Data Management Systems Philippe Cudré-Mauroux School of Computer and Communication Sciences EPFL Switzerland Karl Aberer School of Computer and Communication Sciences

More information

Piazza: Data Management Infrastructure for Semantic Web Applications

Piazza: Data Management Infrastructure for Semantic Web Applications Piazza: Data Management Infrastructure for Semantic Web Applications Alon Y. Halevy Zachary G. Ives Peter Mork Igor Tatarinov University of Washington Box 352350 Seattle, WA 98195-2350 {alon,zives,pmork,igor}@cs.washington.edu

More information

Efficient Query Answering in Peer Data Management Systems

Efficient Query Answering in Peer Data Management Systems Efficient Query Answering in Peer Data Management Systems D I S S E R T A T I O N zur Erlangung des akademischen Grades doctor rerum naturalium (Dr. rer. nat.) im Fach Informatik eingereicht an der Mathematisch-Naturwissenschaftlichen

More information

XOP: Sharing XML Data Objects through Peer-to-Peer Networks

XOP: Sharing XML Data Objects through Peer-to-Peer Networks 22nd International Conference on Advanced Information Networking and Applications XOP: Sharing XML Data Objects through Peer-to-Peer Networks Itamar de Rezende, Frank Siqueira Department of Informatics

More information

Data Validation with OWL Integrity Constraints

Data Validation with OWL Integrity Constraints Data Validation with OWL Integrity Constraints (Extended Abstract) Evren Sirin Clark & Parsia, LLC, Washington, DC, USA evren@clarkparsia.com Abstract. Data validation is an important part of data integration

More information

DATA INTEGRATION AND QUERY REFORMULATION IN SERVICE-BASED GRIDS

DATA INTEGRATION AND QUERY REFORMULATION IN SERVICE-BASED GRIDS DATA INTEGRATION AND QUERY REFORMULATION IN SERVICE-BASED GRIDS Carmela Comito and Domenico Talia DEIS, University of Calabria, Italy ccomito@deis.unical.it talia@deis.unical.it Anastasios Gounaris and

More information

View-based Data Integration

View-based Data Integration View-based Data Integration Yannis Katsis Yannis Papakonstantinou Computer Science and Engineering UC San Diego, USA {ikatsis,yannis}@cs.ucsd.edu DEFINITION Data Integration (or Information Integration)

More information

SERVICE CHOREOGRAPHY FOR DATA INTEGRATION ON THE GRID

SERVICE CHOREOGRAPHY FOR DATA INTEGRATION ON THE GRID SERVICE CHOREOGRAPHY FOR DATA INTEGRATION ON THE GRID Anastasios Gounaris and Rizos Sakellariou School of Computer Science, University of Munchester; UK gounaris@cs.man.ac.uk rizos@cs.man.ac.uk Carrnela

More information

Cloud-dew architecture: realizing the potential of distributed database systems in unreliable networks

Cloud-dew architecture: realizing the potential of distributed database systems in unreliable networks Int'l Conf. Par. and Dist. Proc. Tech. and Appl. PDPTA'15 85 Cloud-dew architecture: realizing the potential of distributed database systems in unreliable networks Yingwei Wang 1 and Yi Pan 2 1 Department

More information

OWL based XML Data Integration

OWL based XML Data Integration OWL based XML Data Integration Manjula Shenoy K Manipal University CSE MIT Manipal, India K.C.Shet, PhD. N.I.T.K. CSE, Suratkal Karnataka, India U. Dinesh Acharya, PhD. ManipalUniversity CSE MIT, Manipal,

More information

CSCI 599: Foundations of Databases, Knowledge Representation, Data Integration and Data Exchange

CSCI 599: Foundations of Databases, Knowledge Representation, Data Integration and Data Exchange CSCI 599: Foundations of Databases, Knowledge Representation, Data Integration and Data Exchange Basic Information Place and Time: Fall 03, Tuesdays and Thursdays, 5:00-6:0pm Instructors: José Luis Ambite,

More information

Virtual Data Integration

Virtual Data Integration Virtual Data Integration Leopoldo Bertossi Carleton University School of Computer Science Ottawa, Canada www.scs.carleton.ca/ bertossi bertossi@scs.carleton.ca Chapter 1: Introduction and Issues 3 Data

More information

A Secure Mediator for Integrating Multiple Level Access Control Policies

A Secure Mediator for Integrating Multiple Level Access Control Policies A Secure Mediator for Integrating Multiple Level Access Control Policies Isabel F. Cruz Rigel Gjomemo Mirko Orsini ADVIS Lab Department of Computer Science University of Illinois at Chicago {ifc rgjomemo

More information

Schema Mediation and Query Processing in Peer Data Management Systems

Schema Mediation and Query Processing in Peer Data Management Systems Schema Mediation and Query Processing in Peer Data Management Systems by Jie Zhao B.Sc., Fudan University, 2003 A THESIS SUBMITTED IN PARTIAL FULFILMENT OF THE REQUIREMENTS FOR THE DEGREE OF Master of

More information

Description Logics for Conceptual Design, Information Access, and Ontology Integration: Research Trends

Description Logics for Conceptual Design, Information Access, and Ontology Integration: Research Trends Description Logics for Conceptual Design, Information Access, and Ontology Integration: Research Trends Enrico Franconi Department of Computer Science, University of Manchester Oxford Rd., Manchester M13

More information

Consistent Answers from Integrated Data Sources

Consistent Answers from Integrated Data Sources Consistent Answers from Integrated Data Sources Leopoldo Bertossi Carleton University School of Computer Science Ottawa, Canada bertossi@scs.carleton.ca www.scs.carleton.ca/ bertossi Queen s University,

More information

XML and Web Services - Managing Distributed Information in a Manner

XML and Web Services - Managing Distributed Information in a Manner Distributed Information Management with XML and Web Services Serge Abiteboul INRIA-Futurs, LRI and Xyleme Serge.Abiteboul@inria.fr Abstract. XML and Web services are revolutioning the automatic management

More information

Logical and categorical methods in data transformation (TransLoCaTe)

Logical and categorical methods in data transformation (TransLoCaTe) Logical and categorical methods in data transformation (TransLoCaTe) 1 Introduction to the abbreviated project description This is an abbreviated project description of the TransLoCaTe project, with an

More information

Data Integration Using ID-Logic

Data Integration Using ID-Logic Data Integration Using ID-Logic Bert Van Nuffelen 1, Alvaro Cortés-Calabuig 1, Marc Denecker 1, Ofer Arieli 2, and Maurice Bruynooghe 1 1 Department of Computer Science, Katholieke Universiteit Leuven,

More information

DIB: DATA INTEGRATION IN BIGDATA FOR EFFICIENT QUERY PROCESSING

DIB: DATA INTEGRATION IN BIGDATA FOR EFFICIENT QUERY PROCESSING DIB: DATA INTEGRATION IN BIGDATA FOR EFFICIENT QUERY PROCESSING P.Divya, K.Priya Abstract In any kind of industry sector networks they used to share collaboration information which facilitates common interests

More information

Data Quality in Information Integration and Business Intelligence

Data Quality in Information Integration and Business Intelligence Data Quality in Information Integration and Business Intelligence Leopoldo Bertossi Carleton University School of Computer Science Ottawa, Canada : Faculty Fellow of the IBM Center for Advanced Studies

More information

A Framework and Architecture for Quality Assessment in Data Integration

A Framework and Architecture for Quality Assessment in Data Integration A Framework and Architecture for Quality Assessment in Data Integration Jianing Wang March 2012 A Dissertation Submitted to Birkbeck College, University of London in Partial Fulfillment of the Requirements

More information

Data Management in Large-scale P2P Systems 1

Data Management in Large-scale P2P Systems 1 Data Management in Large-scale P2P Systems 1 Patrick Valduriez, Esther Pacitti Atlas group, INRIA and LINA, University of Nantes France Patrick.Valduriez@inria.fr Esther.Pacitti@lina.univ-nantes.fr Abstract.

More information

XML Interoperability

XML Interoperability XML Interoperability Laks V. S. Lakshmanan Department of Computer Science University of British Columbia Vancouver, BC, Canada laks@cs.ubc.ca Fereidoon Sadri Department of Mathematical Sciences University

More information

An XML Framework for Integrating Continuous Queries, Composite Event Detection, and Database Condition Monitoring for Multiple Data Streams

An XML Framework for Integrating Continuous Queries, Composite Event Detection, and Database Condition Monitoring for Multiple Data Streams An XML Framework for Integrating Continuous Queries, Composite Event Detection, and Database Condition Monitoring for Multiple Data Streams Susan D. Urban 1, Suzanne W. Dietrich 1, 2, and Yi Chen 1 Arizona

More information

How To Understand Data Integration

How To Understand Data Integration Data Integration 1 Giuseppe De Giacomo e Antonella Poggi Dipartimento di Informatica e Sistemistica Antonio Ruberti Università di Roma La Sapienza Seminari di Ingegneria Informatica: Integrazione di Dati

More information

Structured and Semi-Structured Data Integration

Structured and Semi-Structured Data Integration UNIVERSITÀ DEGLI STUDI DI ROMA LA SAPIENZA DOTTORATO DI RICERCA IN INGEGNERIA INFORMATICA XIX CICLO 2006 UNIVERSITÉ DE PARIS SUD DOCTORAT DE RECHERCHE EN INFORMATIQUE Structured and Semi-Structured Data

More information

Query Reformulation: Data Integration Approach to Multi Domain Query Answering System

Query Reformulation: Data Integration Approach to Multi Domain Query Answering System Niharika Pujari, Debahuti Mishra and Kaberi Das 66 Query Reformulation: Data Integration Approach to Multi Domain Query Answering System Niharika Pujari Department of Information Technology Institute of

More information

Data Semantics Revisited: Databases and the Semantic Web. The Panelists. Next Generation Database Systems Won't Work Without Semantics!

Data Semantics Revisited: Databases and the Semantic Web. The Panelists. Next Generation Database Systems Won't Work Without Semantics! Data Semantics Revisited: Databases and the Semantic Web Alex Borgida Rutgers University John Mylopoulos University of Toronto SEBD 05, Bressanone June 19, 2005 Next Generation Database Systems Won't Work

More information

Data Quality in Ontology-Based Data Access: The Case of Consistency

Data Quality in Ontology-Based Data Access: The Case of Consistency Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence Data Quality in Ontology-Based Data Access: The Case of Consistency Marco Console, Maurizio Lenzerini Dipartimento di Ingegneria

More information

Mapping discovery for XML data integration

Mapping discovery for XML data integration Mapping discovery for XML data integration Zoubida Kedad, Xiaohui Xue Laboratoire PRiSM, Université de Versailles 45 avenue des Etats-Unis, 78035 Versailles, France {Zoubida.kedad, Xiaohui.Xue}@prism.uvsq.fr

More information

Data Integration using Agent based Mediator-Wrapper Architecture. Tutorial Report For Agent Based Software Engineering (SENG 609.

Data Integration using Agent based Mediator-Wrapper Architecture. Tutorial Report For Agent Based Software Engineering (SENG 609. Data Integration using Agent based Mediator-Wrapper Architecture Tutorial Report For Agent Based Software Engineering (SENG 609.22) Presented by: George Shi Course Instructor: Dr. Behrouz H. Far December

More information

XML Data Integration in OGSA Grids

XML Data Integration in OGSA Grids XML Data Integration in OGSA Grids Carmela Comito and Domenico Talia University of Calabria Italy comito@si.deis.unical.it Outline Introduction Data Integration and Grids The XMAP Data Integration Framework

More information

XML DATA INTEGRATION SYSTEM

XML DATA INTEGRATION SYSTEM XML DATA INTEGRATION SYSTEM Abdelsalam Almarimi The Higher Institute of Electronics Engineering Baniwalid, Libya Belgasem_2000@Yahoo.com ABSRACT This paper describes a proposal for a system for XML data

More information

OntoPIM: How to Rely on a Personal Ontology for Personal Information Management

OntoPIM: How to Rely on a Personal Ontology for Personal Information Management OntoPIM: How to Rely on a Personal Ontology for Personal Information Management Vivi Katifori 2, Antonella Poggi 1, Monica Scannapieco 1, Tiziana Catarci 1, and Yannis Ioannidis 2 1 Dipartimento di Informatica

More information

Principles of Distributed Database Systems

Principles of Distributed Database Systems M. Tamer Özsu Patrick Valduriez Principles of Distributed Database Systems Third Edition

More information

Web-Based Genomic Information Integration with Gene Ontology

Web-Based Genomic Information Integration with Gene Ontology Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, kai.xu@nicta.com.au Abstract. Despite the dramatic growth of online genomic

More information

Chapter 1: Introduction

Chapter 1: Introduction Chapter 1: Introduction Database System Concepts, 5th Ed. See www.db book.com for conditions on re use Chapter 1: Introduction Purpose of Database Systems View of Data Database Languages Relational Databases

More information

Transparency and Efficiency in Grid Computing for Big Data

Transparency and Efficiency in Grid Computing for Big Data Transparency and Efficiency in Grid Computing for Big Data Paul L. Bergstein Dept. of Computer and Information Science University of Massachusetts Dartmouth Dartmouth, MA pbergstein@umassd.edu Abstract

More information

Using Ontologies to Enhance Data Management in Distributed Environments

Using Ontologies to Enhance Data Management in Distributed Environments Using Ontologies to Enhance Data Management in Distributed Environments Carlos Eduardo Pires 1, Damires Souza 2, Bernadette Lóscio 3, Rosalie Belian 4, Patricia Tedesco 3 and Ana Carolina Salgado 3 1 Federal

More information

Question Answering and the Nature of Intercomplete Databases

Question Answering and the Nature of Intercomplete Databases Certain Answers as Objects and Knowledge Leonid Libkin School of Informatics, University of Edinburgh Abstract The standard way of answering queries over incomplete databases is to compute certain answers,

More information

Virtual Data Integration

Virtual Data Integration Virtual Data Integration Helena Galhardas Paulo Carreira DEI IST (based on the slides of the course: CIS 550 Database & Information Systems, Univ. Pennsylvania, Zachary Ives) Agenda Terminology Conjunctive

More information

Data Integration: The Teenage Years

Data Integration: The Teenage Years Data Integration: The Teenage Years Alon Halevy Google Inc. halevy@google.com Anand Rajaraman Kosmix Corp. anand@kosmix.com Joann Ordille Avaya Labs joann@avaya.com 1. INTRODUCTION Data integration is

More information

Business Intelligence: Recent Experiences in Canada

Business Intelligence: Recent Experiences in Canada Business Intelligence: Recent Experiences in Canada Leopoldo Bertossi Carleton University School of Computer Science Ottawa, Canada : Faculty Fellow of the IBM Center for Advanced Studies 2 Business Intelligence

More information

The composition of Mappings in a Nautural Interface

The composition of Mappings in a Nautural Interface Composing Schema Mappings: Second-Order Dependencies to the Rescue Ronald Fagin IBM Almaden Research Center fagin@almaden.ibm.com Phokion G. Kolaitis UC Santa Cruz kolaitis@cs.ucsc.edu Wang-Chiew Tan UC

More information

Database Middleware and Web Services for Data Distribution and Integration in Distributed Heterogeneous Database Systems

Database Middleware and Web Services for Data Distribution and Integration in Distributed Heterogeneous Database Systems Database Middleware and Web Services for Data Distribution and Integration in Distributed Heterogeneous Database Systems Han-Chieh Wei Computer Science Department University of Central Arkansas Conway

More information

Collaborative Business Intelligence. University of Bologna - Italy. Summary

Collaborative Business Intelligence. University of Bologna - Italy. Summary Collaborative Business Intelligence Stefano Rizzi University of Bologna - Italy Summary The challenges of BI 2.0 Approaches to collaborative BI Warehousing approaches Federative approaches P2P approaches

More information

Distributed Databases and Peer-to-Peer Databases: Past and Present

Distributed Databases and Peer-to-Peer Databases: Past and Present Distributed Databases and Peer-to-Peer Databases: Past and Present ABSTRACT Angela Bonifati Icar CNR, Italian National Research Council Via P. Bucci 41C I-87036 Rende, Italy bonifati@icar.cnr.it Aris M.

More information

Towards a Semantic Web of Relational Databases: a Practical Semantic Toolkit and an In-Use Case from Traditional Chinese Medicine

Towards a Semantic Web of Relational Databases: a Practical Semantic Toolkit and an In-Use Case from Traditional Chinese Medicine Towards a Semantic Web of Relational Databases: a Practical Semantic Toolkit and an In-Use Case from Traditional Chinese Medicine Huajun Chen 1, Yimin Wang 2, Heng Wang 1, Yuxin Mao 1, Jinmin Tang 1, Cunyin

More information

Logical Foundations of Relational Data Exchange

Logical Foundations of Relational Data Exchange Logical Foundations of Relational Data Exchange Pablo Barceló Department of Computer Science, University of Chile pbarcelo@dcc.uchile.cl 1 Introduction Data exchange has been defined as the problem of

More information

Duplicate Detection Algorithm In Hierarchical Data Using Efficient And Effective Network Pruning Algorithm: Survey

Duplicate Detection Algorithm In Hierarchical Data Using Efficient And Effective Network Pruning Algorithm: Survey www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 3 Issue 12 December 2014, Page No. 9766-9773 Duplicate Detection Algorithm In Hierarchical Data Using Efficient

More information

Combining RDF and Agent-Based Architectures for Semantic Interoperability in Digital Libraries

Combining RDF and Agent-Based Architectures for Semantic Interoperability in Digital Libraries Combining RDF and Agent-Based Architectures for Semantic Interoperability in Digital Libraries Norbert Fuhr, Claus-Peter Klas University of Dortmund, Germany {fuhr,klas}@ls6.cs.uni-dortmund.de 1 Introduction

More information

Ontology-based Data Integration with MASTRO-I for Configuration and Data Management at SELEX Sistemi Integrati

Ontology-based Data Integration with MASTRO-I for Configuration and Data Management at SELEX Sistemi Integrati Ontology-based Data Integration with MASTRO-I for Configuration and Data Management at SELEX Sistemi Integrati Alfonso Amoroso 1, Gennaro Esposito 1, Domenico Lembo 2, Paolo Urbano 2, Raffaele Vertucci

More information

Integrating and Exchanging XML Data using Ontologies

Integrating and Exchanging XML Data using Ontologies Integrating and Exchanging XML Data using Ontologies Huiyong Xiao and Isabel F. Cruz Department of Computer Science University of Illinois at Chicago {hxiao ifc}@cs.uic.edu Abstract. While providing a

More information

Data Integration. May 9, 2014. Petr Kremen, Bogdan Kostov (petr.kremen@fel.cvut.cz, bogdan.kostov@fel.cvut.cz)

Data Integration. May 9, 2014. Petr Kremen, Bogdan Kostov (petr.kremen@fel.cvut.cz, bogdan.kostov@fel.cvut.cz) Data Integration Petr Kremen, Bogdan Kostov petr.kremen@fel.cvut.cz, bogdan.kostov@fel.cvut.cz May 9, 2014 Data Integration May 9, 2014 1 / 33 Outline 1 Introduction Solution approaches Technologies 2

More information

Semantic Technologies for Data Integration using OWL2 QL

Semantic Technologies for Data Integration using OWL2 QL Semantic Technologies for Data Integration using OWL2 QL ESWC 2009 Tutorial Domenico Lembo Riccardo Rosati Dipartimento di Informatica e Sistemistica Sapienza Università di Roma, Italy 6th European Semantic

More information

Scalability Issues in Data Integration 1

Scalability Issues in Data Integration 1 Scalability Issues in Data Integration 1 Arnon Rosenthal, Ph.D. Len Seligman, Ph.D. The MITRE Corporation The MITRE Corporation 202 Burlington Road 7515 Colshire Drive Bedford, MA 01730 McLean, VA 22102

More information

Enterprise Modeling and Data Warehousing in Telecom Italia

Enterprise Modeling and Data Warehousing in Telecom Italia Enterprise Modeling and Data Warehousing in Telecom Italia Diego Calvanese Faculty of Computer Science Free University of Bolzano/Bozen Piazza Domenicani 3 I-39100 Bolzano-Bozen BZ, Italy Luigi Dragone,

More information

Data Integration for Capital Projects via Community-Specific Conceptual Representations

Data Integration for Capital Projects via Community-Specific Conceptual Representations Data Integration for Capital Projects via Community-Specific Conceptual Representations Yimin Zhu 1, Mei-Ling Shyu 2, Shu-Ching Chen 3 1,3 Florida International University 2 University of Miami E-mail:

More information

Constraint-based Query Distribution Framework for an Integrated Global Schema

Constraint-based Query Distribution Framework for an Integrated Global Schema Constraint-based Query Distribution Framework for an Integrated Global Schema Ahmad Kamran Malik 1, Muhammad Abdul Qadir 1, Nadeem Iftikhar 2, and Muhammad Usman 3 1 Muhammad Ali Jinnah University, Islamabad,

More information

RESEARCH ISSUES IN PEER-TO-PEER DATA MANAGEMENT

RESEARCH ISSUES IN PEER-TO-PEER DATA MANAGEMENT RESEARCH ISSUES IN PEER-TO-PEER DATA MANAGEMENT Bilkent University 1 OUTLINE P2P computing systems Representative P2P systems P2P data management Incentive mechanisms Concluding remarks Bilkent University

More information