Shuffling Data Around

Size: px
Start display at page:

Download "Shuffling Data Around"

Transcription

1 Shuffling Data Around An introduction to the keywords in Data Integration, Exchange and Sharing Dr. Anastasios Kementsietsidis Special thanks to Prof. Renée e J. Miller

2 The Cause and Effect Principle Cause: Data sources are autonomous, heterogeneous Different data models, types and schemas Different vocabularies (in data and schemas) Different requirements for what/how data is shared Effect: Integration: Provide uniform access to heterogeneous data Exchange: Move data between heterogeneous sources Sharing: Provide non-uniform access to data through each source s schema and vocabulary 2005 Anastasios Kementsietsidis and Renée e J. Miller 2

3 Data Warehousing Architecture or here? User Query Relational Database (Warehouse) What about updates here? Data Extraction/Forwarding tools GDB Database Swissprot Database Image server Outside Website 2005 Anastasios Kementsietsidis and Renée e J. Miller 3

4 Virtual Integration Architecture User Query Mediated Schema Mediator Reformulation Engine Optimization Engine Execution Engine Metadata Wrapper Wrapper Wrapper Wrapper GDB Database Swissprot Database Image server Outside Website 2005 Anastasios Kementsietsidis and Renée e J. Miller 4

5 Peer-to-Peer Architecture User Query Outside Website User Query GDB Database Image server Swissprot Database User Query 2005 Anastasios Kementsietsidis and Renée e J. Miller 5

6 What are the Metadata? Schemas (models of data) Structured or Semi-structured e.g., relational, object-oriented, XML data, Talk will not cover unstructured data e.g., documents, images, audio files, Data(base) is an instance of a schema Mappings Model relationship between schemas or data Schema mapping (e.g., views) Data mapping (e.g., aliases) Requirements for mapping specifications 2005 Anastasios Kementsietsidis and Renée e J. Miller 6

7 Metadata Lifecycle Creation Automatic discovery or creation Design tools facilitating creation Maintenance Maintain (integrated) schemas as sources change Maintain mappings as schemas change Use Query answering Data exchange (materialization), updates, etc 2005 Anastasios Kementsietsidis and Renée e J. Miller 7

8 Schema Integration Global Integrated Schema G What about schema changes? or source additions? Local Source Schema S 1 Local Source Schema S 2 Local Source Schema S n Data Data Data Schema Design Problem: Create global integrated schema G (and mappings) for a set of independently designed local schemas S i, 1 i n 2005 Anastasios Kementsietsidis and Renée e J. Miller 8

9 Mappings (a.k.a. views) A view is just a query works like a function. It accepts as input the local source instance(s) It outputs an instance of the global (target) schema Global Integrated Schema G Relation: funding (aid, amount, project, date) Relation: finances (aid, date, amount) create view funding (aid, amount,, project, date) as ( select grant.gid gid, grant.amount amount, grant.project project, received.date date from grant, received where grant.gid gid = received.gid gid) This is Global-as-View (GAV) Local Source Schema S Relation: grant (gid, amount, project) Relation: received (gid, date) 2005 Anastasios Kementsietsidis and Renée e J. Miller 9

10 This is Local-as-View (LAV) Global Integrated Schema G Relation: funding (aid, amount, project, date) Relation: finances (aid, date, amount) create view grant (gid, amount,, project) as ( select funding.aid aid, funding.amount amount, funding.project project, from funding) Local Source Schema S Relation: grant (gid, amount, project) Relation: received (gid, date) 2005 Anastasios Kementsietsidis and Renée e J. Miller 10

11 GAV vs. LAV (in plain English) GAV: LAV: Gives direct information about which data satisfy the elements of the global schema Not easily extendible on source schema changes or source additions Query answering is easy Does not give direct information about which data satisfy the global schema Easily extendible on source schema changes or source additions Query answering is hard 2005 Anastasios Kementsietsidis and Renée e J. Miller 11

12 But There Is More Global-and-Local-as-View (GLAV) Global Integrated Schema G Relation: funding (aid, amount, project, date) Relation: finances (aid, date, amount) (select funding.aid aid, funding.amount amount, funding.project project, funding.date date from funding, finances where funding.aid = finances.aid aid) (select grant.gid gid, grant.amount amount, grant.project project, received.date date from grant, received where grant.gid gid = received.gid gid) Local Source Schema S Relation: grant (gid, amount, project) Relation: received (gid, date) 2005 Anastasios Kementsietsidis and Renée e J. Miller 12

13 Creating Mappings with the help of Schema Matching grant gid amount project funding aid amount proj date received gid date financial aid date amount It uses schema-level and/or instance-level information 2005 Anastasios Kementsietsidis and Renée e J. Miller 13

14 Wrapper Database City County Database Database City And Using Them In Data Integration Virtual Integration Architecture In Data Exchange Data Warehousing Architecture User Query User Query Mediated Schema Mediator Reformulation Engine Optimization Engine Execution Engine Metadata Relational Database (Warehouse) Wrapper Wrapper Wrapper Data Extraction/Forwarding tools Image server Outside Website County Database Image server Outside Website 2005 Anastasios Kementsietsidis and Renée J. Miller Anastasios Kementsietsidis and Renée J. Miller Anastasios Kementsietsidis and Renée e J. Miller 14

15 Data Integration vs. Exchange Data Integration Global schema is a reconciled virtual view of heterogeneous sources Uses GAV or LAV mappings No constraints in the global schema are considered Query is answered using source data; integration is virtual Answer is set of tuples in query result on ALL possible target instances: certain answers Data Exchange Global schema is an independently created local source schema Uses GLAV mappings Considers the presence of constraints Query is answered using ONE materialized target Can single target give same information as source(s)? 2005 Anastasios Kementsietsidis and Renée e J. Miller 15

16 Something Slightly Different Data Mappings and Data Sharing Useful in environments where: Sources are unwilling to share schemas The schema of one source cannot be expressed as a view of another There is a need to map (data) vocabularies gid name 001 NF1 002 NGFR 003 NEU1 gene(gid, gid, name) GDB encodes for gid pid protein(pid, pid, name) SwissProt This is a mapping table pid name 101 Neurofibromin 102 p75 ICD 103 Sialidase G9 Sialidase 2005 Anastasios Kementsietsidis and Renée e J. Miller 16

17 Data Sharing Architecture article id UofE Library I have article Mapping I want info for Tables protein AARE Mapping Tables Mapping Tables GDB I have info for gene APEH gene NF1 APEH Mapping Tables Mapping Tables SwissProt protein AARE G9 Sialidase Establish mapping tables between the vocabularies of different sources. Use these tables to translate query requests between the sources 2005 Anastasios Kementsietsidis and Renée e J. Miller 17

18 Management of Tables Source S 1 Mapping Tables Mapping Tables Inferred Tables Network Consistent? Mapping Tables Mapping Tables Source S n Inferred Tables The Consistency and Inference problems, the main vehicles for managing mapping tables. Solving these problems allows us to: Infer new mapping tables from existing ones Augment existing mapping tables with new associations Validate mapping tables 2005 Anastasios Kementsietsidis and Renée e J. Miller 18

19 Closing Remarks and things to remember (other than the keywords): Integration, one of the oldest problems in database research. Research is still going strong in this area Exchange, an interesting and practical problem (e.g. B2B apps) Sharing, the latest twist in the integration problem, also of practical importance (e.g. in P2P apps) Disclaimer: This talk provides only a glimpse of the research issues in these areas. Note: If you find any of these interesting, TALK to us We are in Appleton Tower, 2nd Floor 2005 Anastasios Kementsietsidis and Renée e J. Miller 19

20 Questions!?

21 GAV vs. LAV Local Source Schema S Q S/G Global Integrated Schema G User Query Data Data (?) GAV: Q S (S) R G, where R G is a relation in G, Q S is a query on S LAV: R S Q G (G), where R s is a relation in S, Q G is a query on G 2005 Anastasios Kementsietsidis and Renée e J. Miller 21

22 Query Answering GAV uses Query/View Unfolding select project from funding where amount > $ create view funding (aid, amount,, project, date) as ( select grant.gid gid, grant.amount amount, grant.project project, received.date date from grant, received where grant.gid gid = received.gid gid) What about LAV? It uses a method called select project Query Rewriting from grant, received (not presented here) where grant.gid = received.gid AND grant.amount amount > $ Anastasios Kementsietsidis and Renée e J. Miller 22

INTEGRATION OF XML DATA IN PEER-TO-PEER E-COMMERCE APPLICATIONS

INTEGRATION OF XML DATA IN PEER-TO-PEER E-COMMERCE APPLICATIONS INTEGRATION OF XML DATA IN PEER-TO-PEER E-COMMERCE APPLICATIONS Tadeusz Pankowski 1,2 1 Institute of Control and Information Engineering Poznan University of Technology Pl. M.S.-Curie 5, 60-965 Poznan

More information

Query Processing in Data Integration Systems

Query Processing in Data Integration Systems Query Processing in Data Integration Systems Diego Calvanese Free University of Bozen-Bolzano BIT PhD Summer School Bressanone July 3 7, 2006 D. Calvanese Data Integration BIT PhD Summer School 1 / 152

More information

Data Integration. Maurizio Lenzerini. Universitá di Roma La Sapienza

Data Integration. Maurizio Lenzerini. Universitá di Roma La Sapienza Data Integration Maurizio Lenzerini Universitá di Roma La Sapienza DASI 06: Phd School on Data and Service Integration Bertinoro, December 11 15, 2006 M. Lenzerini Data Integration DASI 06 1 / 213 Structure

More information

Virtual Data Integration

Virtual Data Integration Virtual Data Integration Helena Galhardas Paulo Carreira DEI IST (based on the slides of the course: CIS 550 Database & Information Systems, Univ. Pennsylvania, Zachary Ives) Agenda Terminology Conjunctive

More information

DATA INTEGRATION CS561-SPRING 2012 WPI, MOHAMED ELTABAKH

DATA INTEGRATION CS561-SPRING 2012 WPI, MOHAMED ELTABAKH DATA INTEGRATION CS561-SPRING 2012 WPI, MOHAMED ELTABAKH 1 DATA INTEGRATION Motivation Many databases and sources of data that need to be integrated to work together Almost all applications have many sources

More information

A Tutorial on Data Integration

A Tutorial on Data Integration A Tutorial on Data Integration Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Antonio Ruberti, Sapienza Università di Roma DEIS 10 - Data Exchange, Integration, and Streaming November 7-12,

More information

Catalogs and Data Integration for E-Commerce Applications

Catalogs and Data Integration for E-Commerce Applications Catalogs and Data Integration for E-Commerce Applications On-line catalogues Issues Advantages? Product information Information coupling security purchase process Buyers catalogue vs. Sellers catalogue

More information

OWL based XML Data Integration

OWL based XML Data Integration OWL based XML Data Integration Manjula Shenoy K Manipal University CSE MIT Manipal, India K.C.Shet, PhD. N.I.T.K. CSE, Suratkal Karnataka, India U. Dinesh Acharya, PhD. ManipalUniversity CSE MIT, Manipal,

More information

A Framework and Architecture for Quality Assessment in Data Integration

A Framework and Architecture for Quality Assessment in Data Integration A Framework and Architecture for Quality Assessment in Data Integration Jianing Wang March 2012 A Dissertation Submitted to Birkbeck College, University of London in Partial Fulfillment of the Requirements

More information

Peer Data Management Systems Concepts and Approaches

Peer Data Management Systems Concepts and Approaches Peer Data Management Systems Concepts and Approaches Armin Roth HPI, Potsdam, Germany Nov. 10, 2010 Armin Roth (HPI, Potsdam, Germany) Peer Data Management Systems Nov. 10, 2010 1 / 28 Agenda 1 Large-scale

More information

Comparing Data Integration Algorithms

Comparing Data Integration Algorithms Comparing Data Integration Algorithms Initial Background Report Name: Sebastian Tsierkezos tsierks6@cs.man.ac.uk ID :5859868 Supervisor: Dr Sandra Sampaio School of Computer Science 1 Abstract The problem

More information

Query Management in Data Integration Systems: the MOMIS approach

Query Management in Data Integration Systems: the MOMIS approach Dottorato di Ricerca in Computer Engineering and Science Scuola di Dottorato in Information and Communication Technologies XXI Ciclo Università degli Studi di Modena e Reggio Emilia Dipartimento di Ingegneria

More information

An introduction to data integration. Prof. Letizia Tanca Technologies for Information Systems

An introduction to data integration. Prof. Letizia Tanca Technologies for Information Systems An introduction to data integration Prof. Letizia Tanca Technologies for Information Systems 1 Motivation In modern Information Systems there is a growing quest for achieving integration of SW applications,

More information

Web-Based Genomic Information Integration with Gene Ontology

Web-Based Genomic Information Integration with Gene Ontology Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, kai.xu@nicta.com.au Abstract. Despite the dramatic growth of online genomic

More information

Data integration general setting

Data integration general setting Data integration general setting A source schema S: relational schema XML Schema (DTD), etc. A global schema G: could be of many different types too A mapping M between S and G: many ways to specify it,

More information

Data Integration: A Theoretical Perspective

Data Integration: A Theoretical Perspective Data Integration: A Theoretical Perspective Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Università di Roma La Sapienza Via Salaria 113, I 00198 Roma, Italy lenzerini@dis.uniroma1.it ABSTRACT

More information

Integration and Coordination in in both Mediator-Based and Peer-to-Peer Systems

Integration and Coordination in in both Mediator-Based and Peer-to-Peer Systems Dottorato di Ricerca in Ingegneria dell Informazione e sua applicazione nell Industria e nei Servizi Integration and Coordination in in both Mediator-Based and Peer-to-Peer Systems presenter: (pense@inform.unian.it)

More information

Schema Mediation and Query Processing in Peer Data Management Systems

Schema Mediation and Query Processing in Peer Data Management Systems Schema Mediation and Query Processing in Peer Data Management Systems by Jie Zhao B.Sc., Fudan University, 2003 A THESIS SUBMITTED IN PARTIAL FULFILMENT OF THE REQUIREMENTS FOR THE DEGREE OF Master of

More information

Integrating Heterogeneous Data Sources Using XML

Integrating Heterogeneous Data Sources Using XML Integrating Heterogeneous Data Sources Using XML 1 Yogesh R.Rochlani, 2 Prof. A.R. Itkikar 1 Department of Computer Science & Engineering Sipna COET, SGBAU, Amravati (MH), India 2 Department of Computer

More information

Structured and Semi-Structured Data Integration

Structured and Semi-Structured Data Integration UNIVERSITÀ DEGLI STUDI DI ROMA LA SAPIENZA DOTTORATO DI RICERCA IN INGEGNERIA INFORMATICA XIX CICLO 2006 UNIVERSITÉ DE PARIS SUD DOCTORAT DE RECHERCHE EN INFORMATIQUE Structured and Semi-Structured Data

More information

XML Data Integration in OGSA Grids

XML Data Integration in OGSA Grids XML Data Integration in OGSA Grids Carmela Comito and Domenico Talia University of Calabria Italy comito@si.deis.unical.it Outline Introduction Data Integration and Grids The XMAP Data Integration Framework

More information

Integrating and Exchanging XML Data using Ontologies

Integrating and Exchanging XML Data using Ontologies Integrating and Exchanging XML Data using Ontologies Huiyong Xiao and Isabel F. Cruz Department of Computer Science University of Illinois at Chicago {hxiao ifc}@cs.uic.edu Abstract. While providing a

More information

A Hybrid Approach for Ontology Integration

A Hybrid Approach for Ontology Integration A Hybrid Approach for Ontology Integration Ahmed Alasoud Volker Haarslev Nematollaah Shiri Concordia University Concordia University Concordia University 1455 De Maisonneuve Blvd. West 1455 De Maisonneuve

More information

Requirements for Context-dependent Mobile Access to Information Services

Requirements for Context-dependent Mobile Access to Information Services Requirements for Context-dependent Mobile Access to Information Services Augusto Celentano Università Ca Foscari di Venezia Fabio Schreiber, Letizia Tanca Politecnico di Milano MIS 2004, College Park,

More information

GEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington

GEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington GEOG 482/582 : GIS Data Management Lesson 10: Enterprise GIS Data Management Strategies Overview Learning Objective Questions: 1. What are challenges for multi-user database environments? 2. What is Enterprise

More information

A Service-based Approach to Schema Federation of Distributed Databases

A Service-based Approach to Schema Federation of Distributed Databases A Service-based Approach to Federation of Distributed Databases 1 Leena Al-Hussaini 2 Stratis Viglas 1 Malcolm Atkinson 1 National e-science Center, University of Edinburgh, UK 2 School of Informatics,

More information

Data Integration. May 9, 2014. Petr Kremen, Bogdan Kostov (petr.kremen@fel.cvut.cz, bogdan.kostov@fel.cvut.cz)

Data Integration. May 9, 2014. Petr Kremen, Bogdan Kostov (petr.kremen@fel.cvut.cz, bogdan.kostov@fel.cvut.cz) Data Integration Petr Kremen, Bogdan Kostov petr.kremen@fel.cvut.cz, bogdan.kostov@fel.cvut.cz May 9, 2014 Data Integration May 9, 2014 1 / 33 Outline 1 Introduction Solution approaches Technologies 2

More information

Grid Data Integration based on Schema-mapping

Grid Data Integration based on Schema-mapping Grid Data Integration based on Schema-mapping Carmela Comito and Domenico Talia DEIS, University of Calabria, Via P. Bucci 41 c, 87036 Rende, Italy {ccomito, talia}@deis.unical.it http://www.deis.unical.it/

More information

View-based Data Integration

View-based Data Integration View-based Data Integration Yannis Katsis Yannis Papakonstantinou Computer Science and Engineering UC San Diego, USA {ikatsis,yannis}@cs.ucsd.edu DEFINITION Data Integration (or Information Integration)

More information

Data Integration and Network Marketing

Data Integration and Network Marketing CID Name Quarter CSE444 Databases fall CSE541 Operating systems winter Data Integration Alon Halevy Google Inc. University of Aalborg September, 2007 Introduction What is Data Integration and Why is it

More information

XML Interoperability

XML Interoperability XML Interoperability Laks V. S. Lakshmanan Department of Computer Science University of British Columbia Vancouver, BC, Canada laks@cs.ubc.ca Fereidoon Sadri Department of Mathematical Sciences University

More information

Data Grids. Lidan Wang April 5, 2007

Data Grids. Lidan Wang April 5, 2007 Data Grids Lidan Wang April 5, 2007 Outline Data-intensive applications Challenges in data access, integration and management in Grid setting Grid services for these data-intensive application Architectural

More information

A Uniform Approach to Workflow and Data Integration

A Uniform Approach to Workflow and Data Integration A Uniform Approach to Workflow and Data Integration Lucas Zamboulis 1, 2, Nigel Martin 1, Alexandra Poulovassilis 1 1 School of Computer Science and Information Systems, Birkbeck, Univ. of London 2 Department

More information

Declarative Rule-based Integration and Mediation for XML Data in Web Service- based Software Architectures

Declarative Rule-based Integration and Mediation for XML Data in Web Service- based Software Architectures Declarative Rule-based Integration and Mediation for XML Data in Web Service- based Software Architectures Yaoling Zhu A dissertation submitted in fulfillment of the requirement for the award of Master

More information

JOURNAL OF OBJECT TECHNOLOGY

JOURNAL OF OBJECT TECHNOLOGY JOURNAL OF OBJECT TECHNOLOGY Online at http://www.jot.fm. Published by ETH Zurich, Chair of Software Engineering JOT, 2005 Vol. 4, No.2, March-April 2005 On Metadata Management Technology: Status and Issues

More information

Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM)

Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM) Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM) Extended Abstract Ioanna Koffina 1, Giorgos Serfiotis 1, Vassilis Christophides 1, Val Tannen

More information

Data Management in Peer-to-Peer Data Integration Systems

Data Management in Peer-to-Peer Data Integration Systems Book Title Book Editors IOS Press, 2003 1 Data Management in Peer-to-Peer Data Integration Systems Diego Calvanese a, Giuseppe De Giacomo b, Domenico Lembo b,1, Maurizio Lenzerini b, and Riccardo Rosati

More information

Data Integration of Bioinformatics and Web-Based Software Development

Data Integration of Bioinformatics and Web-Based Software Development Integration of Biological XML data Ph. D. Lecture Bioinformatics & Software Systems Lab. Woo-Hyuk Jang Information and Communications Univ. Where are we? Client-Side Info. Management Business related Issues

More information

A STATISTICAL DATA FUSION TECHNIQUE IN VIRTUAL DATA INTEGRATION ENVIRONMENT

A STATISTICAL DATA FUSION TECHNIQUE IN VIRTUAL DATA INTEGRATION ENVIRONMENT A STATISTICAL DATA FUSION TECHNIQUE IN VIRTUAL DATA INTEGRATION ENVIRONMENT Mohamed M. Hafez 1, Ali H. El-Bastawissy 1 and Osman M. Hegazy 1 1 Information Systems Dept., Faculty of Computers and Information,

More information

Principles of Distributed Database Systems

Principles of Distributed Database Systems M. Tamer Özsu Patrick Valduriez Principles of Distributed Database Systems Third Edition

More information

Access control for data integration in presence of data dependencies. Mehdi Haddad, Mohand-Saïd Hacid

Access control for data integration in presence of data dependencies. Mehdi Haddad, Mohand-Saïd Hacid Access control for data integration in presence of data dependencies Mehdi Haddad, Mohand-Saïd Hacid 1 Outline Introduction Motivating example Related work Approach Detection phase (Re)configuration phase

More information

DATA CO-ORDINATION OF SEMANTIC SUPERVISING IN P2P DATABASE SYSTEM

DATA CO-ORDINATION OF SEMANTIC SUPERVISING IN P2P DATABASE SYSTEM DATA CO-ORDINATION OF SEMANTIC SUPERVISING IN P2P DATABASE SYSTEM 1 GANESAN VEERAPPAN, 2 SURESH GNANA DHAS 1 Asst. Professor, Dept. of MCA, Sri Venkateswara College of Engg. & Tech., Thiruvallur Dt. 2

More information

IJSER Figure1 Wrapper Architecture

IJSER Figure1 Wrapper Architecture International Journal of Scientific & Engineering Research, Volume 5, Issue 5, May-2014 24 ONTOLOGY BASED DATA INTEGRATION WITH USER FEEDBACK Devini.K, M.S. Hema Abstract-Many applications need to access

More information

Data Integration using Agent based Mediator-Wrapper Architecture. Tutorial Report For Agent Based Software Engineering (SENG 609.

Data Integration using Agent based Mediator-Wrapper Architecture. Tutorial Report For Agent Based Software Engineering (SENG 609. Data Integration using Agent based Mediator-Wrapper Architecture Tutorial Report For Agent Based Software Engineering (SENG 609.22) Presented by: George Shi Course Instructor: Dr. Behrouz H. Far December

More information

FUTURE VIEWS OF FIELD DATA COLLECTION IN STATISTICAL SURVEYS

FUTURE VIEWS OF FIELD DATA COLLECTION IN STATISTICAL SURVEYS FUTURE VIEWS OF FIELD DATA COLLECTION IN STATISTICAL SURVEYS Sarah Nusser Department of Statistics & Statistical Laboratory Iowa State University nusser@iastate.edu Leslie Miller Department of Computer

More information

Filtering the Web to Feed Data Warehouses

Filtering the Web to Feed Data Warehouses Witold Abramowicz, Pawel Kalczynski and Krzysztof We^cel Filtering the Web to Feed Data Warehouses Springer Table of Contents CHAPTER 1 INTRODUCTION 1 1.1 Information Systems 1 1.2 Information Filtering

More information

INTEROPERABILITY IN DATA WAREHOUSES

INTEROPERABILITY IN DATA WAREHOUSES INTEROPERABILITY IN DATA WAREHOUSES Riccardo Torlone Roma Tre University http://torlone.dia.uniroma3.it/ SYNONYMS Data warehouse integration DEFINITION The term refers to the ability of combining the content

More information

Piazza: Data Management Infrastructure for Semantic Web Applications

Piazza: Data Management Infrastructure for Semantic Web Applications Piazza: Data Management Infrastructure for Semantic Web Applications Alon Y. Halevy Zachary G. Ives Peter Mork Igor Tatarinov University of Washington Box 352350 Seattle, WA 98195-2350 {alon,zives,pmork,igor}@cs.washington.edu

More information

Data Warehousing. Overview, Terminology, and Research Issues. Joachim Hammer. Joachim Hammer

Data Warehousing. Overview, Terminology, and Research Issues. Joachim Hammer. Joachim Hammer Data Warehousing Overview, Terminology, and Research Issues 1 Heterogeneous Database Integration Integration System World Wide Web Digital Libraries Scientific Databases Personal Databases Collects and

More information

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2 Volume 6, Issue 3, March 2016 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue

More information

Chapter 11 Mining Databases on the Web

Chapter 11 Mining Databases on the Web Chapter 11 Mining bases on the Web INTRODUCTION While Chapters 9 and 10 provided an overview of Web data mining, this chapter discusses aspects of mining the databases on the Web. Essentially, we use the

More information

How To Understand Data Integration

How To Understand Data Integration Data Integration 1 Giuseppe De Giacomo e Antonella Poggi Dipartimento di Informatica e Sistemistica Antonio Ruberti Università di Roma La Sapienza Seminari di Ingegneria Informatica: Integrazione di Dati

More information

6 Data Quality Issues in Data Integration Systems

6 Data Quality Issues in Data Integration Systems 6 Data Quality Issues in Data Integration Systems 6.1 Introduction In distributed environments, data sources are typically characterized by various kinds of heterogeneities that can be generally classified

More information

Raddad Al King, Abdelkader Hameurlain, Franck Morvan

Raddad Al King, Abdelkader Hameurlain, Franck Morvan Raddad Al King, Abdelkader Hameurlain, Franck Morvan Institut de Recherche en Informatique de Toulouse (IRIT), Université Paul Sabatier 118, route de Narbonne, F-31062 Toulouse Cedex 9, France E-mail:

More information

CMDB Federation. DMTF Standards for Federating CMDBs and other Management Data Repositories

CMDB Federation. DMTF Standards for Federating CMDBs and other Management Data Repositories CMDB Federation DMTF Standards for Federating CMDBs and other Management Data Repositories Synopsis Many organizations base IT management on a configuration management system consisting of a configuration

More information

Navigational Plans For Data Integration

Navigational Plans For Data Integration Navigational Plans For Data Integration Marc Friedman University of Washington friedman@cs.washington.edu Alon Levy University of Washington alon@cs.washington.edu Todd Millstein University of Washington

More information

LDIF - Linked Data Integration Framework

LDIF - Linked Data Integration Framework LDIF - Linked Data Integration Framework Andreas Schultz 1, Andrea Matteini 2, Robert Isele 1, Christian Bizer 1, and Christian Becker 2 1. Web-based Systems Group, Freie Universität Berlin, Germany a.schultz@fu-berlin.de,

More information

JOURNAL OF OBJECT TECHNOLOGY

JOURNAL OF OBJECT TECHNOLOGY JOURNAL OF OBJECT TECHNOLOGY Online at www.jot.fm. Published by ETH Zurich, Chair of Software Engineering JOT, 2008 Vol. 7, No. 8, November-December 2008 What s Your Information Agenda? Mahesh H. Dodani,

More information

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya Chapter 6 Basics of Data Integration Fundamentals of Business Analytics Learning Objectives and Learning Outcomes Learning Objectives 1. Concepts of data integration 2. Needs and advantages of using data

More information

XQuery and the E-xml Component suite

XQuery and the E-xml Component suite An Introduction to the e-xml Data Integration Suite Georges Gardarin, Antoine Mensch, Anthony Tomasic e-xmlmedia, 29 Avenue du Général Leclerc, 92340 Bourg La Reine, France georges.gardarin@e-xmlmedia.fr

More information

MDM and Data Warehousing Complement Each Other

MDM and Data Warehousing Complement Each Other Master Management MDM and Warehousing Complement Each Other Greater business value from both 2011 IBM Corporation Executive Summary Master Management (MDM) and Warehousing (DW) complement each other There

More information

Chapter 3. Database Environment - Objectives. Multi-user DBMS Architectures. Teleprocessing. File-Server

Chapter 3. Database Environment - Objectives. Multi-user DBMS Architectures. Teleprocessing. File-Server Chapter 3 Database Architectures and the Web Transparencies Database Environment - Objectives The meaning of the client server architecture and the advantages of this type of architecture for a DBMS. The

More information

Semantic Information Retrieval from Distributed Heterogeneous Data Sources

Semantic Information Retrieval from Distributed Heterogeneous Data Sources Semantic Information Retrieval from Distributed Heterogeneous Sources K. Munir, M. Odeh, R. McClatchey, S. Khan, I. Habib CCS Research Centre, University of West of England, Frenchay, Bristol, UK Email

More information

Topics in basic DBMS course

Topics in basic DBMS course Topics in basic DBMS course Database design Transaction processing Relational query languages (SQL), calculus, and algebra DBMS APIs Database tuning (physical database design) Basic query processing (ch

More information

Constraint-Based XML Query Rewriting for Data Integration

Constraint-Based XML Query Rewriting for Data Integration Constraint-Based XML Query Rewriting for Data Integration Cong Yu Department of EECS, University of Michigan congy@eecs.umich.edu Lucian Popa IBM Almaden Research Center lucian@almaden.ibm.com ABSTRACT

More information

Virtual Data Integration

Virtual Data Integration Virtual Data Integration Leopoldo Bertossi Carleton University School of Computer Science Ottawa, Canada www.scs.carleton.ca/ bertossi bertossi@scs.carleton.ca Chapter 1: Introduction and Issues 3 Data

More information

EDG Project: Database Management Services

EDG Project: Database Management Services EDG Project: Database Management Services Leanne Guy for the EDG Data Management Work Package EDG::WP2 Leanne.Guy@cern.ch http://cern.ch/leanne 17 April 2002 DAI Workshop Presentation 1 Information in

More information

CSE 233. Database System Overview

CSE 233. Database System Overview CSE 233 Database System Overview 1 Data Management An evolving, expanding field: Classical stand-alone databases (Oracle, DB2, SQL Server) Computer science is becoming data-centric: web knowledge harvesting,

More information

XML DATA INTEGRATION SYSTEM

XML DATA INTEGRATION SYSTEM XML DATA INTEGRATION SYSTEM Abdelsalam Almarimi The Higher Institute of Electronics Engineering Baniwalid, Libya Belgasem_2000@Yahoo.com ABSRACT This paper describes a proposal for a system for XML data

More information

XOP: Sharing XML Data Objects through Peer-to-Peer Networks

XOP: Sharing XML Data Objects through Peer-to-Peer Networks 22nd International Conference on Advanced Information Networking and Applications XOP: Sharing XML Data Objects through Peer-to-Peer Networks Itamar de Rezende, Frank Siqueira Department of Informatics

More information

Data Integration in Social Networks A Survey

Data Integration in Social Networks A Survey Data Integration in Social Networks A Survey CSci 5707 Principles of Database Systems Fall 2008 Project Guide: Prof. Jaideep Srivastava by Rasik Phalak [ 3908703 ] Srinivasan Krishnan [ 3942424 ] Table

More information

Enterprise Modeling and Data Warehousing in Telecom Italia

Enterprise Modeling and Data Warehousing in Telecom Italia Enterprise Modeling and Data Warehousing in Telecom Italia Diego Calvanese Faculty of Computer Science Free University of Bolzano/Bozen Piazza Domenicani 3 I-39100 Bolzano-Bozen BZ, Italy Luigi Dragone,

More information

The Role of Ontologies in Data Integration

The Role of Ontologies in Data Integration The Role of Ontologies in Data Integration Isabel F. Cruz Huiyong Xiao ADVIS Lab Department of Computer Science University of Illinois at Chicago, USA {ifc hxiao}@cs.uic.edu Abstract In this paper, we

More information

Data Integration in Multi-sources Information Systems

Data Integration in Multi-sources Information Systems ISSN (e): 2250 3005 Vol, 05 Issue, 01 January 2015 International Journal of Computational Engineering Research (IJCER) Data Integration in Multi-sources Information Systems Adham mohsin saeed Computer

More information

UNIVERSITY OF TRENTO A PEER-TO-PEER DATABASE MANAGEMENT SYSTEM. Albena Roshelova. June 2004. Technical Report # DIT-04-057

UNIVERSITY OF TRENTO A PEER-TO-PEER DATABASE MANAGEMENT SYSTEM. Albena Roshelova. June 2004. Technical Report # DIT-04-057 UNIVERSITY OF TRENTO DEPARTMENT OF INFORMATION AND COMMUNICATION TECHNOLOGY 38050 Povo Trento (Italy), Via Sommarive 14 http://www.dit.unitn.it A PEER-TO-PEER DATABASE MANAGEMENT SYSTEM Albena Roshelova

More information

BIOLOGICAL AND CLINICAL DATA INTEGRATION AND ITS APPLICATIONS IN HEALTHCARE

BIOLOGICAL AND CLINICAL DATA INTEGRATION AND ITS APPLICATIONS IN HEALTHCARE BIOLOGICAL AND CLINICAL DATA INTEGRATION AND ITS APPLICATIONS IN HEALTHCARE A Thesis Presented to The Academic Faculty by Matthew S. Hagen In Partial Fulfillment of the Requirements for the Degree Doctor

More information

BUSINESS VALUE OF SEMANTIC TECHNOLOGY

BUSINESS VALUE OF SEMANTIC TECHNOLOGY BUSINESS VALUE OF SEMANTIC TECHNOLOGY Preliminary Findings Industry Advisory Council Emerging Technology (ET) SIG Information Sharing & Collaboration Committee July 15, 2005 Mills Davis Managing Director

More information

Composing Schema Mappings: An Overview

Composing Schema Mappings: An Overview Composing Schema Mappings: An Overview Phokion G. Kolaitis UC Santa Scruz & IBM Almaden Joint work with Ronald Fagin, Lucian Popa, and Wang-Chiew Tan The Data Interoperability Challenge Data may reside

More information

Abstract 1. INTRODUCTION

Abstract 1. INTRODUCTION A Virtual Database Management System For The Internet Alberto Pan, Lucía Ardao, Manuel Álvarez, Juan Raposo and Ángel Viña University of A Coruña. Spain e-mail: {alberto,lucia,mad,jrs,avc}@gris.des.fi.udc.es

More information

Modern Databases. Database Systems Lecture 18 Natasha Alechina

Modern Databases. Database Systems Lecture 18 Natasha Alechina Modern Databases Database Systems Lecture 18 Natasha Alechina In This Lecture Distributed DBs Web-based DBs Object Oriented DBs Semistructured Data and XML Multimedia DBs For more information Connolly

More information

Data integration and reconciliation in Data Warehousing: Conceptual modeling and reasoning support

Data integration and reconciliation in Data Warehousing: Conceptual modeling and reasoning support Data integration and reconciliation in Data Warehousing: Conceptual modeling and reasoning support Diego Calvanese Giuseppe De Giacomo Riccardo Rosati Dipartimento di Informatica e Sistemistica Università

More information

Technical Writing Examples Plus A Few Tips

Technical Writing Examples Plus A Few Tips Technical Writing Examples Plus A Few Tips What is wrong? How to rewrite? Before The volume of information has been rapidly increasing in the past few decades. While computer technology has played a significant

More information

The Piazza Peer Data Management Project

The Piazza Peer Data Management Project The Piazza Peer Management Project Igor Tatarinov 1, Zachary Ives 2, Jayant Madhavan 1, Alon Halevy 1, Dan Suciu 1, Nilesh Dalvi 1, Xin (Luna) Dong 1, Yana Kadiyska 1, Gerome Miklau 1, Peter Mork 1 1 Department

More information

Enabling Collaboration Using the Biomedical Informatics Research Network (BIRN):

Enabling Collaboration Using the Biomedical Informatics Research Network (BIRN): Enabling Collaboration Using the Biomedical Informatics Research Network (BIRN): Karl Helmer Ph.D. Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital June 4, 2010 BIRN

More information

Integration Technologies Group (ITG) ITIL V3 Service Asset and Configuration Management Assessment Robert R. Vespe Page 1 of 19

Integration Technologies Group (ITG) ITIL V3 Service Asset and Configuration Management Assessment Robert R. Vespe Page 1 of 19 Service Asset and Configuration 1. Does the tool facilitate the registration and management of an organization s logical, physical and virtual Configuration Items (CIs)? For example, services, systems,

More information

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance IBM s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice

More information

SERVICE ORIENTED ONTOLOGY BASED DATA FEDERATION FOR HETEROGENEOUS DATA SOURCES

SERVICE ORIENTED ONTOLOGY BASED DATA FEDERATION FOR HETEROGENEOUS DATA SOURCES SERVICE ORIENTED ONTOLOGY BASED DATA FEDERATION FOR HETEROGENEOUS DATA SOURCES HEMA M S 1, CHANDRAMATHI S 2 1 Department of Computer Science and Engineering, Kumaraguru College of Technology 2 Department

More information

Data Integration Hub for a Hybrid Paper Search

Data Integration Hub for a Hybrid Paper Search Data Integration Hub for a Hybrid Paper Search Jungkee Kim 1,2, Geoffrey Fox 2, and Seong-Joon Yoo 3 1 Department of Computer Science, Florida State University, Tallahassee FL 32306, U.S.A., jungkkim@cs.fsu.edu,

More information

CS2Bh: Current Technologies. Introduction to XML and Relational Databases. Introduction to Databases. Why databases? Why not use XML?

CS2Bh: Current Technologies. Introduction to XML and Relational Databases. Introduction to Databases. Why databases? Why not use XML? CS2Bh: Current Technologies Introduction to XML and Relational Databases Spring 2005 Introduction to Databases CS2 Spring 2005 (LN5) 1 Why databases? Why not use XML? What is missing from XML: Consistency

More information

Data Integration and Exchange. L. Libkin 1 Data Integration and Exchange

Data Integration and Exchange. L. Libkin 1 Data Integration and Exchange Data Integration and Exchange L. Libkin 1 Data Integration and Exchange Traditional approach to databases A single large repository of data. Database administrator in charge of access to data. Users interact

More information

Accessing Data Integration Systems through Conceptual Schemas (extended abstract)

Accessing Data Integration Systems through Conceptual Schemas (extended abstract) Accessing Data Integration Systems through Conceptual Schemas (extended abstract) Andrea Calì, Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Università

More information

Query Reformulation: Data Integration Approach to Multi Domain Query Answering System

Query Reformulation: Data Integration Approach to Multi Domain Query Answering System Niharika Pujari, Debahuti Mishra and Kaberi Das 66 Query Reformulation: Data Integration Approach to Multi Domain Query Answering System Niharika Pujari Department of Information Technology Institute of

More information

Reverse Engineering in Data Integration Software

Reverse Engineering in Data Integration Software Database Systems Journal vol. IV, no. 1/2013 11 Reverse Engineering in Data Integration Software Vlad DIACONITA The Bucharest Academy of Economic Studies diaconita.vlad@ie.ase.ro Integrated applications

More information

Module 1: Getting Started with Databases and Transact-SQL in SQL Server 2008

Module 1: Getting Started with Databases and Transact-SQL in SQL Server 2008 Course 2778A: Writing Queries Using Microsoft SQL Server 2008 Transact-SQL About this Course This 3-day instructor led course provides students with the technical skills required to write basic Transact-

More information

An industry perspective on deployed semantic interoperability solutions

An industry perspective on deployed semantic interoperability solutions An industry perspective on deployed semantic interoperability solutions Ralph Hodgson, CTO, TopQuadrant SEMIC Conference, Athens, April 9, 2014 https://joinup.ec.europa.eu/community/semic/event/se mic-2014-semantic-interoperability-conference

More information

Query Evaluation Techniques for Data Integration Systems

Query Evaluation Techniques for Data Integration Systems Query Evaluation Techniques for Data Integration Systems Der Fakultät für Mathematik und Informatik der Universität Passau vorgelegte Dissertation zur Erlangung des akademischen Grades eines Doktors der

More information

ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY

ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY Yu. A. Zagorulko, O. I. Borovikova, S. V. Bulgakov, E. A. Sidorova 1 A.P.Ershov s Institute

More information

9 Collaborative Business Intelligence

9 Collaborative Business Intelligence 9 Collaborative Business Intelligence Stefano Rizzi Department of Electronics, Computer Sciences and Systems (DEIS) University of Bologna Bologna, Italy stefano.rizzi@unibo.it Summary. The idea of collaborative

More information

DataFoundry Data Warehousing and Integration for Scientific Data Management

DataFoundry Data Warehousing and Integration for Scientific Data Management UCRL-ID-127593 DataFoundry Data Warehousing and Integration for Scientific Data Management R. Musick, T. Critchlow, M. Ganesh, K. Fidelis, A. Zemla and T. Slezak U.S. Department of Energy Livermore National

More information

XML Data Transformation and Integration A Schema Transformation Approach

XML Data Transformation and Integration A Schema Transformation Approach XML Data Transformation and Integration A Schema Transformation Approach Lucas Zamboulis November 2009 A Dissertation Submitted to Birkbeck College, University of London in Partial Fulfillment of the Requirements

More information