XML DATA INTEGRATION SYSTEM
|
|
- Archibald Carroll
- 2 years ago
- Views:
Transcription
1 XML DATA INTEGRATION SYSTEM Abdelsalam Almarimi The Higher Institute of Electronics Engineering Baniwalid, Libya ABSRACT This paper describes a proposal for a system for XML data Integration and Querying via Mediation (XIQM). An XML mediation layer is introduced as a main component of XIQM. It is used as a tool for querying heterogeneous XML data sources associated with XML schemas of diverse formats. Such a tool manages two important tasks: mappings among XML schemas and XML data querying. The former is performed through a semi-automatic process that generates local and global paths. An XML Query Translator for the latter task is developed to translate a global user query into local queries using the mappings that are defined in an XML document. KEYWORDS: Data integration, mediation, XML Schema, XML Query languages. 1. INTRODUCTION XML [10] has become the standard format to exchange information over the internet. The advantages of XML as an exchange model, such as rich expressiveness, clear notation, and extensibility, make it the best candidate for supporting the integrated data model. Tools and infrastructures for data integration are required due to the increasing number of distributed heterogeneous data sources on-line. However, modern business often needs to combine heterogeneous data from different data sources. Therefore, tools are needed to mediate between user queries and heterogeneous data sources to translate such queries into local queries. As the importance of XML has increased, a series of standards has grown up around it, many of which were defined by the World Wide Web Consortium (W3C). For example, XML Schema language [11] provides a notation for defining new types of XML elements and XML documents. Our system prototype called XIQM (XML data Integration and Querying via Mediation) has been built to perform the mappings among XML schemas, producing a mediation layer, which is then used to generate local queries. The mediation layer is proposed as a main component to describe the mappings between global XML schema and local heterogeneous XML schemas. It produces a uniform interface over the local XML data sources and provides the required functionality to query these sources in a uniform way. It involves two important units: the XML Metadata Document (XMD) and the Query Translator. The XMD is an XML document containing metadata, in which the mappings between global and local schemas are defined. In general, the major difficulty of connecting the global XML schema and the local XML schemas comes from the large number of data sources. Therefore, it is absolutely necessary to generate mappings automatically. The designer interaction is necessary; two terms may refer to different concepts and may not have the same meaning. Only a human at the present time is able to guarantee the semantic consistency of such a mapping. Thus, we implement a simple form (GUI) as an assistant tool for mapping generation. In the paper [1], only one unit of our system was introduced. We described the process of the XMD generation in which a semi-automatic process for integrating XML schemas is implemented using GUI. Such a process takes a set of local XML schemas and generates an XML Metadata Document that can be used for query purposes. In this paper, we introduce the whole system including a new method for XML query translation which is the integral part of the system. Currently, we use Quilt [3] as XML query language, but we can move to XQuery language without problems. The rest of the paper is organized as follows. Section 2 introduces the related work. In section 3, we present an overview of the system architecture. Section 4 describes the schema integration process. Section 5 shows the XML Metadata Document generation. Section 6 describes the query translator unit. Examples of query translation are introduced in Section 7. Finally, we conclude the paper and point out the future work. 2. RELATED WORK Data integration has received significant attention since the early days of databases. In the recent years, there have been several works focusing on heterogeneous information integration. Most of them are based on common mediator architecture [9]. In this architecture, mediators provide a uniform user interface to views of heterogeneous data sources. They resolve queries over global concepts into sub-queries over data sources.
2 Figure 1: The system architecture. Mainly, they can be classified into structural approaches and semantic approaches. In structural approaches, local data sources are assumed as crucial. The integration is done by providing or automatically generating a global unified schema that characterizes the underlying data sources. On the other hand, in semantic approaches, integration is obtained by sharing a common ontology among the data sources. According to the mapping direction, the approaches are classified into two categories: global-as-view and local-as-view [5]. In global-as-view approaches, each item in the global schema is defined as a view over the source schemas. In local-as-view approaches, each item in each source schema is defined as a view over the global schema. The local-as-view approach better supports a dynamic environment, where data sources can be added to the integration system without the need to restructure the global schema. Well-known research projects such as Garlic [4], Tsimmis [8], MedMaker [7], and Mix [2] are structural approaches. A common data model is used, e.g., OEM (Object Exchange Model) in Tsimmis and MedMaker. Mix uses XML as the data model; an XML query language XMAS was developed and used as the view definition language there. DDXMI [6] (for Distributed Database XML Metadata Interface) builds on XML Metadata Interchange. DDXMI is a master file including database information, XML path information (a path for each node starting from the root), and semantic information about XML elements and attributes. A system prototype has been built that generates a tool to do the metadata integration, producing a master DDXMI file, which is then used to generate queries to local databases from master queries. In this approach local sources were designed according to DTD definitions. Therefore, the integration process is started from the DTD parsing that is associated to each source. Many efforts are being made to develop semantic approaches, based on RDF (Resource Description Framework) and Knowledge-based Integration [5]. We classify our system as a structural approach and differ from the others by following both the local-asview and the global-as-view approaches. In addition to this, the XML Schema language is adopted in our work instead of DTD grammar language, which has limited applicability. 3. SYSTEM ARCHITECTURE The entire architecture of XIQM is presented above in Figure 1. The data sources that we are interested in are XML documents satisfying different XML schemas. The XML Schema language is adopted in our work instead of DTD (Document Type Definition) grammar language, which has limited applicability. The main component of the system is the mediation layer, which comprises the XML Metadata Document (XMD) and the Query Translator. The XMD is an XML document containing metadata, in which the mappings between global and local schemas are defined. A GUI assistant tool is also involved, which is a simple form used to simplify the mapping process among schemas and to reduce the designer effort.
3 The function of the query translator is rewriting a parsed global query into a local query for each local source. The main idea is that when a global query over the global XML schema is posed, it is automatically translated by the Query Translator unit into subqueries, called local queries, which fit each local database format using the information stored in XMD. 4. AN APPLICATION EXAMPLE To clarify our approach, we introduce an example in which three publishers' database sites are used. Our objective is to create a global view over these heterogeneous sites to be used for query purposes. The publishers are Addison Wesley (AW), Prentice Hall (PH), and Wiley (Wiley). The structure of each site was studied carefully and their XML schemas were defined. Although AW, PH, and Wiley all contain book information, the data structures are different. Let us assume that the author information of the global schema is divided into first name and last name, while in the local sources it is represented as full name. Also the price unit of the local sources is the dollar, while the global schema uses the euro. In addition, the book format of AW is represented as a single element, while in PH is divided into two elements: CoverType and Pages. We present in Figure 2 a part of the tree structures of the schemas that are used in the example. Figure 2: A Part of the tree structure for XSDs. 5. XML SCHEMA PROCESSING The XML schema is itself an XML document, which we denote as XSD (XML schema definition). It is a sequence of components where each component is an attribute, or an element or a simple type or complex type. The JDOM API is used for reading XML schema documents in memory. In fact, JDOM itself does not include a parser. Instead it depends on a SAX parser [12], which can be used to parse documents and build JDOM models from them. 5.1 XSD MODELING We model XSD as a tree structure whose nodes are components of the corresponding local sources. Figure 3: CHILD function table for source AW.
4 Each component corresponds to the occurrence of a tag, to the occurrence of an attribute, to the content of tag, and so on. We formulate an XSD Model (XModel), where an XSD can be described. We consider a set of nodes N that can be represented as: E for element names, A for attribute names, and T for type names. Here, we do not aim at complete formalization of all details of XML Schema, but rather capturing its essential modeling features which meet the basic requirements of our approach. Hence, we consider the below functions as basic requirements in which an XSD can be characterized: extracted and a new tree data structure x is constructed. 3. A unique number is given to each node of x to resolve naming conflicts. 4. A depth-first traversal is performed on x; and the CHILD function is materialized. Figure 3 shows the generated CHILD function (represented by a table) for AW source. It is obvious that, e.g. for node AW 1, we obtain the associated set of its children (here represented as an array) [Discipline 2, Curriculum 3, Course 4, Books 5]. Root: Φ N returns the unique root node of the document, Children: N [N] returns the ordered list of children of a node, or the empty list [ ] in the case of a leaf node, Attr: N A returns an attribute name, Tag: N E returns the unique tag (e.g. element name) of a node, Type: N T returns a type name of a node. Definition (Names) An XModel schema is a tree that holds a set of nodes N which can be disjoint sets T, E, and A of type names, element names, and attribute names, respectively. Definition (Types) Given an XModel for an XML schema definition, then each type t T in the schema is either a simple type Ts, or a complex type Tc, where T = Ts Tc. 5.2 EXTRACING XSD COMPONETS To generate a unique path for each element or attribute of the schemas, we need to search the XModel structure and extract out the components that we are interested in. Mainly, only the values of the elements and attributes are requested. In other words, we omit the names of components which contain a complex type. Therefore, we assume that ELEMENTS and ATTRIBUTES are a set of elements and attributes, respectively, of the XModel. Formally, we introduce a function: CHILD:COMPONENT (COMPONENT), COMPONENT= ELEMENTS ATTRIBUTE which assigns a multi set of child components to each component in an XSD1. Basically, the CHILD function is founded to materialize the XSD components that are needed. The process of extracting XSD components comprises the following steps: 1. After a XModel tree is formed for each XSD. 2. For each XSD object, the value of each components name (exclude the name and type) is 1 In our implementation CHILD is realized by a JAVA 2 hash table assigning to a parent as key and its children as values. Figure 4: A sample of an XMD document. 5.3 XSDs MEDIATION In order to obtain local queries for a query issued against the global XML schema, the system must identify the XML data sources concerning a given query. For this task, the XML Metadata Document (XMD) is utilized as mediation to overcome the heterogeneity of data sources. XMD is proposed to maintain the correspondence between the components of the XSDs. The process of XMD generation comprises the following steps: 1. The generated CHILD table for each XSD is traversed to obtain a unique path for each component of the XSD tree structure starting from the root. 5. A simple form (GUI) is generated for each XSD as an assistant tool for mapping generation. 6. Using the GUI for each XSD, a unique index number is assigned for the equivalent local and global paths. Also, a null value is specified in the
5 case of one-to-one mapping or the required function name is specified in other cases. By gathering the same indices, the equivalent paths are grouped and the XMD document is easily created. A sample of XMD structure with its XSD is shown in Figure 4. Components in the global XSD are called source components <source>, while corresponding components in local XSDs are called destination components <dest>. In our example there are three local sources. Thus, each <source> element is followed by three <dest> elements. Moreover, XMD contains information about the required functions which is represented by the <function> element if it is needed to perform a specific operation for a specific <source> element. 6 QUERY TRANSLATION PROCESS Once the XMD is generated, queries can be posed on the global XML schema and easily evaluated. We developed a method for querying distributed heterogeneous XML data sources. A query translator unit is implemented, which is an integral part of the mediation layer. Its function is to translate global queries into queries that are fit to the data sources. When a global user query is posed against the global schema, first it is parsed, then The XMD document is read, and also parsed by SAX, and the number of local sources is identified. The same idea which we have used for XML schema parsing is also applied here. In this case the CHILD function is of the form CHILD: SOURCE (DESTINATION) (OPERATION) where the SOURCE is a set of the global paths in the XMD, the DESTINATION is denoted to the corresponding local paths, and the OPERATION is a set of the corresponding semantic functions names which are used for resolving heterogeneity conflicts. A CHILD function table t is constructed for the XMD, in which each <source> element value in XMD (global element path) is represented as a key and associated with their <dest> elements values as values (local elements paths). Also <function> elements values in XMD are represented in t as values and its corresponding <path> value as key. The basic idea is that, for each path in the global query (should be a <source> component in XMD), if there is a non-empty value of the corresponding local components (<dest> component in XMD), then by navigating the XMD document, the paths in that query are replaced by paths to the <dest> values to get a local query. Otherwise, an empty query is generated for the corresponding path in the local query, which means this query cannot be applied to such local source. Each (generated) local query is sent to the corresponding local source engine, which will execute the query locally and return the result to the global query. 7 QUERY TRANSLATION EXAMPLES In order to support our analysis, we introduce some examples of XML queries. Two cardinality cases are investigated: many-to-one and one-to- many. 7.1 MANY-TO-ONE QUERY EXAMPLE Q1: FOR $a IN document("global.xml )//Author//Name RETURN <Author> <Name> $a/lname, $a/fname </Name> </Author> In this example, the LName and FName elements in the global schema are mapped to Name element in a local schema. That means we need to provide a specific function to separate the full name into a first name and last name. From the XMD fragment in Figure 4, it is clear that two different <source> element values (in XMD document) have the same <dest> element values. In other words, we need to split the instance value of the destination element value to generate the appropriate local queries. The resulted local query for AW source as following: FUNCTION Fname_fun ($par) {split (, $par) [1]} FUNCTION LName_fun ($par) {document (split (, $par) [2])} FOR $b IN document( source1.xml )//Book RETURN <Book> FName_Fun($a/name), LName_Fun($a/name) </Book> 7.2 ONE-TO-MANY QUERY EXAMPLE Q2: FOR $b IN document("global.xml")//book RETURN <Book>$b/Format</Book> This example illustrates the case that can be happened when there is a node in the global schema mapped to many nodes in a local schema. Figure 2 above describes the mapping of this query. To answer this kind of mapping we need to provide a specific function to perform this task. For example, the global element Format is represented in Wiley by two different local elements: Pages and CoverTypes. Figure 5 shows the execution of Q2.
6 Figure 5: Example of a global query translation. 8 CONCLUSIONS In this paper, we have described our system for resolving structural and semantic conflicts for distributed heterogeneous XML data. We developed a mediation layer for querying heterogeneous distributed XML data sources. This layer holds two main parts: the XMD and the Query Translator. We used XML Schema language for defining the XML data sources. The mediation layer is used for describing the mappings between global and local schemas. XML schemas' trees are generated automatically, each with a GUI. Semantic discrepancies are resolved by using the GUI tool for assigning index numbers to all database elements' paths. Also, we have presented the second part of the mediation layer, the Query Translator. It acts to decompose global queries into a set of sub-queries. A global query from an end-user is translated into local queries for XML data sources by looking up the corresponding paths in the XMD. Java 2, JDOM, JavaCC, and the Java-Servlet server were used as tools for the prototype implementation of this proposal. Our implementation is still early naive prototype; many issues remain to be achieved. In the future, we plan to involve more features of XML Schema. For example, the current prototype does not support paths that contain wildcards. Also, we plan to move to XQuery, instead of Quilt. REFERENCES [1] Almarimi A., and Pokorny J., A Mediation Layer for Heterogeneous XML schemas, The International Journal of Web Information Systems (IJWIS), vol. 1, no. 1, pp , [2] Baru C., Gupta A., Ludascher, B Marciano R., Papakonstantinou Y., Velikhov P., and Chu V., XML-Based Information Mediation with MIX, in Proceedings of the ACM SIGMOD Int. Conf. on Management of Data, pp , [3] Chamberlin D., Robie J., and Florescu D., Quilt: An XML Query Language for Heterogeneous Data Sources, in ACM SIGMOD Associated Workshop on the Web and databases, Dallas, Texas, pp , [4] Haas L., Kossman D., Wimmers E., and Young J., Optimizing Queries across Diverse Data Sources, in: proceedings of 23 rd Int. Conf. On VLDB, Athens, Greece, pp , [5] Lenzerini M., Data Integration: A Theoretical Perspective, in Proceedings of the ACM Symposium on Principles of Database Systems, Madison, Wisconsin, USA, pp , [6] Nam Y., Goguen J., Wang G., A Metadata Integration Assistant Generator for Heterogeneous Distributed Databases, in Proceedings of the Confederated International Conferences DOA, CoopIS and ODBASE, Irvine CA, LNCS 2519, Springer, pp , [7] Papakonstantinou Y., Garcia-Molina H., Ullman J., MedMaker: A Mediation System Based on Declarative Specifications, in Proceedings of
7 the IEEE Int. Conf. on Data Engineering, New Orleans, LA, pp , [8] Ullman J., Information Integration Using Logical Views in Proceedings. of the Int. Conf. on Database Theory, pp , [9] Wiederhold G., Mediators in the Architecture of Future Information System, in IEEE Computer Magazine, vol. 25, no. 3, pp , [10] W3C Consortium: Extensible Markup Language (XML), [11] W3C Consortium: XML Schema, [12] SAX 1.0, The Simple API for XML, ents.htm
A MEDIATION LAYER FOR HETEROGENEOUS XML SCHEMAS
A MEDIATION LAYER FOR HETEROGENEOUS XML SCHEMAS Abdelsalam Almarimi 1, Jaroslav Pokorny 2 Abstract This paper describes an approach for mediation of heterogeneous XML schemas. Such an approach is proposed
Integrating Heterogeneous Data Sources Using XML
Integrating Heterogeneous Data Sources Using XML 1 Yogesh R.Rochlani, 2 Prof. A.R. Itkikar 1 Department of Computer Science & Engineering Sipna COET, SGBAU, Amravati (MH), India 2 Department of Computer
Integration of Heterogeneous Databases based on XML
ISSN:2249-5789 Integration of Heterogeneous Databases based on XML Venciya.A Student, Department Of Computer Science And Engineering, SRM University,Kattankulathur, Venciya.a@gmail.com Abstract As companies
INTEGRATION OF XML DATA IN PEER-TO-PEER E-COMMERCE APPLICATIONS
INTEGRATION OF XML DATA IN PEER-TO-PEER E-COMMERCE APPLICATIONS Tadeusz Pankowski 1,2 1 Institute of Control and Information Engineering Poznan University of Technology Pl. M.S.-Curie 5, 60-965 Poznan
An Introduction to the e-xml Data Integration Suite
An Introduction to the e-xml Data Integration Suite Georges Gardarin, Antoine Mensch, Anthony Tomasic e-xmlmedia, 29 Avenue du Général Leclerc, 92340 Bourg La Reine, France georges.gardarin@e-xmlmedia.fr
Application of XML Tools for Enterprise-Wide RBAC Implementation Tasks
Application of XML Tools for Enterprise-Wide RBAC Implementation Tasks Ramaswamy Chandramouli National Institute of Standards and Technology Gaithersburg, MD 20899,USA 001-301-975-5013 chandramouli@nist.gov
Adaptive Data Mediation over XML Data
Adaptive Data Mediation over XML Data Hui Lin, Tore Risch, Timour Katchaounov Hui.Lin, Tore.Risch, Timour.Katchaounov@dis.uu.se Uppsala Database Laboratory, Uppsala University, Sweden To be published in
Accessing Data Integration Systems through Conceptual Schemas (extended abstract)
Accessing Data Integration Systems through Conceptual Schemas (extended abstract) Andrea Calì, Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Università
OWL based XML Data Integration
OWL based XML Data Integration Manjula Shenoy K Manipal University CSE MIT Manipal, India K.C.Shet, PhD. N.I.T.K. CSE, Suratkal Karnataka, India U. Dinesh Acharya, PhD. ManipalUniversity CSE MIT, Manipal,
CST6445: Web Services Development with Java and XML Lesson 1 Introduction To Web Services 1995 2008 Skilltop Technology Limited. All rights reserved.
CST6445: Web Services Development with Java and XML Lesson 1 Introduction To Web Services 1995 2008 Skilltop Technology Limited. All rights reserved. Opening Night Course Overview Perspective Business
Data Integration using Agent based Mediator-Wrapper Architecture. Tutorial Report For Agent Based Software Engineering (SENG 609.
Data Integration using Agent based Mediator-Wrapper Architecture Tutorial Report For Agent Based Software Engineering (SENG 609.22) Presented by: George Shi Course Instructor: Dr. Behrouz H. Far December
Architecture and Implementation of an XQuery-based Information Integration Platform
Architecture and Implementation of an XQuery-based Information Integration Platform Yannis Papakonstantinou Computer Science and Engineering University of California, San Diego yannis@cs.ucsd.edu Vasilis
Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM)
Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM) Extended Abstract Ioanna Koffina 1, Giorgos Serfiotis 1, Vassilis Christophides 1, Val Tannen
The Architectural Design of FRUIT: A Family of Retargetable User Interface Tools
The Architectural Design of : A Family of Retargetable User Interface Tools Yi Liu Computer Science University of Mississippi University, MS 38677 H. Conrad Cunningham Computer Science University of Mississippi
An Open Integrated Environment for Transparent Fuzzy Agents Design
An Open Integrated Environment for Transparent Fuzzy Agents Design Department of Mathematics and Computer Science, University of Salerno via Ponte don Melillo, 84084 Fisciano, Salerno, Italy {gacampora,loia}@unisa.it
Managing large sound databases using Mpeg7
Max Jacob 1 1 Institut de Recherche et Coordination Acoustique/Musique (IRCAM), place Igor Stravinsky 1, 75003, Paris, France Correspondence should be addressed to Max Jacob (max.jacob@ircam.fr) ABSTRACT
An Approach to Eliminate Semantic Heterogenity Using Ontologies in Enterprise Data Integeration
Proceedings of Student-Faculty Research Day, CSIS, Pace University, May 3 rd, 2013 An Approach to Eliminate Semantic Heterogenity Using Ontologies in Enterprise Data Integeration Srinivasan Shanmugam and
A Workbench for Prototyping XML Data Exchange (extended abstract)
A Workbench for Prototyping XML Data Exchange (extended abstract) Renzo Orsini and Augusto Celentano Università Ca Foscari di Venezia, Dipartimento di Informatica via Torino 155, 30172 Mestre (VE), Italy
Integrating Relational Database Schemas using a Standardized Dictionary
Integrating Relational Database Schemas using a Standardized Dictionary Ramon Lawrence Advanced Database Systems Laboratory University of Manitoba Winnipeg, Manitoba, Canada umlawren@cs.umanitoba.ca Ken
Transparency and Efficiency in Grid Computing for Big Data
Transparency and Efficiency in Grid Computing for Big Data Paul L. Bergstein Dept. of Computer and Information Science University of Massachusetts Dartmouth Dartmouth, MA pbergstein@umassd.edu Abstract
Concrete uses of XML in software development and data analysis.
Concrete uses of XML in software development and data analysis. S. Patton LBNL, Berkeley, CA 94720, USA XML is now becoming an industry standard for data description and exchange. Despite this there are
A LANGUAGE INDEPENDENT WEB DATA EXTRACTION USING VISION BASED PAGE SEGMENTATION ALGORITHM
A LANGUAGE INDEPENDENT WEB DATA EXTRACTION USING VISION BASED PAGE SEGMENTATION ALGORITHM 1 P YesuRaju, 2 P KiranSree 1 PG Student, 2 Professorr, Department of Computer Science, B.V.C.E.College, Odalarevu,
Encoding Library of Congress Subject Headings in SKOS: Authority Control for the Semantic Web
Encoding Library of Congress Subject Headings in SKOS: Authority Control for the Semantic Web Corey A Harper University of Oregon Libraries Tel: +1 541 346 1854 Fax:+1 541 346 3485 charper@uoregon.edu
ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY
ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY Yu. A. Zagorulko, O. I. Borovikova, S. V. Bulgakov, E. A. Sidorova 1 A.P.Ershov s Institute
XML-based Data Integration for Semantic Information Portals
XML-based Data Integration for Semantic Information Portals Patrick Lehti, Peter Fankhauser, Silvia von Stackelberg, Nitesh Shrestha Fraunhofer IPSI, Darmstadt, Germany lehti,fankhaus,sstackel@ipsi.fraunhofer.de
XML Interoperability
XML Interoperability Laks V. S. Lakshmanan Department of Computer Science University of British Columbia Vancouver, BC, Canada laks@cs.ubc.ca Fereidoon Sadri Department of Mathematical Sciences University
ONTOLOGY-BASED MULTIMEDIA AUTHORING AND INTERFACING TOOLS 3 rd Hellenic Conference on Artificial Intelligence, Samos, Greece, 5-8 May 2004
ONTOLOGY-BASED MULTIMEDIA AUTHORING AND INTERFACING TOOLS 3 rd Hellenic Conference on Artificial Intelligence, Samos, Greece, 5-8 May 2004 By Aristomenis Macris (e-mail: arism@unipi.gr), University of
II. PREVIOUS RELATED WORK
An extended rule framework for web forms: adding to metadata with custom rules to control appearance Atia M. Albhbah and Mick J. Ridley Abstract This paper proposes the use of rules that involve code to
An Infrastructure for Supporting Spatial Data Integration *
An Infrastructure for Supporting Spatial Data Integration * L.L. Miller and Sarah Nusser Iowa State University Ames, IA 50011 * - Funded in part by a digital government grant from the National Science
Data Integration. Maurizio Lenzerini. Universitá di Roma La Sapienza
Data Integration Maurizio Lenzerini Universitá di Roma La Sapienza DASI 06: Phd School on Data and Service Integration Bertinoro, December 11 15, 2006 M. Lenzerini Data Integration DASI 06 1 / 213 Structure
Automatic Annotation Wrapper Generation and Mining Web Database Search Result
Automatic Annotation Wrapper Generation and Mining Web Database Search Result V.Yogam 1, K.Umamaheswari 2 1 PG student, ME Software Engineering, Anna University (BIT campus), Trichy, Tamil nadu, India
Web Service Mediation Through Multi-level Views
Web Service Mediation Through Multi-level Views Manivasakan Sabesan and Tore Risch Department of Information Technology, Uppsala University, Sweden {msabesan, Tore.Risch}@it.uu.se Abstract. The web Service
Novel Data Extraction Language for Structured Log Analysis
Novel Data Extraction Language for Structured Log Analysis P.W.D.C. Jayathilake 99X Technology, Sri Lanka. ABSTRACT This paper presents the implementation of a new log data extraction language. Theoretical
Knowledge-Based Persistent Archives
SDSC TR-2001-7 Knowledge-Based Persistent Archives Reagan W. Moore San Diego Supercomputer Center Sponsored by NATIONAL ARCHIVES AND RECORDS ADMINISTRATION and ADVANCED RESEARCH PROJECTS AGENCY ITO INTELLIGENT
Data Integration for XML based on Semantic Knowledge
Data Integration for XML based on Semantic Knowledge Kamsuriah Ahmad a, Ali Mamat b, Hamidah Ibrahim c and Shahrul Azman Mohd Noah d a,d Fakulti Teknologi dan Sains Maklumat, Universiti Kebangsaan Malaysia,
An Eclipse Plug-In for Visualizing Java Code Dependencies on Relational Databases
An Eclipse Plug-In for Visualizing Java Code Dependencies on Relational Databases Paul L. Bergstein, Priyanka Gariba, Vaibhavi Pisolkar, and Sheetal Subbanwad Dept. of Computer and Information Science,
Efficient Virtual Data Integration Based on XML
Efficient Virtual Data Integration Based on XML Konstantin Antipin 1, Andrey Fomichev 2, Maxim Grinev 2, Sergey Kuznetsov 1, Leonid Novak 1, Peter Pleshachkov 2, Maria Rekouts 2, and Denis Shiryaev 1 1
Efficient Information Retrieval in Network Management Using Web Services
Efficient Information Retrieval in Network Management Using Web Services Aimilios Chourmouziadis 1, George Pavlou 1 1 Center of Communications and Systems Research, Department of Electronic and Physical
XRecursive: An Efficient Method to Store and Query XML Documents
XRecursive: An Efficient Method to Store and Query XML Documents Mohammed Adam Ibrahim Fakharaldien, Jasni Mohamed Zain, Norrozila Sulaiman Faculty of Computer System and Software Engineering, University
A Hybrid Approach for Ontology Integration
A Hybrid Approach for Ontology Integration Ahmed Alasoud Volker Haarslev Nematollaah Shiri Concordia University Concordia University Concordia University 1455 De Maisonneuve Blvd. West 1455 De Maisonneuve
An XML Schema Extension for Structural Information in Internet and Web-Based Systems
An XML Schema Extension for Structural Information in Internet and Web-Based Systems Jacky C.K. Ma and Michael R. Lyu Department of Computer Science and Engineering The Chinese University of Hong Kong
Testing XPath Queries using Model Checking
Testing XPath Queries using Model Checking Claudio de la Riva, Javier Tuya, José García-Fanjul Computer Science Department, University of Oviedo Phone +34 98 518 26 64, Fax +34 98 518 21 56 [claudio tuya
Extracting Entity Relationship Diagram (ERD) From Relational Database Schema
, pp.15-26 http://dx.doi.org/10.14257/ijdta.2015.8.3.02 Extracting Entity Relationship Diagram (ERD) From Relational Database Schema Hala Khaled Al-Masree Mutah University, Jordan, Alkarak Hala83990@gmail.com
Query Processing in Data Integration Systems
Query Processing in Data Integration Systems Diego Calvanese Free University of Bozen-Bolzano BIT PhD Summer School Bressanone July 3 7, 2006 D. Calvanese Data Integration BIT PhD Summer School 1 / 152
Use of a Web-Based GIS for Real-Time Traffic Information Fusion and Presentation over the Internet
Use of a Web-Based GIS for Real-Time Traffic Information Fusion and Presentation over the Internet SUMMARY Dimitris Kotzinos 1, Poulicos Prastacos 2 1 Department of Computer Science, University of Crete
Agents and Web Services
Agents and Web Services ------SENG609.22 Tutorial 1 Dong Liu Abstract: The basics of web services are reviewed in this tutorial. Agents are compared to web services in many aspects, and the impacts of
View-based Data Integration
View-based Data Integration Yannis Katsis Yannis Papakonstantinou Computer Science and Engineering UC San Diego, USA {ikatsis,yannis}@cs.ucsd.edu DEFINITION Data Integration (or Information Integration)
Schema Mappings and Agents Actions in P2P Data Integration System 1
Journal of Universal Computer Science, vol. 14, no. 7 (2008), 1048-1060 submitted: 1/10/07, accepted: 21/1/08, appeared: 1/4/08 J.UCS Schema Mappings and Agents Actions in P2P Data Integration System 1
An Ontology-based e-learning System for Network Security
An Ontology-based e-learning System for Network Security Yoshihito Takahashi, Tomomi Abiko, Eriko Negishi Sendai National College of Technology a0432@ccedu.sendai-ct.ac.jp Goichi Itabashi Graduate School
CHAPTER 2 XML PROCESSING
10 CHAPTER 2 XML PROCESSING This chapter describes the XML syntax, XML schema description languages, validating XML, query processing on XML etc. 2.1 XML SYNTAX XML is a technology for creating markup
Integration of Distributed Healthcare Records: Publishing Legacy Data as XML Documents Compliant with CEN/TC251 ENV13606
Integration of Distributed Healthcare Records: Publishing Legacy Data as XML Documents Compliant with CEN/TC251 ENV13606 J.A. Maldonado, M. Robles, P. Crespo Bioengineering, Electronics and Telemedicine
A Framework for Developing the Web-based Data Integration Tool for Web-Oriented Data Warehousing
A Framework for Developing the Web-based Integration Tool for Web-Oriented Warehousing PATRAVADEE VONGSUMEDH School of Science and Technology Bangkok University Rama IV road, Klong-Toey, BKK, 10110, THAILAND
Instructor: Phone: Fax Office Hours: Bio: Table of Contents
Department of Information Technology WEBD341: Enterprise Data Exchange using XML 3 Credit Hours 8 Weeks Prerequisite(s): WEBD121: Web Development Fundamentals Instructor Information Course Description
Using Object And Object-Oriented Technologies for XML-native Database Systems
Using Object And Object-Oriented Technologies for XML-native Database Systems David Toth and Michal Valenta David Toth and Michal Valenta Dept. of Computer Science and Engineering Dept. FEE, of Computer
Web-Based Genomic Information Integration with Gene Ontology
Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, kai.xu@nicta.com.au Abstract. Despite the dramatic growth of online genomic
An XML Based Data Exchange Model for Power System Studies
ARI The Bulletin of the Istanbul Technical University VOLUME 54, NUMBER 2 Communicated by Sondan Durukanoğlu Feyiz An XML Based Data Exchange Model for Power System Studies Hasan Dağ Department of Electrical
Integrating Pattern Mining in Relational Databases
Integrating Pattern Mining in Relational Databases Toon Calders, Bart Goethals, and Adriana Prado University of Antwerp, Belgium {toon.calders, bart.goethals, adriana.prado}@ua.ac.be Abstract. Almost a
Data integration and reconciliation in Data Warehousing: Conceptual modeling and reasoning support
Data integration and reconciliation in Data Warehousing: Conceptual modeling and reasoning support Diego Calvanese Giuseppe De Giacomo Riccardo Rosati Dipartimento di Informatica e Sistemistica Università
A Multidatabase System as 4-Tiered Client-Server Distributed Heterogeneous Database System
A Multidatabase System as 4-Tiered Client-Server Distributed Heterogeneous Database System Mohammad Ghulam Ali Academic Post Graduate Studies and Research Indian Institute of Technology, Kharagpur Kharagpur,
Annotation for the Semantic Web during Website Development
Annotation for the Semantic Web during Website Development Peter Plessers, Olga De Troyer Vrije Universiteit Brussel, Department of Computer Science, WISE, Pleinlaan 2, 1050 Brussel, Belgium {Peter.Plessers,
Web Services Based Integration Tool for Heterogeneous Databases
International Journal of Research in Engineering and Science (IJRES) ISSN (Online): 2320-9364, ISSN (Print): 2320-9356 Volume 1 Issue 3 ǁ July. 2013 ǁ PP.16-26 Web Services Based Integration Tool for Heterogeneous
SQLMutation: A tool to generate mutants of SQL database queries
SQLMutation: A tool to generate mutants of SQL database queries Javier Tuya, Mª José Suárez-Cabal, Claudio de la Riva University of Oviedo (SPAIN) {tuya cabal claudio} @ uniovi.es Abstract We present a
Query reformulation for an XML-based Data Integration System
Query reformulation for an XML-based Data Integration System Bernadette Farias Lóscio Ceará State University Av. Paranjana, 1700, Itaperi, 60740-000 - Fortaleza - CE, Brazil +55 85 3101.8600 berna@larces.uece.br
Lightweight Data Integration using the WebComposition Data Grid Service
Lightweight Data Integration using the WebComposition Data Grid Service Ralph Sommermeier 1, Andreas Heil 2, Martin Gaedke 1 1 Chemnitz University of Technology, Faculty of Computer Science, Distributed
Example XML. XML extensible Markup Language. XML and Information Exchange XML. Outline XML
COS 597A: Principles of Database and Information Systems XML and Information Exchange 1 XML extensible Markup Language History 1988 SGML: Standard Generalized Markup Language Annotate text with structure
Caching XML Data on Mobile Web Clients
Caching XML Data on Mobile Web Clients Stefan Böttcher, Adelhard Türling University of Paderborn, Faculty 5 (Computer Science, Electrical Engineering & Mathematics) Fürstenallee 11, D-33102 Paderborn,
MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database System in Energy Data Management
MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database System in Energy Data Management Zhan Liu, Fabian Cretton, Anne Le Calvé, Nicole Glassey, Alexandre Cotting, Fabrice Chapuis
Navigational Plans For Data Integration
Navigational Plans For Data Integration Marc Friedman University of Washington friedman@cs.washington.edu Alon Levy University of Washington alon@cs.washington.edu Todd Millstein University of Washington
Electronic Medical Records
> REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 1 Electronic Medical Records Mohammed F Audi, Rania A Hodhod, and Abdel-Badeeh M Salem Abstract Electronic Medical
Conceptual Workflow for Complex Data Integration using AXML
Conceptual Workflow for Complex Data Integration using AXML Rashed Salem, Omar Boussaïd and Jérôme Darmont Université de Lyon (ERIC Lyon 2) 5 av. P. Mendès-France, 69676 Bron Cedex, France Email: firstname.lastname@univ-lyon2.fr
Chapter 1: Introduction
Chapter 1: Introduction Database System Concepts, 5th Ed. See www.db book.com for conditions on re use Chapter 1: Introduction Purpose of Database Systems View of Data Database Languages Relational Databases
Providing Semantically Equivalent, Complete Views for Multilingual Access to Integrated Data
Providing Semantically Equivalent, Complete Views for Multilingual Access to Integrated Data Iryna Kozlova, Norbert Ritter, Martin Husemann University of Hamburg, Germany {kozlova, ritter, husemann}@informatik.uni-hamburg.de
Deferred node-copying scheme for XQuery processors
Deferred node-copying scheme for XQuery processors Jan Kurš and Jan Vraný Software Engineering Group, FIT ČVUT, Kolejn 550/2, 160 00, Prague, Czech Republic kurs.jan@post.cz, jan.vrany@fit.cvut.cz Abstract.
Constraint-based Query Distribution Framework for an Integrated Global Schema
Constraint-based Query Distribution Framework for an Integrated Global Schema Ahmad Kamran Malik 1, Muhammad Abdul Qadir 1, Nadeem Iftikhar 2, and Muhammad Usman 3 1 Muhammad Ali Jinnah University, Islamabad,
Distributed Database for Environmental Data Integration
Distributed Database for Environmental Data Integration A. Amato', V. Di Lecce2, and V. Piuri 3 II Engineering Faculty of Politecnico di Bari - Italy 2 DIASS, Politecnico di Bari, Italy 3Dept Information
Data Integration in Multi-sources Information Systems
ISSN (e): 2250 3005 Vol, 05 Issue, 01 January 2015 International Journal of Computational Engineering Research (IJCER) Data Integration in Multi-sources Information Systems Adham mohsin saeed Computer
Data Integration for Capital Projects via Community-Specific Conceptual Representations
Data Integration for Capital Projects via Community-Specific Conceptual Representations Yimin Zhu 1, Mei-Ling Shyu 2, Shu-Ching Chen 3 1,3 Florida International University 2 University of Miami E-mail:
Model-Mapping Approaches for Storing and Querying XML Documents in Relational Database: A Survey
Model-Mapping Approaches for Storing and Querying XML Documents in Relational Database: A Survey 1 Amjad Qtaish, 2 Kamsuriah Ahmad 1 School of Computer Science, Faculty of Information Science and Technology,
XML-Based Software Development
1 XML-Based Software Development Baltasar Fernández-Manjón, Alfredo Fernández-Valmayor, Antonio Navarro, José Luis Sierra Grupo de Investigación en Ingeniería del Software e Inteligencia Artificial. Departamento
Markup Languages SGML, HTML, XML, XHTML. CS 431 February 11, 2008 Carl Lagoze Cornell University
Markup Languages SGML, HTML, XML, XHTML CS 431 February 11, 2008 Carl Lagoze Cornell University My philosophy on teaching technical tools I present the capabilities and underlying concepts And give you
Representing XML Schema in UML A Comparison of Approaches
Representing XML Schema in UML A Comparison of Approaches Martin Bernauer, Gerti Kappel, Gerhard Kramler Business Informatics Group, Vienna University of Technology, Austria {lastname}@big.tuwien.ac.at
Simplifying e Business Collaboration by providing a Semantic Mapping Platform
Simplifying e Business Collaboration by providing a Semantic Mapping Platform Abels, Sven 1 ; Sheikhhasan Hamzeh 1 ; Cranner, Paul 2 1 TIE Nederland BV, 1119 PS Amsterdam, Netherlands 2 University of Sunderland,
Efficient Mapping XML DTD to Relational Database
Efficient Mapping XML DTD to Relational Database Mohammed Adam Ibrahim Fakharaldien 1, Khalid Edris 2, Jasni Mohamed Zain 3, Norrozila Sulaiman 4 Faculty of Computer System and Software Engineering,University
On the Standardization of Semantic Web Services-based Network Monitoring Operations
On the Standardization of Semantic Web Services-based Network Monitoring Operations ChenglingZhao^, ZihengLiu^, YanfengWang^ The Department of Information Techonlogy, HuaZhong Normal University; Wuhan,
Building the Tower of Babel: Converting XML Documents to VoiceXML for Accessibility 1
Building the Tower of Babel: Converting XML Documents to VoiceXML for Accessibility 1 G. Gupta, O. El Khatib, M. F. Noamany, H. Guo Laboratory for Logic, Databases, and Advanced Programming Department
MAPPING RULES FOR ER TO XML USING XML SCHEMA
MAPPING RULES FOR ER TO XML USING XML SCHEMA Abstract Sung Jin Colorado State University Pueblo s.jin@colostate-pueblo.edu Woohyun Kang Colorado State University Pueblo w.kang@colostate-pueblo.edu The
A Mind Map Based Framework for Automated Software Log File Analysis
2011 International Conference on Software and Computer Applications IPCSIT vol.9 (2011) (2011) IACSIT Press, Singapore A Mind Map Based Framework for Automated Software Log File Analysis Dileepa Jayathilake
Logical Design of Data Warehouses from XML
Logical Design of Data Warehouses from XML Marko Banek, Zoran Skočir and Boris Vrdoljak FER University of Zagreb, Zagreb, Croatia {marko.banek, zoran.skocir, boris.vrdoljak}@fer.hr Abstract Data warehouse
DBaaS Using HL7 Based on XMDR-DAI for Medical Information Sharing in Cloud
, pp.111-120 http://dx.doi.org/10.14257/ijmue.2015.10.9.12 DBaaS Using HL7 Based on XMDR-DAI for Medical Information Sharing in Cloud Ho-Kyun Park 1 and Seok-Jae Moon 2 1 School of IT Convergence Engineering,
Grid Data Integration based on Schema-mapping
Grid Data Integration based on Schema-mapping Carmela Comito and Domenico Talia DEIS, University of Calabria, Via P. Bucci 41 c, 87036 Rende, Italy {ccomito, talia}@deis.unical.it http://www.deis.unical.it/
City Data Pipeline. A System for Making Open Data Useful for Cities. stefan.bischof@tuwien.ac.at
City Data Pipeline A System for Making Open Data Useful for Cities Stefan Bischof 1,2, Axel Polleres 1, and Simon Sperl 1 1 Siemens AG Österreich, Siemensstraße 90, 1211 Vienna, Austria {bischof.stefan,axel.polleres,simon.sperl}@siemens.com
EDOS Distribution System: a P2P architecture for open-source content dissemination
EDOS Distribution System: a P2P architecture for open-source content Serge Abiteboul 1, Itay Dar 2, Radu Pop 3, Gabriel Vasile 1 and Dan Vodislav 4 1. INRIA Futurs, France {firstname.lastname}@inria.fr
AN INTELLIGENT TUTORING SYSTEM FOR LEARNING DESIGN PATTERNS
AN INTELLIGENT TUTORING SYSTEM FOR LEARNING DESIGN PATTERNS ZORAN JEREMIĆ, VLADAN DEVEDŽIĆ, DRAGAN GAŠEVIĆ FON School of Business Administration, University of Belgrade Jove Ilića 154, POB 52, 11000 Belgrade,
An Introduction to Using XML for the Management of Laboratory Data
An Introduction to Using XML for the Management of Laboratory Data Mark F. Russo, Ph.D. Bristol-Myers Squibb Pharmaceutical Research Institute Princeton, NJ 08543-4000 USA Email: mark.russo@bms.com A.
Data integration is a central issue within Semantic Web research, especially when
T h e S e m a n t i c W e b Semantic Data Integration in Hierarchical Domains Isabel F. Cruz and Afsheen Rajendran, University of Illinois at Chicago Data integration is a central issue within Semantic
Optimization of ETL Work Flow in Data Warehouse
Optimization of ETL Work Flow in Data Warehouse Kommineni Sivaganesh M.Tech Student, CSE Department, Anil Neerukonda Institute of Technology & Science Visakhapatnam, India. Sivaganesh07@gmail.com P Srinivasu
Semantic Information Retrieval from Distributed Heterogeneous Data Sources
Semantic Information Retrieval from Distributed Heterogeneous Sources K. Munir, M. Odeh, R. McClatchey, S. Khan, I. Habib CCS Research Centre, University of West of England, Frenchay, Bristol, UK Email
XML-BASED INTEGRATION: A CASE STUDY
XML-BASED INTEGRATION: A CASE STUDY Chakib Chraibi, Barry University, cchraibi@mail.barry.edu José Ramirez, Barry University, jramirez@mail.barry.edu Andrew Seaga, Barry University, aseaga@mail.barry.edu
Ontology and automatic code generation on modeling and simulation
Ontology and automatic code generation on modeling and simulation Youcef Gheraibia Computing Department University Md Messadia Souk Ahras, 41000, Algeria youcef.gheraibia@gmail.com Abdelhabib Bourouis
Creating visualizations through ontology mapping
Creating visualizations through ontology mapping Sean M. Falconer R. Ian Bull Lars Grammel Margaret-Anne Storey University of Victoria {seanf,irbull,lgrammel,mstorey}@uvic.ca Abstract We explore how to