Query reformulation for an XML-based Data Integration System

Size: px
Start display at page:

Download "Query reformulation for an XML-based Data Integration System"

Transcription

1 Query reformulation for an XML-based Data Integration System Bernadette Farias Lóscio Ceará State University Av. Paranjana, 1700, Itaperi, Fortaleza - CE, Brazil berna@larces.uece.br Thiago Costa Center for Informatics Federal University of Pernambuco P.O. Box 7851 Cidade Universitária, Recife - PE, Brazil, tac@cin.ufpe.br Ana Carolina Salgado Center for Informatics Federal University of Pernambuco P.O. Box 7851 Cidade Universitária, Recife - PE, Brazil, acs@cin.ufpe.br ABSTRACT The main objective of a mediator-based data integration system is to provide a unified view of several distributed and heterogeneous data sources. This view, called mediation schema, corresponds to a set of elements computed from data available on the local data sources. One of the biggest challenges facing a data integration system consists in answering a user query submitted in terms of the mediation schema given that the data is at the sources. This problem consists mainly in reformulating the query in terms of the source schemas and in integrating the corresponding answers. In this paper we address the problem of query reformulation in the context of an XML-based data integration system. Our solution is based on algorithms that generate a set of source queries considering a given user query and a set of mappings between the mediation schema and the data source schemas. Categories and Subject Descriptors H.2.5 [Heterogeneous Databases]. General Terms Languages. Keywords Data integration, XML, query reformulation, mediation. 1. INTRODUCTION Integrated access to distributed data is an important problem faced in many scientific and commercial domains. One of the main challenging problem in the context of data integration is query processing. In order to answer a query submitted to a data integration system various steps are necessary, including query decomposition, query translation, query optimization and data integration. The complexity of the query execution process depends mainly on the approach adopted to define the mappings between the mediation schema and the source schemas. Two main Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. SAC 06, April, 23-27, 2006, Dijon, France. Copyright 2006 ACM /06/0004 $5.00. approaches have been used to specify such mappings: Global as View (GAV) and Local as View (LAV) [12, 6]. In the GAV approach each object of the global schema is expressed as a view (i.e. a query) on the data sources. On the other hand, in the LAV approach, mediation mappings are defined in an opposite way; each object in a given source is defined as a view on the global schema. In this paper we address the problem of how to process a query submitted to a data integration system based on the GAV approach. The query execution process was developed as part of an XML-based data integration system, called Integra, proposed in [8]. We focus our discussion on the problem of how to decompose a user query into a set of remote data source queries using a set of mediation mappings. The remainder of this paper is organized as follows. Section 2 provides some definitions. Section 3 describes the query execution process. Section 4 explains our approach to query reformulation. Section 5 presents some implementation issues. Section 6 discusses some related works and Section 7 presents our conclusions. 2. SOME DEFINITIONS To facilitate the query execution process and other relevant tasks of the Integra system a conceptual data model, called X-Entity [9], and a declarative internal language called XEQ (X-Entity Query language) [3] were proposed. In this section we give an overview of the main concepts proposed in the X-Entity model and in the XEQ language. We also present the formalism adopted to represent the mappings between the mediation schema and the source schemas. 2.1 X-Entity Model Integra uses XML as the common data model for data exchange and integration. To represent the mediation schema and the source schemas Integra adopts the XML Schema language. To provide a high-level abstraction for information described in an XML schema a conceptual data model, called X-Entity model [9], was proposed. The X-Entity model is not a new formalism for conceptual modeling, rather it is an extension of the Entity Relationship model [2], i.e., it uses some basic features of the ER model and extends it with some additional ones to better represent XML schemas. The main concept of the X-Entity model is the entity type, which represents the structure of XML elements composed by other elements and attributes. In the X-Entity model, relationship types represent element-subelement relationships and 498

2 references between elements. In the following, we present the main X-Entity model constructs used to create X-Entity schemas. An X-Entity schema S is denoted by S = (E, R), where E is a set of entity types and R is a set of relationship types. Entity type: an entity type E, denoted by E({A 1,,A n },{R 1,,R m }), is made up of an entity name E, a set of attributes A 1,,A n and a set of relationships R 1,,R m. An entity type represents a set of elements with a complex structure, composed by attributes and other elements (called subelements). An instance of an entity type is a particular element in the source XML document. Each entity type has attributes {A 1,,A n } that describe it. An attribute A i represents either an attribute or a subelement, which is not composed by other elements or attributes. In X-Entity diagrams, entity types are displayed as rectangles. Containment relationship: a containment relationship between two entity types E and E 1, specifies that each instance of E contains instances of E 1. It is denoted by R(E, E 1,(min,max)), where R is the relationship name and (min, max) defines the minimum and the maximum number of instances of E 1 that can be associated with an instance of E. In X-Entity diagrams, containment relationships are displayed as diamond-shaped boxes labeled with contains. The straight lines connecting the relationship with the participating entities are directed from the entity E to the entity E 1. Reference relationship: a reference relationship, denoted by R(E 1, E 2,{A 11,,A 1n }, {A 21,, A 2n }), where R is the name of the relationship, specifies that the entity type E 1 references the entity type E 2. {A 11,, A 1n } and {A 21,, A 2n } represent the referencing attributes between entities of E 1 and E 2 such that the value of A 1i, 1 i n, in any entity of E 1 must match a value of A 2i, 1 i n, in some entity of E 2. In X-Entity diagrams reference relationships are represented as a diamond-shaped box labeled with refers. The straight lines connecting the relationship with the participating entities are directed to the referenced entity. Figure 2 presents three examples of X-Entity schemas. The entity type book m, for example, has two attributes: title m and publisher m, and one containment relationship, which specifies that an instance of book m has at least one subelement and may have unlimited occurrences of the subelement chapter m. The entity type chapter 1 has a reference relationship, which specifies that an instance of chapter 1 may be associated with an instance of book 1. The attributes book 1 _title 1 and title 1 are the referencing attributes between chapter 1 and book 1, respectively. Lóscio [8] have described and implemented the process of converting an XML Schema to an X-Entity schema. 2.2 XEQ Language Queries submitted to a data integration system are, in most cases, defined using declarative query languages or templates. The great majority of such languages are intended to human comprehension, which leads to difficulties in the query decomposition [4] and translation [7]. In order to minimize the complexity of such problems we proposed XEQ (X-Entity Query language), an XMLbased language, used to provide an internal query representation for the Integra system [3]. Considering that the queries submitted to the Integra system are specified according to a mediation schema defined in the X-Entity model, a XEQ expression uses X-Entity concepts to represent the queried data. The simplest conceptual representation of a XEQ expression involves naming one entity (i.e. the parent entity), providing a list of attributes to be returned, as well as using an expression to constrain this entity. We may also define expressions that return values for subentities of the parent entity. A subentity of an entity type E is an entity type E which is associated with E through a containment relationship. Queries may associate parent entities and their referenced entities. A referenced entity type of an entity type E is an entity type E which is associated with E through a reference relationship. The basic element of a XEQ expression is the SELECT element. The SELECT element has one attribute, called ENTITY, whose value is the name of the entity type E being queried. The SELECT element may be composed by other elements, as: i) a REFERENCE element that defines the join condition used to combine related elements from the entity type E and a referenced entity type E, ii) ATTRIBUTE elements that have in their content the name of an attribute of the entity type E whose value is to be retrieved by the query, iii) a WHERE element that filters the data to be retrieved by the query. To illustrate the use of the XEQ internal language we present, in Figure 1, an XQuery query and its corresponding XEQ expression. User query: Retrieve the title, year, the title of the chapters and the publisher name of all books. XQuery <books> { for $b in //book 2 return <book 2> <title 2>$b/title 2 </title 2> <year 2> $b/year 2 </year 2> <chapter 2> $b/chapter 2/chapter_title 2 </chapter 2> for $p in //publisher 2 where $b/ref_publisher 2 = $p/id_publisher 2 return <publisher 2> $p/publisher_name 2 </publisher 2> </book 2> }</books 2> XEQ <SELECT ENTITY="book 2"> <ATTRIBUTE>title 2</ATTRIBUTE> <ATTRIBUTE>year 2</ATTRIBUTE> <SELECT ENTITY="chapter 2"> <ATTRIBUTE> chapter_title 2 </ATTRIBUTE> <SELECT ENTITY="publisher 2"> <ATTRIBUTE> publisher_name 2 </ATTRIBUTE> <REFERENCE NAME="ref_book_publisher"> <MAP ATTRIBUTE="ref_publisher 2"> id_publisher 2 </MAP> </REFERENCE> </EXP> Figure 1. User query example In the example of Figure 1, consider the X-Entity schema S 2 of Figure 2. During the query translation to XEQ the query is parsed and each one of the entities being queried is identified and represented using a SELECT element. In the example above, book 2 is the entity whose values should be retrieved. Since chapter 2 is a subentity of book 2, to retrieve such information it should be used a nested SELECT element. It is important to observe that in the SELECT element associated with the referenced entity publisher 2, there is a MAP element. This element indicates how the join operation between the entity book 2 and the 499

3 entity publisher 2 should be done in order to retrieve the value of the publisher_name 2 attribute. 2.3 Correspondence assertions Our approach to query reformulation is widely based on correspondence assertions [11] specifying relationships between the mediation schema and the source schemas. In this section, we describe just the correspondence assertions relevant to this paper. Other correspondence assertions may be found in [8]. Definition 1 (Entity correspondence assertion): Let E 1 and E 2 be entity types. The correspondence assertion E 1 E 2 specifies that E 1 and E 2 are semantically equivalent, i.e, they describe the same real world concept and they have the same semantics. Definition 2(Attribute correspondence assertion): Let A 1 be an attribute of the entity type E 1 and A 2 an attribute of the entity type E 2, such that E 1 E 2. The correspondence assertion E 1.A 1 E 2.A 2 specifies that the attributes A 1 and A 2 are semantically equivalent (correspond to the same concept in the real world). Definition 3 (Link): Let X 1 and X 2 be elements of an X-Entity schema (an element can be an entity type, a containment relationship type or an attribute), X 1.X 2 is a link if: i) X 2 is an attribute of the entity type X 1, or ii) X 1 is an entity type participating in the relationship type X 2 (or vice-versa). Definition 4 (Path): If X 1...X n are elements of a schema, such that X i, 1 i n, X i.x i+1 is a link, then X 1.X X n is a path from X 1 (the path is directed from X 1 to X n ). Definition 5 (Inverse path): If there is a path from X 1 denoted by X 1.X X n, such that X n is an entity type, then there is a path denoted by (X 1.X X n ) -1, which is called the inverse path of X 1.X X n. Definition 6 (Path correspondence assertion): Let P 1 and P 2 be two paths: Case 1: P 1 = X 1.X 2....X n and P 2 = Y 1.Y 2...Y m, where X 1 Y 1. The correspondence assertion P 1 P 2 specifies that the entity types X n and Y m are semantically equivalent. Case 2: P 1 = X 1.X 2....X n and P 2 = (Y 1.Y 2...Y n ) -1, where X 1 Y n. The correspondence assertion P 1 P 2 specifies that the entity types X n and Y 1 are semantically equivalent. To better illustrate our ideas, consider the mediation schema and the source schemas presented in Figure 2. Table 1. Correspondence assertions between the mediation schema S med and the source schemas S 1 and S 2 CA 1: book m book 1 CA 2: book m.title m book 1.title 1 CA 3: book m.publisher m book 1.publisher 1 CA 4: chapter m chapter 1 CA 5: chapter m.chapter_title m chapter 1.chapter_title 1 CA 6: book m.book m_chapter m.chapter m (chapter 1.chapter_ref_book 1.book 1) -1 CA 7: book m book 2 CA 8: book m.title m book 2.title 2 CA 9: chapter m chapter 2 CA 10: book m.book m_chapter m.chapter m book 2.book 2_chapter 2.chapter 2 CA 11: chapter m.chapter_title m chapter 2.chapter_title 2 CA 12: book m.publisher m book 2.book 2_ref_publisher 2.publisher 2.publisher_name 2 3. QUERY EXECUTION PROCESS The query execution consists in receiving a user query, expressed in a declarative query language, and in returning a query answer expressed in XML as output. The query execution process [3], described in Figure 3, may be summarized as follows: i) The mediator receives a user query Q and performs the necessary translation and reformulation to produce the set of local XEQ expressions (q 1, q 2,, q n ), ii) Next, the mediator sends these expressions to the corresponding wrappers which translate them to the source query language producing the local queries (q 1, q 2,, q n ), iii) The wrappers receive answers for such queries (r 1, r 2,, r n ) from the data sources, these answers are translated to XML and sent to the mediator and iv) Finally, the mediator integrates the XML answers (r 1, r 2,, r n ) and returns the integrated result R as the answer to the original user query Q. The components responsible for the query execution process are presented in Figure 3 and described next. Mediator Knowledge Base Data Sources Knowledge Base User Query Q Query translator Query reformulator Query interface Data integrator ( q 1, q 2 n ),..., ( r 1, r 2,..., r n ) q Source manager R Mediation schema converter Mediator Query manager Log Cache manager Cache S med title m publisherm S 1 title 1 book m chapter_title m (1,N) contains chapter m chapter_title 1 book 1_title 1 q 1 Wrapper1 Translation Translation XEQ - SQL Relations - XML q 1 r 1 r 1 q 2 r 1 q r n n Wrapper2 Wrappern Translation Translation Translation Translation XEQ - OQL Objects - XML XEQ - XQuery XML - XML q 2 r 2 q n r n publisher1 book 1 refers chapter 1 Relational Database Object Oriented Database XML Documents S 2 title 2 year 2 ref_publisher 2 book 2 (1,N) contains chapter chapter_title 2 2 refers publisher 2 id_publisher 2 publisher_name 2 Figure 2. Mediation schema S med and source schemas S 1 and S 2 Table 1 presents the correspondence assertions (CA) between S med and the source schemas S 1 and S 2. Figure 3. Query execution process The Query Interface takes as input the user query expressed using a declarative query language and it returns an XML document as the answer to this query. The Query Translator receives a query written in a high level query language (XQuery for example) from the query interface and translates this query to the XEQ internal representation. The Query Reformulator takes a XEQ expression as input and produces as output a set of XEQ source expressions. The Source Manager receives as input a set of XEQ source expressions and sends them to the wrapper 500

4 associated with the corresponding data source. It receives from the wrappers the answers for the local queries already converted to XML. The Mediation Schema Converter takes as input the XML data received from the wrappers and converts them to the mediation schema. This translation is necessary to facilitate the further data integration process. Transformation functions are used to convert data format to be conform to the mediation schema. The Data Integrator takes as input a set of XML documents and transforms them in a single integrated document. 4. QUERY REFORMULATION In this section, we describe the basic algorithm that reformulates a XEQ expression over the mediation schema into a set of XEQ expressions over the source schemas. Let Q be the original user query and X Q be the expression obtained after the translation of Q to the XEQ representation. There are three phases in the algorithm: i) Identification of the mediation entity (E m ) being queried: this entity is specified in the most external SELECT element of the X ii) Identification of data sources relevant to answer the user query: a data source DS i is relevant to answer the user query Q if it contains one or more source entities necessary to compute the mediation entity E m, i.e., source entities semantically equivalent to E m, iii) Reformulation phase: this phase consists in generating a XEQ expression for each one of the data sources relevant to answer Q. This is done using the algorithms presented in Figure 4. Next consider: - T/@ENTITY: value of the ENTITY attribute of the SELECT element T - T/ATTRIBUTE: value of an ATTRIBUTE element of the SELECT element T - T/WHERE: value of a WHERE element of the SELECT element T - A/text( ): retrieves the content of the ATTRIBUTE element A - ECA(S i, E m ): retrieves the entity correspondence assertion C E m E j, where E j S i.e (S i.e is the set of entity types of the source achema S i ) - ACA(S i, A m ): retrieves the attribute correspondence assertion C A m A j, where A j E.A and E S i.e (E.A is the set of attributes of the entity type E ) - X 1 Φ X 2 : concatenation operator that joins two XEQ expressions X 1 and X 2 During the third phase of the query reformulation process the ReformulateQuery algorithm is executed for each one of the relevant data sources {DS 1,, DS n }. The first task of this algorithm consists in reformulating the element (X/Exp/SELECT) that defines the mediation entity E m being queried. To do this, the algorithm calls the procedure ReformulateSelect. This procedure recursively analyses the nested SELECT elements of T (T = X/Exp/SELECT) in order to identify the subentities {E 1,, E n } and attributes {A 1,, A k } of the mediation entity E m such that T/@entity = E m. For each mediation entity E j the algorithm ReformulateSelect obtains the correspondence assertion which specifies how to compute E i over the data source DS i. In the same way for each attribute A j, the algorithm obtains the correspondence assertion which specifies how to compute A j over the data source DS i. The algorithm also analyses WHERE elements to transform the constraints applied on the mediation elements into constraints on the source elements. This transformation is done using the algorithm ReformulateWhere. XEQ elements of the resulting source expression are created by the algorithm AddPathToQuery. Due to space limitations, in this paper we present just the ReformulateQuery and ReformulateSelect algorithms. More information about the other algorithms may be found in [3]. Algorithm ReformulateQuery(X Q, S i) {S i is the X-Entity schema of the data source DS i } T := X Q/Exp/SELECT {Exp is root element of a XEQ expression} L Si := new XEQ expression P := empty path ReformulateSelect(L si, T, S i, P) Return L Si End Algorithm ReformulateSelect(X Si, T, S i, P) {X Si is the XEQ expression corresponding to the data source S i } P:= P + T/@ENTITY NS:= AddPathtoQuery(X Si, ECA(S i, P)) For each A = T/ATTRIBUTE do AddPathtoQuery(EXP LS, ACA(S i, P + A/text())) For each T = T/SELECT do ReformulateSelect(X Si, T, S i, P) If (W = T/WHERE) then X Si:= X Si Φ ReformulateWhere(NS, W, S i, P) End Figure 4. Query reformulation Algorithms To illustrate the query reformulation process consider the user query Q 1 over the mediation schema S med and its corresponding XEQ expression X 1 presented in Figure 5. As specified by the most external SELECT element of X 1 we have that the mediation entity being queried is book m (X 1 /Exp/Select/@ENTITY = book m ). Next, based on the set of correspondence assertions presented in Table 1 the algorithm identifies the relevant data sources to compute book m. According to the correspondence assertions CA 1 (CA 1 :book m book 1 ) and CA 7 (CA 7 :book m book 2 ) the data sources relevant to compute book m are DS 1 and DS 2, since book 1 and book 2 are source entities from the schemas S 1 and S 2 respectively. Therefore, the algorithm ReformulateQuery must be executed for these two data sources. User query Q 1: Retrieve the title, publisher and the title of the chapters all books. X1(XEQ) <SELECT ENTITY="book m"> <ATTRIBUTE> title m </ATTRIBUTE> <ATTRIBUTE> publisher m </ATTRIBUTE> <SELECT ENTITY="chapter m"> <ATTRIBUTE> chapter_title m </ATTRIBUTE> XS 1(XEQ) 1. 2.<SELECT ENTITY="book 1"> 3. <ATTRIBUTE>title 1</ATTRIBUTE> 4. <ATTRIBUTE>publisher 1</ATTRIBUTE> 5. <SELECT ENTITY="chapter!"> 6. <ATTRIBUTE>chapter_title 1</ATTRIBUTE> 7. <REFERENCE NAME="ref_chapter 1_book 1"> 8. <MAP ATTRIBUTE="book 1_title 1"> title 1 </MAP> 9. </REFERENCE> Figure 5. User query In the first step, the algorithm generates the source expression XS 1, presented in Figure 5, for the data source DS 1. The SELECT element of line 2 was generated from the correspondence assertion CA 1. Lines 3 and 4 were generated based on the correspondence assertions CA 2 and CA 3. To compute the subentity chapter m of book m the algorithm created a nested select element (line 5) according to the correspondence assertion CA 6. As chapter 1 is 501

5 associated with book 1 through a refers relationship then it was used a reference element to identify this restriction. For the data source DS 2 the algorithm Reformulate generates the XEQ expression XS 2 similar to the XEQ expression XS IMPLEMENTATION ISSUES An Integra prototype was implemented allowing the execution of the whole query process: from the submission of a user query to the answer of the integrated results. Wrappers were implemented to translate queries from XEQ to the data sources native language. In the initial version of the prototype, only wrappers for relational databases were implemented. During the tests, we used two real databases that stores medical information: Healthnet 1 and TELEMED 2, both implemented in MySQL. In the current version of the prototype, the mediation schema and the correspondences between the mediation schema and the source schemas are manually defined and stored in the Mediator Knowledge Base. In the Integra prototype, the query is translated to SQL using a XSLT stylesheet, and it is submitted to the data source through a JDBC driver. 6. RELATED WORK The work presented in [4] states and solves the query decomposition problem for XML publishing in a general setting. It allows mixed (XML and relational) storage of proprietary data and exploits redundancies (materialized views, indexes and caches) to enhance performance. Other problem faced during query processing consists in translating data source queries to the native query language. To improve the query translation process, data integration systems use languages based on scripts or templates as the input format for wrappers [7]. There are also systems that use XML query languages as input format for wrappers. The e-xml [5], MARS [4] and XPERANTO [10] systems provide integration of multiple data sources based on XML schemas.automed [1] adopts a functional language, called IQL, to provide a common query language where queries written in high level query languages can be translated into and out of. In our approach, the user query may be specified in a high level query language. This query is translated to XEQ and it can be easily rewritten into a set of local subqueries. As XEQ is an XML-based language we can take advantage of XML benefits as the capability to easily transform XML data into different formats and to easily navigate through its hierarchical structure. 7. CONCLUSION In this paper, we focused on the problem of how to process a user query submitted to an XML-based data integration system. To solve this problem we proposed X-Entity, an ER-based conceptual data model, and XEQ, an XML-based language, whose specification facilitates both query decomposition and translation. The process of decomposing or translating a XEQ expression to other query languages consists in performing a recursive navigation that visits the expression elements and constructs the query result. In this paper we did not consider the problem of generating a query execution plan nor the problem of Database of a brazilian hospital (Hospital Português Recife) query optimization. As a future work, we intend to study such problems in order to improve our proposed approach. 8. ADDITIONAL AUTHORS Juliano F. de Souza, jsf@cin.ufpe.br, Center for Informatics Federal University of Pernambuco, P.O. Box 7851 Cidade Universitária, Recife - PE, Brazil, , REFERENCES [1] Boyd, M., Kittivoravitkul, S., Lazanitis, C., McBrien, P., Rizopoulos, N. AutoMed: A BAV Data Integration System for Heterogeneous Data Sources. In Proceedings of the 16 th Advanced Information Systems Engineering Conference (CAiSE) (Riga, Latvia, June 7-11, 2004). Lecture Notes in Computer Science, Springer, 2004, [2] Chen, P.P. The Entity-Relationship Model: Toward a unified view of data. ACM Transactions on Database Systems, 1 (Mar. 1976), [3] Costa, T., A. O Gerenciador de Consultas de um Sistema de Integração de Dados. Master Thesis. Universidade Federal de Pernambuco, Recife, Brazil, [4] Deutsch, A. and Tannen, V. Reformulation of XML Queries and Constraints. Proceedings of the 9 th International Conference on Database Theory (ICDT) (Siena, Italy, January 8-10, 2003). Lecture Notes in Computer Science, Springer, 2002, [5] Gardarin, G., Mensch, A., Tomasic, A. An Introduction to the e-xml Data Integration Suite. In Proceedings of the 7th International Conference on Extending Database Technology (EDBT) (Konstanz, Germany, March 27-31, 2000). Lecture Notes in Computer Science, Springer, 2000, [6] Halevy, Y. Theory of answering queries using views. SIGMOD Record, 29 (Dec. 2000), [7] Hammer J., Brennig, M., Garcia-Molina, H., Nesterov, S., Vassalos, V. and Yerneni, R. Template based wrappers in the tsimmis system. In Proceedings of the ACM SIGMOD International Conference on Management of Data (Tucson, Arizona, May 13-15, 1997). ACM Press, 1997, [8] Lóscio, B. F. Managing the Evolution of XML-based Mediation Queries. PhD Thesis. Universidade Federal de Pernambuco, Recife, Brazil, [9] Lóscio, B. F., Salgado, A. C., Galvão, L. R. Conceptual Modeling of XML Schemas. In Proceedings of the Fifth International Workshop on Web Information and Data Management (WIDM) (New Orleans, Louisiana, November 7-8, 2003). ACM, 2003, [10] Shanmugasundaram, J, Kiernan, J., Shekita, E.J., Fan C., Funderburk J. Querying XML Views of Relational Data. In Proceedings of the 27th International Conference on Very Large Data Bases (VLDB) (Roma, Italy, September 11-14, 2001). Morgan Kaufmann, 2001, [11] Spaccapietra, S. and Parent, C. View integration: a step forward in solving structural conflicts. IEEE Transactions on Knowledge and Data Engineering, 6 (Apr. 1994), [12] Ullman, J. D. Information integration using logical views. In Proceedings of the 6 th International Conference on Database Theory (ICDT) (Delphi, Greece, January 8-10, 1997). Lecture Notes in Computer Science, Springer, 1997,

Integrating Heterogeneous Data Sources Using XML

Integrating Heterogeneous Data Sources Using XML Integrating Heterogeneous Data Sources Using XML 1 Yogesh R.Rochlani, 2 Prof. A.R. Itkikar 1 Department of Computer Science & Engineering Sipna COET, SGBAU, Amravati (MH), India 2 Department of Computer

More information

Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM)

Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM) Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM) Extended Abstract Ioanna Koffina 1, Giorgos Serfiotis 1, Vassilis Christophides 1, Val Tannen

More information

{mcmb, acs}@cin.ufpe.br ABSTRACT 1. INTRODUCTION 2. APPROACHES OF IQ IN DATA INTEGRATION SYSTEMS

{mcmb, acs}@cin.ufpe.br ABSTRACT 1. INTRODUCTION 2. APPROACHES OF IQ IN DATA INTEGRATION SYSTEMS Information Quality Measurement in Data Integration Schemas Maria da Conceição Moraes Batista, Ana Carolina Salgado Centro de Informática, Universidade Federal de Pernambuco Av. Professor Luis Freire s/n,

More information

Translating WFS Query to SQL/XML Query

Translating WFS Query to SQL/XML Query Translating WFS Query to SQL/XML Query Vânia Vidal, Fernando Lemos, Fábio Feitosa Departamento de Computação Universidade Federal do Ceará (UFC) Fortaleza CE Brazil {vvidal, fernandocl, fabiofbf}@lia.ufc.br

More information

XML DATA INTEGRATION SYSTEM

XML DATA INTEGRATION SYSTEM XML DATA INTEGRATION SYSTEM Abdelsalam Almarimi The Higher Institute of Electronics Engineering Baniwalid, Libya Belgasem_2000@Yahoo.com ABSRACT This paper describes a proposal for a system for XML data

More information

Using Ontologies to Enhance Data Management in Distributed Environments

Using Ontologies to Enhance Data Management in Distributed Environments Using Ontologies to Enhance Data Management in Distributed Environments Carlos Eduardo Pires 1, Damires Souza 2, Bernadette Lóscio 3, Rosalie Belian 4, Patricia Tedesco 3 and Ana Carolina Salgado 3 1 Federal

More information

A MEDIATION LAYER FOR HETEROGENEOUS XML SCHEMAS

A MEDIATION LAYER FOR HETEROGENEOUS XML SCHEMAS A MEDIATION LAYER FOR HETEROGENEOUS XML SCHEMAS Abdelsalam Almarimi 1, Jaroslav Pokorny 2 Abstract This paper describes an approach for mediation of heterogeneous XML schemas. Such an approach is proposed

More information

INTEGRATION OF XML DATA IN PEER-TO-PEER E-COMMERCE APPLICATIONS

INTEGRATION OF XML DATA IN PEER-TO-PEER E-COMMERCE APPLICATIONS INTEGRATION OF XML DATA IN PEER-TO-PEER E-COMMERCE APPLICATIONS Tadeusz Pankowski 1,2 1 Institute of Control and Information Engineering Poznan University of Technology Pl. M.S.-Curie 5, 60-965 Poznan

More information

Integrating Relational Database Schemas using a Standardized Dictionary

Integrating Relational Database Schemas using a Standardized Dictionary Integrating Relational Database Schemas using a Standardized Dictionary Ramon Lawrence Advanced Database Systems Laboratory University of Manitoba Winnipeg, Manitoba, Canada umlawren@cs.umanitoba.ca Ken

More information

XQuery and the E-xml Component suite

XQuery and the E-xml Component suite An Introduction to the e-xml Data Integration Suite Georges Gardarin, Antoine Mensch, Anthony Tomasic e-xmlmedia, 29 Avenue du Général Leclerc, 92340 Bourg La Reine, France georges.gardarin@e-xmlmedia.fr

More information

OWL based XML Data Integration

OWL based XML Data Integration OWL based XML Data Integration Manjula Shenoy K Manipal University CSE MIT Manipal, India K.C.Shet, PhD. N.I.T.K. CSE, Suratkal Karnataka, India U. Dinesh Acharya, PhD. ManipalUniversity CSE MIT, Manipal,

More information

Integration of Heterogeneous Databases based on XML

Integration of Heterogeneous Databases based on XML ISSN:2249-5789 Integration of Heterogeneous Databases based on XML Venciya.A Student, Department Of Computer Science And Engineering, SRM University,Kattankulathur, Venciya.a@gmail.com Abstract As companies

More information

Unraveling the Duplicate-Elimination Problem in XML-to-SQL Query Translation

Unraveling the Duplicate-Elimination Problem in XML-to-SQL Query Translation Unraveling the Duplicate-Elimination Problem in XML-to-SQL Query Translation Rajasekar Krishnamurthy University of Wisconsin sekar@cs.wisc.edu Raghav Kaushik Microsoft Corporation skaushi@microsoft.com

More information

BInXS: A Process for Integration of XML Schemata

BInXS: A Process for Integration of XML Schemata BInXS: A Process for Integration of XML Schemata Ronaldo dos Santos Mello 1 and Carlos Alberto Heuser 2 1 Universidade Federal de Santa Catarina, Depto. de Informatica e Estatistica, Cx. Postal 476, Florianopolis,

More information

A Multidatabase System as 4-Tiered Client-Server Distributed Heterogeneous Database System

A Multidatabase System as 4-Tiered Client-Server Distributed Heterogeneous Database System A Multidatabase System as 4-Tiered Client-Server Distributed Heterogeneous Database System Mohammad Ghulam Ali Academic Post Graduate Studies and Research Indian Institute of Technology, Kharagpur Kharagpur,

More information

Constraint-based Query Distribution Framework for an Integrated Global Schema

Constraint-based Query Distribution Framework for an Integrated Global Schema Constraint-based Query Distribution Framework for an Integrated Global Schema Ahmad Kamran Malik 1, Muhammad Abdul Qadir 1, Nadeem Iftikhar 2, and Muhammad Usman 3 1 Muhammad Ali Jinnah University, Islamabad,

More information

Technologies for a CERIF XML based CRIS

Technologies for a CERIF XML based CRIS Technologies for a CERIF XML based CRIS Stefan Bärisch GESIS-IZ, Bonn, Germany Abstract The use of XML as a primary storage format as opposed to data exchange raises a number of questions regarding the

More information

Keywords: Regression testing, database applications, and impact analysis. Abstract. 1 Introduction

Keywords: Regression testing, database applications, and impact analysis. Abstract. 1 Introduction Regression Testing of Database Applications Bassel Daou, Ramzi A. Haraty, Nash at Mansour Lebanese American University P.O. Box 13-5053 Beirut, Lebanon Email: rharaty, nmansour@lau.edu.lb Keywords: Regression

More information

Advantages of XML as a data model for a CRIS

Advantages of XML as a data model for a CRIS Advantages of XML as a data model for a CRIS Patrick Lay, Stefan Bärisch GESIS-IZ, Bonn, Germany Summary In this paper, we present advantages of using a hierarchical, XML 1 -based data model as the basis

More information

RETRATOS: Requirement Traceability Tool Support

RETRATOS: Requirement Traceability Tool Support RETRATOS: Requirement Traceability Tool Support Gilberto Cysneiros Filho 1, Maria Lencastre 2, Adriana Rodrigues 2, Carla Schuenemann 3 1 Universidade Federal Rural de Pernambuco, Recife, Brazil g.cysneiros@gmail.com

More information

THE OPEN UNIVERSITY OF TANZANIA FACULTY OF SCIENCE TECHNOLOGY AND ENVIRONMENTAL STUDIES BACHELOR OF SIENCE IN DATA MANAGEMENT

THE OPEN UNIVERSITY OF TANZANIA FACULTY OF SCIENCE TECHNOLOGY AND ENVIRONMENTAL STUDIES BACHELOR OF SIENCE IN DATA MANAGEMENT THE OPEN UNIVERSITY OF TANZANIA FACULTY OF SCIENCE TECHNOLOGY AND ENVIRONMENTAL STUDIES BACHELOR OF SIENCE IN DATA MANAGEMENT ODM 106.DATABASE CONCEPTS COURSE OUTLINE 1.0 Introduction This introductory

More information

Caching XML Data on Mobile Web Clients

Caching XML Data on Mobile Web Clients Caching XML Data on Mobile Web Clients Stefan Böttcher, Adelhard Türling University of Paderborn, Faculty 5 (Computer Science, Electrical Engineering & Mathematics) Fürstenallee 11, D-33102 Paderborn,

More information

Data Integration using Agent based Mediator-Wrapper Architecture. Tutorial Report For Agent Based Software Engineering (SENG 609.

Data Integration using Agent based Mediator-Wrapper Architecture. Tutorial Report For Agent Based Software Engineering (SENG 609. Data Integration using Agent based Mediator-Wrapper Architecture Tutorial Report For Agent Based Software Engineering (SENG 609.22) Presented by: George Shi Course Instructor: Dr. Behrouz H. Far December

More information

Unit 2.1. Data Analysis 1 - V2.0 1. Data Analysis 1. Dr Gordon Russell, Copyright @ Napier University

Unit 2.1. Data Analysis 1 - V2.0 1. Data Analysis 1. Dr Gordon Russell, Copyright @ Napier University Data Analysis 1 Unit 2.1 Data Analysis 1 - V2.0 1 Entity Relationship Modelling Overview Database Analysis Life Cycle Components of an Entity Relationship Diagram What is a relationship? Entities, attributes,

More information

REPORTS IN INFORMATICS

REPORTS IN INFORMATICS REPORTS IN INFORMATICS ISSN 0333-3590 Composing Web Presentations using Presentation Patterns Khalid A. Mughal Yngve Espelid Torill Hamre REPORT NO 331 August 2006 Department of Informatics UNIVERSITY

More information

II. PREVIOUS RELATED WORK

II. PREVIOUS RELATED WORK An extended rule framework for web forms: adding to metadata with custom rules to control appearance Atia M. Albhbah and Mick J. Ridley Abstract This paper proposes the use of rules that involve code to

More information

A Hybrid Approach for Ontology Integration

A Hybrid Approach for Ontology Integration A Hybrid Approach for Ontology Integration Ahmed Alasoud Volker Haarslev Nematollaah Shiri Concordia University Concordia University Concordia University 1455 De Maisonneuve Blvd. West 1455 De Maisonneuve

More information

View-based Data Integration

View-based Data Integration View-based Data Integration Yannis Katsis Yannis Papakonstantinou Computer Science and Engineering UC San Diego, USA {ikatsis,yannis}@cs.ucsd.edu DEFINITION Data Integration (or Information Integration)

More information

A View Integration Approach to Dynamic Composition of Web Services

A View Integration Approach to Dynamic Composition of Web Services A View Integration Approach to Dynamic Composition of Web Services Snehal Thakkar University of Southern California/ Information Sciences Institute 4676 Admiralty Way, Marina Del Rey, California 90292

More information

Efficient Virtual Data Integration Based on XML

Efficient Virtual Data Integration Based on XML Efficient Virtual Data Integration Based on XML Konstantin Antipin 1, Andrey Fomichev 2, Maxim Grinev 2, Sergey Kuznetsov 1, Leonid Novak 1, Peter Pleshachkov 2, Maria Rekouts 2, and Denis Shiryaev 1 1

More information

UNIVERSITY OF TRENTO A PEER-TO-PEER DATABASE MANAGEMENT SYSTEM. Albena Roshelova. June 2004. Technical Report # DIT-04-057

UNIVERSITY OF TRENTO A PEER-TO-PEER DATABASE MANAGEMENT SYSTEM. Albena Roshelova. June 2004. Technical Report # DIT-04-057 UNIVERSITY OF TRENTO DEPARTMENT OF INFORMATION AND COMMUNICATION TECHNOLOGY 38050 Povo Trento (Italy), Via Sommarive 14 http://www.dit.unitn.it A PEER-TO-PEER DATABASE MANAGEMENT SYSTEM Albena Roshelova

More information

Novel Data Extraction Language for Structured Log Analysis

Novel Data Extraction Language for Structured Log Analysis Novel Data Extraction Language for Structured Log Analysis P.W.D.C. Jayathilake 99X Technology, Sri Lanka. ABSTRACT This paper presents the implementation of a new log data extraction language. Theoretical

More information

Component Approach to Software Development for Distributed Multi-Database System

Component Approach to Software Development for Distributed Multi-Database System Informatica Economică vol. 14, no. 2/2010 19 Component Approach to Software Development for Distributed Multi-Database System Madiajagan MUTHAIYAN, Vijayakumar BALAKRISHNAN, Sri Hari Haran.SEENIVASAN,

More information

Data Analysis 1. SET08104 Database Systems. Copyright @ Napier University

Data Analysis 1. SET08104 Database Systems. Copyright @ Napier University Data Analysis 1 SET08104 Database Systems Copyright @ Napier University Entity Relationship Modelling Overview Database Analysis Life Cycle Components of an Entity Relationship Diagram What is a relationship?

More information

Chapter 2 Database System Concepts and Architecture

Chapter 2 Database System Concepts and Architecture Chapter 2 Database System Concepts and Architecture Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Outline Data Models, Schemas, and Instances Three-Schema Architecture

More information

Query Processing in Data Integration Systems

Query Processing in Data Integration Systems Query Processing in Data Integration Systems Diego Calvanese Free University of Bozen-Bolzano BIT PhD Summer School Bressanone July 3 7, 2006 D. Calvanese Data Integration BIT PhD Summer School 1 / 152

More information

A Data Browsing from Various Sources Driven by the User s Data Models

A Data Browsing from Various Sources Driven by the User s Data Models A Data Browsing from Various Sources Driven by the User s Data Models Guntis Arnicans, Girts Karnitis University of Latvia, Raina blvd. 9, Riga, Latvia {Guntis.Arnicans, Girts.Karnitis}@lu.lv Abstract.

More information

Improving Query Performance Using Materialized XML Views: A Learning-Based Approach

Improving Query Performance Using Materialized XML Views: A Learning-Based Approach Improving Query Performance Using Materialized XML Views: A Learning-Based Approach Ashish Shah and Rada Chirkova Department of Computer Science North Carolina State University Campus Box 7535, Raleigh

More information

Chapter 1: Introduction

Chapter 1: Introduction Chapter 1: Introduction Database System Concepts, 5th Ed. See www.db book.com for conditions on re use Chapter 1: Introduction Purpose of Database Systems View of Data Database Languages Relational Databases

More information

Data Integration for XML based on Semantic Knowledge

Data Integration for XML based on Semantic Knowledge Data Integration for XML based on Semantic Knowledge Kamsuriah Ahmad a, Ali Mamat b, Hamidah Ibrahim c and Shahrul Azman Mohd Noah d a,d Fakulti Teknologi dan Sains Maklumat, Universiti Kebangsaan Malaysia,

More information

Application of XML Tools for Enterprise-Wide RBAC Implementation Tasks

Application of XML Tools for Enterprise-Wide RBAC Implementation Tasks Application of XML Tools for Enterprise-Wide RBAC Implementation Tasks Ramaswamy Chandramouli National Institute of Standards and Technology Gaithersburg, MD 20899,USA 001-301-975-5013 chandramouli@nist.gov

More information

Managing the Evolution of XML-based Mediation Queries

Managing the Evolution of XML-based Mediation Queries Universidade Federal de Pernambuco Centro de Informática Pós-Graduação em Ciências da Computação Managing the Evolution of XML-based Mediation Queries by Bernadette Farias Lóscio Tese de Doutorado Recife,

More information

Chapter 7 Data Modeling Using the Entity- Relationship (ER) Model

Chapter 7 Data Modeling Using the Entity- Relationship (ER) Model Chapter 7 Data Modeling Using the Entity- Relationship (ER) Model Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 7 Outline Using High-Level Conceptual Data Models for

More information

A Uniform Approach to Workflow and Data Integration

A Uniform Approach to Workflow and Data Integration A Uniform Approach to Workflow and Data Integration Lucas Zamboulis 1, 2, Nigel Martin 1, Alexandra Poulovassilis 1 1 School of Computer Science and Information Systems, Birkbeck, Univ. of London 2 Department

More information

Accessing Data Integration Systems through Conceptual Schemas (extended abstract)

Accessing Data Integration Systems through Conceptual Schemas (extended abstract) Accessing Data Integration Systems through Conceptual Schemas (extended abstract) Andrea Calì, Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Università

More information

Data Integration. Maurizio Lenzerini. Universitá di Roma La Sapienza

Data Integration. Maurizio Lenzerini. Universitá di Roma La Sapienza Data Integration Maurizio Lenzerini Universitá di Roma La Sapienza DASI 06: Phd School on Data and Service Integration Bertinoro, December 11 15, 2006 M. Lenzerini Data Integration DASI 06 1 / 213 Structure

More information

An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials

An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials ehealth Beyond the Horizon Get IT There S.K. Andersen et al. (Eds.) IOS Press, 2008 2008 Organizing Committee of MIE 2008. All rights reserved. 3 An Ontology Based Method to Solve Query Identifier Heterogeneity

More information

Integration and Coordination in in both Mediator-Based and Peer-to-Peer Systems

Integration and Coordination in in both Mediator-Based and Peer-to-Peer Systems Dottorato di Ricerca in Ingegneria dell Informazione e sua applicazione nell Industria e nei Servizi Integration and Coordination in in both Mediator-Based and Peer-to-Peer Systems presenter: (pense@inform.unian.it)

More information

CSE 132A. Database Systems Principles

CSE 132A. Database Systems Principles CSE 132A Database Systems Principles Prof. Victor Vianu 1 Data Management An evolving, expanding field: Classical stand-alone databases (Oracle, DB2, SQL Server) Computer science is becoming data-centric:

More information

An XML Framework for Integrating Continuous Queries, Composite Event Detection, and Database Condition Monitoring for Multiple Data Streams

An XML Framework for Integrating Continuous Queries, Composite Event Detection, and Database Condition Monitoring for Multiple Data Streams An XML Framework for Integrating Continuous Queries, Composite Event Detection, and Database Condition Monitoring for Multiple Data Streams Susan D. Urban 1, Suzanne W. Dietrich 1, 2, and Yi Chen 1 Arizona

More information

Using Database Metadata and its Semantics to Generate Automatic and Dynamic Web Entry Forms

Using Database Metadata and its Semantics to Generate Automatic and Dynamic Web Entry Forms Using Database Metadata and its Semantics to Generate Automatic and Dynamic Web Entry Forms Mohammed M. Elsheh and Mick J. Ridley Abstract Automatic and dynamic generation of Web applications is the future

More information

A View Integration Approach to Dynamic Composition of Web Services

A View Integration Approach to Dynamic Composition of Web Services A View Integration Approach to Dynamic Composition of Web Services Snehal Thakkar, Craig A. Knoblock, and José Luis Ambite University of Southern California/ Information Sciences Institute 4676 Admiralty

More information

AN AI PLANNING APPROACH FOR GENERATING BIG DATA WORKFLOWS

AN AI PLANNING APPROACH FOR GENERATING BIG DATA WORKFLOWS AN AI PLANNING APPROACH FOR GENERATING BIG DATA WORKFLOWS Wesley Deneke 1, Wing-Ning Li 2, and Craig Thompson 2 1 Computer Science and Industrial Technology Department, Southeastern Louisiana University,

More information

Grid Data Integration based on Schema-mapping

Grid Data Integration based on Schema-mapping Grid Data Integration based on Schema-mapping Carmela Comito and Domenico Talia DEIS, University of Calabria, Via P. Bucci 41 c, 87036 Rende, Italy {ccomito, talia}@deis.unical.it http://www.deis.unical.it/

More information

Mapping discovery for XML data integration

Mapping discovery for XML data integration Mapping discovery for XML data integration Zoubida Kedad, Xiaohui Xue Laboratoire PRiSM, Université de Versailles 45 avenue des Etats-Unis, 78035 Versailles, France {Zoubida.kedad, Xiaohui.Xue}@prism.uvsq.fr

More information

Web-Based Genomic Information Integration with Gene Ontology

Web-Based Genomic Information Integration with Gene Ontology Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, kai.xu@nicta.com.au Abstract. Despite the dramatic growth of online genomic

More information

A Workbench for Prototyping XML Data Exchange (extended abstract)

A Workbench for Prototyping XML Data Exchange (extended abstract) A Workbench for Prototyping XML Data Exchange (extended abstract) Renzo Orsini and Augusto Celentano Università Ca Foscari di Venezia, Dipartimento di Informatica via Torino 155, 30172 Mestre (VE), Italy

More information

SODDA A SERVICE-ORIENTED DISTRIBUTED DATABASE ARCHITECTURE

SODDA A SERVICE-ORIENTED DISTRIBUTED DATABASE ARCHITECTURE SODDA A SERVICE-ORIENTED DISTRIBUTED DATABASE ARCHITECTURE Breno Mansur Rabelo Centro EData Universidade do Estado de Minas Gerais, Belo Horizonte, MG, Brazil breno.mansur@uemg.br Clodoveu Augusto Davis

More information

XML Programming with PHP and Ajax

XML Programming with PHP and Ajax http://www.db2mag.com/story/showarticle.jhtml;jsessionid=bgwvbccenyvw2qsndlpskh0cjunn2jvn?articleid=191600027 XML Programming with PHP and Ajax By Hardeep Singh Your knowledge of popular programming languages

More information

Building Geospatial Ontologies from Geographic Database Schemas in Peer Data Management Systems

Building Geospatial Ontologies from Geographic Database Schemas in Peer Data Management Systems Building Geospatial Ontologies from Geographic Database Schemas in Peer Data Management Systems Danúbia Lima 1, Antonio Mendonça 1, Ana Carolina Salgado 2, Damires Souza 1 1 Federal Institute of Education,

More information

UPROM Tool: A Unified Business Process Modeling Tool for Generating Software Life Cycle Artifacts

UPROM Tool: A Unified Business Process Modeling Tool for Generating Software Life Cycle Artifacts UPROM Tool: A Unified Business Process Modeling Tool for Generating Software Life Cycle Artifacts Banu Aysolmaz 1 and Onur Demirörs 2 1, 2 Informatics Institute, Middle East Technical University, Ankara,

More information

DBMS / Business Intelligence, SQL Server

DBMS / Business Intelligence, SQL Server DBMS / Business Intelligence, SQL Server Orsys, with 30 years of experience, is providing high quality, independant State of the Art seminars and hands-on courses corresponding to the needs of IT professionals.

More information

Lightweight Data Integration using the WebComposition Data Grid Service

Lightweight Data Integration using the WebComposition Data Grid Service Lightweight Data Integration using the WebComposition Data Grid Service Ralph Sommermeier 1, Andreas Heil 2, Martin Gaedke 1 1 Chemnitz University of Technology, Faculty of Computer Science, Distributed

More information

Architecture and Implementation of an XQuery-based Information Integration Platform

Architecture and Implementation of an XQuery-based Information Integration Platform Architecture and Implementation of an XQuery-based Information Integration Platform Yannis Papakonstantinou Computer Science and Engineering University of California, San Diego yannis@cs.ucsd.edu Vasilis

More information

THE OPEN UNIVERSITY OF TANZANIA FACULTY OF SCIENCE TECHNOLOGY AND ENVIRONMENTAL STUDIES BACHELOR OF SIENCE IN INFORMATION AND COMMUNICATION TECHNOLOGY

THE OPEN UNIVERSITY OF TANZANIA FACULTY OF SCIENCE TECHNOLOGY AND ENVIRONMENTAL STUDIES BACHELOR OF SIENCE IN INFORMATION AND COMMUNICATION TECHNOLOGY THE OPEN UNIVERSITY OF TANZANIA FACULTY OF SCIENCE TECHNOLOGY AND ENVIRONMENTAL STUDIES BACHELOR OF SIENCE IN INFORMATION AND COMMUNICATION TECHNOLOGY OIT 217.DATABASE CONCEPTS AND DESIGN COURSE OUTLINE

More information

A Tool for Generating Relational Database Schema from EER Diagram

A Tool for Generating Relational Database Schema from EER Diagram A Tool for Generating Relational Schema from EER Diagram Lisa Simasatitkul and Taratip Suwannasart Abstract design is an important activity in software development. EER diagram is one of diagrams, which

More information

Linking BPMN, ArchiMate, and BWW: Perfect Match for Complete and Lawful Business Process Models?

Linking BPMN, ArchiMate, and BWW: Perfect Match for Complete and Lawful Business Process Models? Linking BPMN, ArchiMate, and BWW: Perfect Match for Complete and Lawful Business Process Models? Ludmila Penicina Institute of Applied Computer Systems, Riga Technical University, 1 Kalku, Riga, LV-1658,

More information

IT2304: Database Systems 1 (DBS 1)

IT2304: Database Systems 1 (DBS 1) : Database Systems 1 (DBS 1) (Compulsory) 1. OUTLINE OF SYLLABUS Topic Minimum number of hours Introduction to DBMS 07 Relational Data Model 03 Data manipulation using Relational Algebra 06 Data manipulation

More information

Business Process Configuration with NFRs and Context-Awareness

Business Process Configuration with NFRs and Context-Awareness Business Process Configuration with NFRs and Context-Awareness Emanuel Santos 1, João Pimentel 1, Tarcisio Pereira 1, Karolyne Oliveira 1, and Jaelson Castro 1 Universidade Federal de Pernambuco, Centro

More information

Consistent Answers from Integrated Data Sources

Consistent Answers from Integrated Data Sources Consistent Answers from Integrated Data Sources Leopoldo Bertossi 1, Jan Chomicki 2 Alvaro Cortés 3, and Claudio Gutiérrez 4 1 Carleton University, School of Computer Science, Ottawa, Canada. bertossi@scs.carleton.ca

More information

Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis

Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661, ISBN: 2278-8727 Volume 6, Issue 5 (Nov. - Dec. 2012), PP 36-41 Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis

More information

SOLVING SEMANTIC CONFLICTS IN AUDIENCE DRIVEN WEB DESIGN

SOLVING SEMANTIC CONFLICTS IN AUDIENCE DRIVEN WEB DESIGN SOLVING SEMANTIC CONFLICTS IN AUDIENCE DRIVEN WEB DESIGN Olga De Troyer Vrije Universiteit Brussel, WISE Pleinlaan 2 B-1050 Brussel Belgium Olga.DeTroyer@vub.ac.be Peter Plessers Vrije Universiteit Brussel,

More information

IT2305 Database Systems I (Compulsory)

IT2305 Database Systems I (Compulsory) Database Systems I (Compulsory) INTRODUCTION This is one of the 4 modules designed for Semester 2 of Bachelor of Information Technology Degree program. CREDITS: 04 LEARNING OUTCOMES On completion of this

More information

Graphical Web based Tool for Generating Query from Star Schema

Graphical Web based Tool for Generating Query from Star Schema Graphical Web based Tool for Generating Query from Star Schema Mohammed Anbar a, Ku Ruhana Ku-Mahamud b a College of Arts and Sciences Universiti Utara Malaysia, 0600 Sintok, Kedah, Malaysia Tel: 604-2449604

More information

Visual Interfaces for the Development of Event-based Web Agents in the IRobot System

Visual Interfaces for the Development of Event-based Web Agents in the IRobot System Visual Interfaces for the Development of Event-based Web Agents in the IRobot System Liangyou Chen ACM Member chen_liangyou@yahoo.com Abstract. Timely integration and analysis of information from the World-Wide

More information

Data Integration by Bi-Directional Schema Transformation Rules

Data Integration by Bi-Directional Schema Transformation Rules Data Integration by Bi-Directional Schema Transformation Rules Peter M c.brien Dept. of Computing, Imperial College, London SW7 2BZ pjm@doc.ic.ac.uk Alexandra Poulovassilis School of Computer Science and

More information

Flexible Semantic B2B Integration Using XML Specifications

Flexible Semantic B2B Integration Using XML Specifications Flexible Semantic B2B Integration Using XML Specifications Ken Barker Ramon Lawrence Computer Science, University of Calgary and Computer Science, University of Iowa Calgary, Alberta, Canada Iowa City,

More information

Chapter 1: Introduction. Database Management System (DBMS) University Database Example

Chapter 1: Introduction. Database Management System (DBMS) University Database Example This image cannot currently be displayed. Chapter 1: Introduction Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Database Management System (DBMS) DBMS contains information

More information

Optimization of Sequences of XML Schema Modifications - The ROfEL Approach

Optimization of Sequences of XML Schema Modifications - The ROfEL Approach Optimization of Sequences of XML Schema Modifications - The ROfEL Approach Thomas Nösinger, Meike Klettke, Andreas Heuer Database Research Group University of Rostock, Germany (tn, meike, ah)@informatik.uni-rostock.de

More information

SQLMutation: A tool to generate mutants of SQL database queries

SQLMutation: A tool to generate mutants of SQL database queries SQLMutation: A tool to generate mutants of SQL database queries Javier Tuya, Mª José Suárez-Cabal, Claudio de la Riva University of Oviedo (SPAIN) {tuya cabal claudio} @ uniovi.es Abstract We present a

More information

Integrating Bioinformatic Data Sources over the SFSU ER Design Tools XML Databus

Integrating Bioinformatic Data Sources over the SFSU ER Design Tools XML Databus Integrating Bioinformatic Data Sources over the SFSU ER Design Tools XML Databus Yan Liu, M.S. Computer Science Department San Francisco State University 1600 Holloway Avenue San Francisco, CA 94132 USA

More information

Piazza: Data Management Infrastructure for Semantic Web Applications

Piazza: Data Management Infrastructure for Semantic Web Applications Piazza: Data Management Infrastructure for Semantic Web Applications Alon Y. Halevy Zachary G. Ives Peter Mork Igor Tatarinov University of Washington Box 352350 Seattle, WA 98195-2350 {alon,zives,pmork,igor}@cs.washington.edu

More information

Data integration and reconciliation in Data Warehousing: Conceptual modeling and reasoning support

Data integration and reconciliation in Data Warehousing: Conceptual modeling and reasoning support Data integration and reconciliation in Data Warehousing: Conceptual modeling and reasoning support Diego Calvanese Giuseppe De Giacomo Riccardo Rosati Dipartimento di Informatica e Sistemistica Università

More information

SQL Server. 2012 for developers. murach's TRAINING & REFERENCE. Bryan Syverson. Mike Murach & Associates, Inc. Joel Murach

SQL Server. 2012 for developers. murach's TRAINING & REFERENCE. Bryan Syverson. Mike Murach & Associates, Inc. Joel Murach TRAINING & REFERENCE murach's SQL Server 2012 for developers Bryan Syverson Joel Murach Mike Murach & Associates, Inc. 4340 N. Knoll Ave. Fresno, CA 93722 www.murach.com murachbooks@murach.com Expanded

More information

On Development of Fuzzy Relational Database Applications

On Development of Fuzzy Relational Database Applications On Development of Fuzzy Relational Database Applications Srdjan Skrbic Faculty of Science Trg Dositeja Obradovica 3 21000 Novi Sad Serbia shkrba@uns.ns.ac.yu Aleksandar Takači Faculty of Technology Bulevar

More information

Database Design Overview. Conceptual Design ER Model. Entities and Entity Sets. Entity Set Representation. Keys

Database Design Overview. Conceptual Design ER Model. Entities and Entity Sets. Entity Set Representation. Keys Database Design Overview Conceptual Design. The Entity-Relationship (ER) Model CS430/630 Lecture 12 Conceptual design The Entity-Relationship (ER) Model, UML High-level, close to human thinking Semantic

More information

Introduction to XML Applications

Introduction to XML Applications EMC White Paper Introduction to XML Applications Umair Nauman Abstract: This document provides an overview of XML Applications. This is not a comprehensive guide to XML Applications and is intended for

More information

Xml Mediator and Data Management

Xml Mediator and Data Management Adaptive Data Mediation over XML Data Hui Lin, Tore Risch, Timour Katchaounov Hui.Lin, Tore.Risch, Timour.Katchaounov@dis.uu.se Uppsala Database Laboratory, Uppsala University, Sweden To be published in

More information

Declarative Rule-based Integration and Mediation for XML Data in Web Service- based Software Architectures

Declarative Rule-based Integration and Mediation for XML Data in Web Service- based Software Architectures Declarative Rule-based Integration and Mediation for XML Data in Web Service- based Software Architectures Yaoling Zhu A dissertation submitted in fulfillment of the requirement for the award of Master

More information

IJSER Figure1 Wrapper Architecture

IJSER Figure1 Wrapper Architecture International Journal of Scientific & Engineering Research, Volume 5, Issue 5, May-2014 24 ONTOLOGY BASED DATA INTEGRATION WITH USER FEEDBACK Devini.K, M.S. Hema Abstract-Many applications need to access

More information

not necessarily strictly sequential feedback loops exist, i.e. may need to revisit earlier stages during a later stage

not necessarily strictly sequential feedback loops exist, i.e. may need to revisit earlier stages during a later stage Database Design Process there are six stages in the design of a database: 1. requirement analysis 2. conceptual database design 3. choice of the DBMS 4. data model mapping 5. physical design 6. implementation

More information

Grid Data Integration Based on Schema Mapping

Grid Data Integration Based on Schema Mapping Grid Data Integration Based on Schema Mapping Carmela Comito and Domenico Talia DEIS, University of Calabria, Via P. Bucci 41 c, 87036 Rende, Italy {ccomito, talia}@deis.unical.it http://www.deis.unical.it/

More information

FiskP, DLLP and XML

FiskP, DLLP and XML XML-based Data Integration for Semantic Information Portals Patrick Lehti, Peter Fankhauser, Silvia von Stackelberg, Nitesh Shrestha Fraunhofer IPSI, Darmstadt, Germany lehti,fankhaus,sstackel@ipsi.fraunhofer.de

More information

Modeling the User Interface of Web Applications with UML

Modeling the User Interface of Web Applications with UML Modeling the User Interface of Web Applications with UML Rolf Hennicker,Nora Koch,2 Institute of Computer Science Ludwig-Maximilians-University Munich Oettingenstr. 67 80538 München, Germany {kochn,hennicke}@informatik.uni-muenchen.de

More information

Using Object And Object-Oriented Technologies for XML-native Database Systems

Using Object And Object-Oriented Technologies for XML-native Database Systems Using Object And Object-Oriented Technologies for XML-native Database Systems David Toth and Michal Valenta David Toth and Michal Valenta Dept. of Computer Science and Engineering Dept. FEE, of Computer

More information

Applying Model Management to Classical Meta Data Problems

Applying Model Management to Classical Meta Data Problems Applying Model Management to Classical Meta Data Problems Philip A. Bernstein Microsoft Research One Microsoft Way Redmond, WA 98052-6399 philbe@microsoft.com Abstract Model management is a new approach

More information

SDMX technical standards Data validation and other major enhancements

SDMX technical standards Data validation and other major enhancements SDMX technical standards Data validation and other major enhancements Vincenzo Del Vecchio - Bank of Italy 1 Statistical Data and Metadata exchange Original scope: the exchange Statistical Institutions

More information

A Framework and Architecture for Quality Assessment in Data Integration

A Framework and Architecture for Quality Assessment in Data Integration A Framework and Architecture for Quality Assessment in Data Integration Jianing Wang March 2012 A Dissertation Submitted to Birkbeck College, University of London in Partial Fulfillment of the Requirements

More information

Analyzing Triggers in XML Data Integration Systems

Analyzing Triggers in XML Data Integration Systems Analyzing Triggers in XML Data Integration Systems Jing Lu, Dunlu Peng, Huan Huo, Liping Gao, Xiaodong Zhu Analyzing Triggers in XML Data Integration Systems Jing Lu 1, Dunlu Peng 1, Huan Huo 1, Liping

More information