Logic Programs for Consistently Querying Data Integration Systems
|
|
|
- Phebe Flynn
- 10 years ago
- Views:
Transcription
1 IJCAI 03 1 Acapulco, August 2003 Logic Programs for Consistently Querying Data Integration Systems Loreto Bravo Computer Science Department Pontificia Universidad Católica de Chile [email protected] Joint work with: Leopoldo Bertossi School of Computer Science Carleton University. Ottawa, Canada [email protected]
2 IJCAI 03 2 Acapulco, August 2003 Presentation Outline Preliminaries Virtual Data Integration Systems Minimal Instances Consistency Logic Programming Specification of Minimal Instances Query Answering over Minimal Instances Computing Consistent Answers Logic Programming Specification of Repairs Retrieving Consistent Answers Conclusions
3 IJCAI 03 3 Acapulco, August 2003 Virtual Data Integration Systems Global schema Global schema R Local Sources Schema Type: open, closed. Contents v Mapping: Global-as-View (GAV) Local-as-View (LAV) V ( X) ϕ V ( X)
4 IJCAI 03 4 Acapulco, August 2003 LAV: tables in local sources are described as (conjunctive) views of the global schema V 1 (Brand, Model, Y ear, Color) Car(Brand, Model, Y ear, Color), Brand = Susuki, Y ear 1990 V 2 (Brand, Model, Country) Car(Brand, Model, Y ear, Color), Open BrandOrigin(Brand, Country)
5 IJCAI 03 5 Acapulco, August 2003 Virtual Global Instances? V i ( X) ϕ V ( X) : LAV mapping D : global database instance ϕ V (D) : extension of V obtained by applying ϕ V to D v i : source contents Since the data sources are open, the legal instances of G are: Linst(G) = { D instance over R v i ϕ Vi (D), i = 1,..., n} The certain answers to a global query Q are those that can be obtained as answers from every legal instance: Certain G (Q)
6 IJCAI 03 6 Acapulco, August 2003 Example 1: Global system G 1 V 1 (X, Y ) R(X, Y ) V 2 (X, Y ) R(Y, X) with v 1 = {(a, b), (c, d)} with v 2 = {(c, a), (e, d)} Global instance D = {(a, b), (c, d), (a, c), (d, e)} is legal v 1 ϕ 1 (D) = {(a, b), (c, d), (a, c), (d, e)} v 2 ϕ 2 (D) = {(b, a), (d, c), (c, a), (e, d)} All supersets of D are legal global; but not the proper subsets of D Query Q: R(X, Y )? Certain G (Q) = {(a, b), (c, d), (a, c), (d, e)}
7 IJCAI 03 7 Acapulco, August 2003 Presentation Outline Preliminaries Virtual Data Integration Systems Minimal Instances Consistency Logic Programming Specification of Minimal Instances Computing Consistent Answers Logic Programming Specification of Repairs Retrieving Consistent Answers Conclusions
8 IJCAI 03 8 Acapulco, August 2003 Example 1 (continued) Consistency V 1 (X, Y ) R(X, Y ) V 2 (X, Y ) R(Y, X) with v 1 = {(a, b), (c, d)} with v 2 = {(c, a), (e, d)} Query Q: R(X, Y )? Certain G (Q) = {(a, b), (c, d), (a, c), (d, e)} Local FDs V 1 : X Y, V 2 : X Y are satisfied in the sources But the global FD R: X Y is not satisfied by legal instance D = {(a, b), (c, d), (a, c), (d, e)} Only (c, d), (d, e) should be consistent answers
9 IJCAI 03 9 Acapulco, August 2003 Several questions arise: What does it mean for G to satisfy global ICs? What are the consistent answers to a global query? How to compute them? These questions were addressed in [Bertossi, Chomicki, Cortés and Gutiérrez; FQAS02] First we briefly review some notions introduced there
10 IJCAI Acapulco, August 2003 A minimal global instance is a legal instance that does not properly contain any other legal instance Mininst(G) := set of minimal instances of G The minimal answers to a query are those that can be contained from every minimal instance: Certain G (Q) Minimal G (Q) For monotone queries they coincide; with negation, possibly not G is consistent wrt ICs if every minimal instance satisfies the ICs
11 IJCAI Acapulco, August 2003 Specification of Minimal Instances Example 2: D = {a, b, c,... } G 2 : V 1 (X, Z) P (X, Y ), R(Y, Z) V 2 (X, Y ) P (X, Y ) {V 1 (a, b)}, {V 2 (a, c)} Inverse Rules [Duschka,Genesereth; SAC97]: P (X, f(x, Z)) V 1 (X, Z) R(f(X, Z), Z) V 1 (X, Z) P (X, Y ) V 2 (X, Y ) Try to use something like this to specify minimal instances...
12 IJCAI Acapulco, August 2003 Answer set program Π(G): 1. Fact dom(a) for every constant a D 2. Fact V i (ā) whenever V i (ā) v i for a source extension v i in G 3. For every view (source) predicate V i with definition V i ( X) P 1 ( X 1 ),..., P n ( X n ), the rule P j ( X j ) V ( X), F i ( X, X i ) X i ( X j \ X) 4. For every predicate F i ( X, X i ) introduced in 3., the rule F i ( X, X i ) V i ( X), dom(x i ), choice(( X), (X i ))
13 IJCAI Acapulco, August 2003 choice(( X), (X i )): non-deterministically chooses a unique value for X i for each value of X [Giannotti,Pedreschi,Sacca,Zaniolo; DOOD 91] Models are the choice models, but the program can be transformed into one with answer sets (stable models) M ininst(g) stable models of Π(G) Linst(G) Queries expressed as logic programs can be answered from the query program together with Π(G) under cautious stable model semantics For monotone queries Q, answers obtained using Π(G) coincide with Certain G (Q) and Minimal G (Q)
14 IJCAI Acapulco, August 2003 Example 2 (continued) D = {a, b, c,... } G 2 : V 1 (X, Z) P (X, Y ), R(Y, Z) V 2 (X, Y ) P (X, Y ) v 1 = {V 1 (a, b)}, v 2 = {V 2 (a, c)} Π(G 2 ) : dom(a)., dom(b)., dom(c).,..., V 1 (a, b)., V 2 (a, c). P (X, Z) V 1 (X, Y ), F 1 (X, Y, Z) R(Z, Y ) V 1 (X, Y ), F 1 (X, Y, Z) P (X, Y ) V 2 (X, Y ) F 1 (X, Y, Z) V 1 (X, Y ), dom(z), choice((x, Y ), (Z))
15 IJCAI Acapulco, August 2003 Example 2 (continued) G 2 : V 1 (X, Z) P (X, Y ), R(Y, Z) {V 1 (a, b)}, V 2 (X, Y ) P (X, Y ) {V 2 (a, c)} Mininst(G) = {{P (a, c), P (a, z), R(z, b)} z {a, b, c,...}} The stable models of SV (Π(G 2 )) are: M b = {dom(a),..., V 1 (a, b), V 2 (a, c), P (a, c), diffchoice 1 (a, b, a), chosen 1 (a, b, b), diffchoice 1 (a, b, c), F 1 (a, b, b), R(b, b), P (a, b)} M a = {dom(a),..., V 1 (a, b), V 2 (a, c), P (a, c), chosen 1 (a, b, a), diffchoice 1 (a, b, b), diffchoice 1 (a, b, c), F 1 (a, b, a), R(a, b), P (a, a)} M c = {dom(a),..., V 1 (a, b), V 2 (a, c), P (a, c), diffchoice 1 (a, b, a), diffchoice 1 (a, b, b), chosen 1 (a, b, c), F 1 (a, b, c), R(c, b)} Here: 1-1 correspondence with M ininst(g)
16 IJCAI Acapulco, August 2003 Example 3: G 3 : V 1 (X) P (X, Y ) V 2 (X, Y ) P (X, Y ) {V 1 (a)} {V 2 (a, c)} Mininst(G 3 ) = {{P (a, c)}} However, the legal global instances corresponding to models of Π(G 3 ) are of the form {{P (a, c), P (a, z)} z D} As V 2 is open, it forces P (a, c) to be in all legal instances, and with this, same condition on V 1 is automatically satisfied, and no other values for Y are needed But the choice operator still has freedom to chose other values (the z D)
17 IJCAI Acapulco, August 2003 We want Π(G) to capture only the minimal instances A revised version of Π(G) is able to detect in which cases it is necessary to use the function predicates. F i ( X, X i ) add V i ( X), dom(x i ), choice(( X), (X i )) where add V i ( X) is true only when the openness of source V i is not satisfied through other views. stable models of Π(G) M ininst(g). This program not only specifies the minimal instances, but can be also used to compute certain answers to monotone queries
18 IJCAI Acapulco, August 2003 Presentation Outline Preliminaries Virtual Data Integration Systems Minimal Instances Consistency Logic Programming Specification of Minimal Instances Computing Consistent Answers Logic Programming Specification of Repairs Retrieving Consistent Answers Conclusions
19 IJCAI Acapulco, August 2003 Computing Consistent Answers In [Bertossi, Chomicki, Cortés and Gutiérrez; FQAS02] (inspired by [Arenas, Bertossi, Chomicki; PODS 99]): A repair of a global system G wrt to global ICs IC is: a global instance that satisfies IC, that minimally differs from a minimal instance (wrt to inclusion of sets of tuples) Repairs IC (G) := set of repairs of G wrt IC A tuple t is a consistent answer to query Q wrt I C if for every D Repairs IC (G): D = Q[ t] Intuitively, consistent answers are invariant under minimal restorations of consistency
20 IJCAI Acapulco, August 2003 Example 1 (continued) Since Mininst(G 1 ) = {{(a, b), (c, d), (a, c), (d, e)}}, G 1 is inconsistent wrt FD : X Y Repairs FD (G 1 ) = {D 1, D 2 } D 1 = {(a, b), (c, d), (d, e)} D 2 = {(c, d), (a, c), (d, e)} Queries: Q(X, Y ): R(X, Y )? (c, d), (d, e) are the consistent answers Q 1 (X): Y R(X, Y )? a is a consistent answer
21 IJCAI Acapulco, August 2003 Specification of Repairs So far: specification of minimal instances of an integration system Minimal instances can be inconsistent In consequence, we want to specify their repairs [Barcelo, Bertossi; NMR 02], [Barcelo, Bertossi; PADL 02]: Specification of the repairs of a single, inconsistent relational database using disjunctive logic programs with stable model semantics We can apply those ideas here...
22 IJCAI Acapulco, August 2003 We can combine the programs that specify the minimal instances and the (single DB) repair programs into a new program Π(G, IC ) that specifies the repairs of an integration system G wrt IC Example 3 (continued) G 3 : V 1 (X) P (X, Y ) V 2 (X, Y ) P (X, Y ) {V 1 (a)}, {V 2 (a, c)} IC : x, y(p (x, y) P (y, x)) Mininst(G 3 ) = {{P (a, c)}}... inconsistent system
23 IJCAI Acapulco, August 2003 Repair Program using DLV syntax: dom(a). dom(c). v1(a). v2(a,c). %begin subprogram for minimal instances P(X,Y,td) :- P(X,Y,v1). P(X,Y,td) :- P(X,Y,to). P(X,Y,nv1) :- P(X,Y,to). addv1(x) :- v1(x), not auxv1(x). auxv1(x) :- P(X,Z,nv1). fz(x,z) :- addv1(x), dom(z), chosenv1z(x,z). chosenv1z(x,z) :- addv1(x), dom(z), not diffchoicev1z(x,z). diffchoicev1z(x,z) :- chosenv1z(x,zz), dom(z), ZZ!=Z. P(X,Z,v1) :- addv1(x), fz(x,z). P(X,Y,v2) :- v2(x,y). P(X,Y,ts) :- P(X,Y,ta), dom(x), dom(y). %begin repair subprogram P(X,Y,ts) :- P(X,Y,td), dom(x), dom(y). P(X,Y,fs) :- dom(x), dom(y), not P(X,Y,td). P(X,Y,fs) :- P(X,Y,fa), dom(x), dom(y). P(X,Y,fa) v P(Y,X,ta) :- P(X,Y,ts), P(Y,X,fs), dom(x), dom(y). P(X,Y,tss) :- P(X,Y,ta), dom(x), dom(y). P(X,Y,tss) :- P(X,Y,td), dom(x), dom(y), not P(X,Y,fa). P(X,Y,fss) :- P(X,Y,fa), dom(x), dom(y). P(X,Y,fss) :- dom(x), dom(y), not P(X,Y,td), not P(X,Y,ta). :- p(x,y,ta), p(x,y,fa).
24 IJCAI Acapulco, August 2003 Repair subprogram annotations Annotation Atom The tuple P (ā) is... td P (ā, td) a fact of the database fd P (ā, fd) not a fact in the database ta P (ā, ta) advised to be made true fa P (ā, fa) advised to be made false ts P (ā, ts) true or becomes true fs P (ā, fs) false or becomes false tss P (ā, tss) it is true in the repair fss P (ā, fss) it is false in the repair
25 IJCAI Acapulco, August 2003 Repair subprogram P(X,Y,ts) :- P(X,Y,ta), dom(x), dom(y). P(X,Y,ts) :- P(X,Y,td), dom(x), dom(y). P(X,Y,fs) :- dom(x), dom(y), not P(X,Y,td). P(X,Y,fs) :- P(X,Y,fa), dom(x), dom(y). P(X,Y,fa) v P(Y,X,ta) :- P(X,Y,ts), P(Y,X,fs), dom(x), dom(y P(X,Y,tss) :- P(X,Y,ta), dom(x), dom(y). P(X,Y,tss) :- P(X,Y,td), dom(x), dom(y), not P(X,Y,fa). P(X,Y,fss) :- P(X,Y,fa), dom(x), dom(y). P(X,Y,fss) :- dom(x), dom(y), not P(X,Y,td), not P(X,Y,ta). :- p(x,y,ta), p(x,y,fa).
26 IJCAI Acapulco, August 2003 Stable models obtained with DLV: M r 1 = {dom(a), dom(c), v1(a), v2(a,c), P(a,c,nv1), P(a,c,v2), P(a,c,td), P(a,c,ts), auxv1(a), P(a,a,fs), P(c,a,fs), P(c,c,fs), P(a,a,fss), P(c,a,ta), P(c,c,fss), P(a,c,tss), P(c,a,ts), P(c,a,tss)} {P (a, c), P (c, a)} M r 2 = {dom(a),dom(c), v1(a), v2(a,c), P(a,c,nv1), P(a,c,v2), P(a,c,td), P(a,c,ts), auxv1(a), P(a,a,fs), P(c,a,fs), P(a,c,fs), P(c,c,fs), P(a,a,fss), P(c,a,fss), P(a,c,fss), P(c,c,fss), P(a,c,fa)} Repair programs specify exactly the repairs of an integration system for universal and simple referential ICs.
27 IJCAI Acapulco, August 2003 Computing Consistent Answers Consistent answers t to a query posed to a global integration system Q( x)? Methodology: 1. Q( P (ū) R( v) ) 2. Q ( x) (Π(Q ), Ans( X)) Q := Q( P (ū, tss) R( v, fss) ) - Π(Q ) is a query program - Ans( X) is a query atom defined in Π(Q ) 3. Run Π := Π(Q ) Π(G, IC) 4. Collect ground atoms Ans( t) {S S is a stable model of Π}
28 IJCAI Acapulco, August 2003 Example 3 (continued) Query Q: P (x, y) 1. Q : P (x, y, tss) 2. Π(Q ): Ans(X, Y ) P (X, Y, tss) 3. Π(G 3, IC ) as before; form Π = Π(G 3, IC ) Π(Q ) 4. The repairs corresponding to the stable models of the program Π extended with query atoms are M r 1 = M r 1 {Ans(a, c), Ans(c, a)}; M r 2 = M r 2 5. No Ans atoms in common, then query has no consistent answers
29 IJCAI Acapulco, August 2003 Conclusions A general specification, under the LAV paradigm, of minimal global instances of an open integration system Certain Answers can be retrieved for monotone queries General approach to specifying the database repairs of a mediated integration system with open sources under the LAV approach The methodology works for conjunctive view definitions, arbitrary universal ICs, simple referential ICs, and queries expressed as Datalog not programs Can be extended to the case of views defined using disjunctions of conjunctive queries (open sources)
30 IJCAI Acapulco, August 2003 Our computations are based on the current implementations of stable models. However we are not interested in the repairs per se, we are interested in the answers to queries. Optimizations are being developed: Select the data of the sources that is relevant to answer the query Magic sets to use the query to guide the computation
31 IJCAI Acapulco, August 2003 Logic Programs for Consistently Querying Data Integration Systems Loreto Bravo Computer Science Department Pontificia Universidad Católica de Chile Joint work with: Leopoldo Bertossi School of Computer Science Carleton University. Ottawa, Canada
32 IJCAI Acapulco, August 2003 Choice s Stable Version The choice models can be obtained as the stable models of the associated program SV (Π(G)), the stable version of Π(G) Replace each choice rule r : H B, choice(( X), (Y )) by H B, chosen r ( X, Y ) Introduce the new rules chosen r ( X, Y ) B, not diffchoice r ( X, Y ) diffchoice r ( X, Y ) chosen r ( X, Y ), Y Y stable models of SV (Π(G)) choice models of Π(G)
33 IJCAI Acapulco, August 2003 Revised Version Π(G) revisited, to capture only the minimal instances: Annotation Atom The tuple P (ā) is... t d P (ā, t d ) an atom of the minimal legal instances t o P (ā, t o ) is an obligatory atom in all the minimal legal instances v i P (ā, v i ) an optional atom introduced to satisfy the openness of view v i nv i P (ā, nv i ) an optional atom introduced to satisfy the openness of view that is not v i
34 IJCAI Acapulco, August Fact dom(a) for every constant a D 2. Fact V (ā) whenever V (ā) v i for some source extension v i in G. 3. For every view (source) predicate V i in the system with description V i ( X) P 1 ( X 1 ),..., P n ( X n ): a) For every P k with no existential variables, the rules P k ( X k, t o ) V i ( X) k = 1,... n b) For every set S ij of predicates of the description s body that are related by common existential variables (Z 1,..., Z m ), the rules, add vij ( X ) V i ( X), not aux vij ( X ) where X = X { P k S ij X k }
35 IJCAI Acapulco, August 2003 aux vij ( X ) m i=1 var v 1 Z l ( X Zl ) var v1 Z l ( X Zl ) P k S ij Z l X k P k ( X k ) where X Zl = { P k S ij Z l X k X k } for l = 1,... m P k ( X k, v ij ) add vij ( X ), Z l ( X k \ X ) F l( X, Z l ), for P k S ij. 4. For every predicate F l ( X, Z l ) introduced in 3.b., the rules, F l ( X, Z l ) add vij Z l ( X ), dom(z l ), choice(( X ), (Z l )). add vij Z l ( X ) add vij ( X ), not aux vij Z l ( X ) for l = 1,... m
36 IJCAI Acapulco, August 2003 aux vij Z l ( X ) var v1 Z l ( X Zl ), Z k Z l F k ( X, Z k ) for l = 1,... m 5. For every global relation P ( X) the rules P ( X, nv ij ) P ( X, v hk ) for {(ij, hk) P ( X) S ij and S hk } P ( X, nv ij ) P ( X, t o ) for {(ij) P ( X) S ij } P ( X, t d ) P ( X, v ij ) for {(ij) P ( X) S ij } P ( X, t d ) P ( X, t o ) stable models of SV (Π(G)) choice models of Π(G) Mininst(G).
37 IJCAI Acapulco, August 2003 Logic Specification of Repairs Annotation Atom The tuple P (ā) is... t d P (ā, t d ) a fact of the database f d P (ā, f d ) not a fact in the database t a P (ā, t a ) adviced to be made true f a P (ā, f a ) adviced to be made false t P (ā, t ) true or becomes true f P (ā, f ) false or becomes false t P (ā, t ) it is true in the repair f P (ā, f ) it is false in the repair
38 IJCAI Acapulco, August 2003 Example 6: A full inclusion dependency IC : x( P (x) Q(x)) An inconsistent database instance DB = {P (a)} Database atoms become facts of the repair program: 1. P (a, t d ) Whatever was true (false) or becomes true (false), gets annotated with t (f ): 2. P (x, t ) P (x, t a ). P (x, t ) P (x, t d ). P (x, f ) P (x, f a ). P (x, f ) not P (x, t d ).... the same for Q...
39 IJCAI Acapulco, August P (x, f a ) Q(x, t a ) P (x, t ), Q(x, f ). One rule per IC; it says how to repair the IC Having passed to annotations t and f keeps repairing the IC if it becomes violated due to the repair of a different IC The repair process must be coherent, this is captured by the denial program constraints 4. P ( x, t a ), P ( x, f a ). Q( x, t a ), Q( x, f a ).
40 IJCAI Acapulco, August 2003 Annotations constants t and f are used to read off the literals that are inside (outside) a repair 5. P (x, t ) P (x, t a ). P (x, t ) P (x, t d ), not P (x, f a ). P (x, f ) P (x, f a ). P (x, f ) not P (x, t d ), not P (x, t a ).... the same for Q... These rules are used to interpret the models as database repairs
41 IJCAI Acapulco, August 2003 The program has two stable models: - {P (a, t d ), P (a, t ), Q(a, f ), Q(a, t a ), P (a, t ), Q(a, t ), Q(a, t )} That corresponds to DB 1 = {P (a), Q(a)} - {P (a, t d ), P (a, t ), P (a, f ), Q(a, f ), P (a, f ), Q(a, f ), P (a, f a )} That corresponds to DB 2 = Theorem: For universal IC, there is a one to one correspondence between the stable models of the repair program and the repairs of DB wrt IC
42 IJCAI Acapulco, August 2003 Computing Minimal Answers Minimal Answers t to a query posed to a global integration system Q( x)? Methodology: 1. Q( P (ū) R( v) ) 2. Q ( x) (Π(Q ), Ans( X)) Q := Q( P (ū, t d ) R( v, t d ) ) - Π(Q ) is a query program - Ans( X) is a query atom defined in Π(Q ) 3. Run Π := Π(Q ) Π(G) 4. Collect ground atoms Ans( t) {S S is a stable model of Π}
43 IJCAI Acapulco, August 2003 Example 3 (continued) Query Q: P (x, y) 1. Q : P (x, y, t d ) 2. Π(Q ): Ans(X, Y ) P (X, Y, t d ) 3. Π = Π(G 3 ) Π(Q ) 4. The stable model of the program Π extended with query atoms is: M 1 = M 1 {Ans(a, c) }; 5. Ans(a, c) is the minimal answer to the query.
Query Answering in Peer-to-Peer Data Exchange Systems
Query Answering in Peer-to-Peer Data Exchange Systems Leopoldo Bertossi and Loreto Bravo Carleton University, School of Computer Science, Ottawa, Canada {bertossi,lbravo}@scs.carleton.ca Abstract. The
Data Quality in Information Integration and Business Intelligence
Data Quality in Information Integration and Business Intelligence Leopoldo Bertossi Carleton University School of Computer Science Ottawa, Canada : Faculty Fellow of the IBM Center for Advanced Studies
Virtual Data Integration
Virtual Data Integration Leopoldo Bertossi Carleton University School of Computer Science Ottawa, Canada www.scs.carleton.ca/ bertossi [email protected] Chapter 1: Introduction and Issues 3 Data
Consistent Query Answering in Databases Under Cardinality-Based Repair Semantics
Consistent Query Answering in Databases Under Cardinality-Based Repair Semantics Leopoldo Bertossi Carleton University School of Computer Science Ottawa, Canada Joint work with: Andrei Lopatenko (Google,
HANDLING INCONSISTENCY IN DATABASES AND DATA INTEGRATION SYSTEMS
HANDLING INCONSISTENCY IN DATABASES AND DATA INTEGRATION SYSTEMS by Loreto Bravo A thesis submitted to the Faculty of Graduate Studies and Research in partial fulfillment of the requirements for the degree
Repair Checking in Inconsistent Databases: Algorithms and Complexity
Repair Checking in Inconsistent Databases: Algorithms and Complexity Foto Afrati 1 Phokion G. Kolaitis 2 1 National Technical University of Athens 2 UC Santa Cruz and IBM Almaden Research Center Oxford,
Data Integration: A Theoretical Perspective
Data Integration: A Theoretical Perspective Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Università di Roma La Sapienza Via Salaria 113, I 00198 Roma, Italy [email protected] ABSTRACT
Computational Methods for Database Repair by Signed Formulae
Computational Methods for Database Repair by Signed Formulae Ofer Arieli ([email protected]) Department of Computer Science, The Academic College of Tel-Aviv, 4 Antokolski street, Tel-Aviv 61161, Israel.
XML Data Integration
XML Data Integration Lucja Kot Cornell University 11 November 2010 Lucja Kot (Cornell University) XML Data Integration 11 November 2010 1 / 42 Introduction Data Integration and Query Answering A data integration
Query Processing in Data Integration Systems
Query Processing in Data Integration Systems Diego Calvanese Free University of Bozen-Bolzano BIT PhD Summer School Bressanone July 3 7, 2006 D. Calvanese Data Integration BIT PhD Summer School 1 / 152
Data Integration. Maurizio Lenzerini. Universitá di Roma La Sapienza
Data Integration Maurizio Lenzerini Universitá di Roma La Sapienza DASI 06: Phd School on Data and Service Integration Bertinoro, December 11 15, 2006 M. Lenzerini Data Integration DASI 06 1 / 213 Structure
A Tutorial on Data Integration
A Tutorial on Data Integration Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Antonio Ruberti, Sapienza Università di Roma DEIS 10 - Data Exchange, Integration, and Streaming November 7-12,
Schema Mediation in Peer Data Management Systems
Schema Mediation in Peer Data Management Systems Alon Y. Halevy Zachary G. Ives Dan Suciu Igor Tatarinov University of Washington Seattle, WA, USA 98195-2350 {alon,zives,suciu,igor}@cs.washington.edu Abstract
Big Data Cleaning. Nan Tang. Qatar Computing Research Institute, Doha, Qatar [email protected]
Big Data Cleaning Nan Tang Qatar Computing Research Institute, Doha, Qatar [email protected] Abstract. Data cleaning is, in fact, a lively subject that has played an important part in the history of data
Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM)
Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM) Extended Abstract Ioanna Koffina 1, Giorgos Serfiotis 1, Vassilis Christophides 1, Val Tannen
RELATIONAL DATABASE DESIGN
RELATIONAL DATABASE DESIGN g Good database design - Avoiding anomalies g Functional Dependencies g Normalization & Decomposition Using Functional Dependencies g 1NF - Atomic Domains and First Normal Form
Data Integration Using ID-Logic
Data Integration Using ID-Logic Bert Van Nuffelen 1, Alvaro Cortés-Calabuig 1, Marc Denecker 1, Ofer Arieli 2, and Maurice Bruynooghe 1 1 Department of Computer Science, Katholieke Universiteit Leuven,
A Fast and Efficient Method to Find the Conditional Functional Dependencies in Databases
International Journal of Engineering Research and Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 3, Issue 5 (August 2012), PP. 56-61 A Fast and Efficient Method to Find the Conditional
On the Decidability and Complexity of Query Answering over Inconsistent and Incomplete Databases
On the Decidability and Complexity of Query Answering over Inconsistent and Incomplete Databases Andrea Calì Domenico Lembo Riccardo Rosati Dipartimento di Informatica e Sistemistica Università di Roma
Relational Database Design
Relational Database Design To generate a set of relation schemas that allows - to store information without unnecessary redundancy - to retrieve desired information easily Approach - design schema in appropriate
Peer Data Exchange. ACM Transactions on Database Systems, Vol. V, No. N, Month 20YY, Pages 1 0??.
Peer Data Exchange Ariel Fuxman 1 University of Toronto Phokion G. Kolaitis 2 IBM Almaden Research Center Renée J. Miller 1 University of Toronto and Wang-Chiew Tan 3 University of California, Santa Cruz
A View Integration Approach to Dynamic Composition of Web Services
A View Integration Approach to Dynamic Composition of Web Services Snehal Thakkar, Craig A. Knoblock, and José Luis Ambite University of Southern California/ Information Sciences Institute 4676 Admiralty
Theory behind Normalization & DB Design. Satisfiability: Does an FD hold? Lecture 12
Theory behind Normalization & DB Design Lecture 12 Satisfiability: Does an FD hold? Satisfiability of FDs Given: FD X Y and relation R Output: Does R satisfy X Y? Algorithm: a.sort R on X b.do all the
SPARQL: Un Lenguaje de Consulta para la Web
SPARQL: Un Lenguaje de Consulta para la Web Semántica Marcelo Arenas Pontificia Universidad Católica de Chile y Centro de Investigación de la Web M. Arenas SPARQL: Un Lenguaje de Consulta para la Web Semántica
Integrating Data from Possibly Inconsistent Databases
Integrating Data from Possibly Inconsistent Databases Phan Minh Dung Department of Computer Science Asian Institute of Technology PO Box 2754, Bangkok 10501, Thailand [email protected] Abstract We address
Chapter 17 Using OWL in Data Integration
Chapter 17 Using OWL in Data Integration Diego Calvanese, Giuseppe De Giacomo, Domenico Lembo, Maurizio Lenzerini, Riccardo Rosati, and Marco Ruzzi Abstract One of the outcomes of the research work carried
Robust Module-based Data Management
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. V, NO. N, MONTH YEAR 1 Robust Module-based Data Management François Goasdoué, LRI, Univ. Paris-Sud, and Marie-Christine Rousset, LIG, Univ. Grenoble
(LMCS, p. 317) V.1. First Order Logic. This is the most powerful, most expressive logic that we will examine.
(LMCS, p. 317) V.1 First Order Logic This is the most powerful, most expressive logic that we will examine. Our version of first-order logic will use the following symbols: variables connectives (,,,,
Business Intelligence: Recent Experiences in Canada
Business Intelligence: Recent Experiences in Canada Leopoldo Bertossi Carleton University School of Computer Science Ottawa, Canada : Faculty Fellow of the IBM Center for Advanced Studies 2 Business Intelligence
! " # The Logic of Descriptions. Logics for Data and Knowledge Representation. Terminology. Overview. Three Basic Features. Some History on DLs
,!0((,.+#$),%$(-&.& *,2(-$)%&2.'3&%!&, Logics for Data and Knowledge Representation Alessandro Agostini [email protected] University of Trento Fausto Giunchiglia [email protected] The Logic of Descriptions!$%&'()*$#)
Integrating and Exchanging XML Data using Ontologies
Integrating and Exchanging XML Data using Ontologies Huiyong Xiao and Isabel F. Cruz Department of Computer Science University of Illinois at Chicago {hxiao ifc}@cs.uic.edu Abstract. While providing a
Cassandra. References:
Cassandra References: Becker, Moritz; Sewell, Peter. Cassandra: Flexible Trust Management, Applied to Electronic Health Records. 2004. Li, Ninghui; Mitchell, John. Datalog with Constraints: A Foundation
A View Integration Approach to Dynamic Composition of Web Services
A View Integration Approach to Dynamic Composition of Web Services Snehal Thakkar University of Southern California/ Information Sciences Institute 4676 Admiralty Way, Marina Del Rey, California 90292
Reasoning Under Uncertainty: Variable Elimination Example
Reasoning Under Uncertainty: Variable Elimination Example CPSC 322 Lecture 29 March 26, 2007 Textbook 9.5 Reasoning Under Uncertainty: Variable Elimination Example CPSC 322 Lecture 29, Slide 1 Lecture
CS143 Notes: Normalization Theory
CS143 Notes: Normalization Theory Book Chapters (4th) Chapters 7.1-6, 7.8, 7.10 (5th) Chapters 7.1-6, 7.8 (6th) Chapters 8.1-6, 8.8 INTRODUCTION Main question How do we design good tables for a relational
Completing Description Logic Knowledge Bases using Formal Concept Analysis
Completing Description Logic Knowledge Bases using Formal Concept Analysis Franz Baader, 1 Bernhard Ganter, 1 Barış Sertkaya, 1 and Ulrike Sattler 2 1 TU Dresden, Germany and 2 The University of Manchester,
CHAPTER 7 GENERAL PROOF SYSTEMS
CHAPTER 7 GENERAL PROOF SYSTEMS 1 Introduction Proof systems are built to prove statements. They can be thought as an inference machine with special statements, called provable statements, or sometimes
Structured and Semi-Structured Data Integration
UNIVERSITÀ DEGLI STUDI DI ROMA LA SAPIENZA DOTTORATO DI RICERCA IN INGEGNERIA INFORMATICA XIX CICLO 2006 UNIVERSITÉ DE PARIS SUD DOCTORAT DE RECHERCHE EN INFORMATIQUE Structured and Semi-Structured Data
Data exchange. L. Libkin 1 Data Integration and Exchange
Data exchange Source schema, target schema; need to transfer data between them. A typical scenario: Two organizations have their legacy databases, schemas cannot be changed. Data from one organization
Enterprise Modeling and Data Warehousing in Telecom Italia
Enterprise Modeling and Data Warehousing in Telecom Italia Diego Calvanese Faculty of Computer Science Free University of Bolzano/Bozen Piazza Domenicani 3 I-39100 Bolzano-Bozen BZ, Italy Luigi Dragone,
CS2Bh: Current Technologies. Introduction to XML and Relational Databases. The Relational Model. The relational model
CS2Bh: Current Technologies Introduction to XML and Relational Databases Spring 2005 The Relational Model CS2 Spring 2005 (LN6) 1 The relational model Proposed by Codd in 1970. It is the dominant data
Logic in general. Inference rules and theorem proving
Logical Agents Knowledge-based agents Logic in general Propositional logic Inference rules and theorem proving First order logic Knowledge-based agents Inference engine Knowledge base Domain-independent
Schema Mappings and Data Exchange
Schema Mappings and Data Exchange Phokion G. Kolaitis University of California, Santa Cruz & IBM Research-Almaden EASSLC 2012 Southwest University August 2012 1 Logic and Databases Extensive interaction
Designing and Refining Schema Mappings via Data Examples
Designing and Refining Schema Mappings via Data Examples Bogdan Alexe UCSC [email protected] Balder ten Cate UCSC [email protected] Phokion G. Kolaitis UCSC & IBM Research - Almaden [email protected]
Computational Logic and Cognitive Science: An Overview
Computational Logic and Cognitive Science: An Overview Session 1: Logical Foundations Technical University of Dresden 25th of August, 2008 University of Osnabrück Who we are Helmar Gust Interests: Analogical
Relational Database Design: FD s & BCNF
CS145 Lecture Notes #5 Relational Database Design: FD s & BCNF Motivation Automatic translation from E/R or ODL may not produce the best relational design possible Sometimes database designers like to
First-Order Stable Model Semantics and First-Order Loop Formulas
Journal of Artificial Intelligence Research 42 (2011) 125-180 Submitted 03/11; published 10/11 First-Order Stable Model Semantics and First-Order Loop Formulas Joohyung Lee Yunsong Meng School of Computing,
Relational Databases
Relational Databases Jan Chomicki University at Buffalo Jan Chomicki () Relational databases 1 / 18 Relational data model Domain domain: predefined set of atomic values: integers, strings,... every attribute
The Relational Model. Why Study the Relational Model?
The Relational Model Chapter 3 Instructor: Vladimir Zadorozhny [email protected] Information Science Program School of Information Sciences, University of Pittsburgh 1 Why Study the Relational Model?
Normalization in Database Design
in Database Design Marek Rychly [email protected] Strathmore University, @ilabafrica & Brno University of Technology, Faculty of Information Technology Advanced Databases and Enterprise Systems 14
Enforcing Data Quality Rules for a Synchronized VM Log Audit Environment Using Transformation Mapping Techniques
Enforcing Data Quality Rules for a Synchronized VM Log Audit Environment Using Transformation Mapping Techniques Sean Thorpe 1, Indrajit Ray 2, and Tyrone Grandison 3 1 Faculty of Engineering and Computing,
Schema Refinement, Functional Dependencies, Normalization
Schema Refinement, Functional Dependencies, Normalization MSCI 346: Database Systems Güneş Aluç, University of Waterloo Spring 2015 MSCI 346: Database Systems Chapter 19 1 / 42 Outline 1 Introduction Design
A first step towards modeling semistructured data in hybrid multimodal logic
A first step towards modeling semistructured data in hybrid multimodal logic Nicole Bidoit * Serenella Cerrito ** Virginie Thion * * LRI UMR CNRS 8623, Université Paris 11, Centre d Orsay. ** LaMI UMR
Web-Based Genomic Information Integration with Gene Ontology
Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, [email protected] Abstract. Despite the dramatic growth of online genomic
Chapter 5 More SQL: Complex Queries, Triggers, Views, and Schema Modification
Chapter 5 More SQL: Complex Queries, Triggers, Views, and Schema Modification Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Outline More Complex SQL Retrieval Queries
Cooperating Answer Set Programming
Cooperating Answer Set Programming Davy Van Nieuwenborgh 1,, Stijn Heymans 2, and Dirk Vermeir 1 1 Dept. of Computer Science Vrije Universiteit Brussel, VUB Pleinlaan 2, B1050 Brussels, Belgium {dvnieuwe,
Certamen 1 de Representación del Conocimiento
Certamen 1 de Representación del Conocimiento Segundo Semestre 2012 Question: 1 2 3 4 5 6 7 8 9 Total Points: 2 2 1 1 / 2 1 / 2 3 1 1 / 2 1 1 / 2 12 Here we show one way to solve each question, but there
Monitoring Metric First-order Temporal Properties
Monitoring Metric First-order Temporal Properties DAVID BASIN, FELIX KLAEDTKE, SAMUEL MÜLLER, and EUGEN ZĂLINESCU, ETH Zurich Runtime monitoring is a general approach to verifying system properties at
The core theory of relational databases. Bibliography
The core theory of relational databases Slide 1 La meilleure pratique... c est une bonne théorie Bibliography M.Levene, G.Loizou, Guided Tour of Relational Databases and Beyond, Springer, 625 pages,1999.
