A Tale of Two Approaches: Query Performance Study of XML Storage Strategies in Relational Databases
|
|
|
- Edith Dickerson
- 9 years ago
- Views:
Transcription
1 A Tale of Two Approaches: Performance Study of XML Storage Strategies in Relational Databases Sandeep Prakash Sourav S. Bhowmick School of Computer Engineering, Nanyang Technological University, Singapore Abstract. Several recent papers have investigated a relational approach to store XML data and there is a growing evidence that schema-conscious approaches are a better option than schema-oblivious techniques as far as query performance is concerned. This paper studies three strategies for storing XML document including one representing schema-conscious approach (Shared- Inlining ) and two representing schema-oblivious approach (XParent and Sucxent++). We implement and evaluate each approach using benchmark non-recursive XQueries. Our analysis shows an interesting fact that schemaconscious approaches are not always a better option than schema-oblivious approaches! In fact, it is possible for a schema-oblivious approach (Sucxent++) to be faster than a schema-conscious approach (Shared-Inlining ) for 55% of the benchmark queries (the highest observed factor being 87.8 times). Sucxent++ also outperforms XParent by up to 17 times. 1 Introduction Recently, there has been a substantial research effort in storing and processing XML data using relational backends. This approach either shred the XML documents into relational tables using some sort of encoding scheme [1, 5, 7, 8, 12, 15, 18, 19] or use a BLOB column to store the XML document [6, 2, 21]. BLOB-based approaches use stored procedures to invoke an external XPath/X processor. The main advantages of this approach are fast retrieval of full XML document and ability of storing any XML document irrespective of the availability of the schema. However, in this paper, we focus on shredding-based approaches as in the BLOB-based approaches, usually the entire XML document must be brought into memory before processing, severely limiting the size of the data and optimization possibilities. The shredding-based approaches can be classified into two major categories: the schema-oblivious and the schema-conscious approaches. In brief, the schemaoblivious method consists of a fixed schema which is used to store XML documents. This approach does not require existence of an XML schema/dtd. Some examples of schema-oblivious approaches are Edge approach [7], XRel [18], XParent [8], Sucxent++[12]. The schema-conscious method, on the other hand, derives a relational schema based on the DTD/XML schema of the XML documents. Examples of such approaches are Shared-Inlining [15] and LegoDB [1]. Once XML data is stored using either schema-conscious or schema-oblivious approach, an X/XPath is
2 translated into SQL for evaluation. A comprehensive review of methods for XML-to- SQL query translation and their limitations is beyond the scope of this paper and can be found in [9]. In this paper, we study the performance of schema-conscious and schema-oblivious approaches and compare these alternatives. In particular, we compare the performance of one schema-conscious approach (Shared-Inlining [15]) with two representative schema-oblivious approaches (XParent[8] and Sucxent++[12]). At this point, one would question the justification of this work as a growing body of research suggests that schema-conscious approaches perform better than schema-oblivious approaches. After all, it has been demonstrated in [16] that schema-conscious approaches generally perform substantially better in terms of query processing and storage size. However, we believe that the superiority of schema-conscious approaches (such as Shared-Inlining) as demonstrated in [16] may not hold anymore! This is primarily due to the following reasons. The Edge approach[7] was used in [16] as the representative schema-oblivious approach for comparison. Although the Edge approach is a pioneering relational approach, we argue that it is not a good representation of the schema-oblivious approach as far as query processing is concerned. In fact, XParent [8] and XRel [18] have been shown to outperform the Edge approach by up to 2 times, with XParent outperforming XRel [8, 11]. This does not necessarily mean that XParent outperforms schema-conscious approaches. However, it does raise the question whether there exists a schema-oblivious approach in the literature that can outperform XParent significantly and consequently outperform a schema-conscious approach? In [12], we provided answer to this question by presenting a novel schema-oblivious approach called Sucxent++ (Schema Unconscious XML Enabled System) that outperforms XParent by up to 15 times and a schema-conscious approach (Shared-Inlining) by up to 8 times for certain types of recursive XML queries. A recursive XML query is an XML query which contains the descendant axis (//) [1]. Although our effort in [12] asserts that it is indeed possible for a schema-oblivious approach to outperform the schema-conscious approach, it was demonstrated only for recursive XML queries. Naturally, we would like to know whether this result holds for non-recursive XML queries. Note that a non-recursive XML query does not contain any descendant axis. Such queries are prevalent in GUI-based XML query formulation framework in the presence of DTDs/XML schemas. In other words, is it possible for Sucxent++ to outperform XParent and Shared-Inlining for non-recursive XML queries as well? In this paper, we provide the answer to this question. 2 Related Work One of the earliest work on performance evaluation of XML storage strategies was carried out by Tian et al. [16]. In this paper five strategies for storing XML documents were studied including one that stores documents in the file system, three that uses a relational database system (Edge and Attribute approaches [7], Shared- Inlining [15]), and one that uses an object manager. Recently, Lu et al. [11] reported results on benchmarking six relational approaches on two commercial RDBMS and two native XML database systems using the XMark[14] and XMach[2] benchmarks.
3 Both these efforts showed that schema-conscious approaches perform better than schema-oblivious approaches. Our study differs from the above efforts in the following ways. First, we consider a relatively more efficient schema-oblivious storage strategy (Sucxent++) compared to the approaches used in [16, 11] for our study. Second, we evaluate the performance of our representative systems on much larger data sets (1GB) having richer variety (data and text-centric single and multiple documents). Note that the maximum size of the data set was 14 MB and 15MB in [16] and [11], respectively. This gives us a better insight on the scalability of the representative approaches. Third, we experimented with wider and richer variety of XML queries. Finally, contrary to the insight gained by Tian et al. and Lu et al., we show that it is indeed possible for a schema-oblivious approach to outperform a schema-conscious approach for certain types of non-recursive XML queries. XMach-1 [2] is a multi-user benchmark that is based on a Web application and considers text documents and catalog data. The goal of the benchmark is to test how many queries per second a database can process at what cost. Additional measures include response times, bulk load times and database or index sizes. XMark [14] is a single-user benchmark. The database model is based on the Internet auction site and therefore, its database contains one large XML document with text and non-text data. XOO7 [3] was derived from OO7 [4], which was designed to test the efficiency of object-oriented DBMS. Besides mapping the original queries of OO7 into XML, XOO7 adds some specific XML queries. Workloads of the above benchmarks, cover different functionalities, but leave out a number of X features [17]. XBench [17] was recently proposed to cover all of X functionalities as captured by XML Use Cases. As our work is based on XBench data set and queries, we also cover all these functionalities. In summary, all of these are application benchmarks and measure the overall performance of a DBMS. That is, they compare the performance of different RDBMSs (e.g., IBM DB2, SQL Server in XBench) and native XML DBMS (e.g., X-Hive) for processing XML data. On the contrary, we focus on evaluating performance of schema-oblivious and schema-conscious approaches on a specific commercial RDBMS. 3 Background In this section, we present the framework for our performance study. We begin by identifying the representative schema-oblivious and schema-conscious systems chosen for our performance study and justify their inclusion. Then, we present the experimental setup and data sets used for our study. 3.1 Representative Systems We chose XParent[8], Sucxent++[13], and Shared-Inlining [15] as representative shredding-based approaches for performance study. The reasons are as follows. First, we intend to ensure that the implementation of our selected storage scheme does not require modification of the relational engine. This is primarily because such approach enhances portability and ease of implementation of the storage approach on top of an off-the-shelf commercial RDBMS. Consequently, we did not choose the dynamic intervals approach [5] and the approach in [19] as these approaches enhance
4 TC (Text-centric) DC (Data-centric) SD MD (Single Document) (Multiple Document) Online dictionaries E-commerce catalogs (a) Data set Digital libraries, news corpus Transactional data Data set DC/MD DC/SD TC/MD TC/SD No of Nodes 1MB 1MB 1GB 219,382 2,183,331 23,821, ,26 2,394,886 24,81, ,258 2,335,18 23,74, ,4 2,765,29 28,419,13 (b) Data set of XBench Fig. 1. Data set of XBench. the relational engine with XML-specific primitives for efficient execution. Note that in the absence of such XML-specific primitives, the query processing cost can be expensive in these approaches. For instance, without the special relational operators defined in [5], the query performance is likely to be inferior even for simple path expressions [9]. Second, the selected approach must have good query performance. Jiang et al. [8] showed that XParent outperforms Edge [7] (up to 2 times) and XRel [18] approaches significantly. In [12], we have shown that Sucxent++ outperforms XParent (up to 15 times) and Shared-Inlining [15] (up to 8 times) for certain types of recursive XML queries. Moreover, XParent takes 2.5 times more storage space compared to the Sucxent++ approach. Hence, XParent and Sucxent++ are chosen as representatives of the schema-oblivious approach. Finally, we chose Shared-Inlining over LegoDB [1] as the representative of schema-conscious approaches for the following reasons. First, Shared-Inlining is widely used in the literature as a representative of schema-conscious approaches. Second, unlike Shared-Inlining, LegoDB is application and query workload-dependent. Third, despite our best efforts (including contacting the authors), we could not get the source code of LegoDB. 3.2 Experimental Setup Prototypes for Sucxent++, XParent and Shared-Inlining were developed using Java JDK 1.5 and a commercial RDBMS 1. The experiments were conducted on a P4 1.4GHz machine with 256MB of RAM and a 4GB (72rpm) IDE hard disk. The operating system was Windows 2 Professional. The XBench [17] data set was used for comparison of storage size, insertion and extraction times, as it provides a comprehensive range of XML document types. We studied both data-centric and text-centric applications consisting of single as well as multiple XML documents. We also test for scalability (small (1MB), normal (1 MB) and large (1 GB) dataset) of the schema-conscious and schema-oblivious approaches. Figures 1(a) and 1(b) summarize the characteristics of the data sets used. In our experiments, we create separate database instances for all the scenarios. For example, we create three database instances for TC/SD, called TC/SD-small, TC/SD-normal, and TC/SD-large. A total of 22 queries as described in [17] were tested on this data set. The list of queries and their characteristics is shown in Figures 2 and 3. Observe that we focus on non-recursive XML queries. The queries were executed in the reconstruct mode where not only the internal nodes are selected, but also all descendants of that node. Several steps were taken to 1 Our licensing agreement disallows us from naming the product.
5 # for $order in input()/order[@id="1"] $order/customer_id for $a in input()/order[@id="2"] $a/order_lines/order_line[1] for $a in input()/order[@id="4"] $a//item_id Characteristics, ordered access - One // axis 4 for $a in input()/order where $a/total gt Sort order by $a/ship_type - Return multiple elements {$a/@id}{$a/order_date}{$a/ship_type} 5 6 for $a in input()/order[@id="6"] $a for $order in input()/order, $cust in input()/customers/customer where $order/customer_id = $cust/@id and $order/@id = "7" {$order/@id}{$order/ order_status}{$cust/first_name}{$cust/ last_name}{$cust/phone_number} - Document reconstruction - Join - Multiple elements # 1 Fig. 2. XBench queries. for $item in input()/catalog/ item[@id="i1"] $item for $item in input()/catalog/:item where every $add in $item/authors/author/ contact_information/mailing_address satisfies $add/name_of_country = "Canada" $item for $a in input()/catalog/item[@id="i6"] {$a/authors/author[1]/ contact_information/mailing_address} for $a in input()/catalog/:item where $a/date_of_release gt " " and $a/date_of_release lt " " and empty($a/publisher/ contact_information/fax_number) {$a/publisher/name} Characteristics - DCSD - DCSD - Quantification - DCSD - Document construction - DCSD - Data type casting - Irregular data ensure the consistency of the test results. Prior to our experiments, we ensure that statistics had been collected, allowing well-informed plan selection. Each test query was executed 6 times while the performance results of the first run discarded. A 95% confidence interval was computed. In our experiments, the estimated error was small (< 1%). The bufferpool of the DBMS was cleared before each run to ensure times reported are from cold runs. Also, appropriate indexes were constructed for all the three approaches through a careful analysis on the benchmark queries. Note that the RDBMS we have used can only create an index for varchar, which is less than 255. Hence, in schema-oblivious approaches, if a single text in the value attribute exceeds the limit, which is highly possible for text-centric XML documents, then we cannot create an index on this attribute to facilitate XML query processing. Similar to [11], we use a two-table approach to handle short/long data values separately. 4 Data-Centric Processing In this section we study the performance of the representative approaches (Shared- Inlining, Sucxent++, and XParent) for XML queries on data-centric XML documents. The reader may refer to [15], [12, 13], and [8] for algorithmic details related to XML query translation and evaluation. Note that we do not discuss storage size, document insertion and extraction performance here as we have already presented this in [13]. In general, Sucxent++ is times faster than XParent and marginally better than Shared-Inlining as far as insertion time is concerned. Sucxent++ is also the most efficient among the three approaches as far as document extraction is concerned. On the other hand, Shared-Inlining requires the least amount of storage. XParent takes 2.5 times more storage space compared to Sucxent++. We use the queries Q1 to Q11 in Figures 2 and 3 for our performance study. The results are shown in Figure 4. The maximum time we allowed a query to run for all
6 # for $size in input()/catalog/:item/ attributes/size_of_book where $size/length*$size/width*$size/ height > 5 {$size/../../title} for $prolog in input()/article/prolog where $prolog/authors/author/name="ben Yang" $prolog/title for $a in input()/article[@id="8"]/body/ section[@heading="introduction"] <HeadingOfSection> {$a/@heading} </HeadingOfSection> for $a in input()/article where some $b in $a/body/abstract/p satisfies (contains($b, "the") and contains($b, "hockey")) $a/prolog/title for $a in input()/article[@id="5"] {$a/prolog/title} {$a/prolog/authors/author/name} {$a/prolog/dateline/date} {$a/body/abstract} Characteristics - DCSD - Data type casting - Exact match - Two conditions -Quantification - Multiple paths # for $a in input()/article/prolog/ authors/author where empty($a/contact/text()) <NoContact> {$a/name} </NoContact> for $a in input()/article where contains ($a//p, "the hockey") {$a/prolog/title} {$a/body/abstract} for $ent in input()/dictionary/e where $ent/hwg/hw="the" $ent for $ent in input()/dictionary/e where $ent/*/hw = "and" $ent/ss/s/qp/*/qt for $a in input()/dictionary/e [hwg/hw="the"]/ss/s/qp/q order by $a/qd {$a/a}{$a/qd} for $a in input()/dictionary/e where contains ($a, "hockey") Return $a/hwg/hw for $ent in input()/dictionary/e where $ent/ss/s/qp/q/qd="19" $ent/hwg/hw Fig. 3. XBench queries (contd.). Characteristics - Irregular data - Text search - One // axis - TCSD - Reconstruction - TCSD - Path expressions - TCSD - Sorting - TCSD - Text search - TCSD - Deep path Execution Time (s) Shared Inlining XParent SUCXENT++ Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q1 Q11 Execution Time (s) Shared Inlining XParent SUCXENT++ Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q1 Q11 Execution Time (s) Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q1 Q11 Shared Inlining XParent SUCXENT++ (a) 1 MB (b) 1 MB Fig. 4. performance (data-centric). (c) 1 GB experiments was 6 minutes; if the execution did not finish in that period, we do not show it in the figures. Note that we use the optimized version of Sucxent++ for our performance study. In non-optimized Sucxent++, the join between the Path and PathValue tables took a significant portion of the query processing time. The optimized version of Sucxent++ reduces this overhead by altering the translated SQL queries. The join expression v1.pathid = p1.id and p1.pathexp = path in a translated SQL query is replaced with v1.pathid = n where n is the PathId value corresponding to path in the table Path. Similarly, v1.pathid = p1.id and p1.pathexp LIKE path % is replaced with v1.pathid >= n and v1.pathid <= m. For the second case PathIds are assigned in lexicographic order and (n, m) correspond to the first and last occurrences of expressions that have the prefix path. Furthermore, the optimized version also incorporates strategies to optimize recursive path expressions. The reader may refer to [12, 13] for further details. We now elaborate on the performance results. Q1 [Exact match (DCMD)]: For all three data sizes Shared-Inlining performs the best. This can be explained based on the relational schema generated for
7 Shared-Inlining and the SQL query corresponding to query Q1. The Shared- Inlining version involves a single predicate on a single table. All other approaches involve several joins and predicates. Observe that XParent performs marginally better than Sucxent++ for the 1MB data set. This is due to two reasons. First, Sucxent++ involves θ-joins whereas XParent use equi-joins. Second, the data set has several small documents instead of one large document. The translations of both schema-oblivious approaches involve a join of the DocId attribute to filter nodes in the same document. This generates much smaller join sizes and the results are particularly obvious for equi-join based queries. However, for the 1MB and 1GB data sets Sucxent++ performs better than XParent by up to 41.7 times. This is due to the following reasons. First, XParent incurs a greater number of joins on a much larger data set. Second, the optimization strategies in Sucxent++ reduce the number of path joins significantly. This reduces the advantage of equi-joins to a great extent as the data size increases. Queries Q2, Q3 [Exact match and ordered access (DCMD)]: Sucxent++ outperforms both Shared-Inlining and XParent for these queries as data size increases (see Figure 6 also). This is because for Shared-Inlining as data size increases the cost of join operation to extract order line and item id (especially with descendant axis) from other tables increases. Sucxent++ performs better than XParent due to its more optimal storage strategy (as discussed above). Q4 [Sort and multiple elements (DCMD)]: Shared-Inlining performs better than the other two approaches. The main observation here is that Sucxent++ significantly outperforms XParent (up to 97 times) with the increase in data size. This is because the number of joins in the translated SQL statement increases with the number of predicates or clause elements in the X query. Observe that this query has quite a few elements (in addition to a sort operation). As a result, XParent requires a greater number of joins (and on larger data) than Sucxent++. Q5 [Document reconstruction (DCMD)]: Shared-Inlining performs better as evaluation is faster due to smaller (fragmented) tables. But the cost of join becomes higher with increasing data size and therefore the gap between Shared-Inlining and Sucxent++ decreases. Q6 [Join and multiple elements (DCMD)]: Sucxent++ now performs better than both Shared-Inlining and XParent (up to 1629 times better than XParent). This can be attributed to the fact that even Shared-Inlining has to execute more joins for this query as it involves an X join. XParent is outperformed significantly by other two approaches due to larger number of joins. Q7 [Exact match (DCSD)]: Shared-Inlining performs significantly better for smaller data size than the other two approaches. However, the gap between Sucxent++ and Shared-Inlining decreases with the increase in data size (reasons are similar to the above discussion). Observe that XParent outperforms Sucxent++ for the small data set. However, Sucxent++ performs better than XParent for the 1 MB and 1GB data sets. This is because, unlike the DCMD version of this query, the join on the DocId attribute no longer generates small join sizes and the advantages of equi-joins are negated by the large data size in XParent.
8 Execution Time (s) Shared Inlining XParent SUCXENT Execution Time (s) Shared Inlining XParent SUCXENT Execution Time (s) Shared Inlining XParent SUCXENT++ Q12 Q13 Q14 Q15 Q16 Q17 Q18 Q19 Q2 Q21 Q22 Q12 Q13 Q14 Q15 Q16 Q17 Q18 Q19 Q2 Q21 Q22 Q12 Q13 Q14 Q15 Q16 Q17 Q18 Q19 Q2 Q21 Q22 (a) 1 MB (b) 1 MB Fig. 5. performance (text-centric). (c) 1 GB Q8 [Quantification (DCSD)]: This query is quite complex as it involves quantification in the form of the every clause. The Shared-Inlining approach performs the best even though it has to execute quite a few joins as well. This is because the execution of the quantification clause is much better in the Shared-Inlining approach. Sucxent++ performs significantly better than XParent. In fact, XParent failed to results in 6 minutes for the 1GB data set and are, therefore, not included in the figures. Q9 [Document Construction (DCSD)]: Shared-Inlining outperforms Sucxent++ for 1MB and 1MB data sets. With its inherent greater data fragmentation one would expect Shared-Inlining to perform the worst. However, the query involves a predicate and the construction is only for a small part of the document - with the entire part available in a single table. As a result Shared- Inlining performs better. Particularly, for smaller data size the join cost to extract child elements is not significant enough for Shared-Inlining. Cost of is lower for Shared-Inlining due to smaller tables. However, as the data size increases the cost of the join increases. As a result, the overall performance difference is not as significant with the increase in data size. In fact, for 1GB data set Sucxent++ marginally outperforms Shared-Inlining. Note that the mapping strategies in [15] do not allow mapped documents to be faithfully reconstructed [9]. Hence, the resulting mapping may be lossy. Q1 [Irregular Data (DCSD)]: For Shared-Inlining, all the data required for this query could be found in just one table. Therefore, Shared-Inlining performs the best. Also, the empty clause is quite easily implemented by using = null in the corresponding SQL statement. The translation for schema-oblivious approaches is quite complicated as there is no notion of "null" in these two approaches. Both approaches implement this query using the SQL not in clause. As a result the performance is much worse than that of Shared-Inlining. Sucxent++ as expected outperforms XParent. In fact, XParent failed to complete execution in 6 minutes for the 1 MB and 1 GB data sets. Q11 [Datatype Casting (DCSD)]: Shared-Inlining can implement datatype casting in a much cleaner fashion than schema-oblivious approaches. Schemaoblivious approaches (in this case at least) store all data as strings in a single column. Shared-Inlining uses several different columns and each can be assigned a specific datatype. As a result, the query performance in Shared-Inlining is much better. In the schema-oblivious approach, data has to be first filtered based on the path
9 expression and only then can it be typecast. Note that this query has five path expressions. As a result the number of joins in XParent is significantly higher than in Sucxent++. Therefore, Sucxent++ significantly outperforms XParent by up to 17 times. 5 Text-centric Processing Text search plays a very important role in XML document systems. In this section we study the performance of the representative approaches for XML queries on text-centric XML documents. We use the queries Q12 to Q22 in Figure 3 for our performance study. The results are shown in Figure 5. Note that for fair comparison we use the default text processing support of the RDBMS and do not build any additional full text indexes on the underlying RDBMS. Q12 [Exact match (TCMD)]: The performance characteristics for this query are quite similar to those of Q1 except that the difference between Shared- Inlining and the schema-oblivious approaches is not a significant as in the datacentric case (Q1 to Q3). This is because the query in the Shared-Inlining approach requires two tables (compared to one in Q1). Queries [Q13 to Q15 (TCMD)]: For Q13, the evaluation "introduction" is faster for Shared-Inlining. The join operation to extract child elements is not very expensive for this query. For larger data sizes Shared- Inlining outperforms Sucxent++ mainly due to the smaller table that needs to be queried for For Sucxent++ the Path- Value table becomes quite large for larger data sets and hence it takes longer time. Q14 has the contains clause which requires all child elements to be queried. This involves a large number of joins for Shared-Inlining. Consequently, Sucxent++ performs better. Shared-Inlining performs better for Q15 for the same reasons as discussed for Q13. Q16 [Irregular Data (TCMD):] Unlike Q1, this query has only two path expressions. The join cost for Shared-Inlining increases significantly with the increase in data size (especially in the absence of an exact predicate) due to several steps in the path expressions. As a result, even though both the schemaoblivious approaches perform worse than Shared-Inlining for smaller data size, Sucxent++ performs better for 1GB data set. Q17 [Text search (TCMD)]: Shared-Inlining performs significantly worse than the schema-oblivious approaches. This is due to the recursive nature of the query. In Shared-Inlining, the resolution of the path expression /article//p requires the use of the UNION operator on three sub-queries. In addition, one of the sub-queries requires a join across several tables. Due to reasons discussed earlier, Sucxent++ performs better than XParent. Q18 [Exact match/reconstruction (TCSD)]: Sucxent++ again performs the best. Unlike in Q9, Shared-Inlining has to join four tables to reconstruct the element e. Queries Q19, Q2 [Path expressions (TCSD)]: Sucxent++ performs better than all the other approaches. XParent performs the worse due to the multiple path expressions in the query. Shared-Inlining performs slightly worse due to
10 Shared-Inlining / SUCXENT++ XParent / SUCXENT++ 1MB 1MB 1GB 1MB 1MB 1GB Q Q Q Q Q Q Q Q DNF Q Q DNF DNF Q Q12 Q13 Q14 Q15 Q16 Q17 Q18 Q19 Q2 Q21 Q22 Shared-Inlining / SUCXENT++ XParent / SUCXENT++ 1MB 1MB 1GB 1MB 1MB 1GB DNF DNF Fig. 6. Performance summary. multiple joins in the resulting SQL query - although the performance is still comparable to Sucxent++. Specifically, Shared-Inlining involves both join and the union operators due to the wild card path expressions /dictionary/e/*/hw and /dictionary/e/ss/s/qp/*/qt. Q21 [Text search (TCSD)]: Sucxent++ performs the best for this query again. The query for the Shared-Inlining approach has several joins in small subqueries that are finally combined with a UNION clause. This is due to the contains clause in the X query. In addition, Shared-Inlining must execute multiple joins to reconstruct the hw element. However, with increasing size querying for "hockey" in the PathValue table becomes more and more expensive compared to searching for it in smaller tables in Shared-Inlining. Therefore, the gap reduces with increasing data size. Q22 [Exact match (TCSD)]: This query presents a deep path expression resulting in several joins for the Shared-Inlining approach. As a result Sucxent++ significantly outperforms Shared-Inlining by up to 88 times. 6 Summary In this paper, we compared the performance of one schema-conscious approach (Shared-Inlining [15]) with two representative schema-oblivious approaches (XParent[8] and Sucxent++[12]). We show that it is indeed possible for a schemaoblivious approach to outperform a schema-conscious approach for several types of non-recursive XML queries. Figure 6 provides summary of the performance results for the compared approaches with respect to Sucxent++. These figures show the ratio of time taken for a given approach to the time taken in Sucxent++. If a query fails to execute in 6 minutes then we show it as DNF in the corresponding column. Observe that Shared-Inlining outperforms XParent for all queries except Q22. However, Sucxent++ can be several times faster than the schema-conscious approach (Shared-Inlining ) for text-centric documents; the highest observed factor being On the other hand, Shared-Inlining fares better for large data-centric XML documents as it outperforms Sucxent++ for 73% of the benchmark queries (Queries Q1 - Q11). Sucxent++ significantly outperforms XParent for all non-recursive queries that we have experimented with. In particular, Sucxent++ is 7-17 times (excluding
11 DNF queries in XParent) faster than XParent for queries (Q1-Q11) on large datacentric XML documents (1GB). For large text-centric documents, it is still times faster. Note that the optimization techniques in Sucxent++ as described in [12] can also be applied to XParent. A preliminary study of the query plans generated by the RDBMS for XParent suggests that XParent can also benefit from these optimizations. However, a considerable performance difference shall still exists between Sucxent++ and XParent, especially for large data set, primarily due to XParent s large storage requirements and consequently greater I/O-cost. References 1. P. Bohannon, J. Freire, P. Roy, J. Simeon. From XML Schema to Relations: A Cost-based Approach to XML Storage. In IEEE ICDE, T. Böhme, E. Rahm. XMach-1: A Benchmark for XML Data Management. In German Database Conference, S. Bressan, M-L. Lee, Y. G. Li, Z. Lacroix, U. Nambiar. The XOO7 Benchmark.In EEXTT, M. Carey, D. DeWitt, J. Naughton. The OO7 Benchmark. In ACM SIGMOD, D. DeHaan, D. Toman, M. P. Consens, M. T. Ozsu. A Comprehensive X to SQL Translation Using Dynamic Interval Coding. In ACM SIGMOD, L. Ennser, C. Delporte, M. Oba, K. Sunil. Integrating XML and DB2 XML Extender and DB2 Text Extender. IBM Redbooks, D. Florescu, D. Kossman. Storing and ing XML Data using an RDBMS. IEEE Data Engineering Bulletin. 22(3), H. Jiang, H. Lu, W. Wang and J. Xu Yu. Path Materialization Revisited: An Efficient Storage Model for XML Data. 13th Australasian Database Conference (ADC 2), R. Krishnamurthy, R. Kaushik, J. F. Naughton. XML to SQL Translation Literature: The State of the Art and Open Problem.In XSym, R. Krishnamurthy, V. T. Chakaravarthy, R. Kaushik, J. F. Naughton. Recursive XML Schemas, Recursive XML Queries, and Relational Storage: XML-to-SQL Translation.In IEEE ICDE, H. Lu, H. Jiang, J. X. Xu, G. Yu et al. What Makes the Differences: Benchmarking XML Database Implementations. In ACM Trans. on Internet Technology, 5(1), S. Prakash, S. S. Bhowmick, S. K. Madria. Efficient Recursive XML Processing Using Relational Databases. In ER, S. Prakash, S. S. Bhowmick, S. K. Madria. Efficient Recursive XML Processing Using Relational Databases. To appear in Data and Knowledge Engineering Journal, Special Issue on Best Papers of ER 24, Elsevier Science, A. Schmidt, F. Waas, M. Kersten, M. J. Carey, I. Manolescu and R. Busse. XMark: A Benchmark for XML Data Management. In VLDB, J. Shanmugasundaram, K. Tufte et al. Relational Databases for ing XML Documents: Limitations and Opportunities. In VLDB F. Tian, D. DeWitt, J. Chen and C. Zhang. The Design and Performance Evaluation of Alternative XML Storage Strategies. ACM Sigmod Record, Vol. 31(1), B. Yao, M. Tamer Özsu, N. Khandelwal. XBench: Benchmark and Performance Testing of XML DBMSs. In ICDE, Boston, M. Yoshikawa, T. Amagasa, T. Shimura, and S. Uemura. XRel: A Path-based Approach to Storage and Retrieval of XML Documents Using Relational Databases. ACM TOIT 1(1):11-141, C. Zhang, J. Naughton, D. Dewitt, Q. Luo and G. Lohmann. On Supporting Containment Queries in Relational Database Systems. In ACM SIGMOD, Microsoft SQL Server 2 SDK Documentation, Microsoft 2, Oracle XML DB.
XBench Benchmark and Performance Testing of XML DBMSs
XBench Benchmark and Performance Testing of XML DBMSs Benjamin Bin Yao M. Tamer Özsu University of Waterloo School of Computer Science {bbyao, tozsu}@uwaterloo.ca Nitin Khandelwal University of Pennsylvania
ON ANALYZING THE DATABASE PERFORMANCE FOR DIFFERENT CLASSES OF XML DOCUMENTS BASED ON THE USED STORAGE APPROACH
ON ANALYZING THE DATABASE PERFORMANCE FOR DIFFERENT CLASSES OF XML DOCUMENTS BASED ON THE USED STORAGE APPROACH Hagen Höpfner and Jörg Schad and Essam Mansour International University Bruchsal, Campus
Efficient Mapping XML DTD to Relational Database
Efficient Mapping XML DTD to Relational Database Mohammed Adam Ibrahim Fakharaldien 1, Khalid Edris 2, Jasni Mohamed Zain 3, Norrozila Sulaiman 4 Faculty of Computer System and Software Engineering,University
Unraveling the Duplicate-Elimination Problem in XML-to-SQL Query Translation
Unraveling the Duplicate-Elimination Problem in XML-to-SQL Query Translation Rajasekar Krishnamurthy University of Wisconsin [email protected] Raghav Kaushik Microsoft Corporation [email protected]
Model-Mapping Approaches for Storing and Querying XML Documents in Relational Database: A Survey
Model-Mapping Approaches for Storing and Querying XML Documents in Relational Database: A Survey 1 Amjad Qtaish, 2 Kamsuriah Ahmad 1 School of Computer Science, Faculty of Information Science and Technology,
XML Data Integration Based on Content and Structure Similarity Using Keys
XML Data Integration Based on Content and Structure Similarity Using Keys Waraporn Viyanon 1, Sanjay K. Madria 1, and Sourav S. Bhowmick 2 1 Department of Computer Science, Missouri University of Science
XML-to-SQL Query Mapping in the Presence of Multi-valued Schema Mappings and Recursive XML Schemas
XML-to-SQL Query Mapping in the Presence of Multi-valued Schema Mappings and Recursive XML Schemas Mustafa Atay, Artem Chebotko, Shiyong Lu, and Farshad Fotouhi Department of Computer Science Wayne State
ABSTRACT 1. INTRODUCTION. Kamil Bajda-Pawlikowski [email protected]
Kamil Bajda-Pawlikowski [email protected] Querying RDF data stored in DBMS: SPARQL to SQL Conversion Yale University technical report #1409 ABSTRACT This paper discusses the design and implementation
Storing and Querying Ordered XML Using a Relational Database System
Kevin Beyer IBM Almaden Research Center Storing and Querying Ordered XML Using a Relational Database System Igor Tatarinov* University of Washington Jayavel Shanmugasundaram* Cornell University Stratis
Translating XQuery expressions to Functional Queries in a Mediator Database System
Uppsala Student Thesis Computing Science No. 268 2004-01-30 ISSN 1100-1836 Translating XQuery expressions to Functional Queries in a Mediator Database System A student project paper by Tobias Hilka Advisor
Telemetry Database Query Performance Review
Telemetry Database Query Performance Review SophosLabs Network Security Group Michael Shannon Vulnerability Research Manager, SophosLabs [email protected] Christopher Benninger Linux Deep Packet
Parallelizing Structural Joins to Process Queries over Big XML Data Using MapReduce
Parallelizing Structural Joins to Process Queries over Big XML Data Using MapReduce Huayu Wu Institute for Infocomm Research, A*STAR, Singapore [email protected] Abstract. Processing XML queries over
Unified XML/relational storage March 2005. The IBM approach to unified XML/relational databases
March 2005 The IBM approach to unified XML/relational databases Page 2 Contents 2 What is native XML storage? 3 What options are available today? 3 Shred 5 CLOB 5 BLOB (pseudo native) 6 True native 7 The
Technologies for a CERIF XML based CRIS
Technologies for a CERIF XML based CRIS Stefan Bärisch GESIS-IZ, Bonn, Germany Abstract The use of XML as a primary storage format as opposed to data exchange raises a number of questions regarding the
QuickDB Yet YetAnother Database Management System?
QuickDB Yet YetAnother Database Management System? Radim Bača, Peter Chovanec, Michal Krátký, and Petr Lukáš Radim Bača, Peter Chovanec, Michal Krátký, and Petr Lukáš Department of Computer Science, FEECS,
DBMS / Business Intelligence, SQL Server
DBMS / Business Intelligence, SQL Server Orsys, with 30 years of experience, is providing high quality, independant State of the Art seminars and hands-on courses corresponding to the needs of IT professionals.
Efficient Iceberg Query Evaluation for Structured Data using Bitmap Indices
Proc. of Int. Conf. on Advances in Computer Science, AETACS Efficient Iceberg Query Evaluation for Structured Data using Bitmap Indices Ms.Archana G.Narawade a, Mrs.Vaishali Kolhe b a PG student, D.Y.Patil
Enhancing Dataset Processing in Hadoop YARN Performance for Big Data Applications
Enhancing Dataset Processing in Hadoop YARN Performance for Big Data Applications Ahmed Abdulhakim Al-Absi, Dae-Ki Kang and Myong-Jong Kim Abstract In Hadoop MapReduce distributed file system, as the input
Deferred node-copying scheme for XQuery processors
Deferred node-copying scheme for XQuery processors Jan Kurš and Jan Vraný Software Engineering Group, FIT ČVUT, Kolejn 550/2, 160 00, Prague, Czech Republic [email protected], [email protected] Abstract.
MTCache: Mid-Tier Database Caching for SQL Server
MTCache: Mid-Tier Database Caching for SQL Server Per-Åke Larson Jonathan Goldstein Microsoft {palarson,jongold}@microsoft.com Hongfei Guo University of Wisconsin [email protected] Jingren Zhou Columbia
An Oracle White Paper October 2013. Oracle XML DB: Choosing the Best XMLType Storage Option for Your Use Case
An Oracle White Paper October 2013 Oracle XML DB: Choosing the Best XMLType Storage Option for Your Use Case Introduction XMLType is an abstract data type that provides different storage and indexing models
Translating WFS Query to SQL/XML Query
Translating WFS Query to SQL/XML Query Vânia Vidal, Fernando Lemos, Fábio Feitosa Departamento de Computação Universidade Federal do Ceará (UFC) Fortaleza CE Brazil {vvidal, fernandocl, fabiofbf}@lia.ufc.br
An Efficient and Scalable Management of Ontology
An Efficient and Scalable Management of Ontology Myung-Jae Park 1, Jihyun Lee 1, Chun-Hee Lee 1, Jiexi Lin 1, Olivier Serres 2, and Chin-Wan Chung 1 1 Korea Advanced Institute of Science and Technology,
Chapter 5 More SQL: Complex Queries, Triggers, Views, and Schema Modification
Chapter 5 More SQL: Complex Queries, Triggers, Views, and Schema Modification Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Outline More Complex SQL Retrieval Queries
A Study on Workload Imbalance Issues in Data Intensive Distributed Computing
A Study on Workload Imbalance Issues in Data Intensive Distributed Computing Sven Groot 1, Kazuo Goda 1, and Masaru Kitsuregawa 1 University of Tokyo, 4-6-1 Komaba, Meguro-ku, Tokyo 153-8505, Japan Abstract.
A Comparison of Approaches to Large-Scale Data Analysis
A Comparison of Approaches to Large-Scale Data Analysis Sam Madden MIT CSAIL with Andrew Pavlo, Erik Paulson, Alexander Rasin, Daniel Abadi, David DeWitt, and Michael Stonebraker In SIGMOD 2009 MapReduce
KEYWORD SEARCH IN RELATIONAL DATABASES
KEYWORD SEARCH IN RELATIONAL DATABASES N.Divya Bharathi 1 1 PG Scholar, Department of Computer Science and Engineering, ABSTRACT Adhiyamaan College of Engineering, Hosur, (India). Data mining refers to
Seema Sundara, Timothy Chorma, Ying Hu, Jagannathan Srinivasan Oracle Corporation New England Development Center Nashua, New Hampshire
Seema Sundara, Timothy Chorma, Ying Hu, Jagannathan Srinivasan Oracle Corporation New England Development Center Nashua, New Hampshire {Seema.Sundara, Timothy.Chorma, Ying.Hu, Jagannathan.Srinivasan}@oracle.com
Use a Native XML Database for Your XML Data
Use a Native XML Database for Your XML Data You already know it s time to switch. Gregory Burd Product Manager [email protected] Agenda Quick Technical Overview Features API Performance Clear Up Some
Dependency Free Distributed Database Caching for Web Applications and Web Services
Dependency Free Distributed Database Caching for Web Applications and Web Services Hemant Kumar Mehta School of Computer Science and IT, Devi Ahilya University Indore, India Priyesh Kanungo Patel College
A Comparison of Database Query Languages: SQL, SPARQL, CQL, DMX
ISSN: 2393-8528 Contents lists available at www.ijicse.in International Journal of Innovative Computer Science & Engineering Volume 3 Issue 2; March-April-2016; Page No. 09-13 A Comparison of Database
Efficient Integration of Data Mining Techniques in Database Management Systems
Efficient Integration of Data Mining Techniques in Database Management Systems Fadila Bentayeb Jérôme Darmont Cédric Udréa ERIC, University of Lyon 2 5 avenue Pierre Mendès-France 69676 Bron Cedex France
Planning the Installation and Installing SQL Server
Chapter 2 Planning the Installation and Installing SQL Server In This Chapter c SQL Server Editions c Planning Phase c Installing SQL Server 22 Microsoft SQL Server 2012: A Beginner s Guide This chapter
PartJoin: An Efficient Storage and Query Execution for Data Warehouses
PartJoin: An Efficient Storage and Query Execution for Data Warehouses Ladjel Bellatreche 1, Michel Schneider 2, Mukesh Mohania 3, and Bharat Bhargava 4 1 IMERIR, Perpignan, FRANCE [email protected] 2
A Framework for Data Migration between Various Types of Relational Database Management Systems
A Framework for Data Migration between Various Types of Relational Database Management Systems Ahlam Mohammad Al Balushi Sultanate of Oman, International Maritime College Oman ABSTRACT Data Migration is
SODDA A SERVICE-ORIENTED DISTRIBUTED DATABASE ARCHITECTURE
SODDA A SERVICE-ORIENTED DISTRIBUTED DATABASE ARCHITECTURE Breno Mansur Rabelo Centro EData Universidade do Estado de Minas Gerais, Belo Horizonte, MG, Brazil [email protected] Clodoveu Augusto Davis
DLDB: Extending Relational Databases to Support Semantic Web Queries
DLDB: Extending Relational Databases to Support Semantic Web Queries Zhengxiang Pan (Lehigh University, USA [email protected]) Jeff Heflin (Lehigh University, USA [email protected]) Abstract: We
Keywords: Regression testing, database applications, and impact analysis. Abstract. 1 Introduction
Regression Testing of Database Applications Bassel Daou, Ramzi A. Haraty, Nash at Mansour Lebanese American University P.O. Box 13-5053 Beirut, Lebanon Email: rharaty, [email protected] Keywords: Regression
Enhancing Traditional Databases to Support Broader Data Management Applications. Yi Chen Computer Science & Engineering Arizona State University
Enhancing Traditional Databases to Support Broader Data Management Applications Yi Chen Computer Science & Engineering Arizona State University What Is a Database System? Of course, there are traditional
Discovering SQL. Wiley Publishing, Inc. A HANDS-ON GUIDE FOR BEGINNERS. Alex Kriegel WILEY
Discovering SQL A HANDS-ON GUIDE FOR BEGINNERS Alex Kriegel WILEY Wiley Publishing, Inc. INTRODUCTION xxv CHAPTER 1: DROWNING IN DATA, DYING OF THIRST FOR KNOWLEDGE 1 Data Deluge and Informational Overload
Caching XML Data on Mobile Web Clients
Caching XML Data on Mobile Web Clients Stefan Böttcher, Adelhard Türling University of Paderborn, Faculty 5 (Computer Science, Electrical Engineering & Mathematics) Fürstenallee 11, D-33102 Paderborn,
A Comparative Study Between Two Types of Database Management Systems: XML-Enabled Relational and Native XML
World Applied Sciences Journal 19 (7): 972-985, 2012 ISSN 1818-4952 IDOSI Publications, 2012 DOI: 10.5829/idosi.wasj.2012.19.07.71012 A Comparative Study Between Two Types of Database Management Systems:
low-level storage structures e.g. partitions underpinning the warehouse logical table structures
DATA WAREHOUSE PHYSICAL DESIGN The physical design of a data warehouse specifies the: low-level storage structures e.g. partitions underpinning the warehouse logical table structures low-level structures
Querying Combined Cloud-Based and Relational Databases
Querying Combined Cloud-Based and Relational Databases Minpeng Zhu and Tore Risch Department of Information Technology, Uppsala University, Sweden [email protected] [email protected] Abstract An increasing
Geodatabase Programming with SQL
DevSummit DC February 11, 2015 Washington, DC Geodatabase Programming with SQL Craig Gillgrass Assumptions Basic knowledge of SQL and relational databases Basic knowledge of the Geodatabase We ll hold
Database Management Systems Comparative Study: Performances of Microsoft SQL Server Versus Oracle
Database Management Systems Comparative Study: Performances of Microsoft SQL Server Versus Oracle Cătălin Tudose*, Carmen Odubăşteanu** * - ITC Networks, Bucharest, Romania, e-mail: [email protected]
Converting Relational Database Into Xml Document
www.ijcsi.org 127 Converting Relational Database Into Xml Document Kanagaraj.S 1 and Dr.Sunitha Abburu 2 1 Master of Computer Application, Adhiyamaan College of Engineering Hosur, Tamilnadu-635109, India.
An Oracle White Paper June 2012. High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database
An Oracle White Paper June 2012 High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database Executive Overview... 1 Introduction... 1 Oracle Loader for Hadoop... 2 Oracle Direct
Virtuoso and Database Scalability
Virtuoso and Database Scalability By Orri Erling Table of Contents Abstract Metrics Results Transaction Throughput Initializing 40 warehouses Serial Read Test Conditions Analysis Working Set Effect of
FileMaker 12. ODBC and JDBC Guide
FileMaker 12 ODBC and JDBC Guide 2004 2012 FileMaker, Inc. All Rights Reserved. FileMaker, Inc. 5201 Patrick Henry Drive Santa Clara, California 95054 FileMaker and Bento are trademarks of FileMaker, Inc.
Documentum Business Process Analyzer and Business Activity Monitor Installation Guide for JBoss
Documentum Business Process Analyzer and Business Activity Monitor Installation Guide for JBoss Version 5.3 SP5 May, 2007 Copyright 1994-2007 EMC Corporation. All rights reserved. Table of Contents Preface...
Efficient database auditing
Topicus Fincare Efficient database auditing And entity reversion Dennis Windhouwer Supervised by: Pim van den Broek, Jasper Laagland and Johan te Winkel 9 April 2014 SUMMARY Topicus wants their current
A Tool for Generating Relational Database Schema from EER Diagram
A Tool for Generating Relational Schema from EER Diagram Lisa Simasatitkul and Taratip Suwannasart Abstract design is an important activity in software development. EER diagram is one of diagrams, which
Database-driven library system
Database-driven library system Key-Benefits of CADSTAR 12.1 Characteristics of database-driven library system KEY-BENEFITS Increased speed when searching for parts You can edit/save a single part (instead
Graph Database Proof of Concept Report
Objectivity, Inc. Graph Database Proof of Concept Report Managing The Internet of Things Table of Contents Executive Summary 3 Background 3 Proof of Concept 4 Dataset 4 Process 4 Query Catalog 4 Environment
An Overview of Distributed Databases
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 4, Number 2 (2014), pp. 207-214 International Research Publications House http://www. irphouse.com /ijict.htm An Overview
Efficiently Identifying Inclusion Dependencies in RDBMS
Efficiently Identifying Inclusion Dependencies in RDBMS Jana Bauckmann Department for Computer Science, Humboldt-Universität zu Berlin Rudower Chaussee 25, 12489 Berlin, Germany [email protected]
Database Design Patterns. Winter 2006-2007 Lecture 24
Database Design Patterns Winter 2006-2007 Lecture 24 Trees and Hierarchies Many schemas need to represent trees or hierarchies of some sort Common way of representing trees: An adjacency list model Each
Advanced SQL Query To Flink Translator
Advanced SQL Query To Flink Translator Yasien Ghallab Gouda Full Professor Mathematics and Computer Science Department Aswan University, Aswan, Egypt Hager Saleh Mohammed Researcher Computer Science Department
Translating between XML and Relational Databases using XML Schema and Automed
Imperial College of Science, Technology and Medicine (University of London) Department of Computing Translating between XML and Relational Databases using XML Schema and Automed Andrew Charles Smith acs203
What is a database? COSC 304 Introduction to Database Systems. Database Introduction. Example Problem. Databases in the Real-World
COSC 304 Introduction to Systems Introduction Dr. Ramon Lawrence University of British Columbia Okanagan [email protected] What is a database? A database is a collection of logically related data for
Indexing XML Data in RDBMS using ORDPATH
Indexing XML Data in RDBMS using ORDPATH Microsoft SQL Server 2005 Concepts developed by: Patrick O Neil,, Elizabeth O Neil, (University of Massachusetts Boston) Shankar Pal,, Istvan Cseri,, Oliver Seeliger,,
SQL. Short introduction
SQL Short introduction 1 Overview SQL, which stands for Structured Query Language, is used to communicate with a database. Through SQL one can create, manipulate, query and delete tables and contents.
PCVITA Express Migrator for SharePoint (File System) 2011. Table of Contents
Table of Contents Chapter-1 ---------------------------------------------------------------------------- Page No (2) What is PCVITA Express Migrator for SharePoint (File System)? Migration Supported The
Unit 4.3 - Storage Structures 1. Storage Structures. Unit 4.3
Storage Structures Unit 4.3 Unit 4.3 - Storage Structures 1 The Physical Store Storage Capacity Medium Transfer Rate Seek Time Main Memory 800 MB/s 500 MB Instant Hard Drive 10 MB/s 120 GB 10 ms CD-ROM
Comparing Microsoft SQL Server 2005 Replication and DataXtend Remote Edition for Mobile and Distributed Applications
Comparing Microsoft SQL Server 2005 Replication and DataXtend Remote Edition for Mobile and Distributed Applications White Paper Table of Contents Overview...3 Replication Types Supported...3 Set-up &
Preparing a SQL Server for EmpowerID installation
Preparing a SQL Server for EmpowerID installation By: Jamis Eichenauer Last Updated: October 7, 2014 Contents Hardware preparation... 3 Software preparation... 3 SQL Server preparation... 4 Full-Text Search
database abstraction layer database abstraction layers in PHP Lukas Smith BackendMedia [email protected]
Lukas Smith database abstraction layers in PHP BackendMedia 1 Overview Introduction Motivation PDO extension PEAR::MDB2 Client API SQL syntax SQL concepts Result sets Error handling High level features
SAP Business Objects Business Intelligence platform Document Version: 4.1 Support Package 7 2015-11-24. Data Federation Administration Tool Guide
SAP Business Objects Business Intelligence platform Document Version: 4.1 Support Package 7 2015-11-24 Data Federation Administration Tool Guide Content 1 What's new in the.... 5 2 Introduction to administration
Physical Design. Meeting the needs of the users is the gold standard against which we measure our success in creating a database.
Physical Design Physical Database Design (Defined): Process of producing a description of the implementation of the database on secondary storage; it describes the base relations, file organizations, and
Report Paper: MatLab/Database Connectivity
Report Paper: MatLab/Database Connectivity Samuel Moyle March 2003 Experiment Introduction This experiment was run following a visit to the University of Queensland, where a simulation engine has been
Database Systems. Lecture 1: Introduction
Database Systems Lecture 1: Introduction General Information Professor: Leonid Libkin Contact: [email protected] Lectures: Tuesday, 11:10am 1 pm, AT LT4 Website: http://homepages.inf.ed.ac.uk/libkin/teach/dbs09/index.html
6. SQL/XML. 6.1 Introduction. 6.1 Introduction. 6.1 Introduction. 6.1 Introduction. XML Databases 6. SQL/XML. Creating XML documents from a database
XML Databases Silke Eckstein Andreas Kupfer Institut für Informationssysteme Technische Universität http://www.ifis.cs.tu-bs.de in XML XML Databases SilkeEckstein Institut fürinformationssysteme TU 2 Creating
Database Auditing Design on Historical Data
ISBN 978-952-5726-09-1 (Print) Proceedings of the Second International Symposium on Networking and Network Security (ISNNS 10) Jinggangshan, P. R. China, 2-4, April. 2010, pp. 275-281 Database Auditing
Creating Synthetic Temporal Document Collections for Web Archive Benchmarking
Creating Synthetic Temporal Document Collections for Web Archive Benchmarking Kjetil Nørvåg and Albert Overskeid Nybø Norwegian University of Science and Technology 7491 Trondheim, Norway Abstract. In
