Acknowledgements References 5. Conclusion and Future Works Sung Wan Kim

Size: px
Start display at page:

Download "Acknowledgements References 5. Conclusion and Future Works Sung Wan Kim"

Transcription

1 Hybrid Storage Scheme for RDF Data Management in Semantic Web Sung Wan Kim Department of Computer Information, Sahmyook College Chungryang P.O. Box118, Seoul , Korea ABSTRACT: With the advent of the Semantic Web as the next-generation of Web technology, large volumes of Semantic Web data described in RDF will appear in the near future. Most previous approaches treat RDF data as a form of triple and store them in a large-sized relational table. Basically, since it always requires the whole table to be scanned for processing a query, it may degrade retrieval performance. In addition, it does not scale well. In this paper, we propose a hybrid storage approach for RDF data management. The proposed approach aims to provide good query performance, scalability, manageability, and flexibility. To achieve these goals, we distinguish some frequently appeared properties in RDF data. A set of RDF data with a distinguished property is independently treated and stored together in a corresponding property-based table. For processing a query having a specific property, we can avoid full scanning the whole data and only have to access a corresponding table. For queries having specific properties, the proposed scheme achieves better performance than the previous approach. Categories and Subject Descriptors B.4 [Input/Output and Data Communications]; D.2.12 [Interoperability Web-based Services]; E.2 [Data Storage Representations]; H.2 [Database Management] General Terms Hybrid data storage, W3C, Web data management Keywords: RDF Data management, Semantic web, Data storage scheme Received 28 Oct. 2005; Revised 15 Dec. 2005; Accepted 16 Jan Introduction The W3C has established the Semantic Web as the next-generation Web. The Semantic Web extends the current Web to make Web information meaningful to computers by giving it a well-defined meaning, which is so called semantics. This semantic data attached to Web information is the foundation in the Semantic Web. The W3C released, therefore, the Resource Description Framework (RDF) to represent and exchange semantic data about resources in the Web [1]. We call these data Semantic Web data or more concisely RDF data in this paper. As it is expected that the utilization scope of the Semantic Web application will be more expanded, enormous Semantic Web data will appear in the near future. For example, MusicBrainz is one of the first of what might be called Semantic Web services [12]. It provides information about musical artists, song titles, and so on using metadata described in RDF. Thus, we strongly believe that how to efficiently store and manage the Semantic Web data is a key role in realizing the vision of the Semantic Web. In order to manage RDF data, most previous approaches use traditional database management systems such as RDBMS and ORDBMS [2][3][4][14]. In these approaches, RDF data is represented by a set of triples and then stored in a single large relational table (what is called a triple table). From a data management view point, it has the advantage of directly using the full power of databases management systems. Basically, since it always requires the whole table to be scanned for processing a query, however, it may degrade retrieval performance. In addition, maintaining a single large triple table is not good for scalability. 1 This is a revised version of the paper presented at the International Conference on Next Generation Web Services Practices (NWeSP), August, 2005, Seoul, Korea. Recently Ding et al [8] reported an analysis on the empirical usage of properties over FOAF (Friend-of-a-Friend) data and revealed the most frequently used properties. We focused on the fact that among whole properties in FOAF vocabulary, the average total usage of several properties (about 5) shows over 50% of the whole usage. We believe that since the most frequently used properties will be continuously and popularly used both in generating future FOAF documents and in forming user query, it is more efficient to manage them with a special manner. In order to enhance query performance, in this paper, we propose a novel storage scheme for managing RDF data. We also aim to provide scalability, manageability, and flexibility. We maintain RDF data not in a single large table but in several independent tables. We group RDF data according to some distinguished properties and store them independently in the corresponding tables. Thus, we can avoid the full scanning for a single large table and obtain a good retrieval performance. The rest of the paper is organized as follows. In Section 2 a brief concept of a RDF data model and the previous approaches for managing RDF data are described. The proposed storage scheme for RDF data management is explained in Section 3. Section 4 covers our experiment and the results of our performance test. We finally conclude this paper with future works in Section Data Model and RDF Data Management In this section, we briefly overview the RDF specifications and define the RDF data model. And then we compare various representation schemes for a data object in database systems and describe how to apply them for RDF data management RDF Data Model RDF is a language for describing semantic metadata about resources in the Web [1]. RDF is based on the idea of identifying things using Web identifiers (URIs), and describing resources in terms of simple properties and property values. This enables RDF to represent a simple statement about resource description. An RDF statement consists of a subject, predicate, and object. Often these ternaries are referred as to a triple. The subject or S means a resource URI. The object or O as the value of property may be either a resource URI or a literal value. The predicate or P means a property and also is a resource URI. Definition 1. A triple <S, P, O> R U ( R L) is called an RDF triple where U is a set of URI references (URIs), L is a set of literals, B is a set of blank nodes, and R is a set of U union B. An RDF statement also can be modeled as a directed graph of node and arc representing the resource, and its property and value. Both the subject and the object are represented with node. A resource without URI is represented with a blank node. The predicate is represented with a directed arc from subject node to object node. This means the relationship between nodes. Figure 1 shows an RDF description example and its graph representation using the FOAF ontology vocabulary. FOAF (Friend-of-a-Friend) is a kind of ontology providing vocabulary for describing personal information such as name, mailbox, homepage URI etc. Figure 2 is a captured image for the extracted RDF triples via CARA RDF parser [10] to the RDF graph in Figure 1. Thus, RDF is a directed graph-based data model, which consists of a set of RDF triples. Another characteristic of RDF is that it is a property-centered model. Property can be defined independently of a specific class definition and applied to all classes unless domain specifications are explicitly described. Thus we can assert an RDF statement for a resource to associating it with any property. Thus, an RDF storage scheme should be flexible for newly added property or deleted property. Journal of Digital Information Management q Volume 4 Number 1 q March

2 Figure 1. An Example RDF Description and RDF Graph Figure 2. Extracted RDF Triples 2.2. RDF Data Management In traditional database systems a data object is represented as a row in the relational table. In this approach since different attributes for an object are grouped together, only one record for the object is required. According to the data object characteristic, however, many null values may appear. Flexibility to schema evolution, such as attribute insertion and deletion, is not good for this approach. This approach is referred to as a horizontal storage approach. Decomposition storage model was introduced in [6]. It divides a table in the horizontal storage approach into as many binary tables as the number of attributes. Whole data objects are physically grouped by attributes. We call this scheme a binary storage approach. Since each table name is the same as the attribute name in the horizontal table, there is no need to explicitly maintain an attribute field in each binary table. Neither does the null values appear. A vertical storage approach introduced how to store a data object with a different manner, where a 3-ary table consists of an object identifier, an attribute name, and an attribute value is generated [7]. In the vertical scheme, since the table contains records for only those attributes that are present in an object, there is no null value. Different attributes for an object are tied together using the same OID. Schema evolution is easier than the previous scheme. Among these approaches, the binary approach has shown better performance in query processing, such as projection, selection, join, and aggregation, than other two schemes. The poor performance of the horizontal scheme is mainly caused by I/O operations. To get further information about the performance comparisons, refer to [7]. Figure 3 compares a horizontal table and its corresponding representations in the vertical and binary approaches respectively. Most approaches for managing the Semantic Web data described in RDF regard an RDF graph as a set of RDF triples and store them in the relational table [3][4][14]. The basic storage schema consists of a single large triple table for storing a set of RDF triples. To reduce disk space, some additional tables such as a resource table and a literal table are generated. The resource table maintains all resources. It includes properties also. The literal table maintains all literal value. The triple table consists of the subject, predicate, and object fields and stores a set of RDF triple and references the other tables. There are as many records as RDF triples having different properties for a specific resource description. Thus, the triple table can be regarded as applying the vertical approach just mentioned above. Since it maintains only one table except the additional tables, it is easy to manage data objects (RDF triples). For example, an RDF statement with new property for a resource is easily inserted in the table. However, since it always requires the whole table to be scanned for processing a query it may degrade retrieval performance. In addition, it does not scale well. In Jena 2 [5], besides a triple table, a horizontal table-based approach in which several properties for a resource are clustered was used. Thus, the related properties and values for a resource can be accessed together. Many null values however may appear in the table. Adding a new property or deleting a property in the table is very expensive also. Figure 3. Comparison of Storage Schema Journal of Digital Information Management q Volume 4 Number 1 q March

3 3. Hybrid Storage Scheme 3.1. Hybrid Approach We fundamentally interpret an RDF graph as a set of RDF triples and store it in the relational tables as in the previous approaches. The proposed RDF storage scheme aims to provide : Query performance in general most RDF queries are given with a specific property. Thus, query to a specific property either with a value or not should be efficiently processed. Scalability and Manageability The proposed scheme should be both scalable and manageable. A single large triple table in previous scheme is good for manageability but not good for scalability. Flexibility RDF is not a resource-centric (or object-centric) but a property-centric data model. It means that it should be easy to insert an RDF statement for a resource description with a new property. Our basic philosophy is that most commonly appeared properties in describing RDF data will be frequently used in the future and also in forming user queries. Thus, we manage and handle them specially. In order to achieve the above goals, we distinguish some frequently appeared properties in RDF data and frequently used properties in user query. RDF statements described with these distinguished properties are grouped by properties and independently maintained. For this, we adapt the binary storage scheme mentioned in the previous section and maintain an independent table for each property. However, maintaining as many tables as the number of properties may cause overhead in manageability if there is a large quantity of properties. Thus, we divide the RDF statements into two categories and manage them with a different manner, which is called a hybrid approach. The first category means a set of RDF statements described with the distinguished properties. These RDF statements are maintained in the corresponding independent binary table according to their properties. The remaining RDF statements are maintained in the same old-fashioned way using a single triple table. Figure 4 shows a brief schema diagram for the proposed storage schema structure. As a result, a set of RDF statements having a frequently used property is physically grouped in a single independent table. On the other hand, RDF statements with low-important properties are grouped together in a common triple table. There is no duplication among these tables. Finally, since a query having a specific or distinguished property is evaluated through accessing only a corresponding property-based arc table, we can achieve high performance in query processing Managing the Property-based Arc Tables Property-based arc tables may be generated selectively according to the characteristics of the property. To achieve this process, we first have to select some distinguished properties after analyzing RDF data and RDF query log information. For example, properties frequently appeared in RDF data in an application domain and frequently used in the user query may be candidates. Generally, the number of types (vocabulary) of properties used in a specific domain is already defined and limited. Especially, frequently used properties are limited also. An analysis result was introduced for FOAF (Friend-of-a-Friend) documents usage under the real Web environment [8]. It analyzed the empirical usage of properties over the FOAF data and revealed the most frequently used properties. Among all the properties in FOAF vocabulary, the average total usage of some properties such as foaf:name, foaf:mbox_sha1sum, foaf:homepage, foaf:knows, and foaf:nick shows over 50% of the whole usage. Since most frequently used properties will be continuously used in future FOAF documents and take a large portion in the entire data, it is efficient to manage them independently. The hybrid approach scales well since RDF data is physically distributed in several tables instead of storing in a single table. Due to maintaining the property-based arc tables, it gives a chance to enhance performance, especially for a retrieval query having specific properties. Although we have not seen other literatures analyzing query log about frequently used properties yet, it is obvious that the frequently used properties also will be often used in forming query over RDF data. Thus, maintaining property-based arc tables for several important properties has a strong advantage in terms of managing and querying RDF data. Maintaining only a single triple table in the previous storage scheme gives good manageability and flexibility. Inserting an RDF triple described with newly appeared property and deleting RDF triples are easily handled. In the proposed hybrid scheme, since all remaining RDF triples described with non-distinguished properties are stored together in a single table like the triple table in the previous scheme, it can be handled with the same flexibility as well. The proposed scheme also provides good manageability by maintaining independent tables as necessary. 4. Performance Experiments We now describe the results of an experiment to evaluate the performance of the proposed scheme. In this experiment we include the performance comparison with the previous scheme described in Section 2. Figure 4. Database Schema for the Proposed Storage Scheme The resources table basically maintains all identifiable resources via URI reference and consists of a resource identifier, namespace, resource name, and resource type fields. The literal table maintains all literal values and their related information. Additional fields for more information to the literal value may also be included. These two tables are referenced to other tables. As just mentioned above, we generate independent binary tables for only distinguished properties. Only RDF statements described with distinguished properties are stored in these tables according to their properties. We call these tables property-based arc tables. It basically consists of the arc identifier, subject, and object fields. Since the table name is the same as the property name, we don t need to explicitly maintain a predicate field. The arc_others table is for the remaining RDF statements. It plays the same role as the triple table in the previous approaches and contains the predicate field explicitly. 4.1 Experimental Conditions In order to compare the previous triple table-based scheme with the proposed scheme, we directly implemented both schemes. As in Jena and Sesame/MySQL, only a single triple table-based approach was implemented. As mentioned in Section 2, it can be regarded as applying the vertical storage scheme. Thus, we call this previous approach a vertical scheme in this section. Since we only focus the experiment on how to manage RDF data in both storage structures, we implement the core storage structure and retrieval module of the vertical approach instead of installing comparative systems such as Jena, Sesame/MySQL. The proposed hybrid scheme maintains 5 property-based arc tables for 5 distinguished properties and the arc_others table for the others. Since the length of resource URIs and literal values for the test data set is not long, in-lining approach is adapted in both schemes. That is, resource URIs and literal values are directly stored in the triple table and property-based arcs tables. Thus, a resource table and literal table are not implemented in either schemes. Using in-lining Journal of Digital Information Management q Volume 4 Number 1 q March

4 No. Description Query in RDQL format Q1 Return all statements with a specific property SELECT?x,?z WHERE (?x <foaf:name>?z) (most frequently used property) Q2 Return all statements with a specific property SELECT?x,?z WHERE (?x <foaf:givenname>?z) (least frequently used property) Q3 Find all properties and their values of a specified SELECT?y,?z WHERE ( resource pics.rdf#photolists?y?z) Q4 Find all value pairs for two related properties of a SELECT?x,?y WHERE (<genid:mkim>, <foaf:name>,?x), specified resource (<genid:mkim>, <foaf:knows>,?y) Q5 Find all object pairs for two related properties grouped by 1st object and having the number of 2nd objects less than 200 Not described in RDQL format Q6 Return all instances known by a resource whose name is given SELECT?z WHERE (?x <foaf:name> Dr. Steven R. (graph pattern query) Newcomb ), (?x <foaf:knows>?z) Q7 Find all name values a specified resource knows (path query) SELECT?z WHERE (<genid:pldms> <foaf:knows>?y), * USING clause (e.g. USING foaf FOR is omitted in Q1 to Q7 Table 1. Test Queries for Performance Evaluation approach reduces the number of join operations and gives performance improvement although space overhead is increased. Refer to [13] for more ideas of the in-lining approach. For the triple table in the vertical approach and the arc_others table in the hybrid approach, we indexed each subject, property, and object columns independently. We also indexed each subject and object fields of property-based arc tables in the hybrid approach. For implementing the storage and retrieval modules of both schemes we used APM_Setup 5 for Win 32 which consists of MySQL database management system, PHP language and interpreter, and Apache web server. We use the open CARA parser [10] as an RDF parser to extract RDF triples. The test is performed with a machine with Pentium III 866MHz, 256 MB main memory, and 20GB hard disk under the Window 2000 professional server. As test data, we use a single FOAF ontology-based RDF file generated by FOAF developer site [9]. The analyzed result for the test RDF data showed that the number of different properties is about 50. As similar with the result in [6], some properties such as foaf:name, foaf:mbox_sha1sum, foaf:thumbnail, foaf:knows, rdfs:seealso are most frequently used. It takes over 50% of whole usage frequency. The number of extracted RDF triples is about 100, Performance Results We first measured storage requirements. The required database sizes for the vertical and hybrid schemes are about 58MB and 44MB respectively. The index spaces are 3.2MB and 2.4 MB respectively. The reason why the vertical scheme uses more space than the hybrid scheme is that it explicitly maintains property field. In the proposed scheme, on the other hand, each propery-based arc table is named as the corresponding property name. Thus, except the arc_others table, property field is not explicitly used. Next we measured retrieval times. The RDF is a directed graphbased model, often represented as a set of RDF triples as mentioned in an earlier section. In most RDF query languages, such as RDQL, the basic query form is founded on triple pattern. A triple pattern is comprised of named variables, URIs or literals. We used seven triple pattern-based queries to perform the test as shown in Table1. We express these queries using RDQL [11] which is one of the RDF Query languages. Queries Q1-Q3 are based on a single triple pattern. Q4 and Q5 are more complex than Q1-Q3. Q4, Q6, and Q7 are based on a graph pattern. Especially, Q7 is a path-based query. Each query was issued several times after cold booting to perfectly flush buffer cache. In real applications, the number of extracted RDF triples may be very large. Therefore, some factors to design database storage schema, such as index, affect the query performance. How to efficiently index each table in storage schema is important. In order to observe and analyze the influence of index, we first experiment with both approaches without indices. And then we experiment with both approaches with proper indices. Table 2 shows the average retrieval times for initial executions without considering indices. Due to cache effects, a large reduction in response times for the following executions compared to the initial execution occurred. Observing the cache effects is not a goal for this experiment, we mention only the retrieval times for initial executions in this section. Query # of results Vertical (sec.) Hybrid (sec.) Q1 16, Q Q3 1, Q Q Q Q Table 2. A Comparison of Retrieval Times (without indices) As a whole the vertical scheme shows lower performance than the proposed scheme. As was expected, for a query with a specific and frequently appeared property (Q1, Q2, Q4, Q6, Q7), the performance of the proposed schemes achieves about 4 to 8 times better retrieval performance than the vertical scheme. For a query issued without a specific property (Q3), the proposed hybrid approach also appears faster than the vertical scheme. More disk I/ O operations in the vertical scheme are one of the reasons that show lower performance. Graph pattern-based queries Q4 - Q7 requires self-join operation in the vertical scheme and 2-way join operation in the proposed scheme respectively. Due to this, more times are required for graph pattern-based queries compared to the simple triple pattern queries in the vertical scheme. Q5 is a very exhaustive query. It requires full scanning to the triple table two times in the vertical scheme and two related tables in the proposed one respectively. In the vertical approach, Q5 exceeds the maximum execution time. Table 3 shows the average retrieval times with indices for initial executions. In [14] it was demonstrated that building independent indices on each column in the triple table is superior to other index combinations. We therefore adopted this indices scheme to the triple table in vertical approach and the arc_others table in hybrid approach respectively. For each property-based arc table, subject and object fields are independently indexed. Query # of results Vertical (sec.) Hybrid (sec.) Q1 16, Q Q3 1, Q Q Q Q Table 3. A Comparison of Retrieval Times (with indices) Due to adopting index schemes, good performance improvement was indicated in both approaches as shown in Table 3. Although the performance differences between two schemes are reduced as a result of adapting a proper index scheme, the hybrid approach Journal of Digital Information Management q Volume 4 Number 1 q March

5 shows overall performance improvement over the vertical one. Even though Q1 is executed with an index only in the vertical approach, a performance difference of more than 6 times is observed compared to the execution of the hybrid approach where table scanning is used. This comes from the fact that the selectivity is somewhat low. Low selectivity means that the percentage of returned rows is high. In the case of the execution on Q3 (given no specific property), the vertical approach is slightly faster because the hybrid approach should access more tables and the selectivity is high (about 1%). In the case of Q5, since MySQL DBMS choose only an index among indices created on a table to execute user query (index on subject field is chosen in this case), whole tables should fully be scanned in both approaches. It takes about 19 seconds to execute the query in the hybrid approach. On the other hand, we exceed the maximum execution time in the vertical approach. 5. Conclusion and Future Works A large quantity of the Semantic Web data described in RDF format will appear in the near future. In most previous approaches, RDF data is stored in a single large relational table called as a triple table. Basically, it always requires the whole table to be scanned for processing a query, however, it may degrade retrieval performance. In addition, it does not scale well. We propose a hybrid approach in this paper. First, we distinguish some important properties according to the appearance and usage frequency. And based on the binary storage scheme, we generate several property-based tables for the distinguished properties to treat each property independently. RDF statements described with a specific property are grouped and stored in a corresponding table. Thus we can avoid full scanning the whole data and achieve better retrieval performance. For other RDF statements described with non-distinguished properties, we manage them in the same manner as the previous approach. It also provides good manageability by maintaining several independent tables as necessary. In addition, it scales well since RDF data is physically distributed in several tables instead of storing in a single table. Finally, we implement and evaluate the proposed scheme. The proposed scheme especially shows better performance for a retrieval query having specific properties. How to analyze the usage frequency of properties is remaining work. In the near future, we have a plan to analyze and evaluate what the optimal number of property-based tables to be maintained is. In this paper we don t consider the ontology languages such as RDF Schema and OWL. Since the RDF Schema and OWL documents fundamentally can be described in RDF syntax we can apply our proposed storage scheme to manage them. However, the ontological data described in RDF Schema or OWL has different characters as compared with RDF data. Thus, it may be more efficient to treat them with a different manner. We have been currently investigating to design a management scheme for the ontological data and to connect it with the hybrid storage scheme proposed in this paper. Sung Wan Kim He is an assistant professor in the Department of Computer Information at the Sahmyook College, Korea. He received the B.Eng. degree with First Class Honors in Computer Science from the Myongji University, Korea in 1996, and the M.S. and Ph.D. degrees in Computer Science from the Hongik University, Korea in 1998 and 2003, respectively. His current research interests are in the areas of XML and semantic web from the viewpoint of database systems. Acknowledgements This work was supported by the Sahmyook College Research Fund in I wish to thank Brenda Yoon for her valuable proofreading efforts on this manuscript. References [1] W3C (2004). RDF Primer. ( [2] Melnik, S (2004). Storing RDF in a Relational Database. ( www-db.stanford.edu/~melnik/rdf/db.html) [3] McBride, B (2001). Jena: Implementing the RDF Model and Syntax. Proc. of the Second International Workshop on the Semantic Web (SemWeb 2001). [4] Broekstra, J. et al (2002). Sesame ÿa Generic Architecture for Storing and Querying RDF and RDF Schema. Proc. of the 1st Int l Semantic Web Conference [5] Wilkinson, K. et al(2003). Efficient RDF Storage and Retrieval in Jena2. Proc. of the 1st International Workshop on Semantic Web and Databases [6] Copeland, G. Khoshafian, S. (1985). A Decomposition Storage Model. Proc. of the ACM SIGMOD Inter l Conf. on Management of Data [7] Agrawal, R. Somani, A.,Xu, Y (2001). Storage and Querying of E- Commerce Data. Proc. of the 27th Int l Conf. on Very Large Data Bases (VLDB) [8] Li Ding et al (2005). How the Semantic Web is Being Used:An Analysis of FOAF. Proc. of the 38th Hawaii Int l Conf. on System Sciences. [9] FOAF project ( [10] CARA RDF Parser ( [11] Andy Seaborne (2004). Jena Tutorial : A Programmer s Introduction - RDQL ( [12] Aaron Swarts (2002). MusicBrainz:A Semantic Web Service, IEEE Intelligent Systems, 17(1) [13] Florescu, D. Kossmann, D (1999). Storing and Querying XML Data using an RDBMS. Bulletin of the IEEE Computer Society Technical Committee on Data Engneering. 22 (3) [14] Li Ma et al (2004). RStar: an RDF Storage and Query System for Enterprise Resource Management. In: Proc. of the 13th ACM Conf. on Information and Knowledge Management, Journal of Digital Information Management q Volume 4 Number 1 q March

ABSTRACT 1. INTRODUCTION. Kamil Bajda-Pawlikowski kbajda@cs.yale.edu

ABSTRACT 1. INTRODUCTION. Kamil Bajda-Pawlikowski kbajda@cs.yale.edu Kamil Bajda-Pawlikowski kbajda@cs.yale.edu Querying RDF data stored in DBMS: SPARQL to SQL Conversion Yale University technical report #1409 ABSTRACT This paper discusses the design and implementation

More information

DLDB: Extending Relational Databases to Support Semantic Web Queries

DLDB: Extending Relational Databases to Support Semantic Web Queries DLDB: Extending Relational Databases to Support Semantic Web Queries Zhengxiang Pan (Lehigh University, USA zhp2@cse.lehigh.edu) Jeff Heflin (Lehigh University, USA heflin@cse.lehigh.edu) Abstract: We

More information

An Efficient and Scalable Management of Ontology

An Efficient and Scalable Management of Ontology An Efficient and Scalable Management of Ontology Myung-Jae Park 1, Jihyun Lee 1, Chun-Hee Lee 1, Jiexi Lin 1, Olivier Serres 2, and Chin-Wan Chung 1 1 Korea Advanced Institute of Science and Technology,

More information

Data Store Interface Design and Implementation

Data Store Interface Design and Implementation WDS'07 Proceedings of Contributed Papers, Part I, 110 115, 2007. ISBN 978-80-7378-023-4 MATFYZPRESS Web Storage Interface J. Tykal Charles University, Faculty of Mathematics and Physics, Prague, Czech

More information

Semantic Knowledge Management System. Paripati Lohith Kumar. School of Information Technology

Semantic Knowledge Management System. Paripati Lohith Kumar. School of Information Technology Semantic Knowledge Management System Paripati Lohith Kumar School of Information Technology Vellore Institute of Technology University, Vellore, India. plohithkumar@hotmail.com Abstract The scholarly activities

More information

Lightweight Data Integration using the WebComposition Data Grid Service

Lightweight Data Integration using the WebComposition Data Grid Service Lightweight Data Integration using the WebComposition Data Grid Service Ralph Sommermeier 1, Andreas Heil 2, Martin Gaedke 1 1 Chemnitz University of Technology, Faculty of Computer Science, Distributed

More information

DISCOVERING RESUME INFORMATION USING LINKED DATA

DISCOVERING RESUME INFORMATION USING LINKED DATA DISCOVERING RESUME INFORMATION USING LINKED DATA Ujjal Marjit 1, Kumar Sharma 2 and Utpal Biswas 3 1 C.I.R.M, University Kalyani, Kalyani (West Bengal) India sic@klyuniv.ac.in 2 Department of Computer

More information

The Ontology and Architecture for an Academic Social Network

The Ontology and Architecture for an Academic Social Network www.ijcsi.org 22 The Ontology and Architecture for an Academic Social Network Moharram Challenger Computer Engineering Department, Islamic Azad University Shabestar Branch, Shabestar, East Azerbaijan,

More information

SEMANTIC WEB BASED INFERENCE MODEL FOR LARGE SCALE ONTOLOGIES FROM BIG DATA

SEMANTIC WEB BASED INFERENCE MODEL FOR LARGE SCALE ONTOLOGIES FROM BIG DATA SEMANTIC WEB BASED INFERENCE MODEL FOR LARGE SCALE ONTOLOGIES FROM BIG DATA J.RAVI RAJESH PG Scholar Rajalakshmi engineering college Thandalam, Chennai. ravirajesh.j.2013.mecse@rajalakshmi.edu.in Mrs.

More information

Supporting Change-Aware Semantic Web Services

Supporting Change-Aware Semantic Web Services Supporting Change-Aware Semantic Web Services Annika Hinze Department of Computer Science, University of Waikato, New Zealand a.hinze@cs.waikato.ac.nz Abstract. The Semantic Web is not only evolving into

More information

DBMS / Business Intelligence, SQL Server

DBMS / Business Intelligence, SQL Server DBMS / Business Intelligence, SQL Server Orsys, with 30 years of experience, is providing high quality, independant State of the Art seminars and hands-on courses corresponding to the needs of IT professionals.

More information

Perspectives of Semantic Web in E- Commerce

Perspectives of Semantic Web in E- Commerce Perspectives of Semantic Web in E- Commerce B. VijayaLakshmi M.Tech (CSE), KIET, A.GauthamiLatha Dept. of CSE, VIIT, Dr. Y. Srinivas Dept. of IT, GITAM University, Mr. K.Rajesh Dept. of MCA, KIET, ABSTRACT

More information

Semantic Stored Procedures Programming Environment and performance analysis

Semantic Stored Procedures Programming Environment and performance analysis Semantic Stored Procedures Programming Environment and performance analysis Marjan Efremov 1, Vladimir Zdraveski 2, Petar Ristoski 2, Dimitar Trajanov 2 1 Open Mind Solutions Skopje, bul. Kliment Ohridski

More information

A generic approach for data integration using RDF, OWL and XML

A generic approach for data integration using RDF, OWL and XML A generic approach for data integration using RDF, OWL and XML Miguel A. Macias-Garcia, Victor J. Sosa-Sosa, and Ivan Lopez-Arevalo Laboratory of Information Technology (LTI) CINVESTAV-TAMAULIPAS Km 6

More information

Grids, Logs, and the Resource Description Framework

Grids, Logs, and the Resource Description Framework Grids, Logs, and the Resource Description Framework Mark A. Holliday Department of Mathematics and Computer Science Western Carolina University Cullowhee, NC 28723, USA holliday@cs.wcu.edu Mark A. Baker,

More information

Semantically Enhanced Web Personalization Approaches and Techniques

Semantically Enhanced Web Personalization Approaches and Techniques Semantically Enhanced Web Personalization Approaches and Techniques Dario Vuljani, Lidia Rovan, Mirta Baranovi Faculty of Electrical Engineering and Computing, University of Zagreb Unska 3, HR-10000 Zagreb,

More information

MS SQL Performance (Tuning) Best Practices:

MS SQL Performance (Tuning) Best Practices: MS SQL Performance (Tuning) Best Practices: 1. Don t share the SQL server hardware with other services If other workloads are running on the same server where SQL Server is running, memory and other hardware

More information

Application of ontologies for the integration of network monitoring platforms

Application of ontologies for the integration of network monitoring platforms Application of ontologies for the integration of network monitoring platforms Jorge E. López de Vergara, Javier Aracil, Jesús Martínez, Alfredo Salvador, José Alberto Hernández Networking Research Group,

More information

K@ A collaborative platform for knowledge management

K@ A collaborative platform for knowledge management White Paper K@ A collaborative platform for knowledge management Quinary SpA www.quinary.com via Pietrasanta 14 20141 Milano Italia t +39 02 3090 1500 f +39 02 3090 1501 Copyright 2004 Quinary SpA Index

More information

Databases in Organizations

Databases in Organizations The following is an excerpt from a draft chapter of a new enterprise architecture text book that is currently under development entitled Enterprise Architecture: Principles and Practice by Brian Cameron

More information

Query Optimization Approach in SQL to prepare Data Sets for Data Mining Analysis

Query Optimization Approach in SQL to prepare Data Sets for Data Mining Analysis Query Optimization Approach in SQL to prepare Data Sets for Data Mining Analysis Rajesh Reddy Muley 1, Sravani Achanta 2, Prof.S.V.Achutha Rao 3 1 pursuing M.Tech(CSE), Vikas College of Engineering and

More information

Semantic Search in Portals using Ontologies

Semantic Search in Portals using Ontologies Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br

More information

LDIF - Linked Data Integration Framework

LDIF - Linked Data Integration Framework LDIF - Linked Data Integration Framework Andreas Schultz 1, Andrea Matteini 2, Robert Isele 1, Christian Bizer 1, and Christian Becker 2 1. Web-based Systems Group, Freie Universität Berlin, Germany a.schultz@fu-berlin.de,

More information

Detection and Elimination of Duplicate Data from Semantic Web Queries

Detection and Elimination of Duplicate Data from Semantic Web Queries Detection and Elimination of Duplicate Data from Semantic Web Queries Zakia S. Faisalabad Institute of Cardiology, Faisalabad-Pakistan Abstract Semantic Web adds semantics to World Wide Web by exploiting

More information

RDF Support in Oracle Oracle USA Inc.

RDF Support in Oracle Oracle USA Inc. RDF Support in Oracle Oracle USA Inc. 1. Introduction Resource Description Framework (RDF) is a standard for representing information that can be identified using a Universal Resource Identifier (URI).

More information

Design and Implementation of a Semantic Web Solution for Real-time Reservoir Management

Design and Implementation of a Semantic Web Solution for Real-time Reservoir Management Design and Implementation of a Semantic Web Solution for Real-time Reservoir Management Ram Soma 2, Amol Bakshi 1, Kanwal Gupta 3, Will Da Sie 2, Viktor Prasanna 1 1 University of Southern California,

More information

Efficient Iceberg Query Evaluation for Structured Data using Bitmap Indices

Efficient Iceberg Query Evaluation for Structured Data using Bitmap Indices Proc. of Int. Conf. on Advances in Computer Science, AETACS Efficient Iceberg Query Evaluation for Structured Data using Bitmap Indices Ms.Archana G.Narawade a, Mrs.Vaishali Kolhe b a PG student, D.Y.Patil

More information

Storing and Querying XML Data using an RDMBS

Storing and Querying XML Data using an RDMBS Storing and Querying XML Data using an RDMBS Daniela Florescu INRIA, Roquencourt daniela.florescu@inria.fr Donald Kossmann University of Passau kossmann@db.fmi.uni-passau.de 1 Introduction XML is rapidly

More information

Towards a Semantic Wiki Wiki Web

Towards a Semantic Wiki Wiki Web Towards a Semantic Wiki Wiki Web Roberto Tazzoli, Paolo Castagna, and Stefano Emilio Campanini Abstract. This article describes PlatypusWiki, an enhanced Wiki Wiki Web using technologies from the Semantic

More information

ARC: appmosphere RDF Classes for PHP Developers

ARC: appmosphere RDF Classes for PHP Developers ARC: appmosphere RDF Classes for PHP Developers Benjamin Nowack appmosphere web applications, Kruppstr. 100, 45145 Essen, Germany bnowack@appmosphere.com Abstract. ARC is an open source collection of lightweight

More information

Powl A Web Based Platform for Collaborative Semantic Web Development

Powl A Web Based Platform for Collaborative Semantic Web Development Powl A Web Based Platform for Collaborative Semantic Web Development Sören Auer University of Leipzig auer@informatik.uni-leipzig.de Abstract: We outline Powl, an opensource, web-based semantic web development

More information

www.gr8ambitionz.com

www.gr8ambitionz.com Data Base Management Systems (DBMS) Study Material (Objective Type questions with Answers) Shared by Akhil Arora Powered by www. your A to Z competitive exam guide Database Objective type questions Q.1

More information

Big Data, Fast Data, Complex Data. Jans Aasman Franz Inc

Big Data, Fast Data, Complex Data. Jans Aasman Franz Inc Big Data, Fast Data, Complex Data Jans Aasman Franz Inc Private, founded 1984 AI, Semantic Technology, professional services Now in Oakland Franz Inc Who We Are (1 (2 3) (4 5) (6 7) (8 9) (10 11) (12

More information

OntoDBench: Ontology-based Database Benchmark

OntoDBench: Ontology-based Database Benchmark OntoDBench: Ontology-based Database Benchmark Stéphane Jean, Ladjel Bellatreche, Géraud Fokou, Mickaël Baron, and Selma Khouri LIAS/ISAE-ENSMA and University of Poitiers BP 40109, 86961 Futuroscope Cedex,

More information

Benchmarking the Performance of Storage Systems that expose SPARQL Endpoints

Benchmarking the Performance of Storage Systems that expose SPARQL Endpoints Benchmarking the Performance of Storage Systems that expose SPARQL Endpoints Christian Bizer 1 and Andreas Schultz 1 1 Freie Universität Berlin, Web-based Systems Group, Garystr. 21, 14195 Berlin, Germany

More information

JOURNAL OF COMPUTER SCIENCE AND ENGINEERING

JOURNAL OF COMPUTER SCIENCE AND ENGINEERING Exploration on Service Matching Methodology Based On Description Logic using Similarity Performance Parameters K.Jayasri Final Year Student IFET College of engineering nishajayasri@gmail.com R.Rajmohan

More information

Semantic Web Standard in Cloud Computing

Semantic Web Standard in Cloud Computing ETIC DEC 15-16, 2011 Chennai India International Journal of Soft Computing and Engineering (IJSCE) Semantic Web Standard in Cloud Computing Malini Siva, A. Poobalan Abstract - CLOUD computing is an emerging

More information

SWARD: Semantic Web Abridged Relational Databases

SWARD: Semantic Web Abridged Relational Databases SWARD: Semantic Web Abridged Relational Databases Johan Petrini and Tore Risch Department of Information Technology Uppsala University Sweden {Johan.Petrini,Tore.Risch}@it.uu.se Abstract The semantic web

More information

A Semantic web approach for e-learning platforms

A Semantic web approach for e-learning platforms A Semantic web approach for e-learning platforms Miguel B. Alves 1 1 Laboratório de Sistemas de Informação, ESTG-IPVC 4900-348 Viana do Castelo. mba@estg.ipvc.pt Abstract. When lecturers publish contents

More information

THE SEMANTIC WEB AND IT`S APPLICATIONS

THE SEMANTIC WEB AND IT`S APPLICATIONS 15-16 September 2011, BULGARIA 1 Proceedings of the International Conference on Information Technologies (InfoTech-2011) 15-16 September 2011, Bulgaria THE SEMANTIC WEB AND IT`S APPLICATIONS Dimitar Vuldzhev

More information

Horizontal Aggregations In SQL To Generate Data Sets For Data Mining Analysis In An Optimized Manner

Horizontal Aggregations In SQL To Generate Data Sets For Data Mining Analysis In An Optimized Manner 24 Horizontal Aggregations In SQL To Generate Data Sets For Data Mining Analysis In An Optimized Manner Rekha S. Nyaykhor M. Tech, Dept. Of CSE, Priyadarshini Bhagwati College of Engineering, Nagpur, India

More information

Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo

Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo Expected Outcomes You will learn: Basic concepts related to ontologies Semantic model Semantic web Basic features of RDF and RDF

More information

Publishing Linked Data Requires More than Just Using a Tool

Publishing Linked Data Requires More than Just Using a Tool Publishing Linked Data Requires More than Just Using a Tool G. Atemezing 1, F. Gandon 2, G. Kepeklian 3, F. Scharffe 4, R. Troncy 1, B. Vatant 5, S. Villata 2 1 EURECOM, 2 Inria, 3 Atos Origin, 4 LIRMM,

More information

High-performance XML Storage/Retrieval System

High-performance XML Storage/Retrieval System UDC 00.5:68.3 High-performance XML Storage/Retrieval System VYasuo Yamane VNobuyuki Igata VIsao Namba (Manuscript received August 8, 000) This paper describes a system that integrates full-text searching

More information

Security Issues for the Semantic Web

Security Issues for the Semantic Web Security Issues for the Semantic Web Dr. Bhavani Thuraisingham Program Director Data and Applications Security The National Science Foundation Arlington, VA On leave from The MITRE Corporation Bedford,

More information

Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science IBM Chief Scientist, Graph Computing. October 29th, 2015

Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science IBM Chief Scientist, Graph Computing. October 29th, 2015 E6893 Big Data Analytics Lecture 8: Spark Streams and Graph Computing (I) Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science IBM Chief Scientist, Graph Computing

More information

D5.3.2b Automatic Rigorous Testing Components

D5.3.2b Automatic Rigorous Testing Components ICT Seventh Framework Programme (ICT FP7) Grant Agreement No: 318497 Data Intensive Techniques to Boost the Real Time Performance of Global Agricultural Data Infrastructures D5.3.2b Automatic Rigorous

More information

Creating an RDF Graph from a Relational Database Using SPARQL

Creating an RDF Graph from a Relational Database Using SPARQL Creating an RDF Graph from a Relational Database Using SPARQL Ayoub Oudani, Mohamed Bahaj*, Ilias Cherti Department of Mathematics and Informatics, University Hassan I, FSTS, Settat, Morocco. * Corresponding

More information

An Ontology-based e-learning System for Network Security

An Ontology-based e-learning System for Network Security An Ontology-based e-learning System for Network Security Yoshihito Takahashi, Tomomi Abiko, Eriko Negishi Sendai National College of Technology a0432@ccedu.sendai-ct.ac.jp Goichi Itabashi Graduate School

More information

Business Intelligence Extensions for SPARQL

Business Intelligence Extensions for SPARQL Business Intelligence Extensions for SPARQL Orri Erling (Program Manager, OpenLink Virtuoso) and Ivan Mikhailov (Lead Developer, OpenLink Virtuoso). OpenLink Software, 10 Burlington Mall Road Suite 265

More information

ICOM 6005 Database Management Systems Design. Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 August 23, 2001

ICOM 6005 Database Management Systems Design. Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 August 23, 2001 ICOM 6005 Database Management Systems Design Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 August 23, 2001 Readings Read Chapter 1 of text book ICOM 6005 Dr. Manuel

More information

Integrating Open Sources and Relational Data with SPARQL

Integrating Open Sources and Relational Data with SPARQL Integrating Open Sources and Relational Data with SPARQL Orri Erling and Ivan Mikhailov OpenLink Software, 10 Burlington Mall Road Suite 265 Burlington, MA 01803 U.S.A, {oerling,imikhailov}@openlinksw.com,

More information

A Semantic Marketplace of Peers Hosting Negotiating Intelligent Agents

A Semantic Marketplace of Peers Hosting Negotiating Intelligent Agents A Semantic Marketplace of Peers Hosting Negotiating Intelligent Agents Theodore Patkos and Dimitris Plexousakis Institute of Computer Science, FO.R.T.H. Vassilika Vouton, P.O. Box 1385, GR 71110 Heraklion,

More information

An Approach to Implement Map Reduce with NoSQL Databases

An Approach to Implement Map Reduce with NoSQL Databases www.ijecs.in International Journal Of Engineering And Computer Science ISSN: 2319-7242 Volume 4 Issue 8 Aug 2015, Page No. 13635-13639 An Approach to Implement Map Reduce with NoSQL Databases Ashutosh

More information

Multi-level Metadata Management Scheme for Cloud Storage System

Multi-level Metadata Management Scheme for Cloud Storage System , pp.231-240 http://dx.doi.org/10.14257/ijmue.2014.9.1.22 Multi-level Metadata Management Scheme for Cloud Storage System Jin San Kong 1, Min Ja Kim 2, Wan Yeon Lee 3, Chuck Yoo 2 and Young Woong Ko 1

More information

E6895 Advanced Big Data Analytics Lecture 4:! Data Store

E6895 Advanced Big Data Analytics Lecture 4:! Data Store E6895 Advanced Big Data Analytics Lecture 4:! Data Store Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science Mgr., Dept. of Network Science and Big Data Analytics,

More information

High-performance metadata indexing and search in petascale data storage systems

High-performance metadata indexing and search in petascale data storage systems High-performance metadata indexing and search in petascale data storage systems A W Leung, M Shao, T Bisson, S Pasupathy and E L Miller Storage Systems Research Center, University of California, Santa

More information

Personalization of Web Search With Protected Privacy

Personalization of Web Search With Protected Privacy Personalization of Web Search With Protected Privacy S.S DIVYA, R.RUBINI,P.EZHIL Final year, Information Technology,KarpagaVinayaga College Engineering and Technology, Kanchipuram [D.t] Final year, Information

More information

DATABASE MANAGEMENT SYSTEM PERFORMANCE ANALYSIS AND COMPARISON. Margesh Naik B.E, Veer Narmad South Gujarat University, India, 2008 PROJECT

DATABASE MANAGEMENT SYSTEM PERFORMANCE ANALYSIS AND COMPARISON. Margesh Naik B.E, Veer Narmad South Gujarat University, India, 2008 PROJECT DATABASE MANAGEMENT SYSTEM PERFORMANCE ANALYSIS AND COMPARISON Margesh Naik B.E, Veer Narmad South Gujarat University, India, 2008 PROJECT Submitted in partial satisfaction of the requirements for the

More information

RDF Resource Description Framework

RDF Resource Description Framework RDF Resource Description Framework Fulvio Corno, Laura Farinetti Politecnico di Torino Dipartimento di Automatica e Informatica e-lite Research Group http://elite.polito.it Outline RDF Design objectives

More information

Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis

Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661, ISBN: 2278-8727 Volume 6, Issue 5 (Nov. - Dec. 2012), PP 36-41 Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis

More information

Toad for Oracle 8.6 SQL Tuning

Toad for Oracle 8.6 SQL Tuning Quick User Guide for Toad for Oracle 8.6 SQL Tuning SQL Tuning Version 6.1.1 SQL Tuning definitively solves SQL bottlenecks through a unique methodology that scans code, without executing programs, to

More information

Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology

Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology Hong-Linh Truong Institute for Software Science, University of Vienna, Austria truong@par.univie.ac.at Thomas Fahringer

More information

GRAPH DATABASE SYSTEMS. h_da Prof. Dr. Uta Störl Big Data Technologies: Graph Database Systems - SoSe 2016 1

GRAPH DATABASE SYSTEMS. h_da Prof. Dr. Uta Störl Big Data Technologies: Graph Database Systems - SoSe 2016 1 GRAPH DATABASE SYSTEMS h_da Prof. Dr. Uta Störl Big Data Technologies: Graph Database Systems - SoSe 2016 1 Use Case: Route Finding Source: Neo Technology, Inc. h_da Prof. Dr. Uta Störl Big Data Technologies:

More information

12 The Semantic Web and RDF

12 The Semantic Web and RDF MSc in Communication Sciences 2011-12 Program in Technologies for Human Communication Davide Eynard nternet Technology 12 The Semantic Web and RDF 2 n the previous episodes... A (video) summary: Michael

More information

Mobile Storage and Search Engine of Information Oriented to Food Cloud

Mobile Storage and Search Engine of Information Oriented to Food Cloud Advance Journal of Food Science and Technology 5(10): 1331-1336, 2013 ISSN: 2042-4868; e-issn: 2042-4876 Maxwell Scientific Organization, 2013 Submitted: May 29, 2013 Accepted: July 04, 2013 Published:

More information

Oracle8i Spatial: Experiences with Extensible Databases

Oracle8i Spatial: Experiences with Extensible Databases Oracle8i Spatial: Experiences with Extensible Databases Siva Ravada and Jayant Sharma Spatial Products Division Oracle Corporation One Oracle Drive Nashua NH-03062 {sravada,jsharma}@us.oracle.com 1 Introduction

More information

Using RDF Metadata To Enable Access Control on the Social Semantic Web

Using RDF Metadata To Enable Access Control on the Social Semantic Web Using RDF Metadata To Enable Access Control on the Social Semantic Web James Hollenbach, Joe Presbrey, and Tim Berners-Lee Decentralized Information Group, MIT CSAIL, 32 Vassar Street, Cambridge, MA, USA,

More information

Semantic Web Applications

Semantic Web Applications Semantic Web Applications Graham Klyne Nine by Nine http://www.ninebynine.net/ 26 February 2004 Nine by Nine Who am I? Scientific, engineering and networked software systems architecture Motion capture,

More information

bla bla OPEN-XCHANGE Open-Xchange Hardware Needs

bla bla OPEN-XCHANGE Open-Xchange Hardware Needs bla bla OPEN-XCHANGE Open-Xchange Hardware Needs OPEN-XCHANGE: Open-Xchange Hardware Needs Publication date Wednesday, 8 January version. . Hardware Needs with Open-Xchange.. Overview The purpose of this

More information

ACCELERATING SELECT WHERE AND SELECT JOIN QUERIES ON A GPU

ACCELERATING SELECT WHERE AND SELECT JOIN QUERIES ON A GPU Computer Science 14 (2) 2013 http://dx.doi.org/10.7494/csci.2013.14.2.243 Marcin Pietroń Pawe l Russek Kazimierz Wiatr ACCELERATING SELECT WHERE AND SELECT JOIN QUERIES ON A GPU Abstract This paper presents

More information

powl Features and Usage Overview

powl Features and Usage Overview powl Features and Usage Overview Live demonstrations and further information is available from: http://powl.sourceforge.net/swc Sören Auer University of Leipzig auer@informatik.uni-leipzig.de Norman Beck

More information

String-Based Semantic Web Data Management Using Ternary B-Trees PhD Seminar, April 29, 2010

String-Based Semantic Web Data Management Using Ternary B-Trees PhD Seminar, April 29, 2010 String-Based Semantic Web Data Management Using Ternary B-Trees PhD Seminar, April 29, 2010 Jürg Senn Department of Computer Science, University of Basel RDF Resource Description Framework (RDF) basis

More information

HybIdx: Indexes for Processing Hybrid Graph Patterns Over Text-Rich Data Graphs Technical Report

HybIdx: Indexes for Processing Hybrid Graph Patterns Over Text-Rich Data Graphs Technical Report HybIdx: Indexes for Processing Hybrid Graph Patterns Over Text-Rich Data Graphs Technical Report Günter Ladwig Thanh Tran Institute AIFB, Karlsruhe Institute of Technology, Germany {guenter.ladwig,ducthanh.tran}@kit.edu

More information

Physical Database Design and Tuning

Physical Database Design and Tuning Chapter 20 Physical Database Design and Tuning Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley 1. Physical Database Design in Relational Databases (1) Factors that Influence

More information

Big Data Analytics. Rasoul Karimi

Big Data Analytics. Rasoul Karimi Big Data Analytics Rasoul Karimi Information Systems and Machine Learning Lab (ISMLL) Institute of Computer Science University of Hildesheim, Germany Big Data Analytics Big Data Analytics 1 / 1 Introduction

More information

1. Physical Database Design in Relational Databases (1)

1. Physical Database Design in Relational Databases (1) Chapter 20 Physical Database Design and Tuning Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley 1. Physical Database Design in Relational Databases (1) Factors that Influence

More information

Graph Database Performance: An Oracle Perspective

Graph Database Performance: An Oracle Perspective Graph Database Performance: An Oracle Perspective Xavier Lopez, Ph.D. Senior Director, Product Management 1 Copyright 2012, Oracle and/or its affiliates. All rights reserved. Program Agenda Broad Perspective

More information

RDF graph Model and Data Retrival

RDF graph Model and Data Retrival Distributed RDF Graph Keyword Search 15 2 Linked Data, Non-relational Databases and Cloud Computing 2.1.Linked Data The World Wide Web has allowed an unprecedented amount of information to be published

More information

How To Simplify Building Semantic Web Applications

How To Simplify Building Semantic Web Applications How To Simplify Building Semantic Web Applications Matthias Quasthoff, Harald Sack, Christoph Meinel Hasso Plattner Institute, University of Potsdam {matthias.quasthoff, harald.sack, meinel}@hpi.uni-potsdam.de

More information

OWL based XML Data Integration

OWL based XML Data Integration OWL based XML Data Integration Manjula Shenoy K Manipal University CSE MIT Manipal, India K.C.Shet, PhD. N.I.T.K. CSE, Suratkal Karnataka, India U. Dinesh Acharya, PhD. ManipalUniversity CSE MIT, Manipal,

More information

Intelligent Data Integration Middleware Based on Updateable Views

Intelligent Data Integration Middleware Based on Updateable Views Intelligent Data Integration Middleware Based on Updateable Views Hanna Kozankiewicz 1, Krzysztof Stencel 2, Kazimierz Subieta 1,3 1 Institute of Computer Sciences of the Polish Academy of Sciences, Warsaw,

More information

Query Optimization in Teradata Warehouse

Query Optimization in Teradata Warehouse Paper Query Optimization in Teradata Warehouse Agnieszka Gosk Abstract The time necessary for data processing is becoming shorter and shorter nowadays. This thesis presents a definition of the active data

More information

SQLMutation: A tool to generate mutants of SQL database queries

SQLMutation: A tool to generate mutants of SQL database queries SQLMutation: A tool to generate mutants of SQL database queries Javier Tuya, Mª José Suárez-Cabal, Claudio de la Riva University of Oviedo (SPAIN) {tuya cabal claudio} @ uniovi.es Abstract We present a

More information

Index Selection Techniques in Data Warehouse Systems

Index Selection Techniques in Data Warehouse Systems Index Selection Techniques in Data Warehouse Systems Aliaksei Holubeu as a part of a Seminar Databases and Data Warehouses. Implementation and usage. Konstanz, June 3, 2005 2 Contents 1 DATA WAREHOUSES

More information

LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model

LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model 22 October 2014 Tony Hammond Michele Pasin Background About Macmillan

More information

Data Integration Hub for a Hybrid Paper Search

Data Integration Hub for a Hybrid Paper Search Data Integration Hub for a Hybrid Paper Search Jungkee Kim 1,2, Geoffrey Fox 2, and Seong-Joon Yoo 3 1 Department of Computer Science, Florida State University, Tallahassee FL 32306, U.S.A., jungkkim@cs.fsu.edu,

More information

Why NoSQL? Your database options in the new non- relational world. 2015 IBM Cloudant 1

Why NoSQL? Your database options in the new non- relational world. 2015 IBM Cloudant 1 Why NoSQL? Your database options in the new non- relational world 2015 IBM Cloudant 1 Table of Contents New types of apps are generating new types of data... 3 A brief history on NoSQL... 3 NoSQL s roots

More information

Semantic Web Technologies and Data Management

Semantic Web Technologies and Data Management Semantic Web Technologies and Data Management Li Ma, Jing Mei, Yue Pan Krishna Kulkarni Achille Fokoue, Anand Ranganathan IBM China Research Laboratory IBM Software Group IBM Watson Research Center Bei

More information

International Journal of Computer & Organization Trends Volume20 Number1 May 2015

International Journal of Computer & Organization Trends Volume20 Number1 May 2015 Performance Analysis of Various Guest Operating Systems on Ubuntu 14.04 Prof. (Dr.) Viabhakar Pathak 1, Pramod Kumar Ram 2 1 Computer Science and Engineering, Arya College of Engineering, Jaipur, India.

More information

A Job Recruitment System Using Semantic Web Technology

A Job Recruitment System Using Semantic Web Technology A Job Recruitment System Using Semantic Web Technology P. Niaphruek Department of Computer Science, Faculty of Science, Rajamangala University of Technology Thanyaburi, Klong 6, Thanyaburi, Pathumthani

More information

Proceedings of the SPDECE-2012. Ninth nultidisciplinary symposium on the design and evaluation of digital content for education

Proceedings of the SPDECE-2012. Ninth nultidisciplinary symposium on the design and evaluation of digital content for education Proceedings of the SPDECE-2012. Ninth nultidisciplinary symposium on the design and evaluation of digital content for education 13 15 June 2011 Universidad de Alicante Alicante, Spain Edited by Manuel

More information

Dependency Free Distributed Database Caching for Web Applications and Web Services

Dependency Free Distributed Database Caching for Web Applications and Web Services Dependency Free Distributed Database Caching for Web Applications and Web Services Hemant Kumar Mehta School of Computer Science and IT, Devi Ahilya University Indore, India Priyesh Kanungo Patel College

More information

Analytics March 2015 White paper. Why NoSQL? Your database options in the new non-relational world

Analytics March 2015 White paper. Why NoSQL? Your database options in the new non-relational world Analytics March 2015 White paper Why NoSQL? Your database options in the new non-relational world 2 Why NoSQL? Contents 2 New types of apps are generating new types of data 2 A brief history of NoSQL 3

More information

Data Integration for XML based on Semantic Knowledge

Data Integration for XML based on Semantic Knowledge Data Integration for XML based on Semantic Knowledge Kamsuriah Ahmad a, Ali Mamat b, Hamidah Ibrahim c and Shahrul Azman Mohd Noah d a,d Fakulti Teknologi dan Sains Maklumat, Universiti Kebangsaan Malaysia,

More information

Reputation Network Analysis for Email Filtering

Reputation Network Analysis for Email Filtering Reputation Network Analysis for Email Filtering Jennifer Golbeck, James Hendler University of Maryland, College Park MINDSWAP 8400 Baltimore Avenue College Park, MD 20742 {golbeck, hendler}@cs.umd.edu

More information

International Journal of Advanced Research in Computer Science and Software Engineering

International Journal of Advanced Research in Computer Science and Software Engineering Volume, Issue, March 201 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com An Efficient Approach

More information

On the Standardization of Semantic Web Services-based Network Monitoring Operations

On the Standardization of Semantic Web Services-based Network Monitoring Operations On the Standardization of Semantic Web Services-based Network Monitoring Operations ChenglingZhao^, ZihengLiu^, YanfengWang^ The Department of Information Techonlogy, HuaZhong Normal University; Wuhan,

More information

InfiniteGraph: The Distributed Graph Database

InfiniteGraph: The Distributed Graph Database A Performance and Distributed Performance Benchmark of InfiniteGraph and a Leading Open Source Graph Database Using Synthetic Data Objectivity, Inc. 640 West California Ave. Suite 240 Sunnyvale, CA 94086

More information

Evaluating SPARQL-to-SQL translation in ontop

Evaluating SPARQL-to-SQL translation in ontop Evaluating SPARQL-to-SQL translation in ontop Mariano Rodriguez-Muro, Martin Rezk, Josef Hardi, Mindaugas Slusnys Timea Bagosi and Diego Calvanese KRDB Research Centre, Free University of Bozen-Bolzano

More information