Combining Unstructured, Fully Structured and Semi-Structured Information in Semantic Wikis
|
|
|
- Steven Baldwin
- 10 years ago
- Views:
Transcription
1 Combining Unstructured, Fully Structured and Semi-Structured Information in Semantic Wikis Rolf Sint 1, Sebastian Schaffert 1, Stephanie Stroka 1 and Roland Ferstl 2 1 {firstname.surname}@salzburgresearch.at Salzburg Research Jakob Haringer Str. 5/ Salzburg Austria 2 [email protected] Siemens AG (Siemens IT Solutions and Services) Werner von Siemens-Platz Salzburg Austria Abstract. The growing impact of Semantic Wikis deduces the importance of finding a strategy to store textual articles, semantic metadata and management data. Due to their different characteristics, each data type requires a specialized storing system, as inappropriate storing reduces performance, robustness, flexibility and scalability. Hence, it is important to identify a sophisticated strategy for storing and synchronizing different types of data structures in a way they provide the best mix of the previously mentioned properties. In this paper we compare fully structured, semi-structured and unstructured data and present their typical appliance. Moreover, we discuss how all data structures can be combined and stored for one application and consider three synchronization design alternatives to keep the distributed data storages consistent. Furthermore, we present the semantic wiki KiWi, which uses an RDF triplestore in combination with a relational database as basis for the persistence of data, and discuss its concrete implementation and design decisions. 1 Introduction Although the promise of effective knowledge management has had the industry abuzz for well over a decade, the reality of available systems fails to meet the expectations. The EU-funded project KiWi - Knowledge in a Wiki project sets out to combine the wiki method of collaborative content creation with the technologies of the Semantic Web to bring knowledge management to the next level. Combining a Wiki with Semantic Web technologies results in three types of content: Wiki Articles, which are basically unstructured textual content, Management Data, like authors, creation dates and revisions, and
2 2 Semantic Metadata, which provide flexibility and spreading of the data about KiWi s contents. In this paper we explain the differences of data storage for these data types. We describe our choice of design and illustrate its usefulness for Semantic Social Software Applications. Furthermore, we explain how these three different approaches can be integrated in a single application, which is build with the Java Enterprise Edition (Java EE) 1 platform. We present and discuss three different kinds of data: structured, unstructured and semi-structured. We discuss the weaknesses and strengths of each of them and describe that for a semantic social software application a combination of them brings advantages in form of an improved flexibility and performance. Furthermore, we describe several ways and designs how an application can implement the different ways of persisting data. The main challenge by developing an application which uses different data storages is to define a common interface for the access of data and to guarantee the synchronization of the different data storages. Chapter 2 discusses the benefits, techniques and differences of structured, unstructured and semi-structured data. For this discussion examples for each paradigm are compared: An Apache Lucene full-text index for unstructured data, a relational database for fully structured data and an RDF triplestore for semi-structured data. Chapter 3 describes several design patterns, which were used within KiWi to combine the three different approaches, focusing on the combination of relational databases and RDF triplestores. This chapter tries to answer the question which data should be stored where and discusses the decisions taken by the KiWi project. Chapter 4 gives an overview of related work and chapter 5 summarizes the practical relevance of this approach. 2 Structured, Unstructured and Semi-Structured Data In the semantic wiki KiWi we need all three kinds of data: structured, unstructured and semi-structured. This chapter presents and compares the different forms of data and gives examples and state-of-the-art techniques. Finally, a tabular overview of the different kinds of data structures is given. 2.1 Unstructured Data According to[1], the term unstructured refers to the fact that no identifiable structure within this kind of data is available. Unstrucured data is also described as data, that cannot be stored in rows and columns in a relational database. Storing data in an unstructured form without any defined data schema is a common way of filing information. An example for unstructured data is a document that is archived in a file folder. Other examples are videos and images. 1
3 3 The advantage of unstructured data is, that no additional effort on its classification is necessary. A limitation of this kind of data is, that no controlled navigation within unstructured content is possible. A common technology to search in unstructured text documents is full-text search. The advantage of full-text search is, that it is completely decoupled from the data. This makes it very flexible, because it can be used on every kind of textual data, even if no schema or structure is defined. One limitation of full-text search is that it cannot be used to search for pictures or videos. Full-Text Search can be optimized by generating a full-text index, that increases the performance of a full-text search query. A famous full-text search engine library is Apache Lucene 2. Other examples are MySql 3 and Postgres indixes Fully Structured Data Fig. 1: Sample Table in a Relational Database System Fully structured data follows a predefined schema. An instance of such a schema is some data that conforms to this specification, [2]. A typical example for fully structured data is a relational database system. Designing a database schema is an elaborate process, because a schema has to be defined before the content is created and the database is populated. The schema defines the type and structure of data and its relations. Figure 1 illustrates an Entity Relationship diagram (ER-diagram) and its concrete tables within a RDBMS (relational database management system). The well-defined schema of fully structured data enables efficient data processing and an improved storage and navigation of content, [2, page 122]. The
4 4 cost for high performance and navigation is flexibility and scalability. It is difficult to subsequently extend a previously defined database schema that already contains content. For example, it is not possible to extend a single table row with a new attribute without creating another table column. This is unprofitable for tables that contain thousands of other rows that do not need another attribute. An advantage of relational database applications are the existing tools and web frameworks, which support the development of database-focused applications. For instance, Hibernate 5 and Oracle TopLink 6 are Object/Relational (O/R) Mapping frameworks, which map classes and objects to relational database tables and rows. Moreover, there exist several practical tools for maintenance, management and administration of relational database systems. 2.3 Semi-Structured Data Fig. 2: Sample RDF Graph Semi-structured data is often explained as...schemaless or self-describing, terms that indicate that there is no separate description of the type or structure of the data [2, page 11]. Semi-structured data does not require a schema definition. This does not mean that the definition of a schema is not possible, it is rather optional. The instances do also exist in the case that the schema changes. Furthermore, a schema can also be defined according to already existing instances (posteriori). The types of semi-structured data instances may be defined for a part of the data and it is also possible that a data instance has more than one type[2]. One of the strengths of semi-structured data is... the ability to accommodate variations in structure [2, page 12]. This means that data may be created according to a specification or close to a type. For instance, fields can be duplicated, data can be lacking or there may exist minor changes[2]. Figure 2 illustrates a graph representation of semistructured data. Figure 4 illustrates the same schema as in Figure 3, with the difference that the instance model has an additional property, which is not defined in the schema model
5 5 Fig. 3: RDF Schema (RDFS) and two instances Fig. 4: RDFS and a flexible instance
6 6 A typical example of semi-structured data is XML, which is a language for data representation and exchange on the web. In XML data can be directly encoded and a Document Type Definition (DTD) or XML Schema (XMLS) may define the structure of the XML document[2]. In the research fields of the Semantic Web, knowledge is encoded in Resource Description Framework (RDF) triples[3], which store data in the form of subject, predicate and object nodes. The RDF Schema (RDFS)[4] vocabulary definition language allows the definition of classes and properties. In the World Wide Web RDF is used as a language that provides metadata to web resources. 2.4 Transformation of Data In KiWi, data sometimes needs to be transformed from one structure into another. For instance, fully structured data is converted into unstructured data when a user generates a PDF out of a wiki article and its management data like author, creation date and so forth. It is also possible to convert data from a database into semi-structured data, like an RDF graph. Several modern web applications use RSS feeds, which are generated by reading data of a relational database and provide it in RDF format. On the contrary, it is more complex to transform unstructured information into semi- or fully structured information. KiWi structures textual content with techniques of information extraction and natural language processing. Tags, which describe the content of a text, are automatically extracted out of a wiki article. In this way the unstructured data can be converted into semi-structured data. 2.5 Comparison and relevance for an application It can be summarized, that the high degree of typing enables a better performance and less flexibility. Serge Abiteboul, Peter Buneman and Dan Suciu define several reasons why defining a structure is good for[2]: to optimize query evaluation, to improve storage, to construct indexes, to describe the database content to the user and facilitate query formulation, to proscribe certain updates, and to support strongly typed languages. Table 1 gives an overview over the strengths and weaknesses of the different storing structures in technology fields that may be important in practice. 2.6 Conceptual Federation of Relational Databases and Triplestores To know how to combine a relational database and a triplestore we have to consider what data is stored where. Therefore, we review the strengths and
7 7 Unstructured Fully Structured Semi-Structured Technology Character and Relational XML/RDF binary data database tables Transaction No transaction Matured transaction Transaction management Management management, management, various adapted from RDBMS, no concurrency concurrency techniques not matured Version Versioned as Versioning over Not very common, Management a whole tuples, rows, versioning over tables, etc. triples or graphs is possible Flexibility Very flexible, Schema-dependent, Flexible, tolerant absence of schema rigorous schema schema Scalability Very scalable Scaling DB Schema scaling schema is difficult is simple Robustness - Very robust, New technology, enhancements since not widely spread 30 years Query- Only textual Structured Query Queries over Performance queries possible allows complex joins anonymous nodes are possible Table 1: Comparison of unstructured, fully structured and semi-structured content weaknesses of different data structures and discuss the demand of structure characteristics for specific data sets. A relational database stores fully structured data, which necessarily have a predefined schema. Relational databases provide the application with a high query-performance and fast joins. Vulnerabilities are rare since more than 30 years of research, development and improvement eliminated most of them and increased the robustness. Semi-structured data like RDF data does not have to predefine a schema and is very scalable and flexible. Furthermore, RDF and OWL 7 allow the definition of logical rules and many applications implement an inference layer that infers new triples by reasoning over the existing data set. Thus, data that has a predefined schema, that is sensitive and that is often queried should be stored in a relational database. Data that is added to the application lately (e.g. data for extensions or plug-ins) and data that might be important for reasoning should be stored in the triplestore. Figure 5 provides a quick overview over the division into relational database data and triplestore data. As one can see, the data sets are partially overlapping. 7
8 8 Relational Database Triplestore Sensitive Management Data Data with a predefined schema Versioned Data Core Component Management Data Non-sensitive Data Data that can be access from other applications or agents Automatically or manually generated data during runtime Plug-In & Extension Data Fig. 5: Overlapping data sets stored in the triplestore and in the relational database 3 Data representation in KiWi Combining structured and unstructured data is an often applied strategy in web applications to achieve the advantages of both persistence types. The employment of all three alternatives, however, is uncommon. KiWi is a platform for Semantic Social Software applications, implemented with Java EE technologies. We decided to store data in a semi-structured form, because we wanted to attain a better flexibility and scalability than provided by the structured form. We also wanted to store data in a robust database with good query and join performance. We have to control a big amount of textual content, which needs to be queried for keywords. Hence, we decided to combine unstructured, structured and semi-structured data storage and segmented the data into long textual content (unstructured), core component data (fully structured) and flexible data (semi-structured). For a better clarity, Table 2 visualizes the segmentation. The sets of fully structured and semi-structured data are overlapping, because we represent the non-sensitive core data additionally in the triplestore to get a complete data set that can be provided to other Semantic Web Applications (e.g. Linked Data 8 ). 3.1 Three possible Levels of Synchronization Applications that store data in a triplestore as well as in a relational database have to implement a synchronization mechanism to keep information consistent. Such a synchronization mechanism can be implemented on different layers of an application. 8
9 9 Content Type Example Unstructured Textual Content Wiki Articles, Blog Pages Fully Structured Sensitive Content & ContentItem, System Maintenance Data Core Component Data User data Semi-Structured Non-sensitive Core ContentItem-extending Component Data, Flexible Data, Use Case Data Content & Individual Data Table 2: Persistence alternatives and apportioned content Database Layer Synchronization on the database layer is implemented by forcing a data storage (e.g. database) to update another data storage (e.g. triplestore) when a data item changed. For instance, every time an application writes on a database, the according operation could be executed on the triplestore, which might be hold in the database. This could be implemented using database triggers or Java EE persistence interceptors. Another possibility is that the triplestore is generated automatically from the entries within the database. Hence, the triplestore could be updated regularly. In both variants the database is defined as master and the triplestore is defined as slave. This design is illustrated in Figure 6. This design benefits from high performance and good integration of relational databases into existing software technology stacks (e.g. Java EE). Furthermore, functions provided by a triplestore, like reasoning, are possible, because the data also exists in a semi-structured form. The disadvantage is that this design does not offer the flexibility of semi-structured data, and that the application has read only access to one data storage. An alternative design is a bi-directional trigger synchronisation between relational database and triplestore. The triplestore, as well as the relational database can update each other with database triggers. The advantage of this design is that it allows writing access to both data storages. This design is illustrated in Figure 7. The limitation is, that some updates on the triplestore cannot be processed on the database and must be forbidden to keep consistency. Therefore, this design does not support the full flexibility of semi-structured data, too. O/R Mapping Tool O/R mapping tools provide another layer for synchronisation. This design is illustrated in Figure 8. For instance, the Java Persistence API (JPA) 9 could be extended to persist Java objects in the database as well as in the triplestore. This encloses the translation of JpaQL (JavaPersistenceApi- QueryLanguage) 10 queries into triplestore queries. This approach decouples the
10 10 Fig. 6: Database defined as master and triplestore defined as slave Fig. 7: Triplestore and database update each other persistence layer from the application layer, and, therefore, provides the flexibility of semi-structured data. Thus, additional attributes of an object may be defined during the runtime of an application and persisted in a triplestore. This may be realized using Aspect Oriented Programming (AOP) 11 techniques or dynamic languages like Groovy 12. With this approach, distributed queries over several datasources could be realized. Fig. 8: Extension of the JPA with a triplestore module to guarantee consistency
11 11 Application Layer / Middleware Layer Another alternative to guarantee the synchronisation of data is to implement it in the middleware or application layer. This layer could use normal JpaQL queries for the database as well as SPARQL commands to query the triplestore. This design is illustrated in Figure 9. A different alternative is to provide a general purpose query language for both data stores. In this way, distributed reasoning over the triplestore, as well as over the data in the relational database system could be enabled. This design is illustrated in Figure 10. Fig. 9: Middleware layer which handles the persistence of data Fig. 10: General purpose query language 3.2 Integration of a triplestore in the Java EE stack We decided to choose the Application Layer for synchronization, because it grants us flexibility to improve weaknesses and to enforce the strengths of each data structure type. First, we will give you an overview over the triplestore position inside of KiWi. Figure 11 illustrates the overall structure of KiWi. The combination of triplestore and relational database can be found in the Persistence and Data Model layers. As an RDF triplestore KiWi currently uses Sesame2 13. The relational database connection is enabled through Hibernate with JPA. Storage configurations for relational database and triplestore can be applied with Java annotations. 13
12 12 Fig. 11: KiWi s overall structure, adapted from[5] Transactional synchronization As Table 1 illustrated, transaction management for unstructured and semi-structured data is not very common or matured. Though, storing data in those federated, heterogeneous databases needs to be controlled to avoid states of inconsistency. A global transaction management is the easiest way to administer all three data structure types in terms of their transactions. JBoss Seam[6], Hibernate/JPA[7], and Enterprise Java Beans (EJB)[8] provide us with diverse techniques to control transactions programmatically and declaratively, for example: Java Transaction API, also called JTA14 specifies standard Java interfaces for Java Enterprise Applications implemented by the application server[9]. Seam Transactions extend JTA UserTransactions with useful functionality, for example the registration of a synchronization implementation[6]. EntityManager Transactions are provided by Hibernate/JPA for programmatic transaction management to start and stop transactions explicitly[10]. Programmatic transaction processing requires the definition of a start and end time for the transaction. It allows flexible pre- and post-treatment of the application when the transaction ends. Declarative transaction processing, on the other hand, is simpler than programmatic transaction management, because the transaction start and end time is managed by the container[11]. To control the behaviour before and after a transaction ends in applications using declarative 14
13 13 transaction processing, a synchronization implementation can be registered[9]. In KiWi we use the before-completion phase to synchronize the relational database state with the triplestore state. Thus, updates to both databases will be executed simultaneously at the end of a transaction. Figure 12 illustrates the process. If an update fails, the whole transaction including changes on both databases will be rolled back. A more detailed description of the transaction models in Java Enterprise Applications, the concurrency problems that triplestores must consider and the database synchronization is given in [12]. Tx start read from rel. database or/and triplestore make local changes to the data items reach beforecompletion phase store changes in triplestore triplestore update fails triplestore update succeeds rollback Tx Tx end rel DB update fails rel DB update succeeds commit Tx Tx end Fig. 12: Transactional synchronization process Data Versioning Versioning of unstructured, semi-structured and fully structured data is an important core functionality of KiWi. RDF triple versioning is uncommon and few well-established RDF repositories allow versioning. Sesame2 puts RDF triples internally under version control, but it does not enable undo or redo functions. With the chosen transaction strategy we can easily implement version-control of unstructured, semi-structured and fully structured data. At the end of a transaction, updates for all kinds of data are creates and stored as revisioning and update tables in the relational database. This design was chosen to collect all versioning data in a robust database, to enable easy querying, and, consequently,
14 14 to allow fast undo and redo functionality for all kinds of data. Versioning data has a pre-defined schema that will not be changed in the future. Query & Reasoning With the chosen level of synchronization it is possible to create a query language for all kinds of data. KiWi enables this global querying that interprets to SQL and SPARQL 15 queries. Furthermore, reasoning is not limited to the RDF repository anymore. The interested reader is referred to [13] for a more detailed discussion about this issue. 4 Related Work In the following we provide an overview over implementations of semi-structured data into existing application stacks. Elmo[14] is a Java library for Semantic Web applications that maps Java classes to RDFS/OWL classes. Another implementation of a server which offers access to different representations of data is Virtuoso,... which is a database engine hybrid that combines the functionality of a traditional RDBMS, ORDBMS, virtual database, RDF, XML, free-text, Web Application Server and File Server functionality in a single server product [15]. 5 Conclusion The main advantage of fully structured data is the strong typing which enables high performance and efficiency. On the other hand, unstructured and semistructured data allow a higher degree of flexibility. In this paper we compared unstructured, semi-structured and fully structured information and discussed an application design which combines all three types of data, based on a relational database system combined with an RDF triplestore. We illustrated this design on the concrete implementation of the semantic wiki KiWi. We saw that a challenge for such an application is to avoid states of inconsistency and present three different layers where a synchronisation of data within an application could be implemented: 1 On a low level database layer, 2 On the he O/R mapping layer, and 3 On the application layer. In KiWi the synchronisation of data is implemented on the application layer because it offers database independence and enables the implementation of a common query language for all different data stores. 15
15 15 References 1. Blumberg, R., Atre, S.: The Problem with Unstructured Data. dmreview.com/issues/ / html ( ) (2003) 2. Abiteboul, S., Buneman, P., Suciu, D.: Data on the Web: from relations to semistructured data and XML. Morgan Kaufmann Publishers Inc. San Francisco, CA, USA (1999) 3. Manola, F., Miller, E.: Resource Description Framework (RDF):Concepts and Abstract Syntax. ( ) (2004) 4. Brickley, D., Guha, R.: RDF Vocabulary Description Language 1.0: RDF Schema. ( ) (2004) 5. Schaffert, S., Sint, R., Grünwald, S., Stroka, S.: The KiWi Architecture. (2008) 6. Allen, D.: Seam in Action. Manning Publications Co. Greenwich, CT, USA (2008) 7. Bauer, C.: Java Persistence with Hibernate. Manning Publications Co. Greenwich, CT, USA (2006) 8. DeMichiel, L., Keith, M.: JSR 220: Enterprise JavaBeansTM,Version 3.0. http: //java.sun.com/products/ejb/docs.html ( ) (2006) 9. Cheung, S., Matena, V.: Java Transaction API (JTA). com/javaee/technologies/jta/index.jsphttp://java.sun.com/javaee/ technologies/jta/index.jsp ( ) (2002) 10. : javax.persistence.entitytransaction Interface JavaDoc. javaee/5/docs/api/javax/persistence/entitytransaction.html ( ) (unknown) 11. Connolly, T., Begg, C.: Database Systems: A Practical Approach to Design, Implementation, and Management. Addison Wesley Publishing Company (2005) 12. Stroka, S.: Transaction Management in Federated, Heterogeneous Database Systems for Semantic Social Software Applications. (2009) 13. Francois Bry, Michael Eckert, J.K., Weiand, K.: What the User interacts with: Reflections On Conceptual Models For Semantic Wikis. (2009) 14. Leigh, J.: Elmo User Guide. user-guide/index.html ( ) (2008) 15. Virtuoso: Virtuoso Universal Server. (2009)
K@ A collaborative platform for knowledge management
White Paper K@ A collaborative platform for knowledge management Quinary SpA www.quinary.com via Pietrasanta 14 20141 Milano Italia t +39 02 3090 1500 f +39 02 3090 1501 Copyright 2004 Quinary SpA Index
Semantic Stored Procedures Programming Environment and performance analysis
Semantic Stored Procedures Programming Environment and performance analysis Marjan Efremov 1, Vladimir Zdraveski 2, Petar Ristoski 2, Dimitar Trajanov 2 1 Open Mind Solutions Skopje, bul. Kliment Ohridski
Performance Evaluation of Java Object Relational Mapping Tools
Performance Evaluation of Java Object Relational Mapping Tools Jayasree Dasari Student(M.Tech), CSE, Gokul Institue of Technology and Science, Visakhapatnam, India. Abstract: In the modern era of enterprise
Linked Data Interface, Semantics and a T-Box Triple Store for Microsoft SharePoint
Linked Data Interface, Semantics and a T-Box Triple Store for Microsoft SharePoint Christian Fillies 1 and Frauke Weichhardt 1 1 Semtation GmbH, Geschw.-Scholl-Str. 38, 14771 Potsdam, Germany {cfillies,
Flattening Enterprise Knowledge
Flattening Enterprise Knowledge Do you Control Your Content or Does Your Content Control You? 1 Executive Summary: Enterprise Content Management (ECM) is a common buzz term and every IT manager knows it
The Ontological Approach for SIEM Data Repository
The Ontological Approach for SIEM Data Repository Igor Kotenko, Olga Polubelova, and Igor Saenko Laboratory of Computer Science Problems, Saint-Petersburg Institute for Information and Automation of Russian
Chapter 11 Map-Reduce, Hadoop, HDFS, Hbase, MongoDB, Apache HIVE, and Related
Chapter 11 Map-Reduce, Hadoop, HDFS, Hbase, MongoDB, Apache HIVE, and Related Summary Xiangzhe Li Nowadays, there are more and more data everyday about everything. For instance, here are some of the astonishing
Relational Database Basics Review
Relational Database Basics Review IT 4153 Advanced Database J.G. Zheng Spring 2012 Overview Database approach Database system Relational model Database development 2 File Processing Approaches Based on
1. Introduction... 1 1.1. What is Slice?... 1 1.2. Background... 1 1.3. Why Slice?... 1 1.4. Purpose of this Document... 1 1.5. Intended Audience...
Slice Documentation Slice Documentation 1. Introduction... 1 1.1. What is Slice?... 1 1.2. Background... 1 1.3. Why Slice?... 1 1.4. Purpose of this Document... 1 1.5. Intended Audience... 1 2. Features
Oracle Application Development Framework Overview
An Oracle White Paper June 2011 Oracle Application Development Framework Overview Introduction... 1 Oracle ADF Making Java EE Development Simpler... 2 THE ORACLE ADF ARCHITECTURE... 3 The Business Services
Data Integration Hub for a Hybrid Paper Search
Data Integration Hub for a Hybrid Paper Search Jungkee Kim 1,2, Geoffrey Fox 2, and Seong-Joon Yoo 3 1 Department of Computer Science, Florida State University, Tallahassee FL 32306, U.S.A., [email protected],
Software Architecture Document
Software Architecture Document Natural Language Processing Cell Version 1.0 Natural Language Processing Cell Software Architecture Document Version 1.0 1 1. Table of Contents 1. Table of Contents... 2
Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo
DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo Expected Outcomes You will learn: Basic concepts related to ontologies Semantic model Semantic web Basic features of RDF and RDF
2. Background on Data Management. Aspects of Data Management and an Overview of Solutions used in Engineering Applications
2. Background on Data Management Aspects of Data Management and an Overview of Solutions used in Engineering Applications Overview Basic Terms What is data, information, data management, a data model,
Big Data Analytics. Rasoul Karimi
Big Data Analytics Rasoul Karimi Information Systems and Machine Learning Lab (ISMLL) Institute of Computer Science University of Hildesheim, Germany Big Data Analytics Big Data Analytics 1 / 1 Introduction
<Insert Picture Here> Java EE 7. Linda DeMichiel Java EE Platform Lead
1 Java EE 7 Linda DeMichiel Java EE Platform Lead The following is intended to outline our general product direction. It is intended for information purposes only, and may not be
Java EE 7: Back-End Server Application Development
Oracle University Contact Us: 01-800-913-0322 Java EE 7: Back-End Server Application Development Duration: 5 Days What you will learn The Java EE 7: Back-End Server Application Development training teaches
Business Process Management with @enterprise
Business Process Management with @enterprise March 2014 Groiss Informatics GmbH 1 Introduction Process orientation enables modern organizations to focus on the valueadding core processes and increase
Case Studies of Running the Platform. NetBeans UML Servlet JSP GlassFish EJB
September Case Studies of Running the Platform NetBeans UML Servlet JSP GlassFish EJB In this project we display in the browser the Hello World, Everyone! message created in the session bean with servlets
AllegroGraph. a graph database. Gary King [email protected]
AllegroGraph a graph database Gary King [email protected] Overview What we store How we store it the possibilities Using AllegroGraph Databases Put stuff in Get stuff out quickly safely Stuff things with
HadoopRDF : A Scalable RDF Data Analysis System
HadoopRDF : A Scalable RDF Data Analysis System Yuan Tian 1, Jinhang DU 1, Haofen Wang 1, Yuan Ni 2, and Yong Yu 1 1 Shanghai Jiao Tong University, Shanghai, China {tian,dujh,whfcarter}@apex.sjtu.edu.cn
1 What Are Web Services?
Oracle Fusion Middleware Introducing Web Services 11g Release 1 (11.1.1) E14294-04 January 2011 This document provides an overview of Web services in Oracle Fusion Middleware 11g. Sections include: What
Graph Database Performance: An Oracle Perspective
Graph Database Performance: An Oracle Perspective Xavier Lopez, Ph.D. Senior Director, Product Management 1 Copyright 2012, Oracle and/or its affiliates. All rights reserved. Program Agenda Broad Perspective
1 File Processing Systems
COMP 378 Database Systems Notes for Chapter 1 of Database System Concepts Introduction A database management system (DBMS) is a collection of data and an integrated set of programs that access that data.
Using EMC Documentum with Adobe LiveCycle ES
Technical Guide Using EMC Documentum with Adobe LiveCycle ES Table of contents 1 Deployment 3 Managing LiveCycle ES development assets in Documentum 5 Developing LiveCycle applications with contents in
Framework Adoption for Java Enterprise Application Development
Framework Adoption for Java Enterprise Application Development Clarence Ho Independent Consultant, Author, Java EE Architect http://www.skywidesoft.com [email protected] Presentation can be downloaded
Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies
Semantic Data Management Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies 1 Enterprise Information Challenge Source: Oracle customer 2 Vision of Semantically Linked Data The Network of Collaborative
1 What Are Web Services?
Oracle Fusion Middleware Introducing Web Services 11g Release 1 (11.1.1.6) E14294-06 November 2011 This document provides an overview of Web services in Oracle Fusion Middleware 11g. Sections include:
Data Integration using Agent based Mediator-Wrapper Architecture. Tutorial Report For Agent Based Software Engineering (SENG 609.
Data Integration using Agent based Mediator-Wrapper Architecture Tutorial Report For Agent Based Software Engineering (SENG 609.22) Presented by: George Shi Course Instructor: Dr. Behrouz H. Far December
JBoss Enterprise Middleware
JBoss Enterprise Middleware The foundation of your open source middleware reference architecture Presented By : Sukanta Basak Red Hat -- Vital Statistics Headquarters in Raleigh, NC Founded in 1993 Over
UIMA and WebContent: Complementary Frameworks for Building Semantic Web Applications
UIMA and WebContent: Complementary Frameworks for Building Semantic Web Applications Gaël de Chalendar CEA LIST F-92265 Fontenay aux Roses [email protected] 1 Introduction The main data sources
2012 LABVANTAGE Solutions, Inc. All Rights Reserved.
LABVANTAGE Architecture 2012 LABVANTAGE Solutions, Inc. All Rights Reserved. DOCUMENT PURPOSE AND SCOPE This document provides an overview of the LABVANTAGE hardware and software architecture. It is written
Service Oriented Architecture
Service Oriented Architecture Charlie Abela Department of Artificial Intelligence [email protected] Last Lecture Web Ontology Language Problems? CSA 3210 Service Oriented Architecture 2 Lecture Outline
Modern Databases. Database Systems Lecture 18 Natasha Alechina
Modern Databases Database Systems Lecture 18 Natasha Alechina In This Lecture Distributed DBs Web-based DBs Object Oriented DBs Semistructured Data and XML Multimedia DBs For more information Connolly
WHITE PAPER DATA GOVERNANCE ENTERPRISE MODEL MANAGEMENT
WHITE PAPER DATA GOVERNANCE ENTERPRISE MODEL MANAGEMENT CONTENTS 1. THE NEED FOR DATA GOVERNANCE... 2 2. DATA GOVERNANCE... 2 2.1. Definition... 2 2.2. Responsibilities... 3 3. ACTIVITIES... 6 4. THE
XpoLog Competitive Comparison Sheet
XpoLog Competitive Comparison Sheet New frontier in big log data analysis and application intelligence Technical white paper May 2015 XpoLog, a data analysis and management platform for applications' IT
Data Modeling for Big Data
Data Modeling for Big Data by Jinbao Zhu, Principal Software Engineer, and Allen Wang, Manager, Software Engineering, CA Technologies In the Internet era, the volume of data we deal with has grown to terabytes
A generic approach for data integration using RDF, OWL and XML
A generic approach for data integration using RDF, OWL and XML Miguel A. Macias-Garcia, Victor J. Sosa-Sosa, and Ivan Lopez-Arevalo Laboratory of Information Technology (LTI) CINVESTAV-TAMAULIPAS Km 6
Developing Web 3.0. Nova Spivak & Lew Tucker http://radarnetworks.com/ Tim Boudreau http://weblogs.java.net/blog/timboudreau/
Developing Web 3.0 Nova Spivak & Lew Tucker http://radarnetworks.com/ Tim Boudreau http://weblogs.java.net/blog/timboudreau/ Henry Story http://blogs.sun.com/bblfish 2007 JavaOne SM Conference Session
City Data Pipeline. A System for Making Open Data Useful for Cities. [email protected]
City Data Pipeline A System for Making Open Data Useful for Cities Stefan Bischof 1,2, Axel Polleres 1, and Simon Sperl 1 1 Siemens AG Österreich, Siemensstraße 90, 1211 Vienna, Austria {bischof.stefan,axel.polleres,simon.sperl}@siemens.com
OWB Users, Enter The New ODI World
OWB Users, Enter The New ODI World Kulvinder Hari Oracle Introduction Oracle Data Integrator (ODI) is a best-of-breed data integration platform focused on fast bulk data movement and handling complex data
White Paper: 1) Architecture Objectives: The primary objective of this architecture is to meet the. 2) Architecture Explanation
White Paper: 1) Architecture Objectives: The primary objective of this architecture is to meet the following requirements (SLAs). Scalability and High Availability Modularity and Maintainability Extensibility
BUSINESS VALUE OF SEMANTIC TECHNOLOGY
BUSINESS VALUE OF SEMANTIC TECHNOLOGY Preliminary Findings Industry Advisory Council Emerging Technology (ET) SIG Information Sharing & Collaboration Committee July 15, 2005 Mills Davis Managing Director
Oracle WebLogic Foundation of Oracle Fusion Middleware. Lawrence Manickam Toyork Systems Inc www.toyork.com http://ca.linkedin.
Oracle WebLogic Foundation of Oracle Fusion Middleware Lawrence Manickam Toyork Systems Inc www.toyork.com http://ca.linkedin.com/in/lawrence143 History of WebLogic WebLogic Inc started in 1995 was a company
GIS Databases With focused on ArcSDE
Linköpings universitet / IDA / Div. for human-centered systems GIS Databases With focused on ArcSDE Imad Abugessaisa [email protected] 20071004 1 GIS and SDBMS Geographical data is spatial data whose
Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc.
Oracle BI EE Implementation on Netezza Prepared by SureShot Strategies, Inc. The goal of this paper is to give an insight to Netezza architecture and implementation experience to strategize Oracle BI EE
The Open Source CMS. Open Source Java & XML
The Open Source CMS Store and retrieve Classify and organize Version and archive management content Edit and review Browse and find Access control collaboration publishing Navigate and show Notify Aggregate
Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2
Volume 6, Issue 3, March 2016 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue
Oracle Data Integrator: Administration and Development
Oracle Data Integrator: Administration and Development What you will learn: In this course you will get an overview of the Active Integration Platform Architecture, and a complete-walk through of the steps
Agile Business Suite: a 4GL environment for.net developers DEVELOPMENT, MAINTENANCE AND DEPLOYMENT OF LARGE, COMPLEX BACK-OFFICE APPLICATIONS
Agile Business Suite: a 4GL environment for.net developers DEVELOPMENT, MAINTENANCE AND DEPLOYMENT OF LARGE, COMPLEX BACK-OFFICE APPLICATIONS In order to ease the burden of application lifecycle management,
From Databases to Natural Language: The Unusual Direction
From Databases to Natural Language: The Unusual Direction Yannis Ioannidis Dept. of Informatics & Telecommunications, MaDgIK Lab University of Athens, Hellas (Greece) [email protected] http://www.di.uoa.gr/
<Insert Picture Here> Building a Complex Web Application Using ADF and Siebel
Building a Complex Web Application Using ADF and Siebel Nishit Rao Group Product Manager Fusion Middleware Oracle Dhiraj Soni Technical Architect GIT Apps Engineering Oracle The following
Portals, Portlets & Liferay Platform
Portals, Portlets & Liferay Platform Repetition: Web Applications and Model View Controller (MVC) Design Pattern Web Applications Frameworks in J2EE world Struts Spring Hibernate Data Service Java Server
Explorer's Guide to the Semantic Web
Explorer's Guide to the Semantic Web THOMAS B. PASSIN 11 MANNING Greenwich (74 w. long.) contents preface xiii acknowledgments xv about this booh xvii The Semantic Web 1 1.1 What is the Semantic Web? 3
HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM. Aniket Bochare - [email protected]. CMSC 601 - Presentation
HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM Aniket Bochare - [email protected] CMSC 601 - Presentation Date-04/25/2011 AGENDA Introduction and Background Framework Heterogeneous
A Near Real-Time Personalization for ecommerce Platform Amit Rustagi [email protected]
A Near Real-Time Personalization for ecommerce Platform Amit Rustagi [email protected] Abstract. In today's competitive environment, you only have a few seconds to help site visitors understand that you
Communiqué 4. Standardized Global Content Management. Designed for World s Leading Enterprises. Industry Leading Products & Platform
Communiqué 4 Standardized Communiqué 4 - fully implementing the JCR (JSR 170) Content Repository Standard, managing digital business information, applications and processes through the web. Communiqué
Customer Bank Account Management System Technical Specification Document
Customer Bank Account Management System Technical Specification Document Technical Specification Document Page 1 of 15 Table of Contents Contents 1 Introduction 3 2 Design Overview 4 3 Topology Diagram.6
Detection and Elimination of Duplicate Data from Semantic Web Queries
Detection and Elimination of Duplicate Data from Semantic Web Queries Zakia S. Faisalabad Institute of Cardiology, Faisalabad-Pakistan Abstract Semantic Web adds semantics to World Wide Web by exploiting
Introduction: Database management system
Introduction Databases vs. files Basic concepts Brief history of databases Architectures & languages Introduction: Database management system User / Programmer Database System Application program Software
Advantages of XML as a data model for a CRIS
Advantages of XML as a data model for a CRIS Patrick Lay, Stefan Bärisch GESIS-IZ, Bonn, Germany Summary In this paper, we present advantages of using a hierarchical, XML 1 -based data model as the basis
Lightweight Data Integration using the WebComposition Data Grid Service
Lightweight Data Integration using the WebComposition Data Grid Service Ralph Sommermeier 1, Andreas Heil 2, Martin Gaedke 1 1 Chemnitz University of Technology, Faculty of Computer Science, Distributed
EDG Project: Database Management Services
EDG Project: Database Management Services Leanne Guy for the EDG Data Management Work Package EDG::WP2 [email protected] http://cern.ch/leanne 17 April 2002 DAI Workshop Presentation 1 Information in
Ontology and automatic code generation on modeling and simulation
Ontology and automatic code generation on modeling and simulation Youcef Gheraibia Computing Department University Md Messadia Souk Ahras, 41000, Algeria [email protected] Abdelhabib Bourouis
Introduction to Database Systems. Chapter 1 Introduction. Chapter 1 Introduction
Introduction to Database Systems Winter term 2013/2014 Melanie Herschel [email protected] Université Paris Sud, LRI 1 Chapter 1 Introduction After completing this chapter, you should be able to:
The Sierra Clustered Database Engine, the technology at the heart of
A New Approach: Clustrix Sierra Database Engine The Sierra Clustered Database Engine, the technology at the heart of the Clustrix solution, is a shared-nothing environment that includes the Sierra Parallel
Requirements Design Implementation. Software Architectures. Components Software Component Architecture. DSSA: Domain-Specific Software Architectures
Frameworks 1. Objectives... 2 2. Frameworks... 3 3. Classification... 3 4. Example: Components for Java (BC4J)... 6 5. Existing Frameworks... 9 6. Presistence Frameworks... 11 7. Content Management System
Encoding Library of Congress Subject Headings in SKOS: Authority Control for the Semantic Web
Encoding Library of Congress Subject Headings in SKOS: Authority Control for the Semantic Web Corey A Harper University of Oregon Libraries Tel: +1 541 346 1854 Fax:+1 541 346 3485 [email protected]
Taking Subversion to a Higher Level. Branching/Merging Support. Component Management Support. And More
Taking Subversion to a Higher Level Branching/Merging Support Component Management Support And More About Impact CM Impact CM is a Service AddOn that facilitates software configuration management (CM)
Database Management. Chapter Objectives
3 Database Management Chapter Objectives When actually using a database, administrative processes maintaining data integrity and security, recovery from failures, etc. are required. A database management
Research Article. ISSN 2347-9523 (Print) *Corresponding author Lili Wang Email: [email protected]
Scholars Journal of Engineering and Technology (SJET) Sch. J. Eng. Tech., 2015; 3(4B):424-428 Scholars Academic and Scientific Publisher (An International Publisher for Academic and Scientific Resources)
Fast and Easy Delivery of Data Mining Insights to Reporting Systems
Fast and Easy Delivery of Data Mining Insights to Reporting Systems Ruben Pulido, Christoph Sieb [email protected], [email protected] Abstract: During the last decade data mining and predictive
SEMANTIC WEB BASED INFERENCE MODEL FOR LARGE SCALE ONTOLOGIES FROM BIG DATA
SEMANTIC WEB BASED INFERENCE MODEL FOR LARGE SCALE ONTOLOGIES FROM BIG DATA J.RAVI RAJESH PG Scholar Rajalakshmi engineering college Thandalam, Chennai. [email protected] Mrs.
Big Systems, Big Data
Big Systems, Big Data When considering Big Distributed Systems, it can be noted that a major concern is dealing with data, and in particular, Big Data Have general data issues (such as latency, availability,
Oracle Identity Analytics Architecture. An Oracle White Paper July 2010
Oracle Identity Analytics Architecture An Oracle White Paper July 2010 Disclaimer The following is intended to outline our general product direction. It is intended for information purposes only, and may
Analytics March 2015 White paper. Why NoSQL? Your database options in the new non-relational world
Analytics March 2015 White paper Why NoSQL? Your database options in the new non-relational world 2 Why NoSQL? Contents 2 New types of apps are generating new types of data 2 A brief history of NoSQL 3
ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY
ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY Yu. A. Zagorulko, O. I. Borovikova, S. V. Bulgakov, E. A. Sidorova 1 A.P.Ershov s Institute
Integration Platforms Problems and Possibilities *
BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 8, No 2 Sofia 2008 Integration Platforms Problems and Possibilities * Hristina Daskalova, Tatiana Atanassova Institute of Information
Course 6232A: Implementing a Microsoft SQL Server 2008 Database
Course 6232A: Implementing a Microsoft SQL Server 2008 Database About this Course This five-day instructor-led course provides students with the knowledge and skills to implement a Microsoft SQL Server
- a Humanities Asset Management System. Georg Vogeler & Martina Semlak
- a Humanities Asset Management System Georg Vogeler & Martina Semlak Infrastructure to store and publish digital data from the humanities (e.g. digital scholarly editions): Technically: FEDORA repository
The EMSX Platform. A Modular, Scalable, Efficient, Adaptable Platform to Manage Multi-technology Networks. A White Paper.
The EMSX Platform A Modular, Scalable, Efficient, Adaptable Platform to Manage Multi-technology Networks A White Paper November 2002 Abstract: The EMSX Platform is a set of components that together provide
XpoLog Center Suite Data Sheet
XpoLog Center Suite Data Sheet General XpoLog is a data analysis and management platform for Applications IT data. Business applications rely on a dynamic heterogeneous applications infrastructure, such
Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies
Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies Big Data: Global Digital Data Growth Growing leaps and bounds by 40+% Year over Year! 2009 =.8 Zetabytes =.08
The Oracle Fusion Development Platform
The Oracle Fusion Development Platform Juan Camilo Ruiz Senior Product Manager Development Tools 1 The preceding is intended to outline our general product direction. It is intended for information purposes
Publishing Linked Data Requires More than Just Using a Tool
Publishing Linked Data Requires More than Just Using a Tool G. Atemezing 1, F. Gandon 2, G. Kepeklian 3, F. Scharffe 4, R. Troncy 1, B. Vatant 5, S. Villata 2 1 EURECOM, 2 Inria, 3 Atos Origin, 4 LIRMM,
Enabling Better Business Intelligence and Information Architecture With SAP PowerDesigner Software
SAP Technology Enabling Better Business Intelligence and Information Architecture With SAP PowerDesigner Software Table of Contents 4 Seeing the Big Picture with a 360-Degree View Gaining Efficiencies
Document management and exchange system supporting education process
Document management and exchange system supporting education process Emil Egredzija, Bozidar Kovacic Information system development department, Information Technology Institute City of Rijeka Korzo 16,
Introduction. Introduction: Database management system. Introduction: DBS concepts & architecture. Introduction: DBS versus File system
Introduction: management system Introduction s vs. files Basic concepts Brief history of databases Architectures & languages System User / Programmer Application program Software to process queries Software
Semantic Knowledge Management System. Paripati Lohith Kumar. School of Information Technology
Semantic Knowledge Management System Paripati Lohith Kumar School of Information Technology Vellore Institute of Technology University, Vellore, India. [email protected] Abstract The scholarly activities
This training is targeted at System Administrators and developers wanting to understand more about administering a WebLogic instance.
This course teaches system/application administrators to setup, configure and manage an Oracle WebLogic Application Server, its resources and environment and the Java EE Applications running on it. This
