Big Data Management Assessed Coursework Two Big Data vs Semantic Web F21BD

Size: px
Start display at page:

Download "Big Data Management Assessed Coursework Two Big Data vs Semantic Web F21BD"

Transcription

1 Big Data Management Assessed Coursework Two Big Data vs Semantic Web F21BD Boris Mocialov (H ) MSc Software Engineering Heriot-Watt University, Edinburgh April 5,

2 1 Introduction The purpose of this essay is to give an overview consisting of objectives of each field and technologies being of the two scientific areas, namely, the Big Data Management and Semantic Web Technologies. A short example will follow after overview of each area that would provide a context associated with that particular area. After an introduction, the essay will try to convince the reader that both areas are actually related by presenting some of the recent applications of the techniques of each field and that this relationship is exploited in the same way every time the combination of the two is used in practice. An additional section at the end will provide more detailed overview over one particular application of the both fields. 2 Introduction to Big Data Management Big Data is a term that describes possibly inconsistent uncertain data that resides in large volumes, different forms and is being produced at high speed. Given such description, tools that operate upon and manage Big Data should capture, process, and analyse the data accordingly to overcome mentioned difficulties. Big Data Management incorporates such tools and techniques to overcome these difficulties. Data Science Series (2012) gives an extended list of possible benefits for both businesses and customers of turning to the Big Data resources. As it can be seen from the list that Big Data can be advantageous to any company independent of the sector or niche it occupies as new opportunities in data-utilisation can be discovered and exploited. 2.1 Objectives As it had been said, Big Data Management is supposed to utilise appropriate tools and techniques to make it possible to capture, process, and analyse data that is fast, large, uncertain, and heterogeneous. Chen and Zhang (2014) present an exhaustive list of challenges posed by the Big Data for computing. The list includes storage problems, I/O speed, network throughput, data 2

3 curation, and processing power as an umbrella over more detailed challenges. All the listed challenges are, indeed, the objectives for the Big Data Management to reach overall aim to be able to store, process, and analyse large amounts of different uncertain data. 2.2 Technologies Given objectives and current challenges for the Big Data Management, Chen and Zhang (2014) discuss possible improvement approaches to allow for better handling of the Big Data. For instance, to improve upon inconsistent, incomplete, and/or noisy data, cleaning, integration, and transformation can be considered. The challenge becomes to perform all these tasks life - as data becomes available. One of the solutions for the fast processing is of course the parallel handling of the data. The current solution to Big Data Management that possibly comes from distributed sources is NoSQL databases. NoSQL databases are more of a philosophy rather than a technique or a tool. It describes a set of approaches the Big Data Management can be accomplished. For instance, some NoSQL databases may or may not use relation, some do not use SQL management language, and some may employ schema-free, schemaless, or flexible schema policies. In addition, different approaches to store data are being used. For example, some systems use key-value storage system, some variation is keydocument system, some turn to column-families type or even graph systems. What all the NoSQL databases have in common is their ability and devotion to dynamic schema as an underlying feature that serves as an advantage when dealing with different data. Another common factor is the separation between the storage and management of the data. While storage happens in one of the previously mentioned fashions, the management is implemented in the application layer, which means that when some dirty data is being extracted from the system, it is then dumped onto the application layer that is supposed to deal with what should be processed further and what is not needed for this particular extraction. Some of the state of the art approaches to Big Data Management that Chen and Zhang (2014) discusses include statistical analysis of the data at hand, data mining approaches and the use of neural networks together with machine learning algorithms to discover patterns in different data and cluster discovered items together to create classes. 3

4 2.3 Big Data Example As it can become apparent from description above, Big Data can provide additional revenues to any company that deals with data. Apart from monetary interest, Big Data can provide new knowledge to science as there is potential value hidden inside of any data. To present a simple, but powerful example, it is worth to mention the notion of smart cities. Data Science Series (2012) provides this as an example of Big Data as well, but smart cities can also be viewed as an encapsulation of services, such as health service, public service, transport service, and more. In the case of health services, patients can have their personalised doctor on their wrist that sends data to an actual doctor or even an AI that records data every moment of patients life and provides clues directly to the person on how to improve upon his/her life. In case of public services, for example, can monitor traffic developments, people gatherings, forums, etc. and act upon this data for the good of the citizens. As for the transportation services, public transport can cooperate and provide services only to the places where it is needed. 3 Introduction to Semantic Web Technologies Semantic web is an idea of adding meaning to the things that are found on the World Wide Web. The purpose of the added meaning is to allow machines to reason about these things. 3.1 Objectives Shadbolt et al. (2006) writes that e-science - the source of the need for the technology, is a major driver for the semantic web for reasons of data integration between heterogeneous data sets that come from different scientific communities. Such integration can be achieved through the use of ontologies - standard for formal namings/definitions/properties/relations of entities within one particular domain. Rationale behind integration of data from wide ranges of fields is inspired by the movement towards interdisciplinary aspects of the science - fusion of different disciplines for the pursuit of acquiring new shared knowledge. Therefore, certain standards should be enforced to allow for distributed and heteroge- 4

5 neous data to merge into meaningful unambiguous knowledge in any domain. 3.2 Technologies The key technologies (rather techniques) in semantic web are URIs that identify various resources. Given a URI to a resource, anyone can tap onto it. URIs is a building block of RDFs that describes every part of a subject-predicate-object triple that, in turn, relates subject to an object. When building an application, RDF vocabulary can be used to specify domain of predicates used within that application. RDF vocabulary serves as an abstraction over distinct RDFs and provides one-point entry for the vocabularies to be linked. RDF Schema (RDFS) is even further abstraction of RDF that provides description of groups of related resources. While RDF Vocabulary is optional, RDF Schema is mandatory. Triple stores, further, extend individual RDFS to provide facilities for richer RDF content. To provide a standardised access to triple stores, SPARQL language had been developed to query the underlying RDFs. OWL languages provide means for adding extra information into RDFS to make the knowledge more representative. In addition, OWL languages support ontology consistency checking (Shadbolt et al., 2006). Switching to tools, it is worth mention Protege, an ontology editor and validator. 3.3 Semantic Web Application Example A commonly cited example of semantic web applications is, perhaps, e-science. As ontologies can be distributed and combined by such technologies as, for example, OWL languages, e-sciences can work in distributed fashion by synchronising their findings and build common knowledge while maintaining a common ontology that would define the domain and range of the research both parties are engaged in. As long as common ontology is defined and obeyed during synchronisation, both parties can make changes to their underlying models, terms and definition as they wish (as local requirements/laws may enforce such differences). 4 Relationship between Big Data and Semantic Web Areas, identified by Data Science Series (2012) had been considered to identify relationship between the two fields. 5

6 4.1 Semantic Link Network for Big Data in Multimedia Paper by Liu et al. (2014) uses a particular approach to organise multimedia resources with the use of texts and surrounding texts. The aim of the project is to give meaning to different multimedia resources and allow users to search related resources and to be able to gain a more comprehensive meaning of a particular resource given its relationships. Authors main assumption is that the manual annotations can be considered as a reliable source of semantics. Also, it is mentioned that ontologies can describe multimedia semantics. The aim of the paper becomes to bridge a gap between ontologies and manually given annotations. Motivation for the paper is to provide reasoning to be able to derive the implicit knowledge from information. Common applications for the derivation of implicit knowledge can be found useful in many areas, such as surveillance, sports, or Internet of Things. Semantic Link Network method is employed to associate relationships between resources. Since every aspect in the Semantic Web is a triple, as it had been pointed out earlier, mapping can be accomplished without any considerable modifications. During the presentation of the results, certain heuristics were applied to filter the underlying assumptions of the model even further. As a result, with the use of ontologies and tags along with textual descriptions, semantic relatedness had been achieved between multimedia items accurately and robustly. 4.2 Personalised Medicine with Big Data and Semantic Web Technologies Panahiazar et al. (2014) considers a patient, who requires personalising health care plan. To accomplish this requirement, a health care system has to implement a new infrastructure that would allow live delivery of patient data directly into the hands of a professional. The other side of the equation would allow health care systems to make better decisions about their patients based on the data from all the patients. The paper discusses an approach towards personalised health care using big data and Semantic Web technologies. 6

7 Smart data notion is introduced into the context of health care as a fusion between the Big Data and Semantic Web. The Big Data part of the smart data deals with accessing and processing large volumes of homogeneous and heterogeneous data about every single patient. Since the data is not structured most of the time, Semantic Web technologies come into play and are used to annotate various concepts. 4.3 Information and Data Sharing in Chemical Sciences Bird and Frey (2013) provide in-detail rationale behind the importance of data and knowledge sharing in the chemical sciences. e-reasearch is a direct consequence of the expansion of available to researchers data. As more work power is required to process the available data, the more need emerges in use of distributed collaborations, so that collaborative bodies can tackle problem of Big Data in sciences. In addition to workforce, scientists depend upon each other s work more than ever. Single-entry database solutions are not feasible to accommodate for all the research centres and universities. Therefore, a distributed approach must be taken. Although the distributed approach is feasible serving as a boilerplate for all the research happening in one field, additional infrastructure should be in place to allow discovery, browsing, documentation, etc. This would in turn allow for the provenance of the data, so that the initial baseline can be frozen and not changed any more after it had been shared. For such a system it would be important to use a controlled vocabulary that would ensure that all the parties belonging to the system use that vocabulary when describing certain aspects of the research. 4.4 Linking Smart Cities Data Yet another example comes from Celino et al. (2012), who report on the implementation of an application that engages users to provide information about a city to fix inconsistencies in automatic inferences made by reasoning software regarding a specific ontology. It had been noted from similar applications that users are willing to provide information if the application supports GWAP paradigm. In other words it can be said that the crowd can foster the connection between the Big Data and the Semantic Web Technologies given appropriate infrastructure. Author also notes that similar works had been done that covered the whole Semantic Web life-cycle rather than the fine-tuning part. 7

8 5 Conclusion Although both the Big Data and Semantic Web Technologies can be seen as two different areas of research, both are applied to real-life certain problems as it had been described in Section 4. Applications converge to a similar aim, namely, process and give meaning to the Big Data generated by the means of embedded technology. In addition, it can be seen that the main focus of applications of the both technologies is knowledge, may it be for profit or for the discovery of more knowledge. Therefore, it is worth to say that both areas should progress further by giving meaning to the unstructured, fast, and uncertain data around us. 8

9 6 Semantic Web technologies for the big data in life sciences Wu and Yamaguchi (2014) present a survey of big data in life sciences with semantic extension. The paper s aim is to enable investigation of effects chemicals on biological systems. Additional data sets are required to be able to accomplish that. The problem emerges when data sources contain different or new unseen data types and different formats of underlying data. To be able to use such data sources, they must be integrated, eliminating thus inconsistencies. To accomplish the task of data integration, considerable knowledge about that data is necessary to find what can be integrated and what cannot or should not. The author points out that the main problems in this context, as was also pointed out in the Section 2, are the volume and the rate of the generated data. The paper, thus, discusses the issue of how Semantic Web Technologies can solve the general problems of the Big Data Management that were outlined in Section 2. The paper later describes the technologies of the Semantic Web that were also listed previously in Section 3.2 along with examples for better visualisation of each technology. In addition to the previously described technologies, the paper presents some additional ones, for instance, linked data, triple stores, and triple stores in the cloud. Linked data tries to incorporate all data from World Wide Web into a single database and to make all the data semantically related in some way. Linked data uses the same basic technologies that were described previously for the Semantic Web. The basic idea is to allow connectionist approach to world. A simple example of that would be to give relevant related recommendations to users that are viewing some certain part of the web or searching for some particular information. Triple store is simply a database for all the triples. The triple store must allow for fast query execution, be scalable, and have a low load cost to be highly-operational. Triple store in the cloud is yet another paradigm that would allow users to connect to a cloud and, from there, use data or applications that are available on that cloud. Cloud computing can provide such services as: Data as a Service that would give access to the current data, Software as a Service that would allow users to use software instance from the cloud, Platforms as a Service that would allow users to exploit dedicated to 9

10 them area on the cloud to execute and test their software, and Infrastructure as a Service that would allow users to utilise execution power of the servers that host the cloud. Author then presents some of the examples of the technologies that offer triple stores in the cloud for scientific purposes. One of the major challenges for the field of the Semantic Web is that it was not designed to serve the Big Data requirements. To turn things around, additional concepts were introduced into the Semantic Web, such as: RDF and RDF Schema and/or OWL on top of RDF. The current issues that still persist in the field of Semantic Web is that the techniques cannot deal with the fast data and large data. Therefore, external solutions are sometimes employed to accommodate for the unpredictability of the Big Data. The author concludes saying that effective data processing platforms are needed to be able to process and share the data, especially in the research, as it had been pointed in Section 4.3. In addition, shared data must remain secure at all times. To increase the processing performance of the Big Data processing, parallel computing seem to provide some solutions in this area with such tools as, for example, Hadoop. In conclusion, the paper does extend the previously discussed chapter of relationships between two fields and is therefore another proof of that the both fields do cooperate when dealing with real-life problems concerning Big Data processing and analysis. 10

11 References Colin L Bird and Jeremy G Frey. Chemical information matters: an e-research perspective on information and data sharing in the chemical sciences. Chemical Society reviews, 42(16): , Aug ISSN doi: /C3CS60050E. I Celino, S Contessa, M Corubolo, and D Dell Aglio. Urbanmatch-linking and improving smart cities data. LDOW, URL files/publications/ldow2012-paper-10.pdf. CLP Chen and CY Zhang. Data-intensive applications, challenges, techniques and technologies: A survey on big data. Information Sciences, doi: /j. ins URL S Data Science Series. Ten practical big data benefits URL Y Liu, L Chen, X Luo, L Mei, C Hu, and Z Xu. Semantic link network based model for organizing multimedia big data. IEEE Transactions on..., URL computer.org/csdl/trans/ec/preprint/ pdf. Maryam Panahiazar, Vahid Taslimitehrani, Ashutosh Jadhav, and Jyotishman Pathak. Empowering personalized medicine with big data and semantic web technology: Promises, challenges, and use cases. Proceedings:... IEEE International Conference on Big Data. IEEE International Conference on Big Data, 2014: , Oct doi: /BigData N Shadbolt, W Hall, and T Berners-Lee. The semantic web revisited. Intelligent Systems, URL Hongyan Wu and Atsuko Yamaguchi. Semantic web technologies for the big data in life sciences. BioScience Trends, 8(4): , doi: /bst

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON Overview * Introduction * Multiple faces of Big Data * Challenges of Big Data * Cloud Computing

More information

Standards for Big Data in the Cloud

Standards for Big Data in the Cloud Standards for Big Data in the Cloud International Cloud Symposium 15/10/2013 Carola Carstens (Project Officer) DG CONNECT, Unit G3 Data Value Chain European Commission Outline 1) Data Value Chain Unit

More information

Industry 4.0 and Big Data

Industry 4.0 and Big Data Industry 4.0 and Big Data Marek Obitko, mobitko@ra.rockwell.com Senior Research Engineer 03/25/2015 PUBLIC PUBLIC - 5058-CO900H 2 Background Joint work with Czech Institute of Informatics, Robotics and

More information

Primary Key Associates Limited

Primary Key Associates Limited is at the core of Primary Key Associates work Our approach to analytics In this paper Andrew Lea, our Technical Director in charge of, describes some of the paradigms, models, and techniques we have developed

More information

Ampersand and the Semantic Web

Ampersand and the Semantic Web Ampersand and the Semantic Web The Ampersand Conference 2015 Lloyd Rutledge The Semantic Web Billions and billions of data units Triples (subject-predicate-object) of URI s Your data readily integrated

More information

Service Road Map for ANDS Core Infrastructure and Applications Programs

Service Road Map for ANDS Core Infrastructure and Applications Programs Service Road Map for ANDS Core and Applications Programs Version 1.0 public exposure draft 31-March 2010 Document Target Audience This is a high level reference guide designed to communicate to ANDS external

More information

DISIT Lab, competence and project idea on bigdata. reasoning

DISIT Lab, competence and project idea on bigdata. reasoning DISIT Lab, competence and project idea on bigdata knowledge modeling, OD/LD and reasoning Paolo Nesi Dipartimento di Ingegneria dell Informazione, DINFO Università degli Studi di Firenze Via S. Marta 3,

More information

JOURNAL OF COMPUTER SCIENCE AND ENGINEERING

JOURNAL OF COMPUTER SCIENCE AND ENGINEERING Exploration on Service Matching Methodology Based On Description Logic using Similarity Performance Parameters K.Jayasri Final Year Student IFET College of engineering nishajayasri@gmail.com R.Rajmohan

More information

Big Data, Fast Data, Complex Data. Jans Aasman Franz Inc

Big Data, Fast Data, Complex Data. Jans Aasman Franz Inc Big Data, Fast Data, Complex Data Jans Aasman Franz Inc Private, founded 1984 AI, Semantic Technology, professional services Now in Oakland Franz Inc Who We Are (1 (2 3) (4 5) (6 7) (8 9) (10 11) (12

More information

bigdata Managing Scale in Ontological Systems

bigdata Managing Scale in Ontological Systems Managing Scale in Ontological Systems 1 This presentation offers a brief look scale in ontological (semantic) systems, tradeoffs in expressivity and data scale, and both information and systems architectural

More information

Leveraging Big Data Technologies to Support Research in Unstructured Data Analytics

Leveraging Big Data Technologies to Support Research in Unstructured Data Analytics Leveraging Big Data Technologies to Support Research in Unstructured Data Analytics BY FRANÇOYS LABONTÉ GENERAL MANAGER JUNE 16, 2015 Principal partenaire financier WWW.CRIM.CA ABOUT CRIM Applied research

More information

Towards a reference architecture for Semantic Web applications

Towards a reference architecture for Semantic Web applications Towards a reference architecture for Semantic Web applications Benjamin Heitmann 1, Conor Hayes 1, and Eyal Oren 2 1 firstname.lastname@deri.org Digital Enterprise Research Institute National University

More information

Transforming the Telecoms Business using Big Data and Analytics

Transforming the Telecoms Business using Big Data and Analytics Transforming the Telecoms Business using Big Data and Analytics Event: ICT Forum for HR Professionals Venue: Meikles Hotel, Harare, Zimbabwe Date: 19 th 21 st August 2015 AFRALTI 1 Objectives Describe

More information

Chapter 11 Mining Databases on the Web

Chapter 11 Mining Databases on the Web Chapter 11 Mining bases on the Web INTRODUCTION While Chapters 9 and 10 provided an overview of Web data mining, this chapter discusses aspects of mining the databases on the Web. Essentially, we use the

More information

Big Data and Semantic Web in Manufacturing. Nitesh Khilwani, PhD Chief Engineer, Samsung Research Institute Noida, India

Big Data and Semantic Web in Manufacturing. Nitesh Khilwani, PhD Chief Engineer, Samsung Research Institute Noida, India Big Data and Semantic Web in Manufacturing Nitesh Khilwani, PhD Chief Engineer, Samsung Research Institute Noida, India Outline Big data in Manufacturing Big data Analytics Semantic web technologies Case

More information

Journal of Chemical and Pharmaceutical Research, 2015, 7(3):1388-1392. Research Article. E-commerce recommendation system on cloud computing

Journal of Chemical and Pharmaceutical Research, 2015, 7(3):1388-1392. Research Article. E-commerce recommendation system on cloud computing Available online www.jocpr.com Journal of Chemical and Pharmaceutical Research, 2015, 7(3):1388-1392 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 E-commerce recommendation system on cloud computing

More information

SEMANTIC WEB BASED INFERENCE MODEL FOR LARGE SCALE ONTOLOGIES FROM BIG DATA

SEMANTIC WEB BASED INFERENCE MODEL FOR LARGE SCALE ONTOLOGIES FROM BIG DATA SEMANTIC WEB BASED INFERENCE MODEL FOR LARGE SCALE ONTOLOGIES FROM BIG DATA J.RAVI RAJESH PG Scholar Rajalakshmi engineering college Thandalam, Chennai. ravirajesh.j.2013.mecse@rajalakshmi.edu.in Mrs.

More information

Big Data with Rough Set Using Map- Reduce

Big Data with Rough Set Using Map- Reduce Big Data with Rough Set Using Map- Reduce Mr.G.Lenin 1, Mr. A. Raj Ganesh 2, Mr. S. Vanarasan 3 Assistant Professor, Department of CSE, Podhigai College of Engineering & Technology, Tirupattur, Tamilnadu,

More information

Annotea and Semantic Web Supported Collaboration

Annotea and Semantic Web Supported Collaboration Annotea and Semantic Web Supported Collaboration Marja-Riitta Koivunen, Ph.D. Annotea project Abstract Like any other technology, the Semantic Web cannot succeed if the applications using it do not serve

More information

DISCOVERING RESUME INFORMATION USING LINKED DATA

DISCOVERING RESUME INFORMATION USING LINKED DATA DISCOVERING RESUME INFORMATION USING LINKED DATA Ujjal Marjit 1, Kumar Sharma 2 and Utpal Biswas 3 1 C.I.R.M, University Kalyani, Kalyani (West Bengal) India sic@klyuniv.ac.in 2 Department of Computer

More information

Big Data Integration: A Buyer's Guide

Big Data Integration: A Buyer's Guide SEPTEMBER 2013 Buyer s Guide to Big Data Integration Sponsored by Contents Introduction 1 Challenges of Big Data Integration: New and Old 1 What You Need for Big Data Integration 3 Preferred Technology

More information

Semantically Enhanced Web Personalization Approaches and Techniques

Semantically Enhanced Web Personalization Approaches and Techniques Semantically Enhanced Web Personalization Approaches and Techniques Dario Vuljani, Lidia Rovan, Mirta Baranovi Faculty of Electrical Engineering and Computing, University of Zagreb Unska 3, HR-10000 Zagreb,

More information

How To Handle Big Data With A Data Scientist

How To Handle Big Data With A Data Scientist III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution

More information

RDF graph Model and Data Retrival

RDF graph Model and Data Retrival Distributed RDF Graph Keyword Search 15 2 Linked Data, Non-relational Databases and Cloud Computing 2.1.Linked Data The World Wide Web has allowed an unprecedented amount of information to be published

More information

Semantic Web Success Story

Semantic Web Success Story Semantic Web Success Story Practical Integration of Semantic Web Technology Chris Chaulk, Software Architect EMC Corporation 1 Who is this guy? Software Architect at EMC 12 years, Storage Management Software

More information

Keywords Big Data, NoSQL, Relational Databases, Decision Making using Big Data, Hadoop

Keywords Big Data, NoSQL, Relational Databases, Decision Making using Big Data, Hadoop Volume 4, Issue 1, January 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Transitioning

More information

Semantic Web Development in China

Semantic Web Development in China Semantic Web Development in China Outline Web development in China Semantic Web communities in China Semantic Web projects in China IODT from IBM Research China Falcon from Southeast University APEX from

More information

a Data Science initiative @ Univ. Piraeus [GR]

a Data Science initiative @ Univ. Piraeus [GR] a Data Science initiative @ Univ. Piraeus [GR] The Data Science Lab members June 2015 What is Data Science source: quora.com! Looking at data! Tools and methods used to analyze large amounts of data! Anything

More information

Smart Cities require Geospatial Data Providing services to citizens, enterprises, visitors...

Smart Cities require Geospatial Data Providing services to citizens, enterprises, visitors... Cloud-based Spatial Data Infrastructures for Smart Cities Geospatial World Forum 2015 Hans Viehmann Product Manager EMEA ORACLE Corporation Smart Cities require Geospatial Data Providing services to citizens,

More information

Some Research Challenges for Big Data Analytics of Intelligent Security

Some Research Challenges for Big Data Analytics of Intelligent Security Some Research Challenges for Big Data Analytics of Intelligent Security Yuh-Jong Hu hu at cs.nccu.edu.tw Emerging Network Technology (ENT) Lab. Department of Computer Science National Chengchi University,

More information

LDIF - Linked Data Integration Framework

LDIF - Linked Data Integration Framework LDIF - Linked Data Integration Framework Andreas Schultz 1, Andrea Matteini 2, Robert Isele 1, Christian Bizer 1, and Christian Becker 2 1. Web-based Systems Group, Freie Universität Berlin, Germany a.schultz@fu-berlin.de,

More information

AppSymphony White Paper

AppSymphony White Paper AppSymphony White Paper Secure Self-Service Analytics for Curated Digital Collections Introduction Optensity, Inc. offers a self-service analytic app composition platform, AppSymphony, which enables data

More information

Mining the Web of Linked Data with RapidMiner

Mining the Web of Linked Data with RapidMiner Mining the Web of Linked Data with RapidMiner Petar Ristoski, Christian Bizer, and Heiko Paulheim University of Mannheim, Germany Data and Web Science Group {petar.ristoski,heiko,chris}@informatik.uni-mannheim.de

More information

Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens

Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens 1 Optique: Improving the competitiveness of European industry For many

More information

Semantic Stored Procedures Programming Environment and performance analysis

Semantic Stored Procedures Programming Environment and performance analysis Semantic Stored Procedures Programming Environment and performance analysis Marjan Efremov 1, Vladimir Zdraveski 2, Petar Ristoski 2, Dimitar Trajanov 2 1 Open Mind Solutions Skopje, bul. Kliment Ohridski

More information

BIG. Big Data Analysis John Domingue (STI International and The Open University) Big Data Public Private Forum

BIG. Big Data Analysis John Domingue (STI International and The Open University) Big Data Public Private Forum Big Data Analysis John Domingue (STI International and The Open University) Project co-funded by the European Commission within the 7th Framework Program (Grant Agreement No. 257943) 1 The Data landscape

More information

Workprogramme 2014-15

Workprogramme 2014-15 Workprogramme 2014-15 e-infrastructures DCH-RP final conference 22 September 2014 Wim Jansen einfrastructure DG CONNECT European Commission DEVELOPMENT AND DEPLOYMENT OF E-INFRASTRUCTURES AND SERVICES

More information

Green Big Data. A Green IT / Green IS Perspective on Big Data

Green Big Data. A Green IT / Green IS Perspective on Big Data Green Big Data A Green IT / Green IS Perspective on Big Data Agenda 1. Starting Point and Research Question 2. Subject of Analysis 3. Research Methodology 4. Results 5. Conclusion Green Big Data 2 Starting

More information

Linked Open Data A Way to Extract Knowledge from Global Datastores

Linked Open Data A Way to Extract Knowledge from Global Datastores Linked Open Data A Way to Extract Knowledge from Global Datastores Bebo White SLAC National Accelerator Laboratory HKU Expert Address 18 September 2014 Developments in science and information processing

More information

SemWeB Semantic Web Browser Improving Browsing Experience with Semantic and Personalized Information and Hyperlinks

SemWeB Semantic Web Browser Improving Browsing Experience with Semantic and Personalized Information and Hyperlinks SemWeB Semantic Web Browser Improving Browsing Experience with Semantic and Personalized Information and Hyperlinks Melike Şah, Wendy Hall and David C De Roure Intelligence, Agents and Multimedia Group,

More information

Application of OASIS Integrated Collaboration Object Model (ICOM) with Oracle Database 11g Semantic Technologies

Application of OASIS Integrated Collaboration Object Model (ICOM) with Oracle Database 11g Semantic Technologies Application of OASIS Integrated Collaboration Object Model (ICOM) with Oracle Database 11g Semantic Technologies Zhe Wu Ramesh Vasudevan Eric S. Chan Oracle Deirdre Lee, Laura Dragan DERI A Presentation

More information

Big Data Analytics Platform @ Nokia

Big Data Analytics Platform @ Nokia Big Data Analytics Platform @ Nokia 1 Selecting the Right Tool for the Right Workload Yekesa Kosuru Nokia Location & Commerce Strata + Hadoop World NY - Oct 25, 2012 Agenda Big Data Analytics Platform

More information

Supporting Change-Aware Semantic Web Services

Supporting Change-Aware Semantic Web Services Supporting Change-Aware Semantic Web Services Annika Hinze Department of Computer Science, University of Waikato, New Zealand a.hinze@cs.waikato.ac.nz Abstract. The Semantic Web is not only evolving into

More information

Secure Semantic Web Service Using SAML

Secure Semantic Web Service Using SAML Secure Semantic Web Service Using SAML JOO-YOUNG LEE and KI-YOUNG MOON Information Security Department Electronics and Telecommunications Research Institute 161 Gajeong-dong, Yuseong-gu, Daejeon KOREA

More information

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies Semantic Data Management Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies 1 Enterprise Information Challenge Source: Oracle customer 2 Vision of Semantically Linked Data The Network of Collaborative

More information

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013 Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013 James Maltby, Ph.D 1 Outline of Presentation Semantic Graph Analytics Database Architectures In-memory Semantic Database Formulation

More information

Addressing Self-Management in Cloud Platforms: a Semantic Sensor Web Approach

Addressing Self-Management in Cloud Platforms: a Semantic Sensor Web Approach Addressing Self-Management in Cloud Platforms: a Semantic Sensor Web Approach Rustem Dautov Iraklis Paraskakis Dimitrios Kourtesis South-East European Research Centre International Faculty, The University

More information

Reference Architecture, Requirements, Gaps, Roles

Reference Architecture, Requirements, Gaps, Roles Reference Architecture, Requirements, Gaps, Roles The contents of this document are an excerpt from the brainstorming document M0014. The purpose is to show how a detailed Big Data Reference Architecture

More information

Computational Science and Informatics (Data Science) Programs at GMU

Computational Science and Informatics (Data Science) Programs at GMU Computational Science and Informatics (Data Science) Programs at GMU Kirk Borne George Mason University School of Physics, Astronomy, & Computational Sciences http://spacs.gmu.edu/ Outline Graduate Program

More information

Semantic Interoperability

Semantic Interoperability Ivan Herman Semantic Interoperability Olle Olsson Swedish W3C Office Swedish Institute of Computer Science (SICS) Stockholm Apr 27 2011 (2) Background Stockholm Apr 27, 2011 (2) Trends: from

More information

Smart Cyber Infrastructure for Big Data processing

Smart Cyber Infrastructure for Big Data processing Smart Cyber Infrastructure for Big Data processing Dr. Paola Grosso Email: p.grosso@uva.nl URL: http://staff.science.uva.nl/~grosso Addressing Big Data Issues in Scientific Data Infrastructure Y. Demchenko,

More information

Business rules and science

Business rules and science Business rules and science Science is a distributed, heterogeneous, rapidly evolving complex of activities, like an enterprise Business processes in science are largely ad hoc and undocumented, like very

More information

LinkZoo: A linked data platform for collaborative management of heterogeneous resources

LinkZoo: A linked data platform for collaborative management of heterogeneous resources LinkZoo: A linked data platform for collaborative management of heterogeneous resources Marios Meimaris, George Alexiou, George Papastefanatos Institute for the Management of Information Systems, Research

More information

Presente e futuro del Web Semantico

Presente e futuro del Web Semantico Sistemi di Elaborazione dell informazione II Corso di Laurea Specialistica in Ingegneria Telematica II anno 4 CFU Università Kore Enna A.A. 2009-2010 Alessandro Longheu http://www.diit.unict.it/users/alongheu

More information

Data Warehousing in the Age of Big Data

Data Warehousing in the Age of Big Data Data Warehousing in the Age of Big Data Krish Krishnan AMSTERDAM BOSTON HEIDELBERG LONDON NEW YORK OXFORD * PARIS SAN DIEGO SAN FRANCISCO SINGAPORE SYDNEY TOKYO Morgan Kaufmann is an imprint of Elsevier

More information

The Open University s repository of research publications and other research outputs

The Open University s repository of research publications and other research outputs Open Research Online The Open University s repository of research publications and other research outputs Survey of tools for collaborative knowledge construction and sharing Conference Item How to cite:

More information

Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep. Neil Raden Hired Brains Research, LLC

Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep. Neil Raden Hired Brains Research, LLC Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep Neil Raden Hired Brains Research, LLC Traditionally, the job of gathering and integrating data for analytics fell on data warehouses.

More information

Big Data Mining: Challenges and Opportunities to Forecast Future Scenario

Big Data Mining: Challenges and Opportunities to Forecast Future Scenario Big Data Mining: Challenges and Opportunities to Forecast Future Scenario Poonam G. Sawant, Dr. B.L.Desai Assist. Professor, Dept. of MCA, SIMCA, Savitribai Phule Pune University, Pune, Maharashtra, India

More information

Data Quality in Information Integration and Business Intelligence

Data Quality in Information Integration and Business Intelligence Data Quality in Information Integration and Business Intelligence Leopoldo Bertossi Carleton University School of Computer Science Ottawa, Canada : Faculty Fellow of the IBM Center for Advanced Studies

More information

Model Driven Interoperability through Semantic Annotations using SoaML and ODM

Model Driven Interoperability through Semantic Annotations using SoaML and ODM Model Driven Interoperability through Semantic Annotations using SoaML and ODM JiuCheng Xu*, ZhaoYang Bai*, Arne J.Berre*, Odd Christer Brovig** *SINTEF, Pb. 124 Blindern, NO-0314 Oslo, Norway (e-mail:

More information

Building Semantic Content Management Framework

Building Semantic Content Management Framework Building Semantic Content Management Framework Eric Yen Computing Centre, Academia Sinica Outline What is CMS Related Work CMS Evaluation, Selection, and Metrics CMS Applications in Academia Sinica Concluding

More information

Implement Hadoop jobs to extract business value from large and varied data sets

Implement Hadoop jobs to extract business value from large and varied data sets Hadoop Development for Big Data Solutions: Hands-On You Will Learn How To: Implement Hadoop jobs to extract business value from large and varied data sets Write, customize and deploy MapReduce jobs to

More information

How To Find Influence Between Two Concepts In A Network

How To Find Influence Between Two Concepts In A Network 2014 UKSim-AMSS 16th International Conference on Computer Modelling and Simulation Influence Discovery in Semantic Networks: An Initial Approach Marcello Trovati and Ovidiu Bagdasar School of Computing

More information

Context Capture in Software Development

Context Capture in Software Development Context Capture in Software Development Bruno Antunes, Francisco Correia and Paulo Gomes Knowledge and Intelligent Systems Laboratory Cognitive and Media Systems Group Centre for Informatics and Systems

More information

Application of ontologies for the integration of network monitoring platforms

Application of ontologies for the integration of network monitoring platforms Application of ontologies for the integration of network monitoring platforms Jorge E. López de Vergara, Javier Aracil, Jesús Martínez, Alfredo Salvador, José Alberto Hernández Networking Research Group,

More information

SURVEY REPORT DATA SCIENCE SOCIETY 2014

SURVEY REPORT DATA SCIENCE SOCIETY 2014 SURVEY REPORT DATA SCIENCE SOCIETY 2014 TABLE OF CONTENTS Contents About the Initiative 1 Report Summary 2 Participants Info 3 Participants Expertise 6 Suggested Discussion Topics 7 Selected Responses

More information

De la Business Intelligence aux Big Data. Marie- Aude AUFAURE Head of the Business Intelligence team Ecole Centrale Paris. 22/01/14 Séminaire Big Data

De la Business Intelligence aux Big Data. Marie- Aude AUFAURE Head of the Business Intelligence team Ecole Centrale Paris. 22/01/14 Séminaire Big Data De la Business Intelligence aux Big Data Marie- Aude AUFAURE Head of the Business Intelligence team Ecole Centrale Paris 22/01/14 Séminaire Big Data 1 Agenda EvoluHon of Business Intelligence SemanHc Technologies

More information

Security Issues for the Semantic Web

Security Issues for the Semantic Web Security Issues for the Semantic Web Dr. Bhavani Thuraisingham Program Director Data and Applications Security The National Science Foundation Arlington, VA On leave from The MITRE Corporation Bedford,

More information

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2 Volume 6, Issue 3, March 2016 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue

More information

A generic approach for data integration using RDF, OWL and XML

A generic approach for data integration using RDF, OWL and XML A generic approach for data integration using RDF, OWL and XML Miguel A. Macias-Garcia, Victor J. Sosa-Sosa, and Ivan Lopez-Arevalo Laboratory of Information Technology (LTI) CINVESTAV-TAMAULIPAS Km 6

More information

An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials

An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials ehealth Beyond the Horizon Get IT There S.K. Andersen et al. (Eds.) IOS Press, 2008 2008 Organizing Committee of MIE 2008. All rights reserved. 3 An Ontology Based Method to Solve Query Identifier Heterogeneity

More information

Towards Semantics-Enabled Distributed Infrastructure for Knowledge Acquisition

Towards Semantics-Enabled Distributed Infrastructure for Knowledge Acquisition Towards Semantics-Enabled Distributed Infrastructure for Knowledge Acquisition Vasant Honavar 1 and Doina Caragea 2 1 Artificial Intelligence Research Laboratory, Department of Computer Science, Iowa State

More information

Navigating Big Data business analytics

Navigating Big Data business analytics mwd a d v i s o r s Navigating Big Data business analytics Helena Schwenk A special report prepared for Actuate May 2013 This report is the third in a series and focuses principally on explaining what

More information

Smart Financial Data: Semantic Web technology transforms Big Data into Smart Data

Smart Financial Data: Semantic Web technology transforms Big Data into Smart Data Smart Financial Data: Semantic Web technology transforms Big Data into Smart Data Insurance Data and Analytics Summit 2013 18 April 2013 David Saul, Senior Vice President & Chief Scientist State Street

More information

Understanding Web personalization with Web Usage Mining and its Application: Recommender System

Understanding Web personalization with Web Usage Mining and its Application: Recommender System Understanding Web personalization with Web Usage Mining and its Application: Recommender System Manoj Swami 1, Prof. Manasi Kulkarni 2 1 M.Tech (Computer-NIMS), VJTI, Mumbai. 2 Department of Computer Technology,

More information

LiDDM: A Data Mining System for Linked Data

LiDDM: A Data Mining System for Linked Data LiDDM: A Data Mining System for Linked Data Venkata Narasimha Pavan Kappara Indian Institute of Information Technology Allahabad Allahabad, India kvnpavan@gmail.com Ryutaro Ichise National Institute of

More information

Data Virtualization A Potential Antidote for Big Data Growing Pains

Data Virtualization A Potential Antidote for Big Data Growing Pains perspective Data Virtualization A Potential Antidote for Big Data Growing Pains Atul Shrivastava Abstract Enterprises are already facing challenges around data consolidation, heterogeneity, quality, and

More information

Improving EHR Semantic Interoperability Future Vision and Challenges

Improving EHR Semantic Interoperability Future Vision and Challenges Improving EHR Semantic Interoperability Future Vision and Challenges Catalina MARTÍNEZ-COSTA a,1 Dipak KALRA b, Stefan SCHULZ a a IMI,Medical University of Graz, Austria b CHIME, University College London,

More information

An Ontology-based e-learning System for Network Security

An Ontology-based e-learning System for Network Security An Ontology-based e-learning System for Network Security Yoshihito Takahashi, Tomomi Abiko, Eriko Negishi Sendai National College of Technology a0432@ccedu.sendai-ct.ac.jp Goichi Itabashi Graduate School

More information

ABSTRACT The World MINING 1.2.1 1.2.2. R. Vasudevan. Trichy. Page 9. usage mining. basic. processing. Web usage mining. Web. useful information

ABSTRACT The World MINING 1.2.1 1.2.2. R. Vasudevan. Trichy. Page 9. usage mining. basic. processing. Web usage mining. Web. useful information SSRG International Journal of Electronics and Communication Engineering (SSRG IJECE) volume 1 Issue 1 Feb Neural Networks and Web Mining R. Vasudevan Dept of ECE, M. A.M Engineering College Trichy. ABSTRACT

More information

A Semantic Web of Know-How: Linked Data for Community-Centric Tasks

A Semantic Web of Know-How: Linked Data for Community-Centric Tasks A Semantic Web of Know-How: Linked Data for Community-Centric Tasks Paolo Pareti Edinburgh University p.pareti@sms.ed.ac.uk Ewan Klein Edinburgh University ewan@inf.ed.ac.uk Adam Barker University of St

More information

Bigdata : Enabling the Semantic Web at Web Scale

Bigdata : Enabling the Semantic Web at Web Scale Bigdata : Enabling the Semantic Web at Web Scale Presentation outline What is big data? Bigdata Architecture Bigdata RDF Database Performance Roadmap What is big data? Big data is a new way of thinking

More information

BUSINESS RULES AND GAP ANALYSIS

BUSINESS RULES AND GAP ANALYSIS Leading the Evolution WHITE PAPER BUSINESS RULES AND GAP ANALYSIS Discovery and management of business rules avoids business disruptions WHITE PAPER BUSINESS RULES AND GAP ANALYSIS Business Situation More

More information

How To Build A Cloud Based Intelligence System

How To Build A Cloud Based Intelligence System Semantic Technology and Cloud Computing Applied to Tactical Intelligence Domain Steve Hamby Chief Technology Officer Orbis Technologies, Inc. shamby@orbistechnologies.com 678.346.6386 1 Abstract The tactical

More information

HOW TO DO A SMART DATA PROJECT

HOW TO DO A SMART DATA PROJECT April 2014 Smart Data Strategies HOW TO DO A SMART DATA PROJECT Guideline www.altiliagroup.com Summary ALTILIA s approach to Smart Data PROJECTS 3 1. BUSINESS USE CASE DEFINITION 4 2. PROJECT PLANNING

More information

SEMANTIC VIDEO ANNOTATION IN E-LEARNING FRAMEWORK

SEMANTIC VIDEO ANNOTATION IN E-LEARNING FRAMEWORK SEMANTIC VIDEO ANNOTATION IN E-LEARNING FRAMEWORK Antonella Carbonaro, Rodolfo Ferrini Department of Computer Science University of Bologna Mura Anteo Zamboni 7, I-40127 Bologna, Italy Tel.: +39 0547 338830

More information

SmartLink: a Web-based editor and search environment for Linked Services

SmartLink: a Web-based editor and search environment for Linked Services SmartLink: a Web-based editor and search environment for Linked Services Stefan Dietze, Hong Qing Yu, Carlos Pedrinaci, Dong Liu, John Domingue Knowledge Media Institute, The Open University, MK7 6AA,

More information

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper CAST-2015 provides an opportunity for researchers, academicians, scientists and

More information

New Design Principles for Effective Knowledge Discovery from Big Data

New Design Principles for Effective Knowledge Discovery from Big Data New Design Principles for Effective Knowledge Discovery from Big Data Anjana Gosain USICT Guru Gobind Singh Indraprastha University Delhi, India Nikita Chugh USICT Guru Gobind Singh Indraprastha University

More information

I N T E L L I G E N T S O L U T I O N S, I N C. DATA MINING IMPLEMENTING THE PARADIGM SHIFT IN ANALYSIS & MODELING OF THE OILFIELD

I N T E L L I G E N T S O L U T I O N S, I N C. DATA MINING IMPLEMENTING THE PARADIGM SHIFT IN ANALYSIS & MODELING OF THE OILFIELD I N T E L L I G E N T S O L U T I O N S, I N C. OILFIELD DATA MINING IMPLEMENTING THE PARADIGM SHIFT IN ANALYSIS & MODELING OF THE OILFIELD 5 5 T A R A P L A C E M O R G A N T O W N, W V 2 6 0 5 0 USA

More information

Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo

Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo Expected Outcomes You will learn: Basic concepts related to ontologies Semantic model Semantic web Basic features of RDF and RDF

More information

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop ISSN: 2454-2377, October 2015 Big Data and Hadoop Simmi Bagga 1 Satinder Kaur 2 1 Assistant Professor, Sant Hira Dass Kanya MahaVidyalaya, Kala Sanghian, Distt Kpt. INDIA E-mail: simmibagga12@gmail.com

More information

Ontology based ranking of documents using Graph Databases: a Big Data Approach

Ontology based ranking of documents using Graph Databases: a Big Data Approach Ontology based ranking of documents using Graph Databases: a Big Data Approach A.M.Abirami Dept. of Information Technology Thiagarajar College of Engineering Madurai, Tamil Nadu, India Dr.A.Askarunisa

More information

How To Create A Web Of Knowledge From Data And Content In A Web Browser (Web)

How To Create A Web Of Knowledge From Data And Content In A Web Browser (Web) Semantic Technologies: Linked Data and OER Opening and linking the data and content silos, to leverage the knowledge capital represented by our OER repositories Edmundo Tovar (UPM etovar@fi.upm.es ) Nelson

More information

Position Paper: Validation of Distributed Enterprise Data is Necessary, and RIF can Help

Position Paper: Validation of Distributed Enterprise Data is Necessary, and RIF can Help Position Paper: Validation of Distributed Enterprise Data is Necessary, and RIF can Help David Schaengold Director of Business Solutions Revelytix, Inc Sept 19, 2011, Revised Oct 17, 2011 Overview Revelytix

More information

We have big data, but we need big knowledge

We have big data, but we need big knowledge We have big data, but we need big knowledge Weaving surveys into the semantic web ASC Big Data Conference September 26 th 2014 So much knowledge, so little time 1 3 takeaways What are linked data and the

More information

Data Modeling in the Age of Big Data

Data Modeling in the Age of Big Data Data Modeling in the Age of Big Data Pete Stiglich Pete Stiglich is a principal at Clarity Solution Group. pstiglich@clarity-us.com Abstract With big data adoption accelerating and strong interest in NoSQL

More information

International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 ISSN 2229-5518

International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 ISSN 2229-5518 International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 Over viewing issues of data mining with highlights of data warehousing Rushabh H. Baldaniya, Prof H.J.Baldaniya,

More information

AllegroGraph. a graph database. Gary King gwking@franz.com

AllegroGraph. a graph database. Gary King gwking@franz.com AllegroGraph a graph database Gary King gwking@franz.com Overview What we store How we store it the possibilities Using AllegroGraph Databases Put stuff in Get stuff out quickly safely Stuff things with

More information