Semantic Technologies for Big Data. Marin Dimitrov (Ontotext)

Size: px
Start display at page:

Download "Semantic Technologies for Big Data. Marin Dimitrov (Ontotext)"

Transcription

1 Semantic Technologies for Big Data Marin Dimitrov (Ontotext) XML Amsterdam 2012

2 XML Amsterdam 2012 #2

3 About Ontotext Provides products and services for creating, managing and exploiting semantic data Founded in 2000 Offices in Bulgaria, USA and UK Major clients and industries Media & Publishing (BBC, Press Association) HCLS (AstraZeneca, UCB) Cultural Heritage (The British Museum, The National Archives, Polish National Museum, Dutch Public Library) Defense and Homeland Security #3

4 Outline Semantic Technologies for the Enterprise Semantic Technologies for Big Data Success stories #4

5 SEMANTIC TECHNOLOGIES FOR THE ENTERPRISE #5

6 The need for a smarter Web "The Semantic Web is an extension of the current web in which information is given well-defined meaning, better enabling computers and people to work in cooperation. (Tim Berners-Lee, 2001) PricewaterhouseCoopers believes a Web of data will develop that fully augments the document Web of today. You ll be able to find pieces of data sets from different places, aggregate them without warehousing, and analyze them in a more straightforward, powerful way than you can now. (PWC, May 2009) #6

7 Linked Data Linked Data is a set of principles that allows publishing, querying and consumption of RDF data, distributed across different servers Design principles Use unambiguous identifiers for resources (URIs) Use HTTP URIs (dereference-able) Provide useful information for URI lookups Interlink resources #7

8 The Semantic Web timeline RDF RDF 2 DAML+OIL OWL OWL 2 SPARQL SPARQL 1.1 RIF RDFa SAWSDL LOD SKOS HCLS SSN RDB2RDF PIL GLD LDP #8

9 Enterprise Information Management Challenges Many disparate data sources and data silos Many point-to-point interfaces Data sources with similar/inconsistent information Complex data integration processes inadequate for changing business requirements Most of the knowledge is hidden in texts Difficult to integrate & analyse structured data and text #9

10 Semantic Web and Linked Data Opportunities for the Enterprise Simplify the information integration processes Flexible, easy to evolve data model Bottom-up / incremental integration Efficiently integrate structured and unstructured data Provide an enterprise metadata layer Unified metadata vocabulary for the enterprise Align the legacy data silos Improve the information sharing and reuse #10

11 Semantic Web and Linked Data Opportunities for the Enterprise (2) Discovery and enrichment of information Interlink people, organisations, events, etc. Enrich enterprise content with structured annotations Discover implicit links and relationships Unified access to information within the enterprise Simplified infrastructure based on open web standards Information interchange across a value chain Easy publishing and consumption of Linked Data Augments existing IT assets and technologies No need for disruptive replacement #11

12 XML and RDF: friends or foes Complement each other XML best for content, structure and interchange format RDF for metadata layer and semantics Typical use case Many XML content data sources Content stored in an XML store (XQuery and XSLT) Structured data sources & external Linked Data RDF-ized and stored in an RDF store (SPARQL) Metadata extracted from content stored in an RDF store (SPARQL) semantic search and metadata driven content delivery #12

13 BBC Sports (c) BBC #13

14 Added value of RDF Explicit semantics Intended meaning of entities and relations Global identifiers (URIs) Simple and flexible graph-based data model Easier data mapping & integration Bottom-up / incremental data integration with owl:sameas Inference of implicit information Working with distributed information Linked Data, federated SPARQL #14

15 Added value of RDF Descriptive / agile schema Open World Assumption, don t restrict predicates Generated dynamically from data Queries based on meaning Not depending on structure / order of statements Data and queries may use different vocabularies Exploratory queries Choice of OWL2 profiles Tradeoff features vs performance New profiles may emerge in the future #15

16 SEMANTIC TECHNOLOGIES FOR BIG DATA #16

17 The three V s of Big Data Velocity Streaming, sensor, real-time data Solution: distributed processing & storage Semantic challenge: stream reasoning Volume Petabytes of data Solution: distributed processing & storage Semantic challenge: distributed reasoning & querying Variety Structured, semi-structured and unstructured data Semantic Technologies (RDF) are a good fit #17

18 Types of Big Data (NIST) Type 1 Velocity (-), Volume (-), Variety (+) Perfect fit for Semantic Technologies Type 2 Velocity and/or Volume, Variety (-) Only horizontal scalability required, traditional approaches are a good enough fit Type 3 All V s Semantic Technologies not a good fit yet, but moving in that direction #18

19 Semantic Technologies for Volume and Velocity Promising ongoing research Distributed inference with Hadoop/Storm Stream reasoning Continuous queries Continuous (dynamic) semantics SPARQL to Pig translation Distributed RDF stores on top of NoSQL C-SPARQL, EP-SPARQL, CQELS #19

20 Linked Open Data Cloud (Sep 2011) (c) Cyganiak & Jentzsch #20

21 From Big Linked Data to Linked Big Data Big Linked Data Big Data approach adopted by the Linked Data community In particular handling Volume and Velocity Exponential growth of Linked Data in the last 5 years Linked Big Data Linked Data approach adopted by the Big Data community RDF data model for Variety Enrich Big Data with metadata and semantics more powerful analytics on top of it Interlink Big Data sets Simplify data access and data integration #21

22 SUCCESS STORIES #22

23 Typical Use Cases for Linked Data and Semantic Technologies Publish / consume Linked Data across enterprises Linked Data is not necessarily free data Facilitate data interchange within the value chain Information integration within the enterprise Integrated asset management / align data silos Master Data Management Knowledge discovery and semantic search Integrate structured and unstructured data Enrich and interlink information Semantic search and exploration of information #23

24 Semantic Information Integration (Ontotext) #24

25 The National Archives (Ontotext) Challenge Large archive of various UK Government websites since 1997 Lots of duplicated information & documents Inefficient search & navigation Semantic Knowledge Base project goals Integrate multiple data sources Extract information & metadata from archived documents Interlink the web archive with data.gov.uk and LOD data Advanced search & navigation of the archive #25

26 The National Archives (Ontotext) Front Ends: Semantic Search Semantic Annotation O 1 O 2 O 3 3 rd party Ontology Editors SPARQL graph exploration Semantic Repository SKB Ontologies A B C D Data Transformation and Integration Factual Knowledge (TNA data, LOD, data.gov.uk) Annotation Process (GATE Teamware) Semantic annotations Semantic Index Identity Resolution #26

27 The National Archives (Ontotext) The numbers 2.5 billion input files 40TB compressed archive data 10 billion RDF triples stored in OWLIM 33,000 EC2 hours used on AWS Dynamic EC2 cluster (180 instances average, 500 max) Major challenges Complex pre-processing of documents De-duplication of information & documents EC2/RRS performance & reliability #27

28 Dutch Public Library (Ontotext + Dayon) Challenge Many disparate data sources, inefficient search Goals Data integration Automated metadata generation Open search platform Numbers 500 heterogeneous data sources 40 million cultural heritage artifacts to be describes 6-8 billion triples to be stored into the knowledge base #28

29 Linked Life Data (Ontotext) Challenge Disparate, heterogeneous and unaligned data silos lock valuable biomedical information Goals Semantic warehouse integrating and interlinking public biomedical data sources Interactive discovery and exploration Numbers 25+ heterogeneous biomedical data sources integrated 1 billion entities described 5.5 billion RDF triples #29

30 Linked Life Data (Ontotext) #30

31 Linked Life Data-as-a-Service (Ontotext) More data sources Large scale text mining over the LOD cloud Adapted for specific use cases UCB use case 2 billion entities described 11 billion RDF triples #31

32 Dynamic Semantic Publishing (Ontotext) Challenge Difficult & slow to aggregate content from various sources Goals Metadata generation for news (semantic annotation) Interlink & categorize content Metadata driven web pages Numbers Nearly real-time processing & annotation required Tens of millions (SPARQL) queries to the knowledge base per day #32

33 Trillion RDF triples (Franz Inc.) Use case Use RDF for the customer management database of a telecom Challenge 4,000 triples per customer, more than a trillion for the whole customer base Numbers 1 trillion triples stored in AllegroGraph by Franz Inc Hardware requirements undisclosed The 310 billion triple result used 8-CPU system with 2TB RAM #33

34 urika (Cray/YarcData) Big Data appliance for graph analytics Based on the Threadstorm tm architecture Up to 8K processors, 512TB RAM, 350TB/hr IO throughput In-memory RDF database SPARQL 1.0 engine (c) YarcData #34

35 TAKEAWAYS #35

36 Semantic Technologies for Big Data Rich ecosystem of Semantic Technologies since 1999 Strong Enterprise focus in the last 5 years Semantic Technologies provide opportunity for reducing the cost and complexity of data integration Common metadata layer for the enterprise More powerful ways to find and explore information RDF complements XML within the enterprise Semantic Technologies are a good fit for Big Data s Variety #36

37 Semantic Technologies for Big Data Velocity and Volume still challenging for Semantic Technologies, but lots of progress in that direction Linked Data will grow into Big Linked Data, but Big Data will also benefit from evolving into Linked Big Data Interesting success stories for Semantic Technologies in Big Data scenarios #37

38 THANK YOU! #38

Linked Data for the Enterprise: Opportunities and Challenges

Linked Data for the Enterprise: Opportunities and Challenges Linked Data for the Enterprise: Opportunities and Challenges Marin Dimitrov (Ontotext) Semantic Days 2012, Stavanger About Ontotext Provides software and expertise for creating, managing and exploiting

More information

From Big Data to Smart Data. Marin Dimitrov - CTO

From Big Data to Smart Data. Marin Dimitrov - CTO From Big Data to Smart Data Marin Dimitrov - CTO May 2013 About Ontotext Provides products and services for creating, managing and exploiting semantic data Founded in 2000 Offices in Bulgaria, USA and

More information

Graph Database Performance: An Oracle Perspective

Graph Database Performance: An Oracle Perspective Graph Database Performance: An Oracle Perspective Xavier Lopez, Ph.D. Senior Director, Product Management 1 Copyright 2012, Oracle and/or its affiliates. All rights reserved. Program Agenda Broad Perspective

More information

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies Semantic Data Management Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies 1 Enterprise Information Challenge Source: Oracle customer 2 Vision of Semantically Linked Data The Network of Collaborative

More information

Publishing Linked Data Requires More than Just Using a Tool

Publishing Linked Data Requires More than Just Using a Tool Publishing Linked Data Requires More than Just Using a Tool G. Atemezing 1, F. Gandon 2, G. Kepeklian 3, F. Scharffe 4, R. Troncy 1, B. Vatant 5, S. Villata 2 1 EURECOM, 2 Inria, 3 Atos Origin, 4 LIRMM,

More information

Steve Hamby Chief Technology Officer Orbis Technologies, Inc. shamby@orbistechnologies.com 678.346.6386

Steve Hamby Chief Technology Officer Orbis Technologies, Inc. shamby@orbistechnologies.com 678.346.6386 Semantic Technology and Cloud Computing Applied to Tactical Intelligence Domain Steve Hamby Chief Technology Officer Orbis Technologies, Inc. shamby@orbistechnologies.com 678.346.6386 1 Abstract The tactical

More information

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies Big Data: Global Digital Data Growth Growing leaps and bounds by 40+% Year over Year! 2009 =.8 Zetabytes =.08

More information

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013 Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013 James Maltby, Ph.D 1 Outline of Presentation Semantic Graph Analytics Database Architectures In-memory Semantic Database Formulation

More information

Increase Agility and Reduce Costs with a Logical Data Warehouse. February 2014

Increase Agility and Reduce Costs with a Logical Data Warehouse. February 2014 Increase Agility and Reduce Costs with a Logical Data Warehouse February 2014 Table of Contents Summary... 3 Data Virtualization & the Logical Data Warehouse... 4 What is a Logical Data Warehouse?... 4

More information

BIG Big Data Public Private Forum

BIG Big Data Public Private Forum DATA STORAGE Martin Strohbach, AGT International (R&D) THE DATA VALUE CHAIN Value Chain Data Acquisition Data Analysis Data Curation Data Storage Data Usage Structured data Unstructured data Event processing

More information

JOURNAL OF OBJECT TECHNOLOGY

JOURNAL OF OBJECT TECHNOLOGY JOURNAL OF OBJECT TECHNOLOGY Online at www.jot.fm. Published by ETH Zurich, Chair of Software Engineering JOT, 2008 Vol. 7, No. 8, November-December 2008 What s Your Information Agenda? Mahesh H. Dodani,

More information

Cray: Enabling Real-Time Discovery in Big Data

Cray: Enabling Real-Time Discovery in Big Data Cray: Enabling Real-Time Discovery in Big Data Discovery is the process of gaining valuable insights into the world around us by recognizing previously unknown relationships between occurrences, objects

More information

Reason-able View of Linked Data for Cultural Heritage

Reason-able View of Linked Data for Cultural Heritage Reason-able View of Linked Data for Cultural Heritage Mariana Damova 1, Dana Dannells 2 1 Ontotext, Tsarigradsko Chausse 135, Sofia 1784, Bulgaria 2 University of Gothenburg, Lennart Torstenssonsgatan

More information

Luncheon Webinar Series May 13, 2013

Luncheon Webinar Series May 13, 2013 Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration

More information

Linked Open Data A Way to Extract Knowledge from Global Datastores

Linked Open Data A Way to Extract Knowledge from Global Datastores Linked Open Data A Way to Extract Knowledge from Global Datastores Bebo White SLAC National Accelerator Laboratory HKU Expert Address 18 September 2014 Developments in science and information processing

More information

HOW TO DO A SMART DATA PROJECT

HOW TO DO A SMART DATA PROJECT April 2014 Smart Data Strategies HOW TO DO A SMART DATA PROJECT Guideline www.altiliagroup.com Summary ALTILIA s approach to Smart Data PROJECTS 3 1. BUSINESS USE CASE DEFINITION 4 2. PROJECT PLANNING

More information

BUSINESS VALUE OF SEMANTIC TECHNOLOGY

BUSINESS VALUE OF SEMANTIC TECHNOLOGY BUSINESS VALUE OF SEMANTIC TECHNOLOGY Preliminary Findings Industry Advisory Council Emerging Technology (ET) SIG Information Sharing & Collaboration Committee July 15, 2005 Mills Davis Managing Director

More information

How semantic technology can help you do more with production data. Doing more with production data

How semantic technology can help you do more with production data. Doing more with production data How semantic technology can help you do more with production data Doing more with production data EPIM and Digital Energy Journal 2013-04-18 David Price, TopQuadrant London, UK dprice at topquadrant dot

More information

Linked Open Data Infrastructure for Public Sector Information: Example from Serbia

Linked Open Data Infrastructure for Public Sector Information: Example from Serbia Proceedings of the I-SEMANTICS 2012 Posters & Demonstrations Track, pp. 26-30, 2012. Copyright 2012 for the individual papers by the papers' authors. Copying permitted only for private and academic purposes.

More information

Module 16. Semantic Search

Module 16. Semantic Search Module 16 Semantic Search Module 16 schedule 9.45-11.00 xxx Xxx 11.00-11.15 Coffee break 11.15-12.30 xxx Xxx 12.30-14.00 14.00-16.00 Lunch Break xxx xxx Module 16 outline Traditional approaches to search

More information

Smart Financial Data: Semantic Web technology transforms Big Data into Smart Data

Smart Financial Data: Semantic Web technology transforms Big Data into Smart Data Smart Financial Data: Semantic Web technology transforms Big Data into Smart Data Insurance Data and Analytics Summit 2013 18 April 2013 David Saul, Senior Vice President & Chief Scientist State Street

More information

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance IBM s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice

More information

TopBraid Insight for Life Sciences

TopBraid Insight for Life Sciences TopBraid Insight for Life Sciences In the Life Sciences industries, making critical business decisions depends on having relevant information. However, queries often have to span multiple sources of information.

More information

DISCOVERING RESUME INFORMATION USING LINKED DATA

DISCOVERING RESUME INFORMATION USING LINKED DATA DISCOVERING RESUME INFORMATION USING LINKED DATA Ujjal Marjit 1, Kumar Sharma 2 and Utpal Biswas 3 1 C.I.R.M, University Kalyani, Kalyani (West Bengal) India sic@klyuniv.ac.in 2 Department of Computer

More information

What s New in Semantic Enrichment

What s New in Semantic Enrichment What s New in Semantic Enrichment 4 Million Content Items, 120 Disciplines, and 1 Metadata Repository Jess Lawson Head of Content Architecture, GAB-IT It s all in the Title Why semantic enrichment: 4 million

More information

LDIF - Linked Data Integration Framework

LDIF - Linked Data Integration Framework LDIF - Linked Data Integration Framework Andreas Schultz 1, Andrea Matteini 2, Robert Isele 1, Christian Bizer 1, and Christian Becker 2 1. Web-based Systems Group, Freie Universität Berlin, Germany a.schultz@fu-berlin.de,

More information

Oracle Spatial and Graph: Benchmarking a Trillion Edges RDF Graph ORACLE WHITE PAPER OCTOBER 2014

Oracle Spatial and Graph: Benchmarking a Trillion Edges RDF Graph ORACLE WHITE PAPER OCTOBER 2014 Oracle Spatial and Graph: Benchmarking a Trillion Edges RDF Graph ORACLE WHITE PAPER OCTOBER 2014 Introduction One trillion is a really big number. What could you store with one trillion facts?» 1000 tweets

More information

Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study

Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study Amar-Djalil Mezaour 1, Julien Law-To 1, Robert Isele 3, Thomas Schandl 2, and Gerd Zechmeister

More information

Semantic Interoperability

Semantic Interoperability Ivan Herman Semantic Interoperability Olle Olsson Swedish W3C Office Swedish Institute of Computer Science (SICS) Stockholm Apr 27 2011 (2) Background Stockholm Apr 27, 2011 (2) Trends: from

More information

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren News and trends in Data Warehouse Automation, Big Data and BI Johan Hendrickx & Dirk Vermeiren Extreme Agility from Source to Analysis DWH Appliances & DWH Automation Typical Architecture 3 What Business

More information

www.sryas.com Analance Data Integration Technical Whitepaper

www.sryas.com Analance Data Integration Technical Whitepaper Analance Data Integration Technical Whitepaper Executive Summary Business Intelligence is a thriving discipline in the marvelous era of computing in which we live. It s the process of analyzing and exploring

More information

DRUM Distributed Transactional Building Information Management

DRUM Distributed Transactional Building Information Management DRUM Distributed Transactional Building Information Management Seppo Törmä, Jyrki Oraskari, Nam Vu Hoang Distributed Systems Group Department of Computer Science and Engineering School of Science, Aalto

More information

Offload Enterprise Data Warehouse (EDW) to Big Data Lake. Ample White Paper

Offload Enterprise Data Warehouse (EDW) to Big Data Lake. Ample White Paper Offload Enterprise Data Warehouse (EDW) to Big Data Lake Oracle Exadata, Teradata, Netezza and SQL Server Ample White Paper EDW (Enterprise Data Warehouse) Offloads The EDW (Enterprise Data Warehouse)

More information

Survey of Big Data Architecture and Framework from the Industry

Survey of Big Data Architecture and Framework from the Industry Survey of Big Data Architecture and Framework from the Industry NIST Big Data Public Working Group Sanjay Mishra May13, 2014 3/19/2014 NIST Big Data Public Working Group 1 NIST BD PWG Survey of Big Data

More information

Semantic Web Mining: Using Association Rules for Learning an Ontology. Presented By : Amgad Madkour

Semantic Web Mining: Using Association Rules for Learning an Ontology. Presented By : Amgad Madkour Semantic Web Mining: Using Association Rules for Learning an Ontology Presented By : Amgad Madkour Agenda Semantic Web Mining aim Web Mining overview Semantic Web overview Ontology Building Learning an

More information

ON DEMAND ACCESS TO BIG DATA THROUGH SEMANTIC TECHNOLOGIES. Peter Haase fluid Operations AG

ON DEMAND ACCESS TO BIG DATA THROUGH SEMANTIC TECHNOLOGIES. Peter Haase fluid Operations AG ON DEMAND ACCESS TO BIG DATA THROUGH SEMANTIC TECHNOLOGIES Peter Haase fluid Operations AG fluid Operations(fluidOps) Linked Data& Semantic Technologies Enterprise Cloud Computing Software company founded

More information

Joshua Phillips Alejandra Gonzalez-Beltran Jyoti Pathak October 22, 2009

Joshua Phillips Alejandra Gonzalez-Beltran Jyoti Pathak October 22, 2009 Exposing cagrid Data Services as Linked Data Joshua Phillips Alejandra Gonzalez-Beltran Jyoti Pathak October 22, 2009 Basic Premise It is both useful and practical to expose cabig data sets as Linked Data.

More information

bigdata Managing Scale in Ontological Systems

bigdata Managing Scale in Ontological Systems Managing Scale in Ontological Systems 1 This presentation offers a brief look scale in ontological (semantic) systems, tradeoffs in expressivity and data scale, and both information and systems architectural

More information

Addressing Self-Management in Cloud Platforms: a Semantic Sensor Web Approach

Addressing Self-Management in Cloud Platforms: a Semantic Sensor Web Approach Addressing Self-Management in Cloud Platforms: a Semantic Sensor Web Approach Rustem Dautov Iraklis Paraskakis Dimitrios Kourtesis South-East European Research Centre International Faculty, The University

More information

Service Oriented Architecture

Service Oriented Architecture Service Oriented Architecture Charlie Abela Department of Artificial Intelligence charlie.abela@um.edu.mt Last Lecture Web Ontology Language Problems? CSA 3210 Service Oriented Architecture 2 Lecture Outline

More information

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ End to End Solution to Accelerate Data Warehouse Optimization Franco Flore Alliance Sales Director - APJ Big Data Is Driving Key Business Initiatives Increase profitability, innovation, customer satisfaction,

More information

A Big Data Storage Architecture for the Second Wave David Sunny Sundstrom Principle Product Director, Storage Oracle

A Big Data Storage Architecture for the Second Wave David Sunny Sundstrom Principle Product Director, Storage Oracle A Big Data Storage Architecture for the Second Wave David Sunny Sundstrom Principle Product Director, Storage Oracle Growth in Data Diversity and Usage 1.8 Zettabytes of Data in 2011, 20x Growth by 2020

More information

Data Management in SAP Environments

Data Management in SAP Environments Data Management in SAP Environments the Big Data Impact Berlin, June 2012 Dr. Wolfgang Martin Analyst, ibond Partner und Ventana Research Advisor Data Management in SAP Environments Big Data What it is

More information

www.ducenit.com Analance Data Integration Technical Whitepaper

www.ducenit.com Analance Data Integration Technical Whitepaper Analance Data Integration Technical Whitepaper Executive Summary Business Intelligence is a thriving discipline in the marvelous era of computing in which we live. It s the process of analyzing and exploring

More information

Deploying a Geospatial Cloud

Deploying a Geospatial Cloud Deploying a Geospatial Cloud Traditional Public Sector Computing Environment Traditional Computing Infrastructure Silos of dedicated hardware and software Single application per silo Expensive to size

More information

Introduction to Ontologies

Introduction to Ontologies Technological challenges Introduction to Ontologies Combining relational databases and ontologies Author : Marc Lieber Date : 21-Jan-2014 BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR.

More information

Offload Historical Data to Big Data Lake. Ample White Paper

Offload Historical Data to Big Data Lake. Ample White Paper Offload Historical Data to Big Data Lake The Need to Offload Historical Data for Compliance Queries How often have heard that the legal or compliance department group needs to have access to your company

More information

Oracle Big Data SQL Technical Update

Oracle Big Data SQL Technical Update Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical

More information

Architecting an Industrial Sensor Data Platform for Big Data Analytics

Architecting an Industrial Sensor Data Platform for Big Data Analytics Architecting an Industrial Sensor Data Platform for Big Data Analytics 1 Welcome For decades, organizations have been evolving best practices for IT (Information Technology) and OT (Operation Technology).

More information

Data Virtualization and ETL. Denodo Technologies Architecture Brief

Data Virtualization and ETL. Denodo Technologies Architecture Brief Data Virtualization and ETL Denodo Technologies Architecture Brief Contents Data Virtualization and ETL... 3 Summary... 3 Data Virtualization... 7 What is Data Virtualization good for?... 8 Applications

More information

The Next Wave of Data Management. Is Big Data The New Normal?

The Next Wave of Data Management. Is Big Data The New Normal? The Next Wave of Data Management Is Big Data The New Normal? Table of Contents Introduction 3 Separating Reality and Hype 3 Why Are Firms Making IT Investments In Big Data? 4 Trends In Data Management

More information

Qlik Sense scalability

Qlik Sense scalability Qlik Sense scalability Visual analytics platform Qlik Sense is a visual analytics platform powered by an associative, in-memory data indexing engine. Based on users selections, calculations are computed

More information

Are You Big Data Ready?

Are You Big Data Ready? ACS 2015 Annual Canberra Conference Are You Big Data Ready? Vladimir Videnovic Business Solutions Director Oracle Big Data and Analytics Introduction Introduction What is Big Data? If you can't explain

More information

ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V

ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V WHITE PAPER Create the Data Center of the Future Accelerate

More information

Serendipity a platform to discover and visualize Open OER Data from OpenCourseWare repositories Abstract Keywords Introduction

Serendipity a platform to discover and visualize Open OER Data from OpenCourseWare repositories Abstract Keywords Introduction Serendipity a platform to discover and visualize Open OER Data from OpenCourseWare repositories Nelson Piedra, Jorge López, Janneth Chicaiza, Universidad Técnica Particular de Loja, Ecuador nopiedra@utpl.edu.ec,

More information

Data Refinery with Big Data Aspects

Data Refinery with Big Data Aspects International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 655-662 International Research Publications House http://www. irphouse.com /ijict.htm Data

More information

Integrating Cloudera and SAP HANA

Integrating Cloudera and SAP HANA Integrating Cloudera and SAP HANA Version: 103 Table of Contents Introduction/Executive Summary 4 Overview of Cloudera Enterprise 4 Data Access 5 Apache Hive 5 Data Processing 5 Data Integration 5 Partner

More information

Urika: Enabling Real-Time Discovery in Big Data

Urika: Enabling Real-Time Discovery in Big Data Urika: TM Enabling Real-Time Discovery in Big Data Discovery is the process of gaining valuable insights into the world around us by recognizing previously unknown relationships between occurrences, objects

More information

Informatica PowerCenter Data Virtualization Edition

Informatica PowerCenter Data Virtualization Edition Data Sheet Informatica PowerCenter Data Virtualization Edition Benefits Rapidly deliver new critical data and reports across applications and warehouses Access, merge, profile, transform, cleanse data

More information

VIEWPOINT. High Performance Analytics. Industry Context and Trends

VIEWPOINT. High Performance Analytics. Industry Context and Trends VIEWPOINT High Performance Analytics Industry Context and Trends In the digital age of social media and connected devices, enterprises have a plethora of data that they can mine, to discover hidden correlations

More information

MarkLogic Enterprise Data Layer

MarkLogic Enterprise Data Layer MarkLogic Enterprise Data Layer MarkLogic Enterprise Data Layer MarkLogic Enterprise Data Layer September 2011 September 2011 September 2011 Table of Contents Executive Summary... 3 An Enterprise Data

More information

Cisco Data Preparation

Cisco Data Preparation Data Sheet Cisco Data Preparation Unleash your business analysts to develop the insights that drive better business outcomes, sooner, from all your data. As self-service business intelligence (BI) and

More information

Where is... How do I get to...

Where is... How do I get to... Big Data, Fast Data, Spatial Data Making Sense of Location Data in a Smart City Hans Viehmann Product Manager EMEA ORACLE Corporation August 19, 2015 Copyright 2014, Oracle and/or its affiliates. All rights

More information

How to Enhance Traditional BI Architecture to Leverage Big Data

How to Enhance Traditional BI Architecture to Leverage Big Data B I G D ATA How to Enhance Traditional BI Architecture to Leverage Big Data Contents Executive Summary... 1 Traditional BI - DataStack 2.0 Architecture... 2 Benefits of Traditional BI - DataStack 2.0...

More information

Using Master Data in Business Intelligence

Using Master Data in Business Intelligence helping build the smart business Using Master Data in Business Intelligence Colin White BI Research March 2007 Sponsored by SAP TABLE OF CONTENTS THE IMPORTANCE OF MASTER DATA MANAGEMENT 1 What is Master

More information

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Datenverwaltung im Wandel - Building an Enterprise Data Hub with Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees

More information

Global Data Integration with Autonomous Mobile Agents. White Paper

Global Data Integration with Autonomous Mobile Agents. White Paper Global Data Integration with Autonomous Mobile Agents White Paper June 2002 Contents Executive Summary... 1 The Business Problem... 2 The Global IDs Solution... 5 Global IDs Technology... 8 Company Overview...

More information

Chapter 11 Mining Databases on the Web

Chapter 11 Mining Databases on the Web Chapter 11 Mining bases on the Web INTRODUCTION While Chapters 9 and 10 provided an overview of Web data mining, this chapter discusses aspects of mining the databases on the Web. Essentially, we use the

More information

A Next-Generation Analytics Ecosystem for Big Data. Colin White, BI Research September 2012 Sponsored by ParAccel

A Next-Generation Analytics Ecosystem for Big Data. Colin White, BI Research September 2012 Sponsored by ParAccel A Next-Generation Analytics Ecosystem for Big Data Colin White, BI Research September 2012 Sponsored by ParAccel BIG DATA IS BIG NEWS The value of big data lies in the business analytics that can be generated

More information

Navigating the Big Data infrastructure layer Helena Schwenk

Navigating the Big Data infrastructure layer Helena Schwenk mwd a d v i s o r s Navigating the Big Data infrastructure layer Helena Schwenk A special report prepared for Actuate May 2013 This report is the second in a series of four and focuses principally on explaining

More information

Sustainable Development with Geospatial Information Leveraging the Data and Technology Revolution

Sustainable Development with Geospatial Information Leveraging the Data and Technology Revolution Sustainable Development with Geospatial Information Leveraging the Data and Technology Revolution Steven Hagan, Vice President, Server Technologies 1 Copyright 2011, Oracle and/or its affiliates. All rights

More information

HTML5 based Facet Browser for SPARQL Endpoints

HTML5 based Facet Browser for SPARQL Endpoints HTML5 based Facet Browser for SPARQL Endpoints Martina Janevska, Milos Jovanovik, Dimitar Trajanov Faculty of Computer Science and Engineering Ss. Cyril and Methodius University Skopje, Republic of Macedonia

More information

Big Data and Advanced Analytics Applications and Capabilities Steven Hagan, Vice President, Server Technologies

Big Data and Advanced Analytics Applications and Capabilities Steven Hagan, Vice President, Server Technologies Big Data and Advanced Analytics Applications and Capabilities Steven Hagan, Vice President, Server Technologies 1 Copyright 2011, Oracle and/or its affiliates. All rights Big Data, Advanced Analytics:

More information

Emerging Requirements and DBMS Technologies:

Emerging Requirements and DBMS Technologies: Emerging Requirements and DBMS Technologies: When Is Relational the Right Choice? Carl Olofson Research Vice President, IDC April 1, 2014 Agenda 2 Why Relational in the First Place? Evolution of Databases

More information

The Ontological Approach for SIEM Data Repository

The Ontological Approach for SIEM Data Repository The Ontological Approach for SIEM Data Repository Igor Kotenko, Olga Polubelova, and Igor Saenko Laboratory of Computer Science Problems, Saint-Petersburg Institute for Information and Automation of Russian

More information

Standardizing for Open Data

Standardizing for Open Data (1) Standardizing for Open Data Ivan Herman, W3C Open Data Week Marseille, France, June 26 2013 Slides at: http://www.w3.org/2013/talks/0626-marseille-ih/ (2) Data is everywhere on the Web! l Public, private,

More information

BIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research &

BIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research & BIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research & Innovation 04-08-2011 to the EC 8 th February, Luxembourg Your Atos business Research technologists. and Innovation

More information

FROM DATA STORE TO DATA SERVICES - DEVELOPING SCALABLE DATA ARCHITECTURE AT SURS. Summary

FROM DATA STORE TO DATA SERVICES - DEVELOPING SCALABLE DATA ARCHITECTURE AT SURS. Summary UNITED NATIONS ECONOMIC COMMISSION FOR EUROPE CONFERENCE OF EUROPEAN STATISTICIANS Working paper 27 February 2015 Workshop on the Modernisation of Statistical Production Meeting, 15-17 April 2015 Topic

More information

Secure Semantic Web Service Using SAML

Secure Semantic Web Service Using SAML Secure Semantic Web Service Using SAML JOO-YOUNG LEE and KI-YOUNG MOON Information Security Department Electronics and Telecommunications Research Institute 161 Gajeong-dong, Yuseong-gu, Daejeon KOREA

More information

LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model

LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model 22 October 2014 Tony Hammond Michele Pasin Background About Macmillan

More information

Geospatial Platforms For Enabling Workflows

Geospatial Platforms For Enabling Workflows Geospatial Platforms For Enabling Workflows Steven Hagan Vice President Oracle Database Server Technologies November, 2015 Evolution of Enabling Workflows HENRY FORD 100 YEARS AGO Industrialized the Manufacturing

More information

Extending Hyperion BI with the Oracle BI Server

<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server Extending Hyperion BI with the Oracle BI Server Mark Ostroff Sr. BI Solutions Consultant Agenda Hyperion BI versus Hyperion BI with OBI Server Benefits of using Hyperion BI with the

More information

Oracle Big Data Building A Big Data Management System

Oracle Big Data Building A Big Data Management System Oracle Big Building A Big Management System Copyright 2015, Oracle and/or its affiliates. All rights reserved. Effi Psychogiou ECEMEA Big Product Director May, 2015 Safe Harbor Statement The following

More information

Turning Big Data into Big Insights

Turning Big Data into Big Insights mwd a d v i s o r s Turning Big Data into Big Insights Helena Schwenk A special report prepared for Actuate May 2013 This report is the fourth in a series and focuses principally on explaining what s needed

More information

Industry 4.0 and Big Data

Industry 4.0 and Big Data Industry 4.0 and Big Data Marek Obitko, mobitko@ra.rockwell.com Senior Research Engineer 03/25/2015 PUBLIC PUBLIC - 5058-CO900H 2 Background Joint work with Czech Institute of Informatics, Robotics and

More information

Towards a reference architecture for Semantic Web applications

Towards a reference architecture for Semantic Web applications Towards a reference architecture for Semantic Web applications Benjamin Heitmann 1, Conor Hayes 1, and Eyal Oren 2 1 firstname.lastname@deri.org Digital Enterprise Research Institute National University

More information

Complex, true real-time analytics on massive, changing datasets.

Complex, true real-time analytics on massive, changing datasets. Complex, true real-time analytics on massive, changing datasets. A NoSQL, all in-memory enabling platform technology from: Better Questions Come Before Better Answers FinchDB is a NoSQL, all in-memory

More information

Architecting an Industrial Sensor Data Platform for Big Data Analytics: Continued

Architecting an Industrial Sensor Data Platform for Big Data Analytics: Continued Architecting an Industrial Sensor Data Platform for Big Data Analytics: Continued 2 8 10 Issue 1 Welcome From the Gartner Files: Blueprint for Architecting Sensor Data for Big Data Analytics About OSIsoft,

More information

Effective Data Integration - where to begin. Bryte Systems

Effective Data Integration - where to begin. Bryte Systems Effective Data Integration - where to begin Bryte Systems making data work Bryte Systems specialises is providing innovative and cutting-edge data integration and data access solutions and products to

More information

Integrate and Deliver Trusted Data and Enable Deep Insights

Integrate and Deliver Trusted Data and Enable Deep Insights SAP Technical Brief SAP s for Enterprise Information Management SAP Data Services Objectives Integrate and Deliver Trusted Data and Enable Deep Insights Provide a wide-ranging view of enterprise information

More information

What to Look for When Selecting a Master Data Management Solution

What to Look for When Selecting a Master Data Management Solution What to Look for When Selecting a Master Data Management Solution What to Look for When Selecting a Master Data Management Solution Table of Contents Business Drivers of MDM... 3 Next-Generation MDM...

More information

Oracle Big Data Discovery Unlock Potential in Big Data Reservoir

Oracle Big Data Discovery Unlock Potential in Big Data Reservoir Oracle Big Data Discovery Unlock Potential in Big Data Reservoir Gokula Mishra Premjith Balakrishnan Business Analytics Product Group September 29, 2014 Copyright 2014, Oracle and/or its affiliates. All

More information

Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep. Neil Raden Hired Brains Research, LLC

Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep. Neil Raden Hired Brains Research, LLC Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep Neil Raden Hired Brains Research, LLC Traditionally, the job of gathering and integrating data for analytics fell on data warehouses.

More information

BIG. Big Data Analysis John Domingue (STI International and The Open University) Big Data Public Private Forum

BIG. Big Data Analysis John Domingue (STI International and The Open University) Big Data Public Private Forum Big Data Analysis John Domingue (STI International and The Open University) Project co-funded by the European Commission within the 7th Framework Program (Grant Agreement No. 257943) 1 The Data landscape

More information

MDM and Data Warehousing Complement Each Other

MDM and Data Warehousing Complement Each Other Master Management MDM and Warehousing Complement Each Other Greater business value from both 2011 IBM Corporation Executive Summary Master Management (MDM) and Warehousing (DW) complement each other There

More information

Traditional BI vs. Business Data Lake A comparison

Traditional BI vs. Business Data Lake A comparison Traditional BI vs. Business Data Lake A comparison The need for new thinking around data storage and analysis Traditional Business Intelligence (BI) systems provide various levels and kinds of analyses

More information

What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy

What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy Much higher Volumes. Processed with more Velocity. With much more Variety. Is Big Data so big? Big Data Smart Data Project HAVEn: Adaptive Intelligence

More information

Knowledge-Based Data Mining Using Semantic Web

Knowledge-Based Data Mining Using Semantic Web Available online at www.sciencedirect.com ScienceDirect IERI Procedia 7 (2014 ) 113 119 2013 International Conference on Applied Computing, Computer Science, and Computer Engineering Knowledge-Based Data

More information

MicroStrategy PRIME High Performance In-memory Analytics

MicroStrategy PRIME High Performance In-memory Analytics MicroStrategy PRIME High Performance In-memory Analytics 1 Speaker Introduction Bala Chandran Dir. Enterprise BI, MicroStrategy 15 years of experience implementing and designing Big Data and Analytics

More information

THE TRUTH ABOUT TRIPLESTORES The Top 8 Things You Need to Know When Considering a Triplestore

THE TRUTH ABOUT TRIPLESTORES The Top 8 Things You Need to Know When Considering a Triplestore TABLE OF CONTENTS Introduction... 3 The Importance of Triplestores... 4 Why Triplestores... 5 The Top 8 Things You Should Know When Considering a Triplestore... 9 Inferencing... 9 Integration with Text

More information