Data Big and Small: How Publisher gain Value out of Data in the Future

Similar documents
Making Sense of Big Data in Insurance

A 360 Degree View of Anything

MarkLogic Semantics in Healthcare and Life Sciences for LIDER COPYRIGHT 2015 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

Data Governance for Regulated Industries

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here

INTELLIGENT BUSINESS STRATEGIES WHITE PAPER

Getting Started Practical Input For Your Roadmap

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

REAL-TIME BIG DATA ANALYTICS

Data Models and Database Management Systems (DBMSs) Dr. Philip Cannata

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

2010 Oracle Corporation 1

A Big Data Storage Architecture for the Second Wave David Sunny Sundstrom Principle Product Director, Storage Oracle

The Principles of the Business Data Lake

Architecting for the Internet of Things & Big Data

Top 3 Ways Big Data Impacts Financial Services

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM

The 4 Pillars of Technosoft s Big Data Practice

Endeca Introduction to Big Data Analytics

TRANSFORM BIG DATA INTO ACTIONABLE INFORMATION

<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

BIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research &

Increase Agility and Reduce Costs with a Logical Data Warehouse. February 2014

MarkLogic Enterprise Data Layer

By Makesh Kannaiyan 8/27/2011 1

BIG DATA AND THE ENTERPRISE DATA WAREHOUSE WORKSHOP

MarkLogic and Cisco: A Next-Generation, Real-Time Solution for Big Data

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Evolution to Revolution: Big Data 2.0

Big Data and Healthcare Payers WHITE PAPER

Database Design. Marta Jakubowska-Sobczak IT/ADC based on slides prepared by Paula Figueiredo, IT/DB

Luncheon Webinar Series May 13, 2013

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Next-Generation Cloud Analytics with Amazon Redshift

Session 805 -End-to-End SAP Lumira: Desktop to On-Premise, Cloud, and Mobile

Big Data and Advanced Analytics Applications and Capabilities Steven Hagan, Vice President, Server Technologies

TE's Analytics on Hadoop and SAP HANA Using SAP Vora

Creating an Enterprise Reporting Bus with SAP BusinessObjects

Sterling Business Intelligence

SAP Business One and SAP HANA

NoSQL for SQL Professionals William McKnight

Big Data Are You Ready? Jorge Plascencia Solution Architect Manager

Simplifying Data Governance and Accelerating Real-time Big Data Analysis in Financial Services with MarkLogic Server and Intel

Data Virtualization and ETL. Denodo Technologies Architecture Brief

Game On: How Information is Changing the Rules of Insurance

Timo Elliott VP, Global Innovation Evangelist SAP SE or an SAP affiliate company. All rights reserved. 1

Microsoft Business Intelligence

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Investment Bank Case Study: Leveraging MarkLogic for Records Retention and Investigation

Customer Insight Appliance. Enabling retailers to understand and serve their customer

Data Warehousing in the Age of Big Data

Offload Enterprise Data Warehouse (EDW) to Big Data Lake. Ample White Paper

"The performance driven Enterprise" Emerging trends in Enterprise BI Platforms

How To Use Big Data For Telco (For A Telco)

Keywords Big Data, NoSQL, Relational Databases, Decision Making using Big Data, Hadoop

Getting Value from Big Data with Analytics

Big Data and Analytics in Government

Executive Summary... 2 Introduction Defining Big Data The Importance of Big Data... 4 Building a Big Data Platform...

MySQL and Hadoop: Big Data Integration. Shubhangi Garg & Neha Kumari MySQL Engineering

Accelerate BI Initiatives With Self-Service Data Discovery And Integration

Mission-Critical Database with Real-Time Search for Big Data

How To Manage Data In Real Time

Trustworthiness of Big Data

How To Create A Business Intelligence (Bi)

How To Use Big Data For Business

Building Confidence in Big Data Innovations in Information Integration & Governance for Big Data

Apache Hadoop in the Enterprise. Dr. Amr Awadallah,

<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise

Composite Data Virtualization Composite Data Virtualization And NOSQL Data Stores

Overcoming Obstacles to Retail Supply Chain Efficiency and Vendor Compliance

XML enabled databases. Non relational databases. Guido Rotondi

Simplifying Data Governance and Accelerating Real-time Big Data Analysis for Healthcare with MarkLogic Server and Intel

Big Data & Its Importance

Architected Blended Big Data with Pentaho

Traditional BI vs. Business Data Lake A comparison

Transcription:

Data Big and Small: How Publisher gain Value out of Data in the Future WS 4: Frank Föge MarkLogic, Oliver Zmorek De Gruyter, Stefan Schwedt Newbooks Solutions

Agenda Introduction De Gruyter, Newbooks & MarkLogic Big Data Definition Data Big and Small Publishers perspective: Challenge, Approach & Solution DeGruyter, Use Case: Internal Data Analysis Newbooks, Use Case: VLB-TIX SLIDE: 2

De Gruyter - Innovative, agile responses to the challenges of tomorrow SLIDE: 3

Only a holistic view on publishing and a subsequent strategy will satisfy customers new demands SLIDE: 4

The only constant is change SLIDE: 5

BIG DATA DEFINITION

The very early days of Big Data 1854 Cholera in London from assumption to knowledge SLIDE: 7

Big Data = One C and Three V s Complexity Volume Variety Velocity SLIDE: 9

How did we get here? The early 80 s SLIDE: 10

The world as it was Data: Regular & Tabular Compute & Storage: Slow & Expensive Pace of Change: Glacial SLIDE: 11

Fast We had forward an elegant to today model that met our needs EMP EMPNO ENAME DEPTNO 7782 CLARK 10 7934 MILLER 10 7876 ADAMS 20 7902 FORD 20 7900 SMITH 30 DEPT DEPTNO DNAME 10 ACCOUNTING 20 RESEARCH 30 SALES 40 SHIPPING SLIDE: 12

Fast forward to today IT Budgets 33% on Innovation & Growth 67% on Maintenance (keeping the lights on) SLIDE: 13

We end up with the wrong technology for the job DATA When all you have is a hammer, everything looks like a nail SLIDE: 14

The Three V s of Big Data VOLUME VARIETY VELOCITY

IT faces the challenge leveraging both: Heterogeneous and Unstructured Data 12% Structured 88% Unstructured Reference Data OLTP Warehouse Archives Data Marts? SLIDE: 16

The Three V s of Big Data VOLUME VARIETY VELOCITY

Variety More of the same things Lots of different things SHAPE SOURCES FORMATS QUESTIONS SLIDE: 18

The result High Costs Crushing Complexity Strangled Innovation SLIDE: 19

DATA BIG AND SMALL

The Big and Small Data Opportunity: It is possible to utilize all your data in a cost effective way and realign for the future? Image Source: http://www.20x200.com/blog/blogimages/get_excited_and_make_things_with_creative_commons/1206_artworkimage.jpg

CHALLENGE, APPROACH & SOLUTION

Today s Data Landscape Massive growth of information with various data types RDBMS Search Engine Volume of Information Information Continuum XML Metadata / Onix Geospatial Graph Relational PDF Content Emails Documents Free text Hierarchical Semi-structured Structured Unstructured

Challenge: Things publishers need to deal with Different file formats and schemas (XML, CSV, Excel, Binaries) Different information transfer technologies (REST/SOAP, MBS, file ex-/ imports, File transfer protocols) Growing data amounts = scalibility Current situation: acquire knowledge of your data streams desired situation: data driven decision making SLIDE: 24

Information and System Flow Chart SLIDE: 25

Approach: What do publishers need? more systems to cover all our requirements = increasing costs, maintenance, support = further interfaces, mid-/long term integration = inflexible, not agile OR one target system that allows interdisciplinarily reporting and offers NoSQL technologies? SLIDE: 26

Source: http://www.langsonenergy.com/what-is-disruptive-technology

MarkLogic Enterprise NoSQL Database Application Services DATABASE SEARCH Semantics SLIDE: 28

Solution: NoSQL Database - don t worry about your data - different applications for different target groups - XML-oriented searches, queries and indexes - manage large volumes De Gruyter uses MarkLogic for: - De Gruyter Online platform - Content Management Systems SLIDE: 29

Use Case Correlation of usage and sales (1/2) 2 exemplary questions Find out what is being used but not sold? What is being sold but not used? => early insight into customers behavior; allows business to react accordingly and address customers Increasing the attractiveness of a publications requires a better understanding of the connection between sales and usage SLIDE: 30

Use Case Correlation of usage and sales (2/2) Source 1: Usage statistics of De Gruyter online gives an overview of database, book (chapter) and journal (article) usages Source 2: Sales Figures From Data warehouse contains sales overall statistics: webshop, mail/telephone order (customer service) DEMO SLIDE: 31

Group Question Think about a Use Case which will allow you to break up data silos to do ad-hoc analysis of heterogenous datatypes coming from various sources that can be usefull for your business - What kind of data? - What data sources? - Questions that take a lot of effort to find answers to? - Which decision making process can this data / answer support? - What new insights can be derived from this? SLIDE: 32

Q&A SLIDE: 33