INTELLIGENT BUSINESS STRATEGIES WHITE PAPER



Similar documents
Getting Started Practical Input For Your Roadmap

WHITE PAPER. Data Migration and Access in a Cloud Computing Environment INTELLIGENT BUSINESS STRATEGIES

How To Use Big Data For Business

Big Data Multi-Platform Analytics (Hadoop, NoSQL, Graph, Analytical Database)

Transitioning to a Data Driven Enterprise - What is A Data Strategy and Why Do You Need One?

WHITE PAPER. An Analytical Platform For The Smart Enterprise INTELLIGENT BUSINESS STRATEGIES

WHITE PAPER. Big Data - Why Transaction Data is Mission Critical To Success INTELLIGENT BUSINESS STRATEGIES

Cloud-based Business Intelligence A Market Study

The Enterprise Data Hub and The Modern Information Architecture

TECHNOLOGY TRANSFER PRESENTS MIKE FERGUSON NEXT GENERATION DATA MANAGEMENT BUILDING AN ENTERPRISE DATA RESERVOIR AND DATA REFINERY

W H I T E P A P E R. Architecting A Big Data Platform for Analytics INTELLIGENT BUSINESS STRATEGIES

WHITE PAPER. Interactive Data Exploration With An In-Memory Analytics Engine INTELLIGENT BUSINESS STRATEGIES

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

The Future of Data Management

VIEWPOINT. High Performance Analytics. Industry Context and Trends

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

The 4 Pillars of Technosoft s Big Data Practice

Using Tableau Software with Hortonworks Data Platform

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.

Understanding Your Customer Journey by Extending Adobe Analytics with Big Data

Evolution to Revolution: Big Data 2.0

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

Making Sense of Big Data in Insurance

Embracing the Cloud, Mobile, Social & Big Data

Next presentation starting soon Business Analytics using Big Data to gain competitive advantage

Data Warehousing in the Age of Big Data

An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise

Business Analytics In a Big Data World Ted Malone Solutions Architect Data Platform and Cloud Microsoft Federal

How the oil and gas industry can gain value from Big Data?

HDP Hadoop From concept to deployment.

Big Data Are You Ready? Jorge Plascencia Solution Architect Manager

Microsoft Big Data. Solution Brief

INTRAFOCUS. DATA VISUALISATION An Intrafocus Guide

IBM Data Warehousing and Analytics Portfolio Summary

Enriching Customer Data With New Customer Insights Using Big Data And Analytics

How To Turn Big Data Into An Insight

Big Data and Your Data Warehouse Philip Russom

DATA ANALYTICS SERVICES. G-CLOUD SERVICE DEFINITION.

Deploying Governed Data Discovery to Centralized and Decentralized Teams. Why Tableau and QlikView fall short

QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM

BEYOND BI: Big Data Analytic Use Cases

The Future of Data Management with Hadoop and the Enterprise Data Hub

Architecting for the Internet of Things & Big Data

Industry Impact of Big Data in the Cloud: An IBM Perspective

Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth

UNIFY YOUR (BIG) DATA

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

Big Data Integration: A Buyer's Guide

The retailers guide to data discovery

Annex: Concept Note. Big Data for Policy, Development and Official Statistics New York, 22 February 2013

Best Practices for Deploying Managed Self-Service Analytics and Why Tableau and QlikView Fall Short

How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns

Ganzheitliches Datenmanagement

Big Data Executive Survey

UNLEASHING THE VALUE OF THE TERADATA UNIFIED DATA ARCHITECTURE WITH ALTERYX

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE

Evolving Data Warehouse Architectures

and Analytic s i n Consu m e r P r oducts

This Symposium brought to you by

The Lab and The Factory

Exploiting Data at Rest and Data in Motion with a Big Data Platform

Architected Blended Big Data with Pentaho

The big data business model: opportunity and key success factors

How To Use Big Data To Help A Retailer

A Next-Generation Analytics Ecosystem for Big Data. Colin White, BI Research September 2012 Sponsored by ParAccel

SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM

Hadoop and Data Warehouse Friends, Enemies or Profiteers? What about Real Time?

Three Open Blueprints For Big Data Success

Using Big Data for Smarter Decision Making. Colin White, BI Research July 2011 Sponsored by IBM

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Analytics Industry Trends Survey. Research conducted and written by:

Your Path to. Big Data A Visual Guide

TECHNOLOGY TRANSFER PRESENTS INTERNATIONAL. Rome, December Residenza di Ripetta Via di Ripetta, 231 CONFERENCE BIG DATA

End Small Thinking about Big Data

However most organisations are expressing frustrations with their data warehousing solutions due to:

An Integrated Big Data & Analytics Infrastructure June 14, 2012 Robert Stackowiak, VP Oracle ESG Data Systems Architecture

Hadoop Data Hubs and BI. Supporting the migration from siloed reporting and BI to centralized services with Hadoop

Transcription:

INTELLIGENT BUSINESS STRATEGIES WHITE PAPER Improving Access to Data for Successful Business Intelligence Part 2: Supporting Multiple Analytical Workloads in a Changing Analytical Landscape By Mike Ferguson Intelligent Business Strategies March 2014 Prepared for:

Table of Contents Introduction... 3 The Changing Analytical Landscape... 4 From data warehouse to analytical ecosystem... 6 The Growth of Self-Service BI Tools... 7 What Is Self-Service BI?... 7 The Importance of Multi-Source Data Access In A Self-Service BI Environment... 7 Conclusion... 9 Copyright Intelligent Business Strategies Limited, 2014, All Rights Reserved 2

INTRODUCTION Business is demanding business insight from data acquired from more data sources Business demand to analyse data from new data sources is resulting in a need to support multiple analytical workloads that go beyond the capabilities of a data warehouse There is also a growth in the deployment of selfservice BI tools for visual discovery This series of three white papers on Improving Access to Data for Successful Business Intelligence looks at the challenges companies are facing in delivering successful business intelligence (BI) during an era where the data landscape is growing in complexity and there is increasing business demand for more data from more sources to deepen business insight. In addition, there is an expectation from business that BI can be delivered more quickly in an agile way, regardless of an increasingly distributed data environment. In Part 1 in this series, we looked at how data is on the march to becoming more distributed across more systems. We discussed how multiple instances of transaction processing systems exist in many organisations and how some of these are being deployed on the cloud. We also explored the growth in analytical systems and the emergence of big data platforms and NoSQL databases. The paper examined why the number of data sources is steadily climbing, with much more data coming into the enterprise from external and unfamiliar data sources, such as social media sites and external bodies. In Part 2 in the series, we will look at the impact the growing numbers of data sources has had on analytical systems and the changing analytical landscape that has emerged from the need to support multiple analytical workloads that require platforms beyond that of a data warehouse. We will also look at the growth in adoption of self-service BI tools in use in business areas, both of which are driving up demand for maintaining good data access in an increasingly complex analytical environment. In the final Part 3 we will look at how one vendor, Progress Software, is addressing these challenges in order help organisations be successful with their business intelligence initiatives. Copyright Intelligent Business Strategies Limited, 2014, All Rights Reserved 3

TODAY S CHANGING ANALYTICAL LANDSCAPE New data sources are required to deepen customer insight The need for deeper insight into customers and business operations has meant that companies now want to analyse new data sources both inside and outside the enterprise over and above the transaction processing systems already providing data into data warehouses and data marts. Popular big data sources now in demand to get deeper insight into customers include: Web log clickstream data to understand what customers and prospects are doing online Clickstream, email and social media data are all in demand Customer interactions recorded in internal or cloud-based CRM systems such as in-bound emails Social media profile and interaction data (e.g. Twitter, Facebook) Review web sites Blogs Note that some of these data sources are unstructured and can include text from social media sites, emails, review web sites and others. Sensor networks are being deployed to get data to improve insight into operations With respect to operations, sensor data is now in demand as companies start to deploy sensor networks to instrument their operations. Sensors can measure everything from temperature, light, vibration, GPS location, pressure and more. This kind of data makes the following possible: Supply/distribution chain optimization Asset management and field service optimization There are many vertical industry use cases where analysis of sensor data can help optimise operations Manufacturing production line optimization Location-based advertising (mobile phones) Grid health monitoring (e.g. electricity, water, cell network) Smart metering Sensors are also being deployed inside products and equipment, opening up a whole set of new data about the performance and location of a product or asset. Both structured and multi-structured data are needed to help deepen business insight The issue with big data is that it is increasing in complexity in terms of data volume, data variety and data velocity (i.e., the rate in which data is being generated). Today, both structured and multi-structured data are needed for analysis. In the case of most structured data from OLTP systems, personal data stores (e.g. Excel, Access), these data are well understood and a schema is often defined. Multi-structured data on the other hand is often un-modelled and not well understood. Such data may also need new analytics to be able to analyse it. Copyright Intelligent Business Strategies Limited, 2014, All Rights Reserved 4

The need to analyse these new kinds of data has taken us beyond the traditional data warehouse into a world where multiple analytical workloads now exist that warrant the use of new platforms beyond that of the data warehouse. Popular new big data analytical workloads include: 1. Analysis of data in motion Several new analytical workloads have emerged to analyse big data 2. Complex analysis of structured data 3. Exploratory analysis of un-modelled multi-structured data 4. Graph analysis (e.g. social networks, fraud) 5. Accelerating extract, transform, and load (ETL) and analytical processing of un-modelled data to enrich data in a data warehouse or analytical appliance 6. The storage and re-processing of archived data New big data analytical platforms are also needed to support these workloads To support these new analytical workloads companies are starting to extend existing data warehouse architectures to accommodate new analytical platforms like Hadoop, NoSQL Graph DBMSs, streaming analytics systems and analytical relational DBMSs, in addition to traditional data warehouses. These are shown in Figure 1. Figure 1 Copyright Intelligent Business Strategies Limited, 2014, All Rights Reserved 5

FROM DATA WAREHOUSE TO ANALYTICAL ECOSYSTEM What this means is that we are extending beyond the traditional data warehouse into a new, more complex analytical ecosystem as shown in Figure 2. The demand for access to new data sources together with new analytical workloads has resulted in the need for an enterprise analytical ecosystem Different big data analytical workloads are suited to different big data platforms Many new data sources now need to be accessed to support tradiional data warehouse and big data analytical workloads Figure 2: Big data workloads result in multiple platforms now being needed. As a general guideline, the six new analytical workloads outlined above would likely run on the following platforms. Analytical Workload Big Data Platform 1. Analysis of data in motion Streaming analytics 2. Complex analysis of structured data Analytical appliance 3. Exploratory analysis of un-modelled multistructured Hadoop data 4. Graph analysis e.g. social networks, fraud NoSQL Graph DBMS 5. Accelerating ETL and analytical processing of Hadoop un-modelled data to enrich data in a data warehouse or analytical appliance 6. The storage and re-processing of archived data Hadoop This new analytical ecosystem consists of multiple analytical platforms and not just a data warehouse. Note the sheer number and variety of data sources shown at the bottom of Figure 2 that business now needs to access. There is a need for both structured and multi-structured data that has to be captured, cleansed, integrated and provisioned across the entire analytical ecosystem. Succeeding in this more complex analytical environment is now even more dependent on the ability to access data from a much larger set of data sources both inside and outside the enterprise. Copyright Intelligent Business Strategies Limited, 2014, All Rights Reserved 6

THE GROWTH OF SELF-SERVICE BI TOOLS WHAT IS SELF-SERVICE BI? Self-service BI is growing rapidly In addition to the emergence of a multi-platform analytical ecosystem, one area of business intelligence has experienced unrivalled growth over the last few years. This area is self-service BI. Self-service BI can be defined as the creation of a BI environment whereby business users can create and access BI reports, queries, and analytics without the need for IT. Self-service data discovery and visualisation tools have been widely adopted Self-service BI tools allow business analysts to access and analyse data in multiple data sources A new type of BI has emerged to support self-service BI and that is data discovery and visualisation tools. These include tools such as Tableau, QlikView and Tibco Spotfire. Business analysts use self-service data discovery and visualisation tools to explore data, detect patterns and produce insight without the need for IT. In particular, they use these types of tools to: Analyse personal data (e.g. Excel data) with more visual tools Quickly mash, explore and investigate personal and corporate data to determine if there is any value in it Explore analyse and visualise big data Produce insight quickly that they can share with others They are highly interactive, support automatic charting and visualisations that make complex data easy to understand There are many reasons why self-service data discovery and visualisation tools are popular with business units and business analysts. They include the fact that they speed time to value and allow exploration of personal and corporate data. They are highly interactive and improve business agility. Business can therefore clear the IT backlog. In addition, smarter visualizations (for example, graph matrices, bullet graphs, tree maps, heat maps) mean that information that is hard to understand in a tabular form can be easily visualised. Business analysts using these tools can produce dashboards to present information and guide people to take the correct action. They can also quickly share insights via user-assembled dashboards by publishing them to the web and to mobile devices. THE IMPORTANCE OF MULTI-SOURCE DATA ACCESS IN A SELF-SERVICE BI ENVIRONMENT One of the greatest strengths of self-service data discovery and visualisation tools is their ability to access multiple data sources and do data blending. Business analysts can therefore answer questions quickly, even though all the data they need is not in a single database. A good example of this is shown in Figure 3. Copyright Intelligent Business Strategies Limited, 2014, All Rights Reserved 7

Having access to multiple data sources is critical to allowing business analysts to produce the insights they need It means that business does not have to wait for IT to design a database and consolidate all the data required Figure 3: This scenario shows how an insurance non-it business analyst is able to calculate net premiums and claims even when reinsurance data is not in the data warehouse. Self-service BI speeds up time to value Figure 3 shows an insurance example of how a business analyst can make use of a self-service data discovery and visualisation BI tool to access multiple data sources such as an underwriting system, a data warehouse, ultimate loss data and re-insurance data to allow the business analyst to calculate Net Premiums and Claims even when re-insurance data is not in the data warehouse. Copyright Intelligent Business Strategies Limited, 2014, All Rights Reserved 8

CONCLUSION The business need to understand more about their customers and about business operations is forcing a transition from a data warehouse to an analytical ecosystem The demand for big data has caused an explosion in the number of new data sources Self-service BI tool support for multiple data sources allows business analysts to continue to produce insights quickly, even though the number of data sources is increasing In this paper we have seen that the commercial business drivers outlined in the first paper in this series are driving the need to be able to analyse structured, semistructured and unstructured data from multiple sources inside and outside the enterprise. These business drivers are fuelling the arrival of big data into the enterprise, which in turn is forcing a transition of the analytical landscape to accommodate new analytical workloads beyond those capable of being supported by a data warehouse. The demand for big data has caused an explosion in the number of data sources, many of which require external connectivity to cloud, data marketplaces and other hosted data stores. In addition, the arrival of self-service BI data discovery and visualisation tools is allowing business users to quickly analyse data and produce their own actionable insights that can be easily shared with decision makers to accelerate time to action. Given the increasing distribution of data across more and more systems and the steady increase in the number of data sources, the job of accessing data is getting harder for business analysts. The fact that self-service BI tools support the ability to analyse data from multiple data sources and support lightweight data blending is a key reason why they are so successful. Part 3 in this series will look at the key requirements for improving data access in an analytical environment and then explore how technology from one vendor, Progress Software, helps companies meet those requirements so that they can be successful with business intelligence. Copyright Intelligent Business Strategies Limited, 2014, All Rights Reserved 9

About Intelligent Business Strategies Author Intelligent Business Strategies is a research and consulting company whose goal is to help companies understand and exploit new developments in business intelligence, analytical processing, data management and enterprise business integration. Together, these technologies help an organisation become an intelligent business. Mike Ferguson is Managing Director of Intelligent Business Strategies Limited. As an analyst and consultant he specialises in business intelligence and enterprise business integration. With over 32 years of IT experience, Mike has consulted for dozens of companies on BI/Analytics, big data, data governance, master data management and enterprise architecture. He has spoken at events all over the world and written numerous articles and blogs providing insights on the industry. Formerly he was a principal and co-founder of Codd and Date Europe Limited the inventors of the Relational Model, a Chief Architect at Teradata on the Teradata DBMS and European Managing Director of Database Associates, an independent analyst organisation. He teaches popular master classes in Big Data Analytics, New Technologies for Business Intelligence and Data Warehousing, Enterprise Data Governance, Master Data Management, and Enterprise Business Integration. Water Lane, Wilmslow Cheshire, SK9 5BG England Telephone: (+44)1625 520700 Internet URL: www.intelligentbusiness.biz E-mail: info@intelligentbusiness.biz Improving Access to Data for Successful Business Intelligence Part 2 Copyright 2014 by Intelligent Business Strategies All rights reserved Copyright Intelligent Business Strategies Limited, 2014, All Rights Reserved 10

Copyright Intelligent Business Strategies Limited, 2014, All Rights Reserved 11