Practical Considerations for Real-Time Business Intelligence. Donovan Schneider Yahoo! September 11, 2006



Similar documents
Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage

ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION

HROUG. The future of Business Intelligence & Enterprise Performance Management. Rovinj October 18, 2007

Applied Business Intelligence. Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA

Data Warehousing and Data Mining in Business Applications

Data Warehouse: Introduction

QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM

MDM for the Enterprise: Complementing and extending your Active Data Warehousing strategy. Satish Krishnaswamy VP MDM Solutions - Teradata

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Integrating Netezza into your existing IT landscape

Introduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence

MDM and Data Warehousing Complement Each Other

Retail POS Data Analytics Using MS Bi Tools. Business Intelligence White Paper

QAD Business Intelligence

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Five Technology Trends for Improved Business Intelligence Performance

<Insert Picture Here> Real-Time Data Integration for BI and Data Warehousing

Business Intelligence in Oracle Fusion Applications

Data Warehousing Systems: Foundations and Architectures

Introduction to Business Intelligence

Chapter 4 Getting Started with Business Intelligence

Driving Peak Performance IBM Corporation

EMC/Greenplum Driving the Future of Data Warehousing and Analytics

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

Master Data Management and Data Warehousing. Zahra Mansoori

IST722 Data Warehousing

Outline. BI and Enterprise-wide decisions BI in different Business Areas BI Strategy, Architecture, and Perspectives

10 Biggest Causes of Data Management Overlooked by an Overload

BI, Analytics and Big Data A Modern-Day Perspective

THE DEVELOPER GUIDE TO BUILDING STREAMING DATA APPLICATIONS

Oracle Business Intelligence 11g Business Dashboard Management

IT FUSION CONFERENCE. Build a Better Foundation for Business

Oracle Business Intelligence Applications The Value of Cross-Functional BI. Darryn Hinett Business Solutions Consultant

Escape from Data Jail: Getting business value out of your data warehouse

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

Building a Data Warehouse

Business Intelligence Solution for Small and Midsize Enterprises (BI4SME)

Deploying Governed Data Discovery to Centralized and Decentralized Teams. Why Tableau and QlikView fall short

Establish and maintain Center of Excellence (CoE) around Data Architecture

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here

Business Intelligence and Service Oriented Architectures. An Oracle White Paper May 2007

Business Intelligence: Effective Decision Making

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

Innovative technology for big data analytics

Big Data Technology ดร.ช ชาต หฤไชยะศ กด. Choochart Haruechaiyasak, Ph.D.

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

BUSINESS INTELLIGENCE. Keywords: business intelligence, architecture, concepts, dashboards, ETL, data mining

Data Integration Alternatives & Best Practices

Performance Management for Enterprise Applications

Business Usage Monitoring for Teradata

CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University

SQL Server 2012 End-to-End Business Intelligence Workshop

Oracle BI Application: Demonstrating the Functionality & Ease of use. Geoffrey Francis Naailah Gora

The HP Neoview data warehousing platform for business intelligence Die clevere Alternative

Executive Dashboards: Putting a Face on Business Service Management

Understanding the Value of In-Memory in the IT Landscape

Syslog Analyzer ABOUT US. Member of the TeleManagement Forum

Introducing Oracle Exalytics In-Memory Machine

Self-Service Business Intelligence: The hunt for real insights in hidden knowledge Whitepaper

Unlock the business value of enterprise data with in-database analytics

Lection 3-4 WAREHOUSING

Simplifying Big Data Analytics: Unifying Batch and Stream Processing. John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!!

Oracle Daily Business Intelligence. PDF created with pdffactory trial version

CDCR EA Data Warehouse / Strategy Overview. February 12, 2010

DATA MINING AND WAREHOUSING CONCEPTS

2009 Oracle Corporation 1

Republic Polytechnic School of Information and Communications Technology C355 Business Intelligence. Module Curriculum

<Insert Picture Here> The Age of the Pure Play BI Vendor is Over

Migrating a Discoverer System to Oracle Business Intelligence Enterprise Edition

CASE STUDY: Oracle TimesTen In-Memory Database and Shared Disk HA Implementation at Instance level. -ORACLE TIMESTEN 11gR1

Ten Things You Need to Know About Data Virtualization

Master Data Management. Zahra Mansoori

Oracle Business Intelligence Suite Enterprise Edition

Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University

Data Warehouse (DW) Maturity Assessment Questionnaire

Business Intelligence at the University of Minnesota

Safe Harbor Statement

Welcome to online seminar on. Oracle Agile PLM BI. Presented by: Rapidflow Apps Inc. January, 2011

A Service-oriented Architecture for Business Intelligence

Oracle BI Suite Enterprise Edition

Enterprise Information Management and Business Intelligence Initiatives at the Federal Reserve. XXXIV Meeting on Central Bank Systematization

Presented by: Jose Chinchilla, MCITP

Application of Predictive Analytics for Better Alignment of Business and IT

Ramesh Bhashyam Teradata Fellow Teradata Corporation

The difference between. BI and CPM. A white paper prepared by Prophix Software

Inge Os Sales Consulting Manager Oracle Norway

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server

When to consider OLAP?

Implementing Oracle BI Applications during an ERP Upgrade

EII - ETL - EAI What, Why, and How!

Oracle Database In-Memory The Next Big Thing

Transcription:

Practical Considerations for Real-Time Business Intelligence Donovan Schneider Yahoo! September 11, 2006

Outline Business Intelligence (BI) Background Real-Time Business Intelligence Examples Two Requirements of Real-Time BI Architecture Alternatives Conclusions and Open Research Challenges

Business Intelligence Market business analytics software market comprises tools and applications for tracking, storing, analyzing, modeling, and presenting data in support of automating decision- making and reporting processes, IDC 20 15 10 5 Revenue (billions) 0 2001 2003 2004 2008

Progression of BI Transactional reporting give me my reports Data warehousing/olap explore data to find interesting patterns/details Business Performance Management (BPM) how am I tracking to business goals? Guided Analytics/Business Activity Monitoring where should I look next? Tactical decisions what do I do right now?

Selected Business Intelligence Vendors

Examples of Real-Time Business Intelligence

Real-Time Enterprise BI Applications Recommendations Collaborative filtering, e.g., people who like X also like Y Timeliness (freshness of data): hours Fraud Detection Detect anomalies in credit card usage Timeliness: minutes Call Center Provide next best offer or action (cross-sell, up-sell) Timeliness: minutes Close of books Track deals at quarter close to grant/refuse contract concessions Timeliness: minutes Defect/Incident Tracking Track open/closed incidents Timeliness: minutes

Real-Time Web Analytics Examples Web Page Usage Analyze web page usage (page views, ad views, link views, clicks) by property, geography, user demographics, referrer, etc. Timeliness: hours/next day Ad Campaign Effectiveness Bid for search terms on Yahoo!, Google, MSN. Analyze click-thru and conversion rates Timeliness: minutes

Real-Time Web Analytics Examples Targeting Display an ad or content based on demographic profile, geographic location or behavior Timeliness: minutes Experimentation Run A/B or multivariate test on page content/layout. Analyze user engagement, clickthru rate, etc. Timeliness: hours Verify Experiment Planning Search Term Analytics Find most popular search terms by geography, gender, age range, etc. Timeliness: hours Conclude Run, Collect & Monitor Experiment Design

Business Activity Monitoring and Operational Performance Management Measure and monitor real-time business events within the enterprise to improve business performance More than real-time alerts Integrate, aggregate, correlate to improve business processes Examples Real-time inventory analysis Timeliness: minutes

Requirements for Real-Time Business Intelligence

Requirement #1: Time is Money Nothing is free; it costs money to reduce data latency Specialized hardware (clusters, large memories, high bandwidth networks, fault tolerant) Specialized software highly-available, fault-tolerant, high performance Integrated systems The business decisions to be made with reduced latency must justify the investment

Requirement #2: Actionable Data Context Must provide contextual information to aid decision making Typically requires access to detailed data Typically requires access to trending/historical data All silo d solutions will ultimately fail this Audience Provide role-specific views of data (e.g., sales rep, sales manager, district manager, executive), task-specific views, etc. Data Data must be clean (normalized, conformed) Time Timely presentation of data to decision maker

Challenges in real-time BI Data Scale Performance, Performance, Performance Low latency data delivery Consistent response times Caching often used Cost Performance/low-latency costs money High Availability Servers, network, databases, middleware, applications Integration

Architecture Alternatives

Architecture Alternatives Custom Solutions Enterprise Data Warehouse (EDW) Federated system (virtual EDW) Streaming

Custom Solutions Build specialized systems to meet latency needs Pros Optimize for specific needs Initial development cost is low for data marts Cheap enough that department VP can purchase Can adapt quickly to meet changing business needs Cons Lack of integration with contextual data Lack of integration with detailed data Multiple, competing sources of truth Scalability (#of users, amount of data) Lack of shared services ETL processes, security, reporting, DBAs Tend to proliferate across an enterprise; overall cost to company is high

Enterprise data warehouse (EDW) Consolidate data marts into a central data warehouse Pros All non-production OLTP data sources in a single system No multi-database joins One system to administer and operate Single source of truth Cons Many EDWs fail for technical and organizational reasons Departments lose control of their data Departments lose agility Difficult to deliver incrementally due to conforming dimensions Difficult to support high volume, low latency ETL along with complex, ad-hoc decision support Not real-time Costs up to $50 million for a large organization Examples Oracle/Teradata/DB2 EDW, SAP/BW

Virtual EDW Provide federated/virtual view(s) of enterprise data Pros Each source system is optimized for a specific need or workload Cubes, data warehousing, OLTP Departments retain some control over their systems/data Incremental build out (unlike an EDW), modulo conforming dimensions Cons Requires conformed dimensions across systems Some source systems restrict query access (e.g., OLTP systems will not allow large queries) Security unification across disparate systems Updates must be coordinated (between OLTP, DW, caches, etc.) Sophistical SQL generation to optimally access data sources Sophistical execution engine to compensate for data source limitations High availability and problem diagnosis hampered by multiple systems Examples Oracle Business Intelligence EE (formerly known as Siebel Analytics) SAP/BW

Streaming, Business Activity Monitoring, Operational BI Make rapid decisions based on large volumes of data Pros Optimized for low latency (few disk accesses) Cons Data inconsistency (late, missing data) Requires very high availability Extend SQL for streaming operations Integration with other sources can be a challenge Examples Hedge funds processing ticker feeds for arbitrage Fraud detection Revenue alerting

Conclusions and Open Problems

Conclusions Some applications are demanding, and willing to pay for, very low latency access Most applications do not require latency in the seconds granularity Delivery may be real-time (seconds), e.g., targeting, alerting, recommendations but underlying data can be less fresh Common evolution strategy is to increase frequency of ETL operations Mini-batch ETL, e.g., load every 10 minutes Requires fast, scalable, and high availability load, clean, transform and aggregate

Research Challenges in real-time BI Data scalability User scalability Decision making is reaching deeper in organizations Breadth of data access Providing context for decision making requires accessing diverse set of data sources Query Language Cost Performance Efficient, incremental algorithms Very large dimensions (millions to hundreds of millions members) Ad-hoc vs. Production How to support dynamic, mixed workloads High Availability/Fault Tolerance Production decision making systems cannot fail Backup/Recovery