Open Source Business Intelligence in the Cloud

Similar documents
Mike Boyarski Jaspersoft Product Marketing Business Intelligence in the Cloud

College of Engineering, Technology, and Computer Science

System Requirements Table of contents

Cost-Effective Business Intelligence with Red Hat and Open Source

Editions Comparison Chart

Sugar Professional. Approvals Competitor tracking Territory management Third-party sales methodologies

How To Choose Cloud Computing

Cloud Computing. Adam Barker

Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013

Sugar Professional. Approvals Competitor tracking Territory management Third-party sales methodologies

OTM in the Cloud. Ryan Haney

Corso di Reti di Calcolatori L-A. Cloud Computing

RRF Reply Reporting Framework

Definitions. Hardware Full virtualization Para virtualization Hosted hypervisor Type I hypervisor. Native (bare metal) hypervisor Type II hypervisor

MicroStrategy Cloud Reduces the Barriers to Enterprise BI...

Open Source Business Intelligence Intro

ECE6130 Grid and Cloud Computing

BI in the Cloud Sky is the limit

Business Intelligence for the Modern Utility

Big Data & Cloud Computing. Faysal Shaarani

Harnessing the Power of the Microsoft Cloud for Deep Data Analytics

Building Success on Acquia Cloud:

Hexaware E-book on Q & A for Cloud BI Hexaware Business Intelligence & Analytics Actionable Intelligence Enabled

Navigating The World of Cloud Computing

Data Mining, Predictive Analytics with Microsoft Analysis Services and Excel PowerPivot

Cloud Computing and Amazon Web Services. CJUG March, 2009 Tom Malaher

Using Cloud Services for Test Environments A case study of the use of Amazon EC2

Big data blue print for cloud architecture

System Requirements. SAS Regular Price Optimization 4.2. Server Tier. SAS Regular Price Optimization Long Jobs Server

Outline. What is cloud computing? History Cloud service models Cloud deployment forms Advantages/disadvantages

The Definitive Guide. Monitoring the Data Center, Virtual Environments, and the Cloud. Don Jones

Leveraging Public Clouds to Ensure Data Availability

Step by Step: Big Data Technology. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 25 August 2015

View Point. Performance Monitoring in Cloud. Abstract. - Vineetha V

ur skills.com

Business Intelligence as a Service for the G-Cloud

White Paper on CLOUD COMPUTING

Denver based Business Intelligence Consulting Company

Build Your Own Performance Test Lab in the Cloud. Leslie Segal Testware Associate, Inc.

An Introduction to Cloud Computing Concepts

ArcGIS for Server: In the Cloud


Virtual Server and Storage Provisioning Service. Service Description

Cloud Computing. What is Cloud Computing? Computing As A Utility. Resource Provisioning Problem 4/29/2014. Kai Shen

Data Centers and Cloud Computing

Where We Are. References. Cloud Computing. Levels of Service. Cloud Computing History. Introduction to Data Management CSE 344

Last time. Today. IaaS Providers. Amazon Web Services, overview

How To Understand Cloud Computing

The Cloud as a Computing Platform: Options for the Enterprise

Oracle Applications and Cloud Computing - Future Direction

Pentaho Reporting Overview

DEMAND SMARTER, FASTER, EASIER BUSINESS INTELLIGENCE

Storage and Disaster Recovery

Technology and Cost Considerations for Cloud Deployment: Amazon Elastic Compute Cloud (EC2) Case Study

9/26/2011. What is Virtualization? What are the different types of virtualization.

Demystifying the Cloud Computing

QlikView Business Discovery Platform. Algol Consulting Srl

Open Source on the Trading Desk

Cloud Vendor Benchmark Price & Performance Comparison Among 15 Top IaaS Providers Part 1: Pricing. April 2015 (UPDATED)

The Future of Data Management

Using WebSphere Application Server on Amazon EC2. Speaker(s): Ed McCabe, Arthur Meloy

Disrupting The Market: Predictive Analytics As A Service

Open Source and Cloud. Ramesh Kumar

Turnkey Hardware, Software and Cash Flow / Operational Analytics Framework

AQUA Private Registry Cloud Technical Overview

The Cisco Powered Network Cloud: An Exciting Managed Services Opportunity

When talking about hosting

Oracle Database Cloud Service Rick Greenwald, Director, Product Management, Database Cloud

Cloud Computing. Chapter 1 Introducing Cloud Computing

CLOUD COMPUTING: THE EMERGING COMPUTING TECHNOLOGY. Feng-Tse Lin and Teng-San Shih. Received May 2010; accepted July 2010

Patrick Beaucamp Founder of the Vanilla Project Mail : Patrick.beaucamp@bpm-conseil.com

Desktop Activity Intelligence

Dimension Data Enabling the Journey to the Cloud

Introduction What is the cloud

Comparing major cloud-service providers: virtual processor performance. A Cloud Report by Danny Gee, and Kenny Li

Running Oracle Applications on AWS

Pentaho Update and Discussion with Curt Monash. Jared Cornelius, Product Marketing Manager Lance Walter, VP of Marketing

Cloud computing - Architecting in the cloud

Business Intelligence on a Budget: Open Source BI. Paul O Rorke

Cloud Computing. Figure 1

Cloud Models and Platforms

Cloud Computing Technology

Business applications:

Introducing AI. A new, free, comprehensive way to manage servers. For press inquiries please contact:

How AWS Pricing Works

RED HAT ENTERPRISE VIRTUALIZATION

Pentaho Data Mining Last Modified on January 22, 2007

Model-driven Performance Estimation, Deployment, and Resource Management for Cloud-hosted Services

Cloud Courses Description

Background on Elastic Compute Cloud (EC2) AMI s to choose from including servers hosted on different Linux distros

CRM s Dirty Little Secret: How to Avoid CRM Sticker Shock

IBM Impact 2012 Conference. Don t Underestimate Monitoring in the Cloud! Rodney Morrison VP Products SL Corporation

JAVASCRIPT CHARTING. Scaling for the Enterprise with Metric Insights Copyright Metric insights, Inc.

Cloud Essentials for Architects using OpenStack

CUMULUX WHICH CLOUD PLATFORM IS RIGHT FOR YOU? COMPARING CLOUD PLATFORMS. Review Business and Technology Series

Product Information. Sugar vs Zoho. Features Comparison

Cloud Computing Workload Benchmark Report

SpagoBI: the 100% open source, complete and flexible Business Intelligence suite

Oracle Primavera P6 Enterprise Project Portfolio Management Performance and Sizing Guide. An Oracle White Paper October 2010

Transcription:

Open Source Business Intelligence in the Cloud July 2009 OpenBI, LLC 2009 1

Business Intelligence in the Age of Collaboration Billions of connected individuals can now actively participate in innovation, wealth creation, and social development in ways we once only dreamed of. And when these masses of people collaborate they collectively can advance the arts, culture, science, education, government, and the economy in surprising but ultimately profitable ways. Don Tapscott and Anthony D. Williams OpenBI, LLC 2009 2

Why Open Source Software? Commercial Open Source is changing the rules Customer Control Free, global access to software Removal of license fee amortization Annual proof-of-value for vendors Lower Costs < 50% of the software cost of proprietary alternatives Better Technology Modern, open architectures Global innovation engine OpenBI, LLC 2009 3

Sampling of OSBI Related Technologies Open Source Business Intelligence Technologies Business Intelligence Platforms & Technologies BIRT Project Database Platforms Statistical Analysis/Data Mining Software * * WEKA is part of the Pentaho BI Suite OpenBI, LLC 2009 4

Free Open Source vs. Commercial Open Source Free Open Source differs from Commercial Open Source in a number of ways. Free Open Source Informal support and broad services providers Uneven velocity of change Community-directed roadmap Functional gaps Challenging licensing provisions Source: Pentaho Commercial Open Source Formal support with service level agreements (SLAs) Indemnification Professional services and partnerships Product management and roadmaps, and advisory boards Business-friendly subscription models Reference accounts, cases studies, and user groups OpenBI, LLC 2009 5

Differing Philosophies on Open Source I think it addresses a niche market for high-end data analysts that want free, readily available code. We have customers who build engines for aircraft. I am happy they are not using freeware when I get on a jet. Anne H. Milley, director of technology product marketing at SAS It s interesting that SAS Institute feels that non-peerreviewed software with hidden implementations of analytic methods that cannot be reproduced by others should be trusted when building aircraft engines. Dr. Frank Harrell, Professor of Biostatistics and Department Chair at Vanderbilt University and R Community Member OpenBI, LLC 2009 6

What Is Cloud Computing? Cloud computing is ondemand access to virtualized IT resources that are housed outside of your own data center, shared by others, simple to use, paid for via subscription, and accessed over the Web. John Foley, Information Week, September 2008 Seven Principles: Off-site Virtual On-demand Subscription Shared Simple Web-Based or, use how Larry Ellison described it: "idiocy," "crap," "gibberish," "crazy," and "stupidest" OpenBI, LLC 2009 7

What isn t Cloud Computing (our def) Google Apps Maybe a Cloud Application End User cannot determine WHAT they run IBM Computing on Demand Not truly on demand Activate physical processors already within a Box Not utility based (yet) Microsoft Azure Limited to the Azure technology Etc. OpenBI, LLC 2009 8

The Amazon Elastic Computing Cloud Powered By Open Source If an economic downturn cools IT capital spending, some business technology managers may turn to rent-by-the-hour cloud computing resources If they turn to Amazon EC2, they're tapping into open source Linux, Apache, and a tweaked Xen open source hypervisor that powers much of the company's cloud's operation. Information Week, November 2008 OpenBI, LLC 2009 9

Amazon Elastic Computing Cloud (EC2) Most well known Cloud Computer Allows customization of Amazon Machine Images that can be started and run on demand Different instance sizes from small 32-bit (1 CPU, 1.7GB RAM equivalent) to extra large 64 bit (8 CPU, 15GB RAM) or extra large 64 bit, high CPU (20 CPU, 7GB RAM) Runs varied operating systems (Linux, Windows) and charged on an hourly basis (Windows is 25-50% more expensive) Can attach persistent storage to an instance, charged by the GB Accessed via command line or web interface Some data charges apply for transfer in and out of Amazon Competitors: IBM (Computing On Demand), Google (App Engine), AT&T (Synaptic), Microsoft (Azure), Sun (OpenCloud) Rackspace, Flexiscale, GoGrid OpenBI, LLC 2009 10

Open BI in the Cloud: Danone/Nutricia Client Profile Nutricia, a division of Danone, specializes in Baby and Medical Nutrition products. They provide medical nutrition for the management of conditions such as milk protein allergy, inborn errors of metabolism (e.g., PKU), pediatric epilepsy, Alzheimer s & more. Nutricia markets its products across 19 countries. Project Background Internal order management system was limited in providing analytical insights on products, product groupings, time, customer, or geographic analysis in the aggregate. Scope Build a pilot analytical database and web-based business intelligence application to allow business users to see a high level snapshot of business performance, and be able to drill into detailed order and invoice activity to reveal performance trends. OpenBI, LLC 2009 11

Cloud BI In Action: Danone/Nutricia Web-based sales performance dashboard provides quick analysis on sales activity for products, customers, and sales regions Allows drilling from dashboard into interactive OLAP analysis sessions for a deeper look at sales activity OpenBI, LLC 2009 12

Community Dashboard Framework Demonstrates open source community development in action Started by Ingo Klose and Pedro Alvares, first community releases were in 2008 Provides a framework and templates for simple dashboard building Includes basic selection/filtering objects, including text boxes, multi-select pick lists, calendar date selections, check boxes, etc. Uses the Pentaho platform s guts to provide data from databases, transforms, etc. Allows Pentaho reports, charts, OLAP sessions and other objects to be embedded in the dashboard. Version 3.0 released Jan 2009 OpenBI, LLC 2009 13

Environment Overview The pilot environment is hosted on an Amazon EC2 Large (Approx 8GB, 4CPU) Instance. This instance contains: JBoss Web Server Pentaho BI Suite Includes custom web page templates, charts, and OLAP views Community Dashboard Framework Pentaho Data Integration Includes custom ETL Routines to extract, transform and load data from the operational systems. MySQL Database Stores BI database, Pentaho Repository, and User Database OpenBI, LLC 2009 14

Environment Overview Amazon EC2 Large Instance JBoss Application Server Pentaho BI Suite Data Integration (ETL) Community Dashboard Analysis (OLAP) Elastic Block Store MySQL 5.0 userdb hibernate bi Encrypted, Secure VPN US Order Management Order Entry Canada Order Management Order Entry Web Browser OpenBI, LLC 2009 15

Case Study - Conclusion Project took approx 6 weeks, including requirements, design, build and deploy to the cloud Has been operating since July 2008 Users within and outside of the client s walls have secure access to performance metrics OpenBI, LLC 2009 16

Five Key Cloud Considerations 1. Everything is the same, and everything is different OS is the same Software installation and configuration is the same Differences: Connectivity Secure connections over the internet Persistence Native file system is transient across reboots» Internal Instance Storage vs. Elastic Block Store Determine what is included in the instance image» Isolate software versus data» Use the Elastic Block Store for persistence OpenBI, LLC 2009 17

Five Key Cloud Considerations 2. Plan your sizing (at least a little) Pick your bits Upfront decision between 32bit and 64bit Starting with small Reduces upfront price but requires new AMI image creation for upsize Consider your end platform need Pick your up time 8-5, 6-6, WeekDays Only Reserved Instances (16hr/day) Remember data persistence 3. Plan your ETL extraction technique (pay by the byte) JDBC Requires direct connection to source DB File transfer S/FTP Requires extracts OpenBI, LLC 2009 18

Five Key Cloud Considerations 4. Plan your security MSAD utilized Required VPN 24 x 7 (availability) Mirror RDBMS on Cloud Required ETL/Process 5. Your end-user functionality can be identical Use DNS server aliasing Mercy of internet Has not been an issue, for us, but could for data dumps OpenBI, LLC 2009 19

Thank You! Kevin Haas kevin.haas@openbi.com OpenBI, LLC 2009 20