The REAL Big Data Actually using smart sensors and other time sensitive data



Similar documents
Oracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya

Unprecedented Performance and Scalability Demonstrated For Meter Data Management:

Tableau Server 7.0 scalability

Informix The Intelligent Database for IoT

FAQ: HPA-SQL FOR DB2 MAY

Oracle Database In-Memory The Next Big Thing

Oracle Exadata Database Machine Aké jednoznačné výhody prináša pre finančné inštitúcie

Managing big data for smart grids and smart meters

FOR A FEW TERABYTES MORE THE GOOD, THE BAD and THE BIG DATA. Cenk Kiral Senior Director of BI&EPM solutions ECEMEA region

Inge Os Sales Consulting Manager Oracle Norway

Introducing Oracle Exalytics In-Memory Machine

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

A TECHNICAL WHITE PAPER ATTUNITY VISIBILITY

Big Data Use Cases Update

Using the HP Vertica Analytics Platform to Manage Massive Volumes of Smart Meter Data

USING ANALYTICS TO IMPROVE SUPPLY CHAIN PERFORMANCE

2009 Oracle Corporation 1

Big Data and Its Impact on the Data Warehousing Architecture

TUT NoSQL Seminar (Oracle) Big Data

Optimizing Your Database Performance the Easy Way

The Internet of Things (IoT)

Oracle Big Data SQL Technical Update

Understanding the Value of In-Memory in the IT Landscape

Tableau Server Scalability Explained

The Rise of Industrial Big Data. Brian Courtney General Manager Industrial Data Intelligence

Big Data Are You Ready? Thomas Kyte

Systems of Discovery The Perfect Storm of Big Data, Cloud and Internet-of-Things

One of the database administrators

Agility Database Scalability Testing

<Insert Picture Here> Best Practices for Extreme Performance with Data Warehousing on Oracle Database

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Big Data: Are You Ready? Kevin Lancaster

Maximum performance, minimal risk for data warehousing

Big Data Analytics Nokia

Understanding the impact of the connected revolution. Vodafone Power to you

WHITEPAPER BEST PRACTICES

Fleet Health Monitoring and Machine Learning Technology for CBM+

An Oracle White Paper June High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database

Scalability and Performance Report - Analyzer 2007

The Analytics Value Chain Key to Delivering Value in IoT

Intelligent Government From Data to Decision. Robert Lindsley Oracle, Public Sector Technology Group

Holistic Performance Analysis of J2EE Applications

Creating business outcomes for Building Operators with the Internet of Things

VirtualCenter Database Performance for Microsoft SQL Server 2005 VirtualCenter 2.5

SAP HANA PLATFORM Top Ten Questions for Choosing In-Memory Databases. Start Here

DIALSOURCE. Sales Acceleration Software

How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW

Business opportunities from IOT and Big Data. Joachim Aertebjerg Director Enterprise Solution Sales Intel EMEA

Cloud Computing Capacity Planning. Maximizing Cloud Value. Authors: Jose Vargas, Clint Sherwood. Organization: IBM Cloud Labs

Oracle Database Public Cloud Services

Big Data & the LAMP Stack: How to Boost Performance

Introduction to Database as a Service

The Internet of Things

ORACLE DATABASE 10G ENTERPRISE EDITION

Smarter Buildings & Management of Buildings

White Paper. Making Sense of the Data-Oriented Tools Available to Facility Managers. Find What Matters. Version 1.1 Oct 2013

Scaling Your Data to the Cloud

Overview: X5 Generation Database Machines

BYOzzzz: Focusing on the Unsolved Challenges of Mobility, An Industry Perspective

INTRODUCTION. IoT AND IP STRATEGIES

BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014

Dynamic M2M Event Processing Complex Event Processing and OSGi on Java Embedded

High Velocity Analytics Take the Customer Experience to the Next Level

Exadata Database Machine

2012 AnnuAl MArket OutlOOk & FOrecAst SUMMARY REPORT

<Insert Picture Here> Oracle and/or Hadoop And what you need to know

Crystal Reports Server 2008

Big Data. Fast Forward. Putting data to productive use

ORACLE DATABASE 12C IN-MEMORY OPTION

Virtuoso and Database Scalability

IBM Cognos 10: Enhancing query processing performance for IBM Netezza appliances

Lab Validation Report

The Internet of Things. Giles Norman MobileFirst Consulting Manager, IBM. Daniel Dombach Director EMEA, Industry Solutions, Zebra Technologies

The Impact of IoT for Building Service Contractors

Innovation Session BIG DATA. HP EMEA Software Performance Tour 2014

Go with the flow: Transportation information management solutions from IBM.

OECD Paris 26 October 2011

HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica

Oracle Exadata: The World s Fastest Database Machine Exadata Database Machine Architecture

EVERYTHING THAT MATTERS IN ADVANCED ANALYTICS

Zero-in on business decisions through innovation solutions for smart big data management. How to turn volume, variety and velocity into value

White paper: Unlocking the potential of load testing to maximise ROI and reduce risk.

The NexTraq Difference

Safe Harbor Statement

ORACLE OLAP. Oracle OLAP is embedded in the Oracle Database kernel and runs in the same database process

Towards Smart and Intelligent SDN Controller

Data Analytics The New Growth Opportunity for Software Developers

SCI Briefing: A Review of the New Hitachi Unified Storage and Hitachi NAS Platform 4000 Series. Silverton Consulting, Inc.

Oracle on Oracle. Hans Peter Kipfer Vice President, Engineered Systems EMEA

Enterprise Architectures for Large Tiled Basemap Projects. Tommy Fauvell

Remote Support Competitive Overview

Driving Business Value Through Big Data Analytics Creating Winners And Disruptors

What are the top new features of DB2 10?

ELECTRIC VEHICLE CHARGING SOLUTIONS FOR FLEETS

SAP HANA. SAP HANA Performance Efficient Speed and Scale-Out for Real-Time Business Intelligence

Key Attributes for Analytics in an IBM i environment

The Internet of Things

Maximum Availability Architecture

See the wood for the trees

Transcription:

The REAL Big Data Actually using smart sensors and other time sensitive data Tom Rieger IBM tomrieger@us.ibm.com 952-221-6514

A CASE STUDY: Smart Sensor data: The next BIG DATA challenge with always-on data generators GPS devices on moving vehicles (buses, trains, trucks, cars), smart meters for utilities and all kinds of remote sensing equipment. This is not just an incremental increase; this is 1000x of times increase. How do you load it, process it, query it and gain the insight? And how do you do it economically? THIS IS NOT A SOLUTION TRADITIONAL RELATIONAL MODELS SOLVE

City of Seattle moving to real-time route, passenger and fleet mgmt

Traffic sensing

Traffic sensing

So let s do some math In normal relational models, each one is a database row No matter what you throw at this, it will not scale if you just keep adding rowafter-row BTW 100,000 utility smart meters is equal to a city with population about ~200,000

The CASE Study The primary electric company for North Texas Completing a roll out of ~3.5 million smart meters 3 million residential and 500,000 business customers As of right now, adding 40,000 a week until completed this fall Currently doing reads every 15 minutes Planning to go to 5 minute for commercial customers They discovered early in the project when they benchmarked 1 million meters of data for 90 days they were hitting a wall.

The problem Traditional Database was barely keeping up with the data Original database taking about 7 hours to read in the data for 1 million meters Reports taking between 2 and 7 hours to run Problems with reading and writing data simultaneously What happens when they hit 3.5 million meters? 73 billion records stored will grow to 220 billion records Even with linear scaling performance inadequate not be enough time in the day to read and process all this data Client is looking for ways to cut their costs 90 days of data for 1 million meters was ~1.3TB. What happens when they hit full roll out and look to store it for 3 years? They would like to do more with the hardware they have Process the data more quickly Allow other applications to run against the data

The Solution - Informix with TimeSeries INFORMIX Oracle Time to Load 1,000,000 Meters 18 Minutes 7 Hours Reports Seconds to 11 min 2-7 HOURS Storage Requirements 1,000,000 Meters 350GB 1.3 TB Informix performance and storage comparison is linear with more meter data Informix gives even better results if you increase CPUs and storage

What Informix does different Traditional relational answer ROWS!!! INSERT INSERT INSERT And this would create new records in a table For our case study, that would be 15 minute read rates for 3.5 million meters = ~122,640,000,000 rows PER YEAR Informix s answer There is a column datatype type time series You still insert, but the datatype grows to the right (like an array) Another way to look at it is the datatype grows like a columnar table - vertically For our case study, that would be 15 minute read rates for 3.5 million meters = 3,500,000 rows

The solution Informix with time series A relational database that is still one of the top used in corporations today You make a reservation at Hyatt, you touch Informix You do online banking with Bank of America, you touch Informix Wal-Mart has over 18,000 machines running Informix..with 2 DBAs CISCO embeds Informix in 25 different products they sell every day It is Scalable Starting small and scaling up can happen on the fly AND autonomic/self healing/self running/self tuning/secure All tuning and maintenance happens while the machine is running AND extensible Pre-packaged extensions like time series, spatial, text search, etc and the ability to code your own using SPL, C and JAVA extensions AND highly concurrent 10s of users to 10,000s concurrent users AND mixed workloads Balance transactional work against those ugly queries AND all transparent to your application, tools and code

What is next? Better analytics If you can ask comparative questions with a few second response time across meter data, think of the opportunity Roll trucks smarter, better predict power needs, when and where to buy power. Ask what if questions with greater granularity Enable spatial analysis Informix also allows them to ask mapping questions with spatial extension NOW you can compare WHEN and WHERE components like never before.

So everyone loves a good comparison.. Oracle published a benchmark around their ability to process smart meter data in September 2011 IBM published a benchmark around Informix s ability to process smart meter data in October, 2011 The solution did this using a meter data management (MDM) solution over the respective databases: Loading of smart meter data from millions of meters Ran the data thru a VEE (Validation, Estimation and Editing) phase Processed a set of meters thru a billing cycle So I felt compelled to show the results

Meter Data Management benchmark Metric Oracle Informix Difference Date published Sept, 2011 Oct, 2011 CPU cores for database 96 16 6x fewer Number of meters 5,500,000 100,000,000 18x more load and validate per sec 277,777 421,000 1.5x more per processor performance 2,894 26,313 9x more Power Needs Max 14 kw Max 1.6 kw 8.75x less IBM P770/Informix Oracle/Sun Exadata

So what is my point? It is possible for any sized organization to capture and analyze sensor data It does not take a refrigerator to process it and scary storage subsystems to manage it. You can go from automation to anticipation Based on X, you get Y Greater amps pulled by a motor, means a filter needs cleaning Lower MPG means tune up is needed After X miles, time to get tires checked If tomorrow s weather is 92F and sunny, this city will consume 200MW/hr of power through this substation If across a product line, a consistent product failure takes place, readjust Y (maint, usage, replace bad part, some combination).

Key Takeaways Traditional relational database structures do not manage sensor data well Load Query Cost to store Non-scalable IBM Informix with time series solves this problem Open relational database Extends to store, access and secure other data Like time series, spatial and others. 66% less storage this pays for it by itself Load data 100x times faster Over 100 functions in the database for time series

Next steps and questions Download and try it anytime www.ibm.com/informix and click trials and demos and download the Developer Edition The sample database has time series data in it to walk you through the capabilities Check out www.youtube.com and search on Informix time series Ask me for further details Tom Rieger IBM tomrieger@us.ibm.com 952-221-6514