Analytics and Big Data with the PI System Part 2: Statistical Analytics



Similar documents
The PI System and Hadoop: Unleash the Power of Big Data

Extend your analytic capabilities with SAP Predictive Analysis

Copyright 2016 OSIsoft, LLC

Copyright 2013 OSIsoft, LLC. 1

Using Tableau Software with Hortonworks Data Platform

ANALYTICS CENTER LEARNING PROGRAM

HDP Enabling the Modern Data Architecture

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

A Case Study of Hadoop in Healthcare

Cisco Data Preparation

Big Data and Data Science: Behind the Buzz Words

Analytics Industry Trends Survey. Research conducted and written by:

The 4 Pillars of Technosoft s Big Data Practice

FUTURE DATA Yes, it s finally coming to the PI System!

Actionable Knowledge from Refined Data with Microsoft Business Intelligence

Some vendors have a big presence in a particular industry; some are geared toward data scientists, others toward business users.

This Symposium brought to you by

How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns

Advanced Big Data Analytics with R and Hadoop

The Future of Data Management

Big Data and Data Science. The globally recognised training program

Building Your Big Data Team

Data Mining, Predictive Analytics with Microsoft Analysis Services and Excel PowerPivot

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Architecture & Experience

BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014

Introduction to Big Data! with Apache Spark" UC#BERKELEY#

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

Predictive analytics for the business analyst: your first steps with SAP InfiniteInsight

TE's Analytics on Hadoop and SAP HANA Using SAP Vora

MapR: Best Solution for Customer Success

IBM BigInsights Has Potential If It Lives Up To Its Promise. InfoSphere BigInsights A Closer Look

HADOOP. Revised 10/19/2015

COULD VS. SHOULD: BALANCING BIG DATA AND ANALYTICS TECHNOLOGY WITH PRACTICAL OUTCOMES

Building Analytics and Big Data Capabilities Tom Davenport CDB Annual Conference May 23, 2012

The BIg Picture. Dinsdag 17 september 2013

SAP Big Data Helping Government Run Like Never Before

Business Intelligence with the PI System and PowerPivot

Extending the Enterprise Data Warehouse with Hadoop Robert Lancaster. Nov 7, 2012

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

What is a Petabyte? Gain Big or Lose Big; Measuring the Operational Risks of Big Data. Agenda

DATA VISUALIZATION: CONVERTING INFORMATION TO DECISIONS DAVID FRONING, PRINCIPAL PRODUCT MANAGER

Cisco IT Hadoop Journey

Big Data Analytics with Spark and Oscar BAO. Tamas Jambor, Lead Data Scientist at Massive Analytic

Big Data Executive Survey

Big Data Can Drive the Business and IT to Evolve and Adapt

Sunnie Chung. Cleveland State University

Big Data? Definition # 1: Big Data Definition Forrester Research

Integrating Predictive Models and Microsoft BI

Industry-Driven Master Certificate in

Big Data Scenario mit Power BI vs. SAP HANA Gerhard Brückl

TRANSFORM BIG DATA INTO ACTIONABLE INFORMATION

Apache Hadoop: The Big Data Refinery

Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

Oracle Big Data SQL Technical Update

SAP HANA SPS 09 - What s New? HANA IM Services: SDI and SDQ

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

BIG DATA What it is and how to use?

Chapter 6 8/12/2015. Foundations of Business Intelligence: Databases and Information Management. Problem:

Microsoft SQL Server 2012 with Hadoop

Comprehensive Analytics on the Hortonworks Data Platform

Cost-Effective Business Intelligence with Red Hat and Open Source

We are building the next generation of Big Data and Analytics solutions!

HDP Hadoop From concept to deployment.

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

Simplifying Big Data Analytics: Unifying Batch and Stream Processing. John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!!

Value Realization at Johnson Controls using SAP HANA smart data integration Steve Carpenter Johnson Controls Ryan Champlin - SAP

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

Big Data and the SAP Data Platform Including SAP HANA and Apache Hadoop Balaji Krishna, SAP HANA Product Management

BITKOM& NIK - Big Data Wo liegen die Chancen für den Mittelstand?

Big Data Technologies Compared June 2014

Data processing goes big

SAP and Hortonworks Reference Architecture

Introducing the Reimagined Power BI Platform. Jen Underwood, Microsoft

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Architecting an Industrial Sensor Data Platform for Big Data Analytics

The Definitive Guide to Data Blending. White Paper

Leveraging Machine Data to Deliver New Insights for Business Analytics

Data Management in SAP Environments

How To Handle Big Data With A Data Scientist

BIG DATA & DATA SCIENCE

Empower Your organization with

An Overview of Predictive Analytics for Practitioners. Dean Abbott, Abbott Analytics

Salesforce.com and MicroStrategy. A functional overview and recommendation for analysis and application development

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

Microsoft Big Data. Solution Brief

Understanding NoSQL on Microsoft Azure

May 2015 Robert Gibbon & Jochen Stroobants

In-Memory or Live Reporting: Which Is Better For SQL Server?

Session 0202: Big Data in action with SAP HANA and Hadoop Platforms Prasad Illapani Product Management & Strategy (SAP HANA & Big Data) SAP Labs LLC,

Microsoft Business Intelligence

High Performance Predictive Analytics in R and Hadoop:

Business Intelligence. Advanced visualization. Reporting & dashboards. Mobile BI. Packaged BI

Big Data Services From Hitachi Data Systems

How to Run a Successful Big Data POC in 6 Weeks

Mastering Big Data. Steve Hoskin, VP and Chief Architect INFORMATICA MDM. October 2015

Online Courses. Version 9 Comprehensive Series. What's New Series

Apache Hadoop Patterns of Use

Transcription:

Analytics and Big Data with the PI System Part 2: Statistical Analytics Presented by Matt Ziegler, Product Manager, OSIsoft Wes Dyk, Data Scientist, Noble Energy

PI Integrators PI Integrators reduce the complexity of analyzing real world industrial data All PI System data delivered on your terms, in your language, to the tools you use, and to the people that can make a difference.

Big Expectations 64% of large enterprises plan to implement a big data project. 85% will be unsuccessful. Prep Analysis 85% Data cleansing and preparation tasks can take 50-80% of the development time and cost. https://hbr.org/2014/04/the-sexiest-job-of-the-21st-century-is-tedious-and-that-needs-to-change/

Downtime events Startup Phases Fleet Assets and Different Brands Communications and incomplete data Plant turn-around

New Persona: Information Architect Works in an IT or OPs IT role Responsible for integrating many data systems to achieve business goals Data Lifecycle Management Access Policies & Distribution Logical and Physical Architecture Creation and Receipt (Trust) Maintenance & Disposition PI System is just 1 system Business Expert / Business Enabler / Not subject matter expert (engineer)

New Persona: PI Data Scientist source: http://www.marketingdistillery.com/wp-content/uploads/2014/08/mds.png

Noble Energy Company Highlights DJ Basin Marcellus Shale Deepwater Gulf of Mexico West Africa Israel and Cyprus Falkland Islands Northeast Nevada Levant Basin 7

Me @ Production Analysis and Optimization Demonstrate the potential value of Data Science I have been charged with showing value using data science. There is a plethora of work to build on. We have worked to pick use cases and some just fall out of the sky.

Me @ Production Analysis and Optimization I didn t start as a data scientist I ve been a developer, database administrator, system analyst, system architect, etc. I really love math. I have a BS in Applied Mathematics and am completing a MS in the same. Getting into the data science mindset has taken work.

Me @ Production Analysis and Optimization Interdependence / Community It is pretty straightforward to do advanced analytics, once the data is in hand. Getting the data in the right place is difficult. A team dedicated to all aspects of data science is essential for success. Resources have been critical OSIsoft Hortonworks Academia (UC Denver and Carnegie Mellon University)

Our Approach Data as a Strategic Asset Gain New Insights Into Production Dynamics Across a wide variety of disparate data sources and variables such as well information systems, SCADA data, sensors, and unstructured data Build Executive Consensus and Business Sponsorship Start with a single business unit prove the value Recognition that data provides sustainable competitive advantage Widespread, systemic value creation because data is managed as professionally as capital or labor Rely on Trusted Partners to Assist Combination of Noble team members along with Hortonworks (Hadoop) and OSIsoft (PI / CAST) High performing operational and data science team (Python)

Data Science / Operations Research Concept of a Record The record or atom is essential and not much different than a row in a spreadsheet or record in a database Create an atom for method input Supervised learning Collection of these records for training Predictions can be made from individual atoms Unsupervised learning Clustering of individual atoms Neural Networks

Data Science / Operations Research Some Exciting (for me) Projects Facility Downtime Analysis Search data for downtime (Binary Search) Use generated data to predict system pressure (Random Forest) Use system pressure to infer production volume (Direct Calculation)

Current Historical Data Science / Operations Research Some Exciting (for me) Projects Well Downtime Analysis Combine data into records for a model Build the predictive model from history (Random Forest) Make predictions using the most current data RDBMS Train Model 3 rd Party PI: Time Series Predict Report

Data Science / Operations Research Some Exciting (for me) Projects Hauling and Inventory Optimization Mixed integer linear programming Decision problem using Model Predictive Control (MPC) Capacity RDBMS: Targets PI: Tank Levels Optimization Execute

Data Science Tools Primary Tools Hadoop Hive vs Hbase vs plain HDFS Python vs Java Specialized Packages in Python scikit-learn for machine learning cvxopt for convex optimization GLPK for integer programming Challenges Distribution of model training Tools and data have been difficult to integrate Successes Ad hoc analysis PI Integrator has saved the day with delivering usable data sets

Demo

20

PI Integrators Roadmap Subject to change 1Q 2Q 2015 3Q 4Q 1Q 2Q 2016 3Q 4Q Wave 1 Business Intelligence PI Integrator Lighthouse Program PI Integrator for Business Intelligence PI Integrator for SAP HANA PI Integrator for Hadoop PI Integrator for Microsoft Wave 2 Statistical Analytics Machine Learning and Predictions

V1.0 Features Q3 2015 Data perfectly suited for BI Row Column Data (Dimensional model) Data alignments: Evenly spaced in Time, Reference attribute Time, By Events Comprehensive coverage of AF Event Frames As Filters, As Data View updates on a schedule Caveats: V1.0 does not detect history changes

V1.0 Features Q3 2015 Platform Integration SAP HANA Automatically creates Virtual Tables via Smart Data Access (SDA) PI View Access decision ready data via ODBC with first class support for Tableau, Spotfire, and Microsoft BI (Excel, PowerPivot, PowerView, Power BI) Works with any software with an ODBC data source connection

Feature Focus Areas Subject to change 1Q 2Q 2015 3Q 4Q 1Q 2Q 2016 3Q 4Q Wave 1 Business Intelligence Creating PI Views Windows and Linux ODBC Access Data Pull Scenarios Wave 2 Statistical Analytics? Modeling Workflows - Event Driven Push Scenarios - Writing back Predictions? Business Workflows - Archive Changes - Out of order data - Update View History 24

PI Integrator Lighthouse Program Jump start your big data or analytics project with help from OSIsoft. PIIntegrators@osisoft.com https://www.surveymonkey.com/s/pxw5l6x 6 week engagements through September 2015.

Questions Please wait for the microphone before asking your questions State your name & company

Matt Ziegler mziegler@osisoft.com Product Manager OSIsoft, LLC

Wes Dyk Wes.Dyk@nblenergy.com Noble Energy