Tackling the big data challenges in E&P. Dr Duncan Irving, EMEA Oil and Gas Practice Lead



Similar documents
BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

Collecting and Analyzing Big Data for O&G Exploration and Production Applications October 15, 2013 G&G Technology Seminar

The Internet of Things and Big Data: Intro

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

Microsoft Analytics Platform System. Solution Brief

BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014

How To Use Big Data For Business

Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc.

Investor Presentation. Second Quarter 2015

ADVANCED ANALYTICS AND FRAUD DETECTION THE RIGHT TECHNOLOGY FOR NOW AND THE FUTURE

The Future of Data Management

EMC/Greenplum Driving the Future of Data Warehousing and Analytics

Data Warehousing. Jens Teubner, TU Dortmund Winter 2015/16. Jens Teubner Data Warehousing Winter 2015/16 1

Big Data Analytics Nokia

HDP Hadoop From concept to deployment.

How To Understand The Oil And Gas Analytics

Exploiting Prestack Seismic from Data Store to Desktop

Big Data Can Drive the Business and IT to Evolve and Adapt

Impact of Big Data in Oil & Gas Industry. Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India.

Introduction to the Event Analysis and Retention Dilemma

NextGen Infrastructure for Big DATA Analytics.

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Getting Started Practical Input For Your Roadmap

Safe Harbor Statement

Unleashing the Potential of your Social Media and CRM Data. Markus Hirsch Sales Manager

How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW

Executive Summary... 2 Introduction Defining Big Data The Importance of Big Data... 4 Building a Big Data Platform...

Teradata Unified Big Data Architecture

Architecting for the Internet of Things & Big Data

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

Parallel Data Warehouse

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & MANAGEMENT INFORMATION SYSTEM (IJITMIS)

The Construction of Seismic and Geological Studies' Cloud Platform Using Desktop Cloud Visualization Technology

IBM Data Warehousing and Analytics Portfolio Summary

A Next-Generation Analytics Ecosystem for Big Data. Colin White, BI Research September 2012 Sponsored by ParAccel

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE

Oracle Big Data SQL Technical Update

Modernizing Your Data Warehouse for Hadoop

Advanced In-Database Analytics

IBM Netezza High Capacity Appliance

Evolution from Big Data to Smart Data

Il mondo dei DB Cambia : Tecnologie e opportunita`

BIG DATA AND THE ENTERPRISE DATA WAREHOUSE WORKSHOP

Big Data Processing: Past, Present and Future

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

IBM - Fueling the Oil & Gas Industry


An Oracle White Paper October Oracle: Big Data for the Enterprise

SQL Server 2012 Business Intelligence Boot Camp

Four Ways High-Speed Data Transfer Can Transform Oil and Gas WHITE PAPER

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

W H I T E P A P E R. Building your Big Data analytics strategy: Block-by-Block! Abstract

Luncheon Webinar Series May 13, 2013

Big Data Analytics for the Upstream Domain The art of the possible

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum

Why Big Data in the Cloud?

The Enterprise Data Hub and The Modern Information Architecture

In-memory computing with SAP HANA

Big Data Just Noise or Does it Matter?

A Case Study of Hadoop in Healthcare

An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise

HDP Enabling the Modern Data Architecture

Oracle Big Data Strategy Simplified Infrastrcuture

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

CHAPTER - 5 CONCLUSIONS / IMP. FINDINGS

QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM

Information systems architecture for the Oil and Gas industry

The Evolution of Microsoft SQL Server: The right time for Violin flash Memory Arrays

Integrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April

Course Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning

Bringing Big Data into the Enterprise

Tapping the power of big data for the oil and gas industry

High Performance Data Management Use of Standards in Commercial Product Development

Integrated Big Data: Hadoop + DBMS + Discovery for SAS High Performance Analytics

Data Centric Computing Revisited

SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering

Performance Testing of Big Data Applications

Azure Data Lake Analytics

Virtual Data Warehouse Appliances

Protecting Big Data Data Protection Solutions for the Business Data Lake

Cost-Effective Business Intelligence with Red Hat and Open Source

Innovation Session BIG DATA. HP EMEA Software Performance Tour 2014

UNIFY YOUR (BIG) DATA

Data Virtualization A Potential Antidote for Big Data Growing Pains

5 Signs You Might Be Outgrowing Your MySQL Data Warehouse*

The Future of Data Management with Hadoop and the Enterprise Data Hub

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Cloud Integration and the Big Data Journey - Common Use-Case Patterns

Big Fast Data Hadoop acceleration with Flash. June 2013

Evolution to Revolution: Big Data 2.0

So What s the Big Deal?

We are Big Data A Sonian Whitepaper

Big Data Technology ดร.ช ชาต หฤไชยะศ กด. Choochart Haruechaiyasak, Ph.D.

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

Transcription:

Tackling the big data challenges in E&P Dr Duncan Irving, EMEA Oil and Gas Practice Lead

What if you could perform all E&P analytical activities through a web browser? work collaboratively on a single instance of the data? guide the science rather than drive the PC? Seismic Volumes Reservoir Models Well Sensors

What if you could perform all E&P analytical activities through a web browser? work collaboratively on a single instance of the data? guide the science rather than drive the PC?

What if you could perform all E&P analytical activities through a web browser? work collaboratively on a single instance of the data? guide the science rather than drive the PC? + +

What if you could perform all E&P analytical activities through a web browser? work collaboratively on a single instance of the data? guide the science rather than drive the PC? guarantee data custodianship based on standards? know who knew what, and when?

What is inhibiting this? perform all E&P analytical activities through a web browser? work collaboratively on a single instance of the data? guide the science rather than drive the PC? guarantee standardsbased data custodianship? know who knew what, and when? browser capabilities network bandwidth data access data volumes compute loads poor data model/structures 70% of time managing data analytical compartmentalisation application-centric view file/transfer formats rule no community ownership no granularity temporally-enabled data governance

What is out there to help? perform all E&P analytical activities through a web browser? work collaboratively on a single instance of the data? guide the science rather than drive the PC? guarantee standardsbased data custodianship? know who knew what, and when? browser capabilities network bandwidth data access data volumes compute loads poor data model/structures 70% of time managing data analytical compartmentalisation RDBMS application-centric view file/transfer formats rule High Performance no community XXXX ownership no granularity Big Data Solution temporally-enabled data governance appliances Data Warehouse

Why can t I make sense of it all? I am old I am an enterprise architect, not a web programmer! I am a web programmer, not an enterprise architect! I see red and twitch involuntarily whenever I hear Big Data I have all of these things and I m still confused Even with all these pieces, no one has yet managed to bring all necessary data to bear on business questions in a useful timeframe RDBMS High Performance XXXX Big Data Solution appliances Data Warehouse

Am I building it properly? (A lesson from history) 10 15 Subsurface 10 14 volumes (bytes) 10 13 10 12 10 11 Disk Capacity (bytes) 10 10 10 9 10 8 10 7 10 6 10 5 Interconnect Speed Network (bps) Speed (bps) Data Transfer (bps) 10 4 10 3 Transistors on a CPU 1980 1990 2000 2010

Am I building it properly? Subsurface volumes (bytes) Data Transfer (bps) Questioning power is not keeping up with the potential value of the answer.

Application-centric approach??? The quest to integrate decision making led to: Application Service Provision Collaborative Visualisation Virtualized workstations Proprietary file formats No long-term stewardship of: File formats Decision milestones Data dependencies Information management

Analytical Compartments v Data Flow Action Decision Data Retention Insight Knowledge Knowledge Discovery Information Decision Support Wells Borehole Sensors Data Subsurface Models Seismic

E&P Data Usage Modes 50 45 40 35 30 25 20 15 10 5 Could be "Operationalized" Could be used in "Knowledge Discovery" Data of low business value Knowledge Discovery 0 Shell BP Aramco Qatar O&G Statoil Total Repsol Gazprom Maersk Centrica ENI GDF-Suez British Gas NPD DONG KOC Lukoil OMV RWE Gupco Subsurface Data Volumes (Petabytes) Caveat: These numbers are half-baked estimates. Error is ~50% Data Retention Decision Support

The E&P analytical landscape Data Volume / yr Petabytes Exploration Production Terabytes Refining Distribution Gigabytes Logistics Core and Borehole hrs days months Data Latency years

The E&P analytical landscape Many other industries are curing their big data problems: Analytical Integration Massive data volumes Mixed workloads Query Concurrency Exploration Production Derived and Duplicated Seismic, 5000 Exploration Seismic, 5000 Other, 660 Production Seismic, 2000 Geological Interpretation, 100 Reservoir Models, 1000 Production Sensor, 500 Asset and Logistics, 500 Trading, 50 Retail and Marketing, 10 ERP, 100 Refining Distribution Logistics Approximate volumes in Terabytes

Integrated Operations Analytics Wellfield Data Warehousing in a supermajor How to do a rapid comparison between new and everything ever seen and make rapid decisions on it. Data Retention Knowledge Discovery Decision Support Carry out $100M+ seismic survey every three years to re-image producing reservoir Spend 2-3 months reprocessing data and reincorporate into workflow Hope geoscientists and engineers can control reservoir flows at the weekly scale based on imaging from the year+ timeframe Current approach Install seafloor seismic imaging array and stimulate with in-reservoir tectonic events, and supply-vessel based airguns Spend 2-3 days reprocessing data and reincorporate into workflow Allow geoscientists and engineers to respond to HSE and production issues to see how the reservoir is evolving in a right-time manner! Shortened Timeframe Ideal approach

Best of Breed Big Data Architecture ETL Data Retention Seismic Imaging Seismic Modelling Reservoir Char n Reservoir Monitoring Reservoir Modelling Operations Knowledge Discovery MR/Hadoop ~5 concurrent users Fast Loading Data Assimilation Online Archival SQL-MR ~25 concurrent users Decision Support Data Discovery Knowledge Generation Pattern Detection Data Warehouse ~100+ concurrent users Decision Support Data Dependencies Predictive Analytics

MapReduce/Hadoop Is Not a database HDFS No schema, indexes, optimizer No high availability, security Not high performance Not a data warehouse No integrated data, no history Severe data skew Not mature technology Early open source A few ISV tools integrating Single points of failure Not a cloud technology, per se Uses Seismic Processing Trace Sorting 1D/2D filtering and transformation Online Seismic Archiving Repurposing WITSML and other sensor feeds It is a great place to get to know your data.

SQL-MapReduce Is a database MPP database schema, indexes, optimizer security Is high performance Not a data warehouse No integrated data but it can talk to a DW via SQL Not mature technology A few ISV tools integrating Single points of failure Can enable AaaS Uses: Pattern matching Feature extraction Spot the difference Statistical investigations: Clustering Likelihoods Associations Fail fast hypothesis reduction: Seismic Modelling Reservoir Modelling frameworks

How do I describe a pattern in SQL-MR? Simple partitioning in-trace (vertical) analytics and adjacent trace analytics Broader pattern matching how do I state spatial relationships, mesoscale textures? easy with SQL-MR! How do I find everything that looks like a channel in my 10 Tb 3D (inc. pre-stack) image volume? 2D Profile through a 3D volume showing a cross-section across a filled channel. The Power of SQL-MR Java MR functions perform the standard textural descriptions (reflectivity, variance, etc) and SQL asks the questions above, inside, below. 1. Capping boundary: Strong, broadly convex reflection (npath) 2. Basal boundary: Very strong, grossly concave reflection with local minima (npath) 3. Overburden facies: laterally moderately continuous with high amplitude stratigraphy (npath) 4. Incised facies: laterally continuous (low variance) and high amplitude stratigraphy (npath) 5. Internal facies: moderate variance, poor lateral continuity and steeply-dipping beds (statistical description)

The Active Data Warehouse in E&P Operationalised DSS Master Store for all data data Sensor is stored Data in a manner that allows it to be useful Features Fully-integrated Highly relational Fully-decomposed ACID compliant Enterprise Grade Decision Control Systems Asset Data Remove scaling and complexity barriers Asset Management Current Problems: Lack of integration Performance barriers Poor scaling to large data volumes and higher complexity Cannot provide answers to big strategic questions Integrated Single Instance Establish workflows to integrated operations Integrate Subsurface insights with Logging Asset Data and Maintenance actions Link production activities LWD/LWP reporting via + Logical real time analytics Data Models = Datamining while Drilling to other areas of the business Permanent Seismic Monitoring Fracturation Models and regimes Hydrocarbon Accounting

Enabling the E&P cloud Seismic Archiving Seismic Imaging Reservoir Characterisation Reservoir Modelling Reservoir Monitoring Asset Management Integrated Operations Hydrocarbon Accounting End-to-end workflow management guarantee data custodianship based on standards know who knew what, and when? Data dependency management