EXPLORATION TECHNOLOGY REQUIRES A RADICAL CHANGE IN DATA ANALYSIS

Similar documents
THE EMC ISILON STORY. Big Data In The Enterprise. Copyright 2012 EMC Corporation. All rights reserved.

WHITE PAPER. Get Ready for Big Data:

EMC ISILON OneFS OPERATING SYSTEM Powering scale-out storage for the new world of Big Data in the enterprise

EMC ISILON SCALE-OUT STORAGE PRODUCT FAMILY

BUILDING A SCALABLE BIG DATA INFRASTRUCTURE FOR DYNAMIC WORKFLOWS

EMC IRODS RESOURCE DRIVERS

Managing the Unmanageable: A Better Way to Manage Storage

EMC ISILON SCALE-OUT STORAGE PRODUCT FAMILY

Scala Storage Scale-Out Clustered Storage White Paper

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

EMC ISILON X-SERIES. Specifications. EMC Isilon X200. EMC Isilon X210. EMC Isilon X410 ARCHITECTURE

Integrated Grid Solutions. and Greenplum

EMC ISILON ONEFS OPERATING SYSTEM

EMC s Enterprise Hadoop Solution. By Julie Lockner, Senior Analyst, and Terri McClure, Senior Analyst

IBM Global Technology Services September NAS systems scale out to meet growing storage demand.

Building a Scalable Big Data Infrastructure for Dynamic Workflows

HadoopTM Analytics DDN

Protecting Big Data Data Protection Solutions for the Business Data Lake

How To Manage A Single Volume Of Data On A Single Disk (Isilon)

EMC ISILON NL-SERIES. Specifications. EMC Isilon NL400. EMC Isilon NL410 ARCHITECTURE

EMC Isilon: Data Lake 2.0

AUTOMATED DATA RETENTION WITH EMC ISILON SMARTLOCK

Big + Fast + Safe + Simple = Lowest Technical Risk

Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack

Big data management with IBM General Parallel File System

EMC BIG DATA GIS INFRASTRUCTURE

BIG DATA-AS-A-SERVICE

Accelerating and Simplifying Apache

EMC XTREMIO EXECUTIVE OVERVIEW

Data Storage. Vendor Neutral Data Archiving. May 2015 Sue Montagna. Imagination at work. GE Proprietary Information

Business-centric Storage FUJITSU Hyperscale Storage System ETERNUS CD10000

Collecting and Analyzing Big Data for O&G Exploration and Production Applications October 15, 2013 G&G Technology Seminar

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Solving Agencies Big Data Challenges: PED for On-the-Fly Decisions

Enterprise Storage Solution for Hyper-V Private Cloud and VDI Deployments using Sanbolic s Melio Cloud Software Suite April 2011

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage

Introduction. Scalable File-Serving Using External Storage

THE EMC ISILON SCALE-OUT DATA LAKE

Cisco WAAS for Isilon IQ

EMC SOLUTION FOR SPLUNK

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE

With DDN Big Data Storage

Information Architecture

IBM Spectrum Scale vs EMC Isilon for IBM Spectrum Protect Workloads

Tactical Advantage for Data Management at Scale and gaining value. Callan Fox, Emerging Technologies Division, EMC.

Cisco Data Preparation

Hadoop Cluster Applications

BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE

Enabling High performance Big Data platform with RDMA

NetApp Big Content Solutions: Agile Infrastructure for Big Data

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

TRANSFORM YOUR BUSINESS: BIG DATA AND ANALYTICS WITH VCE AND EMC

CONFIGURATION GUIDELINES: EMC STORAGE FOR PHYSICAL SECURITY

DATA LAKE FOUNDATION 2.0 JEUDI 19 NOVEMBRE Denis FRAVAL-OLIVIER : ISD Presales Manager

EMC VPLEX FAMILY. Continuous Availability and Data Mobility Within and Across Data Centers

CDH AND BUSINESS CONTINUITY:

THE BRIDGE FROM PACS TO VNA: SCALE-OUT STORAGE

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM

EMC XtremSF: Delivering Next Generation Storage Performance for SQL Server

IBM Enterprise Linux Server

Isilon IQ Scale-out NAS for High-Performance Applications

ANY SURVEILLANCE, ANYWHERE, ANYTIME

The BIG Data Era has. your storage! Bratislava, Slovakia, 21st March 2013

In-Memory Analytics for Big Data

EMC ISILON AND ELEMENTAL SERVER

EMC VPLEX FAMILY. Transparent information mobility within, across, and between data centers ESSENTIALS A STORAGE PLATFORM FOR THE PRIVATE CLOUD

Building a Successful Strategy To Manage Data Growth

SGI HPC Systems Help Fuel Manufacturing Rebirth

Isilon: Scalable solutions using clustered storage

EMC Solutions for Oil & Gas. Rune Olsen Senior Systems Engineer

SQL Server 2012 Parallel Data Warehouse. Solution Brief

Dell* In-Memory Appliance for Cloudera* Enterprise

Object Storage: Out of the Shadows and into the Spotlight

Simple. Extensible. Open.

EMC ISILON SCALE-OUT NAS FOR IN-PLACE HADOOP DATA ANALYTICS

Riverbed WAN Acceleration for EMC Isilon Sync IQ Replication

Chapter 7. Using Hadoop Cluster and MapReduce

White. Paper. EMC Isilon: A Scalable Storage Platform for Big Data. April 2014

Modernizing Hadoop Architecture for Superior Scalability, Efficiency & Productive Throughput. ddn.com

Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop

Netapp HPC Solution for Lustre. Rich Fenton UK Solutions Architect

Clustering Windows File Servers for Enterprise Scale and High Availability

IBM System x reference architecture solutions for big data

EMC VPLEX FAMILY. Continuous Availability and data Mobility Within and Across Data Centers

Data Centric Computing Revisited

Get More Scalability and Flexibility for Big Data

Using In-Memory Computing to Simplify Big Data Analytics

Transcription:

EXPLORATION TECHNOLOGY REQUIRES A RADICAL CHANGE IN DATA ANALYSIS EMC Isilon solutions for oil and gas EMC PERSPECTIVE

TABLE OF CONTENTS INTRODUCTION: THE HUNT FOR MORE RESOURCES... 3 KEEPING PACE WITH BOOMING DEMAND AND LOW PRICES... 3 OPTIMIZING COMPUTATIONAL WORKFLOWS.. 3 EMC ISILON AS YOUR TECHNOLOGY PARTNER. 4 2

ESSENTIALS With a need to make rapid drill/nodrill decisions, oil and gas exploration organizations need highly honed computational workflows in place to handle the workloads. What s needed to support today s oil and gas exploration computational workflows is a storage solution that is highly scalable in capacity and performance at low operating costs. INTRODUCTION: THE HUNT FOR MORE RESOURCES Despite some recent flattening in consumption, demand for oil and gas is projected to rise substantially over the next few decades. Much of the need for new energy will continue to come from developing regions of the world, such as China and India. And even though there has been growing interest in other forms of energy, most experts believe fossil fuels will remain a significant source of energy for the foreseeable future, especially in this current period of low oil prices. To meet the growing energy needs whilst working within low oil price constraints, exploration and production companies are trying to improve their recovery rates from existing wells. But realistically, the only way to truly meet the expected energy demands is to find new reserves. The problem is that much of the easy oil has already been found. Most recent oil and gas discoveries have been in remote locations, requiring expensive recovery techniques such as hydraulic fracturing, or complex deep-water installations. Finding new reserves in these locations requires the use of a new generation of exploration technologies for better operational efficiency. These technologies generate vast amounts of data that must be analyzed and visualized, using ever-more sophisticated applications. New exploration techniques such as Reverse-Time Migration, Waveform Inversion, and 3D and 4D technologies generate orders of magnitude more data than previous technologies. Additionally, the computational analysis requires the use of multiple routines, each of which places widely varying demands for I/O operations per second (IOPS) and throughput on data storage systems. KEEPING PACE WITH BOOMING DEMAND AND LOW PRICES U.S. oil and gas exploration is booming. The industry consists of about 5,000 companies with combined annual revenues of about $290 billion, according to Hoovers. To find new reserves and increase production from existing wells, exploration companies are using new seismic imaging equipment equipment that generates huge volumes of data. Some are predicting that seismic surveys could grow from one petabyte today to ten petabytes in twelve to eighteen months. And all of this data is now being analyzed, modeled, and visualized using more sophisticated algorithms to produce 3D Earth models. The raw data and the processed information derived from the data must also be integrated and processed with other data sources, including geological, petrophysical, and, in some cases, production data. OPTIMIZING COMPUTATIONAL WORKFLOWS With a need to make rapid drill/no-drill decisions, oil and gas exploration organizations need highly honed computational workflows in place to handle the workloads. Most organizations rely on HPC clusters to perform computations. Increasingly, exploration analysis algorithms are being modified to run more efficiently on these systems (parallelizing them to spread computations to hundreds or thousands of nodes), and to take advantage of a hardware-assisted speedup by running them on graphics processing units (GPUs). With these systems in place, computational workflows can be optimized and honed to speed analysis. However, use of these HPC technologies can significantly change the IOPS and throughput demands on a storage system. Making matters more 3

complicated, the wide variety of routines that need to be run on all the data all have different and varying throughput and IOPS requirements. This leads to highly unpredictable workloads and demands on storage systems. In the past, one way to keep pace with the explosion in exploration data was to throw storage capacity at the problem. However, doing so increases operating costs. More devices must be managed, more rack space is required, and more electricity is needed to power and cool the storage units. And, worse still, relying on the addition of raw storage capacity does not address performance issues. What s needed to support today s oil and gas exploration computational workflows is a storage solution that is highly scalable in both capacity and performance. The solution must also offer varying price/performance-tiered storage to support today s mixed and unpredictable computational workloads. Finally, the storage solution must also provide simplified data management. This leads to a storage solution that is a combination of a robust file system and data migration, data availability, and data protection technologies. An effective energy exploration storage solution must also support storage virtualization to make more efficient use of storage capacity and to simplify data management tasks. And, as sophisticated analytics applications come online many using Hadoop for mixed workload analytics an ideal storage solution implements an entire Exploration & Production Data Lake. A data lake is a single place to put all the data you need, including structured data drawn from traditional databases, and unstructured data like text and images. Having all data in a single place lends itself to analytics that can see all of relevant data ultimately leading to better answers. New scale-out storage systems offer higher performance and lower power consumption than the aging equipment found in most labs. This means fewer devices are needed, which lowers management requirements. These devices use less electricity as well. EMC ISILON AS YOUR TECHNOLOGY PARTNER EMC Isilon scale-out storage offers the capacity to meet the growing data storage needs of the oil and gas industry. Isilon enables you to unify vast libraries of exploration and production (E&P) data into one accessible shared data pool, increasing the productivity of your geoscientists and engineers. Isilon scale-out NAS platforms deliver industry-leading scalability and excellent throughput and I/O speeds in a single file system. Every Isilon solution can seamlessly scale, enabling you to add hundreds of terabytes of storage in minutes. Isilon hardware platforms are designed for simplicity, value, and outstanding performance. Organizations can mix and match various hardware elements to meet their specific needs. For example, the EMC Isilon S-Series delivers the performance needed for IOPS-intensive applications, the X-Series is ideal for high-concurrent and sequential throughput workflows, and the NL-Series provides economical storage that enables organizations to keep data online and available for longer periods of time. Every Isilon solution can seamlessly scale on the fly, enabling organizations to add hundreds of terabytes of storage or expand performance in minutes. At the same time, the Isilon modular architecture and intelligent software make deployment and management simple. Powered by the award-winning EMC Isilon OneFS operating system, every Isilon cluster is a single pool of storage with a global namespace, eliminating the need to support multiple volumes and file systems. 4

OneFS combines the three layers of traditional storage architectures file system, volume manager, and data protection into one unified software layer, creating a single intelligent file system that spans all nodes within a cluster. Unlike simple NAS namespace aggregation products, the Isilon OneFS operating system is truly distributed and intelligently stripes data across all nodes in a cluster to create a single, shared pool of storage. OneFS offers unsurpassed mission-critical reliability and industry-leading drive rebuild times. OneFS also delivers unique cluster-aware symmetric multiprocessing (SMP) capabilities that enable the system to move tasks between processors for extremely efficient workload balancing. In conjunction with the OneFS operating system s ability to stripe data across all nodes in a cluster, Isilon solutions achieve the high aggregate bandwidth and transactional performance required to power next-generation enterprise data centers. With these capabilities, OneFS enables: Scalability of performance and capacity to achieve up to 2.6M I/O s per second and 200 GB/s concurrent throughput and more than 50 petabytes of capacity in a single file system A single point of management for large and rapidly growing data repositories Mission-critical reliability and high availability with state-of-the-art data protection Analytics-ready data lake architecture greatly facilitates implementation of today s and future analytics applications including those based on the Apache Hadoop technology and its Hadoop Distributed File System (HDFS) As data management becomes a more essential core element of storage, there is a growing need for software applications to protect and secure the data. To that end, Isilon offers many software solutions to help meet critical data protection, access, management, and availability needs. The combination of Isilon hardware, file system, and management software helps deliver the requisite performance needed in today s oil and gas exploration organizations, all while simplifying data management, providing robust data protection, and lowering operating costs. CONTACT US To learn more about how EMC products, services, and solutions can help solve your business and IT challenges, contact your local representative or authorized reseller, visit www.emc.com, or explore and compare products in the EMC Store. EMC 2, EMC, the EMC logo, are registered trademarks or trademarks of EMC Corporation in the United States and other countries. Copyright 2015 EMC Corporation. All rights reserved. Published in the USA. 01/15 EMC Perspective H10824.3 EMC believes the information in this document is accurate as of its publication date. The information is subject to change without notice. 5