How Big Data Transforms Data Protection and Storage



Similar documents
Big Data Tips the Power Balance Between IT and Business Users

Thin Provisioning: Using Intelligent Storage Virtualization Technology for More Efficient Use of Storage Assets

What Can Software as a Service Do for Your Business?

Software as a Service: A Transformative Way to Deliver Applications

I D C V E N D O R S P O T L I G H T. S t o r a g e Ar c h i t e c t u r e t o Better Manage B i g D a t a C hallenges

Modernizing Data Protection With Backup Appliances

I D C E X E C U T I V E B R I E F

What's on the Mind ??? June 2011 Sponsored by BT Benelux IDC

Optimizing Information Management in the Cloud

I D C A N A L Y S T C O N N E C T I O N. T h e C r i t i cal Role of I/O in Public Cloud S e r vi c e P r o vi d e r E n vi r o n m e n t s

I D C T E C H N O L O G Y S P O T L I G H T

I D C E X E C U T I V E B R I E F

Self-Service Big Data Analytics for Line of Business

Got Files? Get Cloud!

How does Big Data disrupt the technology ecosystem of the public cloud?

O p t i m i z i n g t h e N e t w o r k t o M e e t T o m o r r o w ' s I C T D e m a n d s

Improving Small Business Profitability by Optimizing IT Management

I D C T E C H N O L O G Y S P O T L I G H T. B i g D a t a a n d E C M : Making Smarter Decisions

INSIGHT. Cisco' s Continuing Services Evolution: Remote Management Services IDC OPINION IN THIS INSIGHT SITUATION OVERVIEW

Network Management Services: A Cost-Effective Approach to Complexity

I D C V E N D O R S P O T L I G H T

Demystifying Big Data Government Agencies & The Big Data Phenomenon

V E N D O R P R O F I L E. F i c s t a r : S i m p l i f y i n g W e b D a t a E x t r a c t i o n I D C O P I N I O N

Journey to 3rd Platform Digital Customer Experience

Video Telephony: Completing the Picture of Unified Communications Effectiveness

I D C V E N D O R S P O T L I G H T

I D C T E C H N O L O G Y S P O T L I G H T. L e ve r a g i n g N e tw o r k Virtualization for B u s i n e s s D i fferentiation

Global Headquarters: 5 Speen Street Framingham, MA USA P F

and Analytic s i n Consu m e r P r oducts

Cloud Computing in the Midmarket: Assessing the Options

T a c k l i ng Big Data w i th High-Performance

WHITE PAPER Improving Storage Efficiencies with Data Deduplication and Compression

INSIGHT. Symantec Optimizes Veritas Cluster Server for Use in VMware Environments IDC OPINION IN THIS INSIGHT SITUATION OVERVIEW. Jean S.

Maintaining Business Continuity with Disk-Based Backup and Recovery Solutions

How To Balance Business Innovation With It Cost Control In Europea

I D C T E C H N O L O G Y S P O T L I G H T

The Customer Still Comes First: Defining the Mission of the Modern Contact Center

CRM Analytics: Turning Data into Action

Data Analytics. SPAN White Paper. Turning information into insights

DevOps and the Cost of Downtime: Fortune 1000 Best Practice Metrics Quantified

T r a n s f o r m i ng Manufacturing w ith the I n t e r n e t o f Things

I D C M A R K E T S P O T L I G H T. P r i va t e a n d H yb r i d C l o u d s E n a b l e New L e ve l s o f B u s i n e s s and IT Collaboration

W H I T E P A P E R E d u c a t i o n a t t h e C r o s s r o a d s o f B i g D a t a a n d C l o u d

Exploiting Data at Rest and Data in Motion with a Big Data Platform

I D C T E C H N O L O G Y S P O T L I G H T

University of Kentucky Leveraging SAP HANA to Lead the Way in Use of Analytics in Higher Education

Affordable, Scalable, Reliable OLTP in a Cloud and Big Data World: IBM DB2 purescale

Windows Server 2003 Migration: Take a Fresh Look at Your IT Infrastructure

IBM's Fraud and Abuse, Analytics and Management Solution

C l o u d - B a s e d S u p p l y C h a i n s : T r a n s f o rming M a n u f a c t u r ing Performance

I D C E X E C U T I V E B R I E F

Embedded advanced storage efficiency capabilities (e.g. thin provisioning, file dedupe, and compression)

The Future of Data Management

How To Protect Data From A Virtual Machine

Human Capital Management in the Public Cloud

Global Headquarters: 5 Speen Street Framingham, MA USA P F

Agile Information Life-Cycle Management: Controlling Data Growth and Managing Complexity with Virtual Databases

The Copy Data Problem: An Order of Magnitude Analysis

M A N A G I N G D A T A G R O W T H W H I L E B E T T E R M O N E T I Z I N G I N F O R M A T I O N V A L U E

Taming IT Management Chaos

IDC PlanScape: The Essentials of Internet of Things Investment for Smart Cities

Using Converged Infrastructure to Enable Rapid, Cost-Effective Private Cloud Deployments

Data Growth Presents Challenges And Opportunities

Data Management: Foundational Technologies for Health Insurance Exchange Success

I D C S P O T L I G H T. Ac c e l e r a t i n g Cloud Ad o p t i o n w i t h Standard S e c u r i t y M e a s u r e s

Global Headquarters: 5 Speen Street Framingham, MA USA P F

The Rise of Intelligent Systems: Connecting Enterprises and Smart Devices in Seamless Networks

Worldwide Problem Management Software Market Shares, 2014: 3rd Platform Technologies and Delivery Models Drive Growth

I N D U S T R Y S P O T L I G H T. T h e Grow i n g Appeal of Ad va n c e d a n d P r e d i c ti ve Analytics f o r the Utility I n d u s t r y

DIGITAL UNIVERSE UNIVERSE

IDC MarketScape: Western Europe Network Virtualization Solutions 2013 Vendor AssessmentEnter the sponsors here

On-Demand vs. On-Premise Customer Relationship Management: A New Hybrid Emerges

I D C V E N D O R S P O T L I G H T

WHITE PAPER Making Cloud an Integral Part of Your Enterprise Storage and Data Protection Strategy

Global Headquarters: 5 Speen Street Framingham, MA USA P F

INFORMATION EVERYWHERE, BUT WHERE' S THE KNOWLEDGE?

Worldwide Advanced and Predictive Analytics Software Market Shares, 2014: The Rise of the Long Tail

I D C V E N D O R S P O T L I G H T

W H I T E P A P E R. M a k e Y o u r B a c k u p s a C o m p e t i t i v e A d v a n t a g e f o r G r e a t e r P r o d u c t i v i t y

The BIG Five Benefits. One File and Content Storage Family

I D C M A R K E T S P O T L I G H T

I D C V E N D O R S P O T L I G H T. H yb r i d C l o u d Solutions for ERP

I D C T E C H N O L O G Y S P O T L I G H T

Standards for Big Data in the Cloud

I D C T E C H N O L O G Y S P O T L I G H T. I m p r o ve I T E f ficiency, S t o p S e r ve r S p r aw l

End Small Thinking about Big Data

The Business Value of Predictive Analytics

Understanding Your Customer Journey by Extending Adobe Analytics with Big Data

I D C T E C H N O L O G Y S P O T L I G H T. T i m e t o S c ale Out, Not Scale Up

Unlocking the Intelligence in. Big Data. Ron Kasabian General Manager Big Data Solutions Intel Corporation

Virtualization in Healthcare: Less Can Be More

Gridstore is seeking to simplify this situation, specifically for midsize companies:

SMART ARCHIVING. The need for a strategy around archiving. Peter Van Camp

SaaS BI Tools: Better Decision Making for the Rest of Us

I D C M A R K E T S P O T L I G H T. T a m i n g D a t a M a n a g e m e nt Costs in a " C l o u d y" I T W o rld

I D C V E N D O R F O C U S. C l o u d S e r vi c e s : U s i n g Virtual Priva t e C l o u d s t o I m p r o ve B u s i n e s s Ag i l i t y

WHITE PAPER Embedding Additional Value into Applications: What Enterprises Need Most from Application Vendors

W H I T E P A P E R C l i m a t e C h a n g e : C l o u d ' s I m p a c t o n I T O r g a n i z a t i o n s a n d S t a f f i n g

A Hurwitz white paper. Inventing the Future. Judith Hurwitz President and CEO. Sponsored by Hitachi

Leveraging Information For Smarter Business Outcomes With IBM Information Management Software

Transcription:

I D C E X E C U T I V E B R I E F How Big Data Transforms Data Protection and Storage August 2012 Written by Carla Arend Sponsored by CommVault Introduction: How Big Data Transforms Storage Omøgade 8 P.O.Box 2609 2100 Copenhagen, Denmark P.45.39.16.2222 Big Data is one of the transformative forces that are impacting the IT industry today. Attitudes to Big Data range from sarcastic to enthusiastic, but IDC is certain that Big Data will transform the way that we architect and use IT, and even more importantly, Big Data will change the way that business decisions are taken based on the accuracy and timeliness of data available for decision making. This paper discusses how the emergence of Big Data use cases will impact and transform storage infrastructure requirements. What is Big Data? Big Data has an analytics dimension and a storage dimension, and much of the discussion of Big Data is focused on how companies can gain competitive advantage from analyzing existing and emerging data sources in real time. These new requirements on the analytics side have an impact on the way storage is architected as well. IDC defines Big Data as follows: "Big Data technologies describe a new generation of technologies and architectures, designed to economically extract value from very large volumes of a wide variety of data, by enabling high velocity capture, discovery, and/or analysis."

Figure 1 The Four Elements of Big Data Source: IDC, 2012 Big Data is described using the following four elements: Volume. The challenge of handling ever-larger data volumes is nothing new for storage administrators. However, Big Data might actually drive companies to the limits of their current architecture faster. Variety. Big Data enables organizations to analyze data that has been generated outside of the organization, such as social media data and weather data, as well as data that is generated from sensors, point of sales systems, RFID tags, video surveillance cameras, etc. These new data types ask new questions around information governance and add to the volume of data stored. Velocity. Data is coming into the organization at an increasing speed, and Big Data analytics want to take advantage of it in real time. Consequently, performance is a key element of the underlying IT infrastructure. Value. Big Data analysis is done to create a unique competitive advantage for organizations, through understanding their customers' preferences better, to segment customers more granularly, and to target specific offers at precise segments. But 2 2012 IDC

public sector organizations are also using Big Data to prevent fraud and save tax payer money and to provide better services to citizens, for example in healthcare. Big Data use cases are emerging in every single industry enthusiasm and creativity is what they have in common. Overall, Big Data approaches can be divided into two: those who are optimizing current data and analytics processes with new technology, and those who are using technology to open new business opportunities for their organizations, and think out of the box. Storage Challenges Stemming From Big Data How do these four parameters change the need for data protection? What are the challenges IT managers are facing? Volume. Increasing data volumes is the most commonly understood challenge for storage managers. They struggle with shrinking backup windows, yet longer backup cycles due to the larger volumes. They also struggle with requirements for shorter restore processes. Big Data accelerates these challenges, and raises the question about rearchitecting backup processes as well as questions about the value of data, and if all data should be treated equal. Variety. Different data types, not all of which are generated within the organization, raise the question of information governance. How do you protect data that has been generated on the social Web? How can you apply policy to data that lives in the cloud, is analyzed in the cloud, but forms the basis for important business decisions? Velocity. Performance is a key attribute of Big Data, and shorter time to decision is one of its benefits. This increases the requirement for performance on the storage infrastructure. Value. The purpose of Big Data analytics is to create additional value to the organization. This raises the old question about the value of data that is stored. Differentiating between data continues to be a challenge, and many companies treat all data equal, for lack of an efficient alternative. Another dimension of value is to find the relevant data and make it available in the decision process, particularly unstructured information. Benefits From Big Data How can storage help derive value and competitive advantage from data? Even though most of the competitive advantage is created from advances on the analytics side, storage also plays a key role in enabling Big Data by: Providing policy-based data management. Information architecture and information governance should be reviewed when organizations are starting to embrace Big Data. Making data searchable through intelligent indexing. Finding data and making it available for management and decision making is another way of adding value. 2012 IDC 3

Ensuring storage performance. Performance is a key parameter within Big Data, as value comes from real-time analysis of data. Efficient data management ensures optimum storage performance. Storing data very efficiently to contain the storage footprint (dedupe, single instancing, compression, thin provisioning, snapshots). Storage efficiency is one key means to providing value to the organization. If the data is stored in the most efficient way, the storage footprint can be contained to a minimum, and the organization can free up resources and money to deploy for innovation. Providing data access from mobile devices. Data is increasingly being accessed from mobile workers and through smart mobile devices. This is particularly true for data for decision making. IT managers need to plan for this. Using cloud storage where appropriate. Some Big Data is created, analyzed, and stored in the cloud. The movement of large amounts of data over networks remains a performance challenge, so cloud storage needs to be part of the storage mix where appropriate. Storage Best Practices to Support Big Data From the currently known use cases, we can highlight some emerging best practices around data management for Big Data: Revisit your storage architecture. Some Big Data datasets require multiple active copies that are protected with replication instead of traditional backup. Many companies are using a mix of snapshots, replication, and backup to protect Big Data datasets. Take a point of departure in your current storage infrastructure, and understand how you can evolve it by benefitting from a new architecture or new technologies. Understand your data. Especially in Big Data, not all data is equally important and needs the same protection. When looking at the Big Data process, input data will most likely need to be stored, but in some cases it is transient and just passes through the organization without being kept. The algorithms are usually the most valuable part because they are a unique differentiator for any organization. The outcomes of the analysis do not necessarily need to be stored because some datasets are faster to recreate than to restore. Data governance gets more complex. When using additional data types in the analytics mix, organizations need to understand the privacy regulations associated with this data. This is true for data generated externally to the organization, but also for data that lives in the cloud. 4 2012 IDC

Conclusion: Big Data Will Transform Storage How Can You Benefit? As Big Data is adopted across Europe, organizations will need to evolve their storage infrastructures as well. However, many of the storage challenges created by Big Data are well-known and wellunderstood, just on a smaller scale. So organizations are advised to evolve their storage infrastructure and not to rip and replace what they currently have. Storage vendors are constantly innovating in order to tackle the rising challenges, and Big Data is on the roadmap for some of them already. Consult your storage vendor or channel partner and ask them about their vision for the Big Data market. IDC also recommends the use of architectural services to understand the impact of Big Data usage in your organization. Big Data use cases differ greatly by industry and company size, and so does the value of the data used for analysis. Understanding the value creation throughout the process helps architect an efficient storage infrastructure. COPYRIGHT NOTICE The analyst opinion, analysis, and research results presented in this IDC Executive Brief are drawn directly from the more detailed studies published in IDC Continuous Intelligence Services. Any IDC information that is to be used in advertising, press releases, or promotional materials requires prior written approval from IDC. Contact IDC Go-to-Market Services at gms@idc.com or the GMS information line at 508-988-7610 to request permission to quote or source IDC or for more information on IDC Executive Briefs. Visit www.idc.com to learn more about IDC subscription and consulting services or www.idc.com/gms to learn more about IDC Go-to-Market Services. Copyright 2012 IDC. Reproduction is forbidden unless authorized. 2012 IDC 5