The Convergence of Big Data Processing and Integrated Infrastructure



Similar documents
Research Report. Abstract: The Impact of Big Data on Data Analytics. September 2011

Enterprise Database Trends in a Big Data World

The Shift Toward Data Protection Appliances

Enterprise Big Data, Business Intelligence, and Analytics Trends

Research Report. Abstract: Social Enterprise Adoption Trends. June 2012

2015 Data Storage Market Trends

Research Report. Abstract: Solid-state Storage Market Trends. November By Bill Lundell and Mark Peters With Jennifer Gahm and John McKnight

Research Report. Abstract: Trends in Data Protection Modernization. August 2012

Online File Sharing and Collaboration: Deployment Model Trends

Research Report. Remote Office/Branch Office Technology Trends. July 2011

Platform-as-a-service Usage and Satisfaction Study

Corporate Online File Sharing and Collaboration Market Trends

Research Report. Abstract: The Impact of Server Virtualization on Data Protection. September 2010

Backup and Archiving Convergence Trends

Data Protection-as-a-service (DPaaS) Trends

Research Report. Abstract: The Evolution of Server Virtualization. November 2010

Cloud Computing Adoption Trends:

Research Report. Abstract: e-discovery Market Trends. A View from the Legal Department. October 2011

The State of Mobile Computing Security

Research Report. Abstract: Endpoint Device Backup Trends. December By Lauren Whitehouse With Bill Lundell and John McKnight

Research Report. Abstract: 2013 Public Cloud Computing Trends. March 2013

Platform-as-a-service Language Use Study

Threat Intelligence and Its Role Within Enterprise Cybersecurity Practices

Trends in Private Cloud Infrastructure

Research Report. Abstract: Archiving Market Trends. May By Brian Babineau With Bill Lundell and John McKnight

Web Application Security Testing Tools and Services

Research Report. Abstract: Trends for Protecting Highly Virtualized and Private Cloud Environments. June 2013

Research Report. Abstract: 2014 Public Cloud Computing Trends. March 2014

RESEARCH REPORT. Abstract. Storage Resource Management Market on the Launch Pad. By Mary Turner and Bob Laliberte With John McKnight and Jennifer Gahm

Research Report. Abstract: The Impact of Cloud Computing on the Channel. September By Jeff Hine and Bill Lundell

Market Research. Study. Database Security and Compliance Risks. December, By Jon Oltsik

Enterprise Strategy Group Getting to the bigger truth. By Bill Lundell, Senior Research Analyst and John McKnight, VP Research and Analysts

ESG Research Final Sponsor Report

EMC s Enterprise Hadoop Solution. By Julie Lockner, Senior Analyst, and Terri McClure, Senior Analyst

Cybersecurity Skills Shortage: A State of Emergency

SaaS with a Face: User Satisfaction in Cloud- based E- mail Management with Mimecast

White. Paper. EMC Isilon: A Scalable Storage Platform for Big Data. April 2014

White. Paper. Benefiting from Server Virtualization. Beyond Initial Workload Consolidation. June, 2010

White. Paper. Enterprises Need Hybrid SSO Solutions to Bridge Internal IT and SaaS. January 2013

White. Paper. Building Next Generation Data Centers. Implications for I/O Strategies. August 2014

The Challenge. ESG Case Study

Getting on the Road to SDN. Attacking DMZ Security Issues with Advanced Networking Solutions

Total year-over-year spending change in networking, (Percent of respondents) 37% 36% 35% 37% 29% 26% 16% 13% 0% 20% 40% 60% 80%

A Storage Network Architecture for Highly Dynamic Virtualized and Cloud Computing Environments

How To Understand The Needs Of The Network

White. Paper. Extracting the Value of Big Data with HP StoreAll Storage and Autonomy. December 2012

White. Paper. Big Data Advisory Service. September, 2011

Research Report. Abstract: Scale-out Storage Market Forecast February By Terri McClure

White Paper. Recovery-focused Data Protection: Research Shows Your Future Depends On It

White. Paper. Customer Service & Support in the Age of IT-as-a-Service. July, 2012

Solution Impact. Analysis. NEC Powers ServIT's Custom Hosting Solutions. September, 2011

White. Paper. The Road to the Hybrid Cloud: Signposts on the Way to Success. July 2015

White. Paper. Evaluating Sync and Share Solutions. Balancing Security, Control, and Productivity. September, 2014

Next Generation NAS: A market perspective on the recently introduced Snap Server 500 Series

Integrated Computing Platforms: Infrastructure Builds for Tomorrow s Data Center

This ESG White Paper was commissioned by DH2i and is distributed under license from ESG.

White. Paper. The Converged Network. November, By Bob Laliberte. 2009, Enterprise Strategy Group, Inc. All Rights Reserved

This ESG White Paper was commissioned by Zettaset and is distributed under license from ESG.

2010 Networking Spending Trends Date: February 2010 Author: Jon Oltsik, Principal Analyst

Varonis: Secure Enterprise Collaboration and File Sharing Date: June 2015 Author: Terri McClure, Senior Analyst; and Leah Matuson, Research Analyst

Field Audit Report. Asigra. Hybrid Cloud Backup and Recovery Solutions. May, By Brian Garrett with Tony Palmer

HGST Object Storage for a New Generation of IT

ESG Report. Data Protection Strategies for SMBs. By Heidi Biggar Storage Analyst, Data Protection Enterprise Strategy Group.

Research Perspectives

Symantec OpenStorage Date: February 2010 Author: Tony Palmer, Senior ESG Lab Engineer

SunGard Enterprise Cloud Services Date: March 2012 Author: Mark Bowker, Senior Analyst

White. Paper. The Rise of Network Functions Virtualization. Implications for I/O Strategies in Service Provider Environments.

Utilizing Security Ratings for Enterprise IT Risk Mitigation Date: June 2014 Author: Jon Oltsik, Senior Principal Analyst

The Data Center of the Future

ESG Brief. Overview by The Enterprise Strategy Group, Inc. All Rights Reserved.

By Jason Buffington, Senior Analyst, and Monya Keane, Research Analyst

White. Paper. Improving Backup Effectiveness and Cost-Efficiency with Deduplication. October, 2010

Nexsan and FalconStor Team for High Performance, Operationally Efficient Disk-based Backup Date: August, 2009 Author:

A Comparative TCO Study: VTLs and Physical Tape. With a Focus on Deduplication and LTO-5 Technology

This ESG White Paper was commissioned by Extreme Networks and is distributed under license from ESG.

White. Paper. EMC Personalized Support Services: A Focus on Keeping IT Healthy. November 2012

The Evolving Public Cloud Landscape Date: June 2014 Author: Mark Bowker, Senior Analyst and Bill Lundell, Senior Research Analyst

How To Improve Storage Efficiency With Ibm Data Protection And Retention

IBM: An Early Leader across the Big Data Security Analytics Continuum Date: June 2013 Author: Jon Oltsik, Senior Principal Analyst

White. Paper. The Application Deluge and Visibility Imperative: How to ensure network performance for your business-critical applications

Analytics With Hadoop. SAS and Cloudera Starter Services: Visual Analytics and Visual Statistics

White. Paper. Rethinking Endpoint Security. February 2015

Network Security Trends in the Era of Cloud and Mobile Computing

White. Paper. Optimizing the Virtual Data Center with Data Path Pools. EMC PowerPath/VE. February, 2011

Lab Validation Report

File System Archiving

EMC Isilon: Data Lake 2.0

Enterprise-class Backup Performance with Dell DR6000 Date: May 2014 Author: Kerry Dolan, Lab Analyst and Vinny Choinski, Senior Lab Analyst

Lab Validation Report

The Growing Need for Real-time and Actionable Security Intelligence Date: February 2014 Author: Jon Oltsik, Senior Principal Analyst

Enabling Small and Midsize Businesses to Acquire and Retain Customers in an Evolving Digital World

Accelerating Network Attached Storage with iscsi

The Power of Pentaho and Hadoop in Action. Demonstrating MapReduce Performance at Scale

SavvyDox: Next-generation Collaboration Bridges the Space Between EFSS and ECM

Product Brief. Overview. Analysis

Perspective: Cloud Solutions and Deployment for Healthcare Payers in 2014

W H I T E P A P E R C l i m a t e C h a n g e : C l o u d ' s I m p a c t o n I T O r g a n i z a t i o n s a n d S t a f f i n g

Compensating Security Controls for Windows Server 2003 Security

Enterprise Strategy Group Getting to the bigger truth. Cisco: ACL Survey. Final Results. Jon Oltsik, Senior Principal Analyst

White. Paper. Trends in e-discovery: Cloud and Collection. A Market Perspective. June, 2011

Transcription:

Research Report Abstract: The Convergence of Big Data Processing and Integrated Infrastructure By Evan Quinn, Senior Principal Analyst and Bill Lundell, Senior Research Analyst With Brian Babineau, Vice President of Research and Analyst Services July 2012

Introduction Research Objectives Research Report: The Convergence of Big Data Processing and Integrated Infrastructure In order to assess current data analytics and processing trends, as well as plans for the next 12-18 months, ESG recently surveyed 399 North American IT and business professionals representing midmarket (100 to 999 employees) and enterprise-class (1,000 employees or more) organizations. Respondents were familiar with their organization s current data analytics environment and processes, as well as forward-looking strategies involving the infrastructure and platforms necessary to support data analytics initiatives. The survey was designed to answer the following questions: How important is the enhancement of data analytics capabilities relative to all of an organization s business and IT priorities? What is associated with the term big data? What are the trends for current usage and planned adoption of MapReduce framework technology? What is the size of the largest data set upon which an organization conducts data analytics activities? How many unique sources do organizations integrate as part of their largest data sets? How frequently do organizations update their largest data set? What kind of tools do organizations use to integrate the data sources populating their largest data sets? What sources and data types comprise organizations largest data sets? What data analytics and/or processing challenges do organizations face with respect to their largest data sets? What types of data analytics platforms have organizations deployed to support their largest data sets? What benefits have they derived from these platforms? What types of data analytics platforms do organizations anticipate deploying in support of their fastest growing data sets? What requirements are driving these changes? Are the sources populating organizations largest data sets geographically dispersed? What challenges does this present? What are the must-have data management features/functionality for data analytics platforms and infrastructure? What kind of storage technologies do organizations use to support their data analytics and processing activities? Which are most pervasive and how will this change going forward? How much downtime can organizations tolerate when it comes to their data analytics platforms? What data protection technologies do they have in place to support these requirements? Survey participants represented a wide range of industries including manufacturing, financial services, communications and media, health care, and retail. For more details, please see the Research Methodology and Respondent Demographics sections of this report.

Research Methodology To gather data for this report, ESG conducted a comprehensive online survey of IT professionals from private- and public-sector organizations in North America (United States and Canada) between March 5, 2012 and March 12, 2012. To qualify for this survey, respondents were required to be IT or business professionals personally responsible for their organization s data analytics and processing environment, including the software/applications and/or the underlying platforms and systems. All respondents were provided an incentive to complete the survey in the form of cash awards and/or cash equivalents. After filtering out unqualified respondents, removing duplicate responses, and screening the remaining completed responses (on a number of criteria) for data integrity, we were left with a final total sample of 399 IT and business professionals. Please see the Respondent Demographics section of this report for more information on these respondents. Note: Totals in figures and tables throughout this report may not add up to 100% due to rounding.

Respondent Demographics The data presented in this report is based on a survey of 399 qualified respondents. The figures below detail the demographics of the respondent base, including individual respondents current job responsibility, technology responsibility, and job function, as well as the respondent organizations total number of employees, primary industry, and annual revenue. Respondents by Data Analytics Job Responsibility The breakdown of current job responsibility within an organization among survey respondents is shown in Figure 1. Figure 1. Survey Respondents, by Data Analytics Job Responsibility Which of the following best describes your current responsibility with respect to your organization s data analytics and processing environment? (Percent of respondents, N=399) Line-of-business support (non-it) my responsibilities include data analytics and processing support for the business, 22% IT Application Development & Support my primary responsibilities include the support and maintenance of data analytics and processing software, 28% Respondents by Technology Responsibility IT operations respondents primary area of technology responsibility is shown in Figure 2. Figure 2. Survey Respondents, by Technology Responsibility IT Operations my primary responsibility includes supporting the underlying data analytics and processing infrastructure, 50% Which of the following would you consider to be your primary area of technology responsibility? (Percent of respondents, N=199) Other, 1% Storage / SAN, 2% Servers, 5% Data protection, 6% IT operations, 29% General IT, 16% IT architecture/planning, 20% Applications/database, 22%

Respondents by Job Function The primary job function among survey respondents responsible for their organization s application environment is shown in Figure 3. Figure 3. Survey Respondents, by Job Function Which of the following best describes your primary job function? (Percent of respondents, N=200) Reports administrator, 3% Data scientist, 3% Data warehouse/business intelligence, 5% Other, 10% Business manager, 41% Data analyst, 11% Applications/database, 13% Respondents by Number of Employees Business analyst, 15% The number of employees in respondents organizations is shown in Figure 4. Figure 4. Survey Respondents, by Number of Employees How many total employees does your organization have worldwide? (Percent of respondents, N=399) 20,000 or more, 18% 100 to 249, 18% 10,000 to 19,999, 8% 250 to 499, 17% 5,000 to 9,999, 8% 2,500 to 4,999, 12% 1,000 to 2,499, 9% 500 to 999, 11%

Respondents by Industry Research Report: The Convergence of Big Data Processing and Integrated Infrastructure Respondents were asked to identify their organization s primary industry. In total, ESG received completed, qualified responses from individuals in 20 distinct vertical industries, plus an Other category. Respondents were then grouped into the broader categories shown in Figure 5. Figure 5. Survey Respondents, by Industry What is your organization s primary industry? (Percent of respondents, N=399) Other, 25% Manufacturing, 18% Health Care, 6% Financial (banking, securities, insurance), 13% Retail/Wholesale, 8% Business Services (accounting, consulting, legal, etc.), 9% Respondents by Annual Revenue Communications & Media, 10% Government (Federal/National, State/Province/Local), 13% The annual revenue of respondents organizations is shown in Figure 6. Figure 6. Survey Respondents, by Annual Revenue What is your organization s total annual revenue ($US)? (Percent of respondents, N=399) Not applicable (e.g., public sector, nonprofit), 9% $20 billion or more, 10% $10 billion to $19.999 billion, 5% $5 billion to $9.999 billion, 7% $1 billion to $4.999 billion, 12% $500 million to $999 million, 8% Less than $50 million, 20% $100 million to $499 million, 15% $50 million to $99 million, 15%

Contents List of Figures... 3 List of Tables... 4 Executive Summary... 5 Report Conclusions... 5 Introduction... 7 Research Objectives... 7 Research Findings... 8 The Increasing Importance of Analytics Thank You, Big Data... 8 The Impact of Big Data on Analytics... 9 Big Data Analytics Platforms... 18 Security Considerations for Big Data... 22 Data Analytics Storage and IT Infrastructure Requirements... 24 Increasing Interest in Hadoop MapReduce Framework Technology... 30 Conclusion... 32 Research Implications for Technology Vendors... 32 Research Implications for IT Professionals... 33 Research Methodology... 34 Respondent Demographics... 35 Respondents by Data Analytics Job Responsibility... 35 Respondents by Technology Responsibility... 35 Respondents by Job Function... 36 Respondents by Number of Employees... 36 Respondents by Industry... 37 Respondents by Annual Revenue... 37

List of Figures Figure 1. Importance of Enhancing Data Processing and Analytics Activities... 8 Figure 2. Meaning of the Term Big Data... 9 Figure 3. Size of Largest Data Set for Data Analytics and Processing Functions... 10 Figure 4. Number of Data Sources Integrated to Support Data Analytics Activities on Largest Data Set... 11 Figure 5. Number of Data Sources Integrated to Support Data Analytics Activities on Largest Data Set, by Company Size... 11 Figure 6. Update Frequency of Largest Data Set... 12 Figure 7. Primary Method of Integrating Data Sources in Largest Data Set... 13 Figure 8. Primary Method of Integrating Data Sources in Largest Data Set, by Largest Data Set Update Frequency... 13 Figure 9. Sources Responsible for Populating Largest Data Set... 14 Figure 10. Types of Data in Largest Data Set... 15 Figure 11. Types of Data Processing and Analytics Activities Conducted on Largest Data Set... 16 Figure 12. Data Processing and/or Analytics Challenges with Largest Data Set... 17 Figure 13. Data Processing and Analytics Platforms Currently Deployed to Support Largest Data Set... 18 Figure 14. Key Benefits Organizations Have Derived from Data Analytics Platforms... 19 Figure 15. Plans to Deploy New Data Analytics Platform to Support Fastest Growing Data Set... 20 Figure 16. Data Analytics Platform Organizations Plan to Deploy to Support Fastest Growing Data Set... 20 Figure 17. Requirements Driving Organizations to Evaluate New Data Analytics Solutions for Fastest Growing Data Set... 21 Figure 18. Geographic Dispersion of Largest Data Set... 22 Figure 19. Challenges of a Geographically Dispersed Data Set... 23 Figure 20. Importance of Features/Functionality in Considering Data Analytics Infrastructure and Platforms... 24 Figure 21. Disk-based Storage Used to Support Data Analytics and Processing Activities... 25 Figure 22. Percent of Total Volume of Data Analytics/Processing Activity Stored on Disk-based Storage... 26 Figure 23. Challenges Scaling Storage Environment to Support Data Analytics and/or Processing Activities... 27 Figure 24. Infrastructure for Data Analytics and Processing Activities... 28 Figure 25. Amount of Downtime Data Analytics Platforms Can Tolerate... 29 Figure 26. Data Protection / Availability Technologies Currently Deployed to Support Data Analytics Platforms.. 29 Figure 27. Interest in MapReduce Technology... 30 Figure 28. Interest in MapReduce Technology, by Company Size... 31 Figure 29. Interest in MapReduce Technology, by Size of Largest Data Set... 31 Figure 30. Survey Respondents, by Data Analytics Job Responsibility... 35 Figure 31. Survey Respondents, by Technology Responsibility... 35 Figure 32. Survey Respondents, by Job Function... 36 Figure 33. Survey Respondents, by Number of Employees... 36 Figure 34. Survey Respondents, by Industry... 37 Figure 35. Survey Respondents, by Annual Revenue... 37

List of Tables Table 1. Size of Largest Data Set for Data Analytics and Processing Functions, by Company Size... 10 Table 2. Sources Responsible for Populating Largest Data Set, by Company Size... 14 Table 3. Data Processing and/or Analytics Challenges with Largest Data Set, by Role... 17 Table 4. Geographic Dispersion of Largest Data Set, by Company Size... 22 Table 5. Challenges of a Geographically Dispersed Data Set, by Company Size... 23 Table 6. Disk-based Storage Used to Support Data Analytics and Processing Activities, by Role and Size of Largest Data Set... 25 Table 7. Percent of Total Volume of Data Analytics/Processing Activity Stored on SAN-based Storage, by Size of Largest Data Set... 26 All trademark names are property of their respective companies. Information contained in this publication has been obtained by sources The Enterprise Strategy Group (ESG) considers to be reliable but is not warranted by ESG. This publication may contain opinions of ESG, which are subject to change from time to time. This publication is copyrighted by The Enterprise Strategy Group, Inc. Any reproduction or redistribution of this publication, in whole or in part, whether in hard-copy format, electronically, or otherwise to persons not authorized to receive it, without the express consent of The Enterprise Strategy Group, Inc., is in violation of U.S. copyright law and will be subject to an action for civil damages and, if applicable, criminal prosecution. Should you have any questions, please contact ESG Client Relations at 508.482.0188.

20 Asylum Street Milford, MA 01757 Tel: 508.482.0188 Fax: 508.482.0128 www.enterprisestrategygroup.com