The Forrester Wave : Big Data Hadoop- Optimized Systems, Q2 2016



Similar documents
The Forrester Wave : Big Data Hadoop Cloud Solutions, Q2 2016

The Forrester Wave : Big Data Hadoop Distributions, Q1 2016

The Forrester Wave : Application Release Automation, Q2 2015

Brief: GE Positions Itself As A Digital Industrial Leader

The Forrester Wave : Enterprise Data Warehouse, Q4 2015

The Forrester Wave : Digital Agencies In China Strategy And Execution, Q1 2015

The Forrester Wave : Big Data Search And Knowledge Discovery Solutions, Q3 2015

The Forrester Wave : Big Data Hadoop Solutions, Q1 2014

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances

Market Overview: Big Data Integration

Data Warehouse Appliances: The Next Wave of IT Delivery. Private Cloud (Revocable Access and Support) Applications Appliance. (License/Maintenance)

Is Your Data Management Ready For Systems Of Insight?

EXECUTIVE SUMMARY. For IT Infrastructure & Operations Professionals

WHITE PAPER. Building Blocks of the Modern Data Center

CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data

The Forrester Wave : Enterprise Data Warehouse, Q4 2013

The Forrester Wave : Enterprise Mobile Management, Q4 2015

Overview and Frequently Asked Questions

The Forrester Wave : Traditional Disaster Recovery Service Providers, Q1 2014

Predictions 2016: The Path From Data To Action For Marketers

May 6, 2011 The Forrester Wave : Database Auditing And Real-Time Protection, Q2 2011

The Forrester Wave : Bid Management Software Providers, Q4 2012

Oracle Big Data SQL Technical Update

The Forrester Wave : Loyalty Program Service Providers, Q4 2013

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

September 27, 2007 PremiTech s Passive End User Experience Monitoring Agent Is Performance-Oriented The Forrester Wave Vendor Summary, Q3 2007

The Forrester Wave : In-Memory Database Platforms, Q3 2015

OWB Users, Enter The New ODI World

Interactive data analytics drive insights

The Forrester Wave : Enterprise Data Virtualization, Q1 2015

ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION

VCE PROFESSIONAL SERVICES PORTFOLIO OVERVIEW

Einsatzfelder von IBM PureData Systems und Ihre Vorteile.

Dell Cloudera Syncsort Data Warehouse Optimization ETL Offload

The Forrester Wave : Real-Time Interaction Management, Q3 2015

April 15, 2008 The Forrester Wave : Data Center Automation, Q by Evelyn Hubbert for IT Infrastructure & Operations Professionals

Cisco Data Preparation

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

The Forrester Wave : Web Analytics, Q2 2014

IBM InfoSphere Guardium Data Activity Monitor for Hadoop-based systems

The Forrester Wave : BPM Platforms For Digital Business, Q4 2015

BASHO DATA PLATFORM SIMPLIFIES BIG DATA, IOT, AND HYBRID CLOUD APPS

Building & Optimizing Enterprise-class Hadoop with Open Architectures Prem Jain NetApp

Platfora Big Data Analytics

Quickly Deploy Microsoft Private Cloud and SQL Server 2012 Data Warehouse on Hitachi Converged Solutions. September 25, 2013

Table of Contents. Technical paper Open source comes of age for ERP customers

What Is Microsoft Private Cloud Fast Track?

Dell* In-Memory Appliance for Cloudera* Enterprise

EXECUTIVE SUMMARY. For IT Infrastructure & Operations Professionals

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

ebay Enterprise Is A Strong Performer Among Omnichannel Order

Hadoop in the Hybrid Cloud

QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering

Oracle Database Cloud Exadata Service. Transforming The Cloud for Mission-critical Business Operations

Ubuntu and Hadoop: the perfect match

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

OnX Oracle Cloud Services

The Forrester Wave : Customer Analytics Solutions, Q4 2012

The Forrester Wave : Master Data Management Solutions, Q1 2014

Oracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya

Optimizing Storage for Better TCO in Oracle Environments. Part 1: Management INFOSTOR. Executive Brief

IBM PureFlex System. The infrastructure system with integrated expertise

Microsoft SQL Server versus IBM DB2 Comparison Document (ver 1) A detailed Technical Comparison between Microsoft SQL Server and IBM DB2

Extend your analytic capabilities with SAP Predictive Analysis

Introducing Oracle Exalytics In-Memory Machine

The Forrester Wave : Big Data Streaming Analytics, Q1 2016

Database-As-A-Service Saves Money, Improves IT Productivity, And Speeds Application Development

Oracle Big Data Building A Big Data Management System

Oracle Big Data Management System

VMware vrealize Operations. Management Pack for. PostgreSQL

The Forrester Wave : Enterprise Backup And Recovery Software, Q2 2013

SAP HANA FAQ. A dozen answers to the top questions IT pros typically have about SAP HANA

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

The Forrester Wave : Application Security, Q4 2014

How to leverage SAP HANA for fast ROI and business advantage 5 STEPS. to success. with SAP HANA. Unleashing the value of HANA

The Forrester Wave : SEO Platforms, Q4 2012

ORACLE INFRASTRUCTURE AS A SERVICE PRIVATE CLOUD WITH CAPACITY ON DEMAND

Dell s SAP HANA Appliance

The Forrester Wave : Cross-Channel Campaign Management, Q3 2014

The Forrester Wave : Enterprise File Sync And Share Platforms, Hybrid Solutions, Q2 2016

SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM

Information Architecture

Achieving Real-Time Business Solutions Using Graph Database Technology and High Performance Networks

The Inside Scoop on Hadoop

Zenoss for Cisco ACI: Application-Centric Operations

SAS and Oracle: Big Data and Cloud Partnering Innovation Targets the Third Platform

WHITE PAPER Get Your Business Intelligence in a "Box": Start Making Better Decisions Faster with the New HP Business Decision Appliance

Brochure. Update your Windows. HP Technology Services for Microsoft Windows 2003 End of Support (EOS) and Microsoft Migrations

Transcription:

The Forrester Wave : Big Data Hadoop- Optimized Systems, Q2 2016 by Noel Yuhanna and Mike Gualtieri Why Read This Report On-premises Hadoop deployments just got easier. General-purpose hardware infrastructure requires considerable time and effort to install, configure, tune, upgrade, and monitor Hadoop platforms. Hadoop-optimized systems help overcome these issues by optimizing the infrastructure with automation, balanced system resources, and integrated testing. Enterprise architecture (EA) pros can find the best solution for their needs with Forrester s 26-criteria evaluation of seven leading vendors: Cisco Systems, Cray, Dell, IBM, NetApp, Oracle, and Teradata. Key Takeaways We Found Seven Viable Hadoop-Optimized System Vendors Among the seven vendors evaluated, we found three Leaders (Oracle, Teradata, and Cisco Systems) and four Strong Performers (Cray, IBM, Dell, and NetApp). All offer competitive solutions that provide enterprise scale and proven Hadoop systems. EA Pros Look At Time-To-Value And This market is growing because EA pros see Hadoop-optimized systems as a strategic platform to support their big data initiatives. When selecting a system, enterprises should look at modularity, automation, performance, and overall price/performance attributes. Scale,, And Tooling Are Key Differentiators While all of the vendors offer compelling value and features, we found three key differentiating factors: automation to simplify most tasks, modular scalability, and comprehensive tooling for hardware and software. forrester.com

by Noel Yuhanna and Mike Gualtieri with Gene Leganza and Shreyas Warrier Table Of Contents Notes & Resources 2 3 6 8 11 Optimized Systems Accelerate Hadoop Deployments Big Data Hadoop-Optimized Systems Evaluation Overview Evaluated Vendors And Inclusion Criteria Choose Any Of These Vendors For A Fast, Stable On-Prem Deployment Vendor Profiles Leaders Strong Performers Supplemental Material Forrester conducted product evaluations in March 2016 and interviewed seven vendor and user companies: Cisco Systems, Cray, Dell, IBM, NetApp, Oracle, and Teradata. Related Research Documents Big Data Fabric Drives Innovation And Growth The Forrester Wave : Big Data Hadoop Cloud Solutions, Q2 2016 The Forrester Wave : Big Data Hadoop Distributions, Q1 2016 Forrester Research, Inc., 60 Acorn Park Drive, Cambridge, MA 02140 USA +1 617-613-6000 Fax: +1 617-613-5000 forrester.com 2016 Forrester Research, Inc. Opinions reflect judgment at the time and are subject to change. Forrester, Technographics, Forrester Wave, RoleView, TechRadar, and Total Economic Impact are trademarks of Forrester Research, Inc. All other trademarks are the property of their respective companies. Unauthorized copying or distributing is a violation of copyright law.

Optimized Systems Accelerate Hadoop Deployments Enterprises agree that speedy deployment of big data Hadoop platforms has been critical to their success, especially as use cases expand and proliferate. However, deploying Hadoop systems is often difficult, especially when supporting complex workloads and dealing with hundreds of terabytes or petabytes of data. Architects need a considerable amount of time and effort to install, tune, and optimize Hadoop. Hadoop-optimized systems offer a new opportunity to deliver next-generation insights quickly rather than first dealing with hardware and technology challenges. Unlike generic hardware infrastructure, Hadoop-optimized systems are preconfigured systems that integrate hardware and software components to deliver optimal performance and support various big data workloads. As a result, organizations spend less time installing, tuning, troubleshooting, patching, upgrading, and dealing with integration- and scale-related issues. Forrester defines Hadoop-optimized systems as: Preconfigured hardware platforms comprising CPU, memory, and/or disk that are optimized to run a Hadoop distribution, which includes value-added tools, Hadoop ecosystem software, and/ or add-ons appropriate for use by enterprises. Although this is a relatively new market, it promises to gain significant traction in the coming years as enterprises look to accelerate and scale big data deployments. EA pros choose Hadoop-optimized systems to support: Faster time-to-value. Hadoop-optimized systems are preconfigured and ready to run you ll no longer spend weeks or months acquiring and configuring all the components of a Hadoop cluster. Optimized systems accelerate the deployment of big data use cases, allowing enterprise architects to enable more use cases to be done in a given time frame. They are also designed for quick expansion when data and compute needs grow. Reduced cost. Hadoop-optimized systems require high upfront capital cost, often more than generic hardware infrastructure. But, in the long run, they save money via deferred system upgrades to address optimization needs, improved administrator productivity, their extensibility because of modular design, and minimized downtime. Minimized administration effort. Today, organizations have many Hadoop administrators who manage Hadoop clusters to support their big data initiatives. These administrators spend a considerable amount of time and effort on installing, configuring, tuning, upgrading, and expanding their clusters. Our customer interviews show that organizations that are currently using Hadoopoptimized systems reduce their administration efforts by between 25% and 65% when compared with generic hardware platforms. 2

Modular expansion. Unlike cloud deployments, on-premises Hadoop deployments often require preplanning to deploy the hardware configuration, such as the number of cores, amount of memory, and storage blades. With Hadoop-optimized systems, most vendors support a modular system architecture, whereby staff can add additional hardware components like memory, CPU cores, nodes, and storage as needed. Big Data Hadoop-Optimized Systems Evaluation Overview To assess the state of the big data Hadoop-optimized systems market and see how the vendors stack up against each other, Forrester evaluated the strengths and weaknesses of seven Hadoopoptimized systems vendors: Cisco Systems, Cray, Dell, IBM, NetApp, Oracle, and Teradata. IBM was a nonparticipating vendor. After examining past research, user need assessments, and vendor and expert interviews, we developed a comprehensive set of 26 evaluation criteria, which we grouped into three high-level buckets: Current offering. We evaluated each product s configuration options, scalability, performance, high availability, disaster recovery, administration, security, core data capabilities, development tools, and other features to establish the capabilities of the vendor s current offering. Strategy. We reviewed each vendor s strategy to assess its ability to compete and grow in the Hadoop-optimized systems market. Key criteria include Forrester s level of confidence in the vendor s ability to execute on its stated strategy and to support current and future customers. We also reviewed each vendor s product road map to assess how it will affect the vendor s competitive position compared with the others in this evaluation. Market presence. To determine each vendor s market presence, we evaluated overall Hadoop revenue, the number of paying customers, the global distribution of paying customers, market awareness of the vendor s product, and partnerships with other technology and services firms. Evaluated Vendors And Inclusion Criteria Each of the evaluated vendors has (see Figure 1): A comprehensive Hadoop-optimized system offering. The vendors included in this evaluation provide hardware infrastructure that includes server and storage to support Hadoop deployments in a clustered configuration. Vendors have tested the systems to be functional with one or more Hadoop distributions and optimized them for scale, performance, security, and integration. They also offer tools that help with installation, configuration, and management of Hadoop systems. The vendors included must provide installation and support services directly or through partners. Optimized their systems for Hadoop. We only included vendors that deliver systems for Hadoop to support various enterprise big data use cases. These were standalone systems running only Hadoop and Hadoop-related software. 3

A referenceable install base. Included vendors have at least two unique enterprise customers using the Hadoop-optimized system offering. Client inquiries or noteworthy technologies that put the vendor on Forrester s radar. Forrester monitors which vendors and systems clients discuss through inquiries. We included vendors that were mentioned more often than others. Alternatively, the vendor may, in Forrester s judgment, warrant inclusion or exclusion because of technology trends and market presence. A publicly available offering. The participating vendors must have actively marketed a Hadoopoptimized system as of January 1, 2016. 4

FIGURE 1 Evaluated Vendors: Product Information And Selection Criteria Vendor Cisco Systems Cray IBM NetApp Oracle Teradata Dell Product Cisco UCS Urika-XA 1.0 IBM Power Systems including IBM BigInsights, other IBM Software components, and IBM Data Engine for Analytics NetApp Solutions for Hadoop is the broad name. Specific products include NetApp E5600, E2700, EF560, and NFS Connection for Hadoop as well as SANtricity OS 8.20 and SANtricity Storage Manager 11.20. Big Data Appliance X5-2 Teradata Appliance for Hadoop 5; supported by Teradata Unified Data Architecture; Teradata Database 15.10; Aster Discovery Platform 6.1; Teradata QueryGrid 15.10; Teradata Listener; and Teradata Cloud Dell Cloudera Apache Hadoop Solution (CDH 5.5) Vendor inclusion criteria Hadoop hardware infrastructure. The vendors included in this evaluation provide hardware infrastructure that includes server and storage to support Hadoop deployments in a clustered configuration. Vendors have tested the system to be functional with one or more Hadoop distributions and optimized them for scale, performance, security, and integration. The vendors also offer tools that help with installation, configuration, and management of Hadoop systems. The vendors included must provide installation and support services directly or through partners. Standalone Hadoop system. We only included Hadoop systems that are not technologically embedded into any particular application, business intelligence (BI), predictive analytics, ETL, or middleware stacks. It must be supported in a standalone system running Hadoop. Referenceable install base. There should be more than two enterprise customers using the Hadoop-optimized system. Publicly available. The participating vendors must have actively marketing a Hadoop-optimized system as of January 1, 2016. Client inquiries and/or technologies that put the vendor on Forrester s radar. Forrester clients often discuss the vendors and products through inquiries; alternatively, the vendor may, in Forrester s judgment, warrant inclusion or exclusion in this evaluation because of technology trends and market presence. 5

Choose Any Of These Vendors For A Fast, Stable On-Prem Deployment Forrester s evaluation of big data Hadoop-optimized systems uncovered a market with three Leaders and four Strong Performers (see Figure 2): Oracle, Teradata, and Cisco Systems lead the pack. The Leaders have demonstrated mature and comprehensive Hadoop-optimized system offerings for sophisticated big data applications, large and complex deployments, and broad tooling to deliver enterprise-scale business value. They were early to offer Hadoop systems and leveraged their significant experience with both hardwareand software-optimized solutions in data warehousing to build an optimized Hadoop solution. Cray, IBM, Dell, and NetApp offer competitive options. Strong Performers provide competitive Hadoop-optimized systems to organizations that demand improved performance from their existing generic hardware platforms. 6

FIGURE 2 The Forrester Wave : Big Data Hadoop-Optimized Systems, Q2 16 Challengers Contenders Strong Strong Performers Leaders Oracle Current offering NetApp Dell Teradata Cisco Systems Cray IBM Go to Forrester.com to download the Forrester Wave tool for more detailed product evaluations, feature comparisons, and customizable rankings. Market presence Weak Weak Strategy Strong 7

FIGURE 2 The Forrester Wave : Big Data Hadoop-Optimized Systems, Q2 16 Forrester s Weighting Cisco Systems Cray Dell IBM NetApp Oracle Teradata CURRENT OFFERING Solution configuration System architecture System administration Data management System integration 50% 10% 40% 30% 10% 10% 3.55 4.40 4.10 3.10 2.40 3.24 2.80 3.70 3.60 1.00 2.98 3.80 3.10 2.80 2.20 2.75 2.20 2.60 2.30 3.32 4.30 3.20 2.40 1.00 4.44 2.80 4.30 4.80 4.34 4.20 3.90 4.80 4.20 STRATEGY System and support cost Ability to execute System support Solution fixes Road map 50% 0% 30% 10% 10% 50% 0.00 1.00 3.70 0.00 1.00 3.20 0.00 1.00 3.90 0.00 2.80 0.00 1.00 4.80 0.00 4.30 0.00 MARKET PRESENCE Company financials Customer base Partnerships 0% 30% 60% 10% 2.70 3.20 2.30 3.60 1.59 1.30 1.70 1.80 3.30 3.60 2.40 3.30 3.20 3.20 4.20 2.97 2.30 3.20 3.60 4.17 4.10 4.20 4.20 3.11 2.30 3.20 All scores are based on a scale of 0 (weak) to 5 (strong). Vendor Profiles This evaluation of the big data Hadoop-optimized systems market is intended to be a starting point only. We encourage clients to view detailed product evaluations and adapt criteria weightings to fit their individual needs through the Forrester Wave Excel-based vendor comparison tool. Clients can also schedule an inquiry with the authors of this report to have a conversation about market trends and specific vendor products. In addition, be sure to read our related Forrester Wave evaluations of big data Hadoop distributions and big data Hadoop cloud solutions. 1 Leaders Oracle s mature Big Data Appliance continues to make its impact. Oracle announced Oracle Big Data Appliance (BDA) in October 2011, making it one of the first vendors to offer a Hadoopoptimized system. Oracle BDA is a fully preconfigured, pretested, and pretuned system that includes a full complement of software components, including Cloudera Enterprise Data Hub Edition, Oracle NoSQL Database CE, Oracle R Distribution, Oracle Linux, Oracle Data Integrator, Oracle Loader for Hadoop, Oracle R Advanced Analytics for Hadoop, and Oracle Spatial and 8

Graph. Oracle BDA can run a diverse set of workloads, from Hadoop and Spark only to interactive, all-encompassing interactive SQL queries using Oracle Big Data SQL. Currently, Cloudera is the only distribution supported on Oracle BDA. Customers like Oracle BDA for its ability to quickly stand up enterprise-grade Hadoop and Spark clusters and to integrate with Oracle Exadata and Oracle Database. In addition, Oracle is one of the few vendors that have comprehensive data security for Hadoop. Teradata s fifth-generation Hadoop appliance is efficient and refined. Teradata is one of the few vendors that have had significant success with data management appliances over the past decade. Teradata Appliance for Hadoop comes integrated with multiple software components (open source and commercial), including the latest versions of Hadoop from Cloudera or Hortonworks, Teradata Connector for Hadoop, and Teradata Hadoop Command Line Interface. The data-center-ready stack includes high-performance and high-availability components in Teradata BYNETV5 and InfiniBand for faster networking speeds and the flexibility to optimize hardware configurations for performance, capacity, and balance. It also includes Teradata Vital Infrastructure for proactive monitoring, troubleshooting, and support. Teradata QueryGrid enables on-the-fly data access and pushdown processing from other Teradata data warehouse appliances. Teradata also provides solutions consulting through its ThinkBig consulting group to design and build integrated business solutions. Cisco Systems provides a viable midsized system at attractive price point. Cisco UCS Integrated Infrastructure for Big Data provides a secure and scalable infrastructure to support enterprise requirements. Cisco s UCS solution comes pretested and prevalidated for Cloudera, Hortonworks, IBM, and MapR, providing a lower-cost and scalable storage platform to support Hadoop deployments. Management tools such as Cisco UCS Manager and Cisco UCS Director allow for simple configuration of big data Hadoop clusters that can adapt dynamically to changing workloads. Cisco s key differentiators lie in its ability to offer a wide range of configurations, its strong focus on internet-of-things (IoT) use cases, and its broad partner ecosystem. Strong Performers Cray brings its high-performance computing experience to Hadoop. When it comes to supercomputers, Cray is a brand that has been around for more than 40 years, and this tradition and experience is now making its way into big data architectures. Cray offers strong capabilities for interconnected, memory-centric architectures, and its optimized performance lets it handle various workloads. The Cray system is shipped installed with a pre-integrated and optimized software stack using high-density on-node memory as well as SSD and HDD drives to efficiently support Hadoop and Spark workloads. 2 Cray includes fast interconnects and an on-node dense memory hierarchy architecture that positions its analytics platform well for supporting new and emerging in-memory workloads, such as graph analytics, machine learning, and IoT. Cray s strong highperformance computing history, good technical support, and continued focus and commitment to analytics and big data will appeal to a larger audience in the coming years. 9

IBM s new Hadoop system on power will increase its competitiveness. Today, IBM s customers use Linux-based IBM Power Systems in integrated solutions for Hadoop and Spark deployments, including IBM BigInsights and other software components. IBM Data Engine for Analytics uses a centralized storage model that eliminates data replication, which is ideal for large-scale environments. Our scoring reflects the product available as of January 1, 2016, but the recent announcement of IBM Data Engine for Hadoop and Spark will expand IBM s competitiveness. 3 These products include IBM s newly added line of OpenPower Linux servers designed for big data, with support for larger caches, memory bandwidth, and optimized resource management. The vendor is also likely to extend its capabilities to support vertical-specific and use-case-driven systems. Partners such as Avnet, Mainline, and Vion continue to build fully integrated, end-to-end, custom-built Hadoop systems based on IBM s reference architecture. IBM is a nonparticipating vendor in this Forrester Wave. Dell takes a simplified reference architecture approach to Hadoop. Dell offers engineered solutions for a variety of workloads, including data warehouse optimization using ETL offload and Hadoop deployments. 4 Dell s Hadoop reference architecture utilizes Cloudera support for the Hadoop distribution in a highly configurable, optimized hardware package. Besides leveraging various partners from the Hadoop ecosystem, it supports native integration with Dell Boomi for data integration and Syncsort for data transformation jobs in Hadoop. While Dell continues to develop and expand its Hadoop appliance portfolio, its commitment to supporting a broader range of servers, its growing data management capabilities, and its attractive price and performance have proven appealing to many enterprises. NetApp s scalable storage solution attaches to Hadoop. NetApp is a computer storage and data management vendor that supports various use cases, including big data for enterprises. It provides validated and supported designs as well as a certified architecture for major Hadoop distributions, including Cloudera, Hortonworks, and MapR. NetApp also offers SANtricity software to check storage performance and ensure optimization for various workloads, and NetApp AutoSupport provides proactive monitoring and predictive analytics on the NetApp storage system. While the vendor has a strong storage offering and NetApp NFS Connector for Hadoop, it relies on third-party vendors for servers, such as its integration with Cisco UCS Servers. Enterprises looking for optimized Hadoop storage solutions will find NetApp to be a viable option, especially for those that run into petabyte-sized deployments. 10

Engage With An Analyst Gain greater confidence in your decisions by working with Forrester thought leaders to apply our research to your specific business and technology initiatives. Analyst Inquiry Ask a question related to our research; a Forrester analyst will help you put it into practice and take the next step. Schedule a 30-minute phone session with the analyst or opt for a response via email. Learn more about inquiry, including tips for getting the most out of your discussion. Analyst Advisory Put research into practice with in-depth analysis of your specific business and technology challenges. Engagements include custom advisory calls, strategy days, workshops, speeches, and webinars. Learn about interactive advisory sessions and how we can support your initiatives. Supplemental Material Online Resource The online version of Figure 2 is an Excel-based vendor comparison tool that provides detailed product evaluations and customizable rankings. Data Sources Used In This Forrester Wave Forrester used a combination of vendor briefings and customer interviews to assess the strengths and weaknesses of each solution. We evaluated the vendors participating in this Forrester Wave, in part, using materials that they provided to us by May 16, 2016. Hands-on lab evaluations. Vendors spent one day with a team of analysts who performed a hands-on evaluation of the product using a scenario-based testing methodology. We evaluated each product using the same scenario(s), creating a level playing field by evaluating every product on the same criteria. Vendor surveys. Forrester surveyed vendors on their capabilities as they relate to the evaluation criteria. Once we analyzed the completed vendor surveys, we conducted vendor calls where necessary to gather details of vendor qualifications. 11

Product demos. We asked vendors to conduct demonstrations of their products functionality. We used findings from these product demos to validate details of each vendor s product capabilities. Customer reference calls. To validate product and vendor qualifications, Forrester also conducted reference calls with two of each vendor s current customers. The Forrester Wave Methodology We conduct primary research to develop a list of vendors that meet our criteria to be evaluated in this market. From that initial pool of vendors, we then narrow our final list. We choose these vendors based on: 1) product fit; 2) customer success; and 3) Forrester client demand. We eliminate vendors that have limited customer references and products that don t fit the scope of our evaluation. After examining past research, user need assessments, and vendor and expert interviews, we develop the initial evaluation criteria. To evaluate the vendors and their products against our set of criteria, we gather details of product qualifications through a combination of lab evaluations, questionnaires, demos, and/or discussions with client references. We send evaluations to the vendors for their review, and we adjust the evaluations to provide the most accurate view of vendor offerings and strategies. We set default weightings to reflect our analysis of the needs of large user companies and/or other scenarios as outlined in the Forrester Wave evaluation and then score the vendors based on a clearly defined scale. We intend these default weightings to serve only as a starting point and encourage readers to adapt the weightings to fit their individual needs through the Excel-based tool. The final scores generate the graphical depiction of the market based on current offering, strategy, and market presence. Forrester intends to update vendor evaluations regularly as product capabilities and vendor strategies evolve. For more information on the methodology that every Forrester Wave follows, go to http://www.forrester.com/marketing/policies/forrester-wave-methodology.html. Integrity Policy We conduct all our research, including Forrester Wave evaluations, in accordance with our Integrity Policy. For more information, go to http://www.forrester.com/marketing/policies/integrity-policy.html. Endnotes 1 Forrester s evaluation of five leading Hadoop distribution vendors Cloudera, Hortonworks, IBM, MapR Technologies, and Pivotal Software is based on 35 criteria. Forrester s 37-criteria evaluation of eight leading big data Hadoop cloud solution vendors Altiscale, Amazon Web Services (AWS), Google, IBM, Microsoft, Oracle, Qubole, and Rackspace will help EA pros understand the available solutions and recommend the best one for their organization. See the The Forrester Wave : Big Data Hadoop Distributions, Q1 2016 Forrester report and see the The Forrester Wave : Big Data Hadoop Cloud Solutions, Q2 2016 Forrester report. 2 SSD: solid-state drive. HDD: hard disk drive. 3 IBM Data Engine for Hadoop and Spark is a new offering that combines the recently added line of OpenPower Linux Servers designed for big data and analytics with open source Apache Hadoop and Spark distribution along with optional advanced analytics capabilities. Source: IBM Data Engine for Hadoop and Spark - Power Systems Edition: 12

An integrated and optimized solution for high-performance Hadoop-based and Spark-based analytics, IBM, February 9, 2016 (http://www-01.ibm.com/common/ssi/cgi-bin/ssialias?subtype=ca&infotype=an&supplier=897&letternum=en US116-011). 4 ETL: extract, transform, load. 13

We work with business and technology leaders to develop customer-obsessed strategies that drive growth. Products and Services Core research and tools Data and analytics Peer collaboration Analyst engagement Consulting Events Forrester s research and insights are tailored to your role and critical business initiatives. Roles We Serve Marketing & Strategy Professionals CMO B2B Marketing B2C Marketing Customer Experience Customer Insights ebusiness & Channel Strategy Technology Management Professionals CIO Application Development & Delivery Enterprise Architecture Infrastructure & Operations Security & Risk Sourcing & Vendor Management Technology Industry Professionals Analyst Relations Client support For information on hard-copy or electronic reprints, please contact Client Support at +1 866-367-7378, +1 617-613-5730, or clientsupport@forrester.com. We offer quantity discounts and special pricing for academic and nonprofit institutions. Forrester Research (Nasdaq: FORR) is one of the most influential research and advisory firms in the world. We work with business and technology leaders to develop customer-obsessed strategies that drive growth. Through proprietary research, data, custom consulting, exclusive executive peer groups, and events, the Forrester experience is about a singular and powerful purpose: to challenge the thinking of our clients to help them lead change in their organizations. For more information, visit forrester.com. 123865