MapR Enterprise Edition & Enterprise Database Edition

Similar documents
Hortonworks Data Platform Reference Architecture

Dell Reference Configuration for Hortonworks Data Platform

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Platfora Big Data Analytics

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

N /150/151/160 RAID Controller. N MegaRAID CacheCade. Feature Overview

Cisco IT Hadoop Journey

Power Efficiency Comparison: Cisco UCS 5108 Blade Server Chassis and Dell PowerEdge M1000e Blade Enclosure

Appro Supercomputer Solutions Best Practices Appro 2012 Deployment Successes. Anthony Kenisky, VP of North America Sales

IRON Big Data Appliance Platform for Hadoop

How Cisco IT Built Big Data Platform to Transform Data Management

APACHE HADOOP PLATFORM HARDWARE INFRASTRUCTURE SOLUTIONS

CORRIGENDUM TO TENDER FOR HIGH PERFORMANCE SERVER

IBM System x family brochure

Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack

LSI MegaRAID CacheCade Performance Evaluation in a Web Server Environment

How To Build A Cisco Ukcsob420 M3 Blade Server

Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database

Cisco UCS B-Series M2 Blade Servers

Redundancy in enterprise storage networks using dual-domain SAS configurations

Dell s SAP HANA Appliance

Power Efficiency Comparison: Cisco UCS 5108 Blade Server Chassis and IBM FlexSystem Enterprise Chassis

INDIAN INSTITUTE OF TECHNOLOGY KANPUR Department of Mechanical Engineering

HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW

HP SN1000E 16 Gb Fibre Channel HBA Evaluation

SUN HARDWARE FROM ORACLE: PRICING FOR EDUCATION

Oracle Exadata Database Machine for SAP Systems - Innovation Provided by SAP and Oracle for Joint Customers

Apache Hadoop Cluster Configuration Guide

SUN ORACLE EXADATA STORAGE SERVER

Dell Reference Configuration for DataStax Enterprise powered by Apache Cassandra

HUAWEI TECHNOLOGIES CO., LTD. HUAWEI FusionServer X6800 Data Center Server

HP recommended configuration for Microsoft Exchange Server 2010: ProLiant DL370 G6 supporting GB mailboxes

The Hardware Dilemma. Stephanie Best, SGI Director Big Data Marketing Ray Morcos, SGI Big Data Engineering

IBM System x reference architecture for Hadoop: MapR

Fujitsu PRIMERGY Servers Portfolio

An Oracle White Paper November Backup and Recovery with Oracle s Sun ZFS Storage Appliances and Oracle Recovery Manager

1. Specifiers may alternately wish to include this specification in the following sections:

præsentation oktober 2011

UCS M-Series Modular Servers

Cisco UCS B440 M2 High-Performance Blade Server

White Paper. Cisco and Greenplum Partner to Deliver High-Performance Hadoop Reference Configurations

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

Intel RAID SSD Cache Controller RCS25ZB040

Data Sheet FUJITSU Server PRIMERGY CX272 S1 Dual socket server node for PRIMERGY CX420 cluster server

HadoopTM Analytics DDN

EMC SYMMETRIX VMAX 20K STORAGE SYSTEM

IP Video Surveillance Certified Global OEM. Directory Servers. Archive Servers. Catalog

Built for Business. Ready for the Future.

Cisco for SAP HANA Scale-Out Solution on Cisco UCS with NetApp Storage

Lenovo ThinkServer and Cloudera Solution for Apache Hadoop

Optimizing SQL Server Storage Performance with the PowerEdge R720

AirWave 7.7. Server Sizing Guide

AMD SEAMICRO OPENSTACK BLUEPRINTS CLOUD- IN- A- BOX OCTOBER 2013

How To Write An Article On An Hp Appsystem For Spera Hana

Sun Constellation System: The Open Petascale Computing Architecture

How To Run Apa Hadoop 1.0 On Vsphere Tmt On A Hyperconverged Network On A Virtualized Cluster On A Vspplace Tmter (Vmware) Vspheon Tm (

Certification Document macle GmbH GRAFENTHAL R2208 S2 01/04/2016. macle GmbH GRAFENTHAL R2208 S2 Storage system

Building & Optimizing Enterprise-class Hadoop with Open Architectures Prem Jain NetApp

Appendix 3. Specifications for e-portal

lightning wire labs Hardware Appliances

Dell In-Memory Appliance for Cloudera Enterprise

Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

Cybernetics isan D Series Affordable, Self-Protecting iscsi SAN Storage

The Convergence of Software Defined Storage and Physical Appliances Hybrid Cloud Storage

Visual UpTime Select Server Specifications

Hadoop on the Gordon Data Intensive Cluster

The Greenplum Analytics Workbench

High-Performance Computing Clusters

Accelerate Big Data Analysis with Intel Technologies

Adaptec: Snap Server NAS Performance Study

Certification Document macle GmbH Grafenthal-S1212M 24/02/2015. macle GmbH Grafenthal-S1212M Storage system

MESOS CB220. Cluster-in-a-Box. Network Storage Appliance. A Simple and Smart Way to Converged Storage with QCT MESOS CB220

Get More Scalability and Flexibility for Big Data

Reference Architecture - Microsoft Exchange 2013 on Dell PowerEdge R730xd

Oracle Server X5-2. Product Overview

CUTTING-EDGE SOLUTIONS FOR TODAY AND TOMORROW. Dell PowerEdge M-Series Blade Servers

Maximum performance, minimal risk for data warehousing

NOTICE ADDENDUM NO. TWO (2) JULY 8, 2011 CITY OF RIVIERA BEACH BID NO SERVER VIRTULIZATION/SAN PROJECT

Increasing Hadoop Performance with SanDisk Solid State Drives (SSDs)

NetApp Solutions for Hadoop Reference Architecture

EMC Integrated Infrastructure for VMware

Milestone Solution Partner IT Infrastructure MTP Certification Report Scality RING Software-Defined Storage

Pentaho High-Performance Big Data Reference Configurations using Cisco Unified Computing System

Cisco Unified Computing System: Meet the Challenges of Virtualization with Microsoft Hyper-V

HP Cloudline Overview

Analyzing the Virtualization Deployment Advantages of Two- and Four-Socket Server Platforms

Cost Efficient VDI. XenDesktop 7 on Commodity Hardware

HP reference configuration for entry-level SAS Grid Manager solutions

Hadoop: Embracing future hardware

Transcription:

MapR Enterprise Edition & Enterprise Database Edition Reference Architecture A PSSC Labs Reference Architecture Guide June 2015

Introduction PSSC Labs continues to bring innovative compute server and cluster platforms to market. Focusing on specific applications where performance and reliability are critical, highlights PSSC Labs strengths. The Apace Hadoop data framework requires substantial compute and storage capabilities coupled tightly together. MapR Enterprise Edition & Enterprise Database Edition adds high availability, enhanced performance, and extensive data management capabilities. PSSC Labs is the first manufacturer to design a server platform specifically for Hadoop. Introducing the World s highest density, lowest power consuming, Enterprise Ready Big Data server platform designed specifically for Hadoop workloads. The Big Data BD12000 Enterprise Server offers the absolute highest possible compute and storage density combined with high performance Data IO throughput. PSSC Labs already delivers the Big Data BD12000 for small POC to large production clusters spanning hundreds or thousands of nodes. Key Features: MapR Enterprise 4.1 Certified Reduce Data Center Footprint By 50% Reduce Power Consumption By 40% Nearly 50% Greater Data IO Rates Patent Pending Tool Free Maintenance Technical Specifications: Up to 12 x 3.5 SATAIII or 14 x SSDs in 1U Rack Space o 72 TB using twelve x 6 TB Hard Drives Supports UP to 2 x Intel Xeon E5 Series Processors Supports Up to 256 GB ECC Enterprise Memory 10 GigE, 40 GigE and Infiniband Network Support Redundant Power Supply Redundant Operating System Hard Drives Red Hat, CentOS, Ubuntu, MS Windows Compatible The Big Data BD 12000 platform offers a unique, highly optimized system design with a direct data path for disk I/O for up to 12 SATA/SAS devices in a 1u Rackmount chassis. This hardware explicitly lacks a raid controller and a backplane, as its intended use is for Hadoop. Jim Scott Director Enterprise Strategy & Architecture

Other Big Data Server Platforms In addition to our Big Data BD 12000 server platform, PSSC Labs offers additional servers that may better compliment MapR workloads. Below are two additional server options that contain 24 x hard drives. Depending on the use case, MapR does recommend 24 x hard drive server configurations. SureStore U2000SD: Up to 26 x 2.5 SSD or SAS Hard Drives 2U Rackmount Chassis with Redundant Power Supports UP to 2 x Intel Xeon E5 Series Processors Supports Up to 512 GB ECC Enterprise Memory 10 GigE, 40 GigE and Infiniband Network Support Redundant Power Supply Redundant Operating System Hard Drives Red Hat, CentOS, Ubuntu, MS Windows Compatible Big Data BD 24000: Up to 24 x 3.5, SATA III, SSD or SAS Hard Drives 2U Rackmount Chassis with Redundant Power Supports UP to 2 x Intel Xeon E5 Series Processors Supports Up to 512 GB ECC Enterprise Memory 10 GigE, 40 GigE and Infiniband Network Support Redundant Power Supply Redundant Operating System Hard Drives Red Hat, CentOS, Ubuntu, MS Windows Compatible

BigData BD12000 Sample Configurations Every organization or use case requires different computing needs. The Big Data BD12000 offers the greatest flexibility possible. Below are three different proposed architectures for different workloads: high density storage, high computational requirements and a balanced configuration. Big Data BD12000 High Density 48 TB Total Storage 12 Xeon E5 Processor Cores 64 GB ECC Memory 2 x 10GigE Network Ports 2 x GigE Network Ports Power Draw Estimate 215 Watt Idle / 285 Watt Full Load Big Data BD12000 High Compute 24 TB Total Storage 20 Xeon E5 Processor Cores 256 GB ECC Memory 2 x 10GigE Network Ports 2 x GigE Network Ports Power Draw Estimate 265 Watt Idle / 355 Watt Full Load Big Data BD12000 Balanced 36 TB Total Storage 16 Xeon E5 Processor Cores 128 GB ECC Memory 2 x 10GigE Network Ports 2 x GigE Network Ports Power Draw Estimate 230 Watt Idle / 320 Watt Full Load A Sample of Organizations Currently Using PSSC Labs for Hadoop.

Big Data RAX : MapR Enterprise Edition Validated Turn Key Cluster Solutions PSSC Labs offers a complete, turn-key cluster that is ready to run on delivery. With over 2000+ Computing & Big Data Clusters delivered to date, PSSC Labs understands everything that is necessary for a successful deployment. All necessary hardware including servers, network equipment, power and infrastructure are included. PSSC Labs Cluster Engineers preconfigure network, storage, operating system and BIOS settings to the end user s specifications. MapR Enterprise Edition can be installed at PSSC Labs factory. The final step is the running of sample data sets to ensure proper functionality and performance. Each Big Data RAX Turn Key Cluster is custom configured for specific needs and budget. Below is an overview of each different server platform PSSC Labs offers for MapR Enterprise Edition turn-key deployments. Depending on the complexity of the environment, some software resources can be installed on different server platforms. MapR-OOP 12000 DATA NODE Tech Specs Key Features Software Resource o 1U High Density Form Factor o 2 x Intel Xeon E5 Processors o 12 x SATAIII or SAS Hard Drives or 14 x SSDs o 12 TB to 72 TB Storage Capacity o 32 GB to 256 GB ECC Memory o 2 x GigE Network Adapters o Optional 10 GigE, 40 GigE, Infiniband Support o Dedicated IPMI / ikvm o Enterprise Platform o Redundant Power Supply o Improved Data IO Throughput o 40% Reduction in Power Consumption o 2 x the Density of Standard Server o Flexible Configuration Options o 3 Year Warranty Included (24 x 7 x 365 NBD Available) o YARN/MRv1 Daemons o MapR-Fileserver Daemons o MapR-DB o Zookeeper o Apache Drill Daemon o Solr Servers SURESTORE U2000 MANAGEMENT NODE (ONLY RECOMMEDNED FOR 50+ NODE CLUSTERS) Tech Specs Key Features Software Resources o 2 x Intel Xeon E5 Processors o 1 TB to 24 TB SATA III, SAS, SSD Hard Drives o 32 GB to 256 GB ECC Memory o 2 x GigE Network Adapters o Optional 10GigE, 40GigE, Infiniband Support o Dedicated IPMI / ikvm o Enterprise Platform o Redundant Power Supply o Redundant Storage o Raid Levels 0,1,5,6,10,50 o Flexible Configuration Options o 3 Year Warranty Included (24 x 7 x 365 NBD Available) o MapR-MCS management o CLDB o JobTracker o Standby JobTracker o Zookeeper o Oozie

SURESTORE U1000 EDGE NODE Tech Specs Key Features Resources o 2 x Intel Xeon E5 Processors o 1 TB to 12 TB SATAIII, SAS, SSD o Enterprise Platform o Redundant Power Supply o Hive command line client o Hadoop command line client Hard Drives o Redundant Storage o Drill command line client o 32 GB to 256 GB ECC Memory o Raid Levels 0,1,5,6,10,50 o Flume Agents o 2 x GigE Network Adapters o Flexible Configuration Options o Hue Server o Optional 10GigE, 40GigE, o 3 Year Warranty Included o MapR-FS Posix client Infiniband Support (24 x 7 x 365 NBD Available) o Dedicated IPMI / ikvm MapR-Rax Turn-Key Cluster Sample Configurations MapR Rax 300 300 TB Total Storage 6 Data Nodes 36 Xeon E5 Processor Cores 196 GB ECC Memory 10GigE Network Backplane MapR Installation Service MapR Validation Service MapR Rax 500 500 TB Total Storage 1 Edge Node 10 Data Nodes 160 Xeon E5 Processor Cores 1280 GB ECC Memory 10GigE Network Backplane MapR Installation Service MapR Validation Service Rack & Roll Service MapR Rax 1500 1500 TB Total Storage 1 Edge Node 30 Data Nodes 480 Xeon E5 Processor Cores 3840 GB ECC Memory 10GigE Network Backplane MapR Installation Service MapR Validation Service Rack & Roll Service We believe strongly in our ability to deliver the highest performance, highest reliability server platforms to MapR Enterprise end users. Our experience delivering clusters ranging from several hundred TBs to several dozen PBs ensures a successful MapR Enterprise deployment. Larry Lesser PSSC Labs, CTO

Total Cost of Ownership Comparison PSSC Labs goal is to offer solutions with the absolute lowest total cost of ownership. The below chart compares different server manufacturer s solution for a 1 Petabyte (raw) Hadoop environment. PSSC Labs Big Data BD 1000 requires 50% less rack space and consumes 40% less power. Coupled with MapR s built-in transparent compression, this maximizes usable capacity and cost per usable TB for data within your environment. Required Data Center Footprint Power Consumption Estimate* Required Power Circuits Pre-installation and Validation of Cloudera Enterprise 5 at Factory Onsite Physical Installation Cluster Management Training Dedicated Remote Monitoring Capabilities Hardware Warranty PSSC Labs Big Data RAX 1000 for 1PB Total Storage Space Dell Configuration for 1PB Total Storage HP Configuration for 1PB Total Storage Cisco Configuration for 1PB Total Storage Single x 42U Rack Two x 42U Rack Two x 42U Rack Two x 42U Rack 4300 Watts Total @ Idle 5500 Watts Total @ Load Single x 30 Amp / 208V / Yes. MapR Enterprise Edition preinstalled and tested. Yes. Cluster arrives preracked, cabled and labeled. Yes. Yes. IPMI 2.0 Network Standard 5800 Watts Total @ Idle 8500 Watts Total @ Load Two x 30 Amp / 208V / 6000 Watts Total @ Idle 8800 Watts Total @ Load Two x 30 Amp / 208V / 5700 Watts Total @ Idle 8700 Watts Total @ Load Two x 30 Amp / 208V / and fees required. and fees required. and fees required. Yes. Yes. Yes. *Dell, HP and Cicso power estimates based on manufacturers website power draw estimates.