HP Reference Architecture for MapR M5

Size: px
Start display at page:

Download "HP Reference Architecture for MapR M5"

Transcription

1 Technical white paper HP Reference Architecture for MapR M5 HP Converged Infrastructure with MapR M5 for Apache Hadoop Table of contents Executive summary... 2 MapR overview... 3 Pre-deployment considerations... 5 Operating system... 5 Computation... 5 Memory... 6 Storage... 6 Network... 7 Switches... 7 Data movement into and out of the MapR cluster... 8 HP Insight Cluster Management utility... 9 Server selection Reference Architectures Single Rack configuration Medium Cluster configuration Large Cluster configuration Edge Nodes Summary For more information... 22

2 Executive summary HP and MapR allow you to derive new business insights from Big Data by providing a platform to store, manage and process data at scale. However, Apache Hadoop is complex to deploy, configure, manage and monitor. This HP converged infrastructure with MapR M5 for Hadoop white paper provides several performance optimized configurations for deploying MapR M5 distribution of Apache Hadoop clusters of varying sizes, on HP infrastructure. The reference architecture configurations for MapR M5 provide a significant reduction in complexity, faster time to value and an improvement in performance. This white paper has been created to assist in the rapid design and deployment of MapR M5 software on HP infrastructure for clusters of various sizes. In addition, it is also intended to concretely identify the software and hardware components required in a solution in order to simplify the ordering process. The recommended HP Software, HP ProLiant servers and HP Networking switches and their respective configurations have been carefully tested with a variety of I/O, CPU, network and memory bound workloads to ensure the best designs possible. HP Big Data solutions provide best-in-class performance and availability, with integrated software, services, infrastructure, and management all delivered as one proven solution as described at hp.com/go/hadoop. In addition to the benefits described above, the solution in this white paper also includes the following features that are unique to HP: For networking, the HP 5830AF-48G 1GbE Top of Rack switch and the HP 5900AF-48XG-4QSFP Aggregation switch provide IRF Bonding and sflow which simplifies the management, monitoring and resiliency of the customer s Hadoop network. In addition, the HP 5830AF-48G delivers 1GB packet buffers to increase the Hadoop network performance by seamlessly handling burst scenarios such as Shuffle, Sort and Block Replication which are common in a Hadoop network. For management, HP Insight Cluster Management Utility (CMU) provides push-button scale out and provisioning with industry leading provisioning performance (deployment of 800 nodes in 30 minutes), reducing deployments from days to hours. In addition, CMU provides real-time and historical infrastructure and Hadoop monitoring with 3D visualizations allowing customers to easily characterize Hadoop workloads and cluster performance reducing complexity and improving system optimization leading to improved performance and reduced cost. HP Insight Management and HP Service Pack for ProLiant, allow for easy management of firmware and the server. For servers, the HP ProLiant DL380e provides: Up to 14 Large Form Factor disk drives, providing increased storage capacity and I/O performance for the MapR File System. The HP Smart Array P420 controller which provides increased 1 I/O throughput performance resulting in a significant performance increase for I/O bound Hadoop workloads (a common use case) and the flexibility for the customer to choose the desired amount of resilience in the Hadoop Cluster with either JBOD or various RAID configurations. Two sockets with the fastest 6 core processors and the Intel C600 Series Chipset, providing the performance required for fastest time to completion for CPU bound Hadoop workloads. The HP ilo Management Engine on the servers contains HP Integrated Lights-Out 4 (ilo 4) which features a complete set of embedded management features for HP Power/Cooling, Agentless Management, Active Health System, and Intelligent Provisioning which reduces node and cluster level administration costs for Hadoop. All of these features reflect HP s balanced building blocks of servers, storage and networking, along with integrated management software and bundled support. Target audience: This document is intended for decision makers, system and solution architects, system administrators, and experienced users who are interested in reducing time to design or purchase an HP and MapR solution. An intermediate knowledge of Apache Hadoop and scale out infrastructure is recommended. Those already possessing expert knowledge about these topics may proceed directly to the Pre-deployment considerations section. 1 Compared to the previous generation of Smart Array controllers 2

3 MapR overview MapR makes Hadoop a reality for the enterprise. MapR's distribution for Hadoop is designed to not only meet core enterprise needs but also provide exceptional performance. MapR provides these distinct advantages: Lights Out Data Center capability MapR provides high availability across the entire stack including storage, MapReduce, NFS layers and more. In the storage layer, the No NameNode architecture provides high availability with self-healing and support for multiple, simultaneous failures, with no additional hardware whatsoever. In the MapReduce layer, MapR s JobTracker HA makes JobTracker failures transparent to applications the currently running tasks continue to execute during the failover process. In the NFS layer, MapR automatically manages virtual IP addresses and balances them between the nodes so that failures are transparent to clients that are reading and writing data via NFS. In addition, these capabilities are combined with both data protection (snapshots) and disaster recovery (mirroring) providing protection against user errors and enabling greater resiliency. Automated rolling upgrades on the other hand eliminate the need to bring the cluster down to move to the latest software release of MapR. Ease of Use MapR provides the easiest interface to integrate other enterprise software with Hadoop. The NFS layer allows reading and writing directly to a file in the cluster, enabling near real-time streaming of data into a MapR cluster. Support for industry standard ODBC API facilitates seamless connection to SQL based systems. On the administration side, the MapR Control System provides a management console to set the frequency of the snapshots, number of replications, mirroring policies, and to better understand user defined cluster health criteria. Furthermore, with provision for logical partitioning of the physical cluster it is extremely easy to share the cluster across different users, groups and applications. Performance MapR delivers faster performance with enterprise ready features such as snapshot and remote mirroring and is the only one that has near limitless scale-out capability. The MapR Distribution for Hadoop includes almost all of the associated Apache projects (see picture below) and is 100% API compatible. 3

4 Figure 1. MapR s software overview Note MapR provides the aforementioned features via services that run on nodes throughout the cluster. Services that manage the cluster and coordinate the jobs ( control services ) run on a small number of nodes that the user can designate; the services responsible for the actual execution of work ( worker services ) run on most or all nodes in the cluster. The Reference Architectures we provide in this document are prescriptive as to which nodes the control and worker services are deployed to. By mapping the services onto specific nodes within the HP infrastructure for clusters of varying sizes we have simplified your deployment. Table 1. Control Services Summary Service Package Recommended Number of Instances Container Location Database (CLDB) mapr-cldb 1-6 Web Server mapr-webserver 1 or more ZooKeeper mapr-zookeeper 3, 5 JobTracker mapr-jobtracker 1-3 HBase Master mapr-hbase-master 1-3 4

5 Table 2. Worker Services Summary Service Package Distribution across nodes FileServer mapr-fileserver Most or all nodes TaskTracker mapr-tasktracker Most or all nodes NFS mapr-nfs Most or all nodes HBase RegionServer mapr-hbase-regionserver Varies Pre-deployment considerations There are a number of key factors one should consider prior to designing and deploying a Hadoop Cluster. The following subsections articulate the design decisions in creating the baseline configurations for the reference architectures. The rationale provided includes the necessary information for one to take the configurations and modify them to suit a particular custom scenario. Table 3. Overview of Functional Components and Configurable Value Functional Component Operating System Computation Memory Storage Network Value Improves Availability and Reliability Ability to balance Price with Performance Ability to balance Price with Capacity and Performance Ability to balance Price with Capacity and Performance Ability to balance Price with Performance Operating system MapR supports 64-bit Red Hat Enterprise Linux (RHEL), CentOS 5.4 or higher and 64-bit Ubuntu 9.04 or higher as choices for the operating system. A 64-bit operating system is required for MapR deployment. 64-bit Red Hat Enterprise Linux 5.4 or greater is recommended due to better ecosystem support, more comprehensive functionality for components such as RAID controllers and compatibility with HP Insight CMU. The Reference Architectures listed in this document were tested with 64-bit Red Hat Enterprise Linux 6.2. Computation Map/Reduce slots are configured on a per server basis. Employing Hyper-Threading increases your effective core count, allowing you to configure more Map/Reduce slots. Refer to the Storage section to see how I/O performance issues arise from sub-optimal disk to core ratios (too many slots and too few disks). For CPU bound workloads we recommend buying processors with faster clock speeds to remove the bottleneck. To remove the bottleneck for CPU bound workloads, for the best cost/performance tradeoff, we recommend buying 6 core processors with faster clock speeds as opposed to buying 8 core processors. 5

6 The default number of slots per node in a MapR cluster is set to the following values: Map Slots = (Total Hyper-Threaded cores per server * 0.75) Reduce Slots = (Total Hyper-Threaded cores per server * 0.5) Since the server in each of our configurations in the Reference Architectures section below is a dual socket system with a total of 12 cores with Hyper-Threading enabled for 24 logical processors, this gives a default of 18 Map Slots and 12 Reduce Slots per server. Some application mixes may do better with either more or fewer slots per node. Memory Use of Error Correcting Memory (ECC) should be used with MapR clusters and is standard on all HP ProLiant servers. Memory requirements differ between the control services and the worker services. For the worker services, sufficient memory is needed to manage the TaskTracker and FileServer services in addition to the sum of all the memory assigned to each of the Map/Reduce slots. If you have a memory bound Map/Reduce Job we recommend that you increase the amount of memory on all the nodes running worker services. When increasing memory, one should always attempt to populate all the memory channels available to ensure optimum performance. For instance, if one had 6 memory channels total per server, one could begin by populating the channels with 4GB DIMMs resulting in 24GB RAM for the server. If that became insufficient, one could replace all the 4GB DIMMs with 8GB DIMMs resulting in 48GB of RAM for the server. In addition, the MapR M5 cluster can also be used for HBase which is very memory intensive, which is why we recommend more RAM per server. The ProLiant DL380e Gen8 server has a baseline of 48GB memory and can be easily upgraded to higher memory capacity if all memory channels are populated. The Reference Architectures section details how to constrain the amount of Map/Reduce slots on nodes that are running control services to avoid the services competing for resources. Storage Fundamentally, MapR is designed to achieve performance and scalability by moving the compute activity to the data. It does this by distributing the Hadoop jobs to nodes close to their data, ideally running the tasks against data on local disks. Given the architecture of MapR, the data storage requirements for the worker nodes are best met by direct attached storage (DAS) in a Just a Bunch of Disks (JBOD) configuration and not as DAS with RAID or Network Attached Storage (NAS). There are several factors to consider and balance when determining the number of disks a node requires. Storage capacity The number of disks and their corresponding storage capacity determines the total amount of the FileServer storage capacity for your cluster. We recommend Large Form Factor (3.5 ) disks due to their lower cost and larger storage capacity compared with the Small Form Factor (2.5 ) disks. Redundancy MapR ensures that a certain number of block copies are consistently available. This number is configurable as a volume property, and is typically set to three. If a MapR node goes down, MapR will replicate the blocks that had been on that server onto other servers in the cluster to maintain the consistency of the number of block copies. For example, if a server with 12 TB of data fails, that 12 TB of data will be replicated onto other servers, generating 12 TB of traffic within the cluster. The failure of a non-redundant TOR (Top of Rack) switch will generate even more replication traffic. Make sure your network, whether 1GbE or 10GbE, has the capacity to handle block replication for your server configurations in case of failure. I/O performance Each node has a certain number of Map/Reduce slots available for processing Hadoop tasks. Each slot operates on one block of data at a time. The more disks you have, the less likely it is that you will have multiple tasks accessing a given disk at the same time. This avoids thrashing the disk heads and incurring the resulting I/O performance degradation. 6

7 Customers can choose to use SATA or SAS Midline (MDL) disks in a MapR Cluster. SAS MDL disks are preferred by customers who are looking for disks that are more efficient at driving higher I/O throughput across a larger amount of disks delivering to customers a higher performing cluster. SAS MDL disks are recommended over SAS Enterprise (ENT) disks in a MapR cluster because the SAS ENT disks deliver only marginally higher performance at a higher cost with lower storage capacity. Network Configuring only a single Top of Rack (TOR) switch per rack introduces a single point of failure for each rack. In a multi-rack system such a failure will result in a large amount of network traffic as Hadoop re-replicates data, and in a single-rack system such a failure will bring down the whole cluster. Consequently, configuring two TOR switches per rack is recommended for all production configurations as it provides an additional measure of redundancy. This can be further improved by configuring link aggregation between the switches. In order to balance network bandwidth against disk I/O bandwidth. MapR recommends 2-4 1GbE NICs per node or GbE NICs per node to satisfy the disk throughput requirements. MapR auto detects multiple NICs per node and enables application layer bonding for maximum throughput. Larger clusters with three or more racks will benefit from having TOR switches connected by 10GbE uplinks to core aggregation switches. During the map phase of Hadoop jobs that utilize the FileServer, the majority of tasks reference data on the server that executes the task (node-local). For those tasks that must access data remotely, the data is usually on other servers in the same rack (rack-local). Only a small percentage of tasks need to access data from remote racks. The shuffle and reduce phases, in contrast, can drive very high levels of network traffic for some kinds of workloads. TOR switches with deep buffering are recommended to minimize the effects of switch congestion during periods of high network traffic. In particular, it is important to avoid lost packets since this will cause servers to decrease their TCP window sizes leading to lower throughput. Switches MapR clusters contain two types of switches, namely Aggregation switches and Top of Rack switches. Top of Rack switches route the traffic between the nodes in each rack and Aggregation switches route the traffic between the racks. Aggregation switches The HP 5900AF-48XG-4QSFP switch is an ideal aggregation switch as it offers exceptional scalability for Hadoop cluster expansion with its 10GbE ports. For more information on the HP 5900AF-48XG-4QSFP, please see or 5900AF QuickSpecs. The configuration for the HP 5900AF-48XG-4QSFP switch is provided below. Figure 2. HP 5900AF-48XG-4QSFP Aggregation switch 7

8 Table 4. HP 5900AF-48XG-4QSFP Single Aggregation Switch options Qty Description 1 HP 5900AF-48XG-4QSFP Switch 2 HP 58x0AF 650W AC Power Supply 2 HP 58x0AF Front (port-side) to Back (power-side) Airflow Fan Tray 4 HP X140 40G QSFP MPO SR4 Transceiver 48 HP X130 10G SFP+ LC SR Transceiver 48 HP X240 40G QSFP 5m Direct Attach Copper Cable Top of Rack (TOR) switches The HP 5830AF-48G is an ideal TOR switch and has a 1 GB buffer size for very deep buffering, better HA support, two 10GbE uplinks, forty-eight 1GbE ports and the option for adding two more 10GbE ports. A dedicated management switch for ilo traffic is not required as the ProLiant DL380e servers are able to share ilo traffic over the first NIC. The volume of ilo traffic is minimal and does not degrade performance over that port. For more information on the HP 5830AF-48G switch, please see or 5830AF QuickSpecs. This switch is rear-facing, in that the cables for the switch are connected on the same side of the rack as the cables that are connected to the NICs at the back of the DL380e servers. Figure 3. HP 5830AF-48G Top of Rack (TOR) switch The configuration for the HP 5830AF-48G switch is provided below. Table 5. HP 5830AF-48G Switch options Qty Description 1 HP 5830AF-48G Switch with 1 Interface Slot 2 HP 58x0AF 650W AC Power Supply 1 HP 5500/ port 10GbE SFP+ Module 1 HP 5830AF-48G Back(power)-Front(prt) Fan Tray Data movement into and out of the MapR cluster It is best to isolate the MapR M5 cluster on its own private network in order to ensure that external network traffic does not collide with that of the MapR network. However, one still needs to ensure that data can be moved into and out of the MapR cluster from external networks. MapR can be mounted by another system as a network file share (NFS). In order to enable the import and export of data to and from a MapR cluster, we recommend configuring the aggregation switches to allow NFS traffic from external networks. 8

9 HP Insight Cluster Management utility HP Insight Cluster Management Utility (CMU) is an efficient and robust hyperscale cluster lifecycle management framework and suite of tools for large Linux clusters such as those found in High Performance Computing (HPC) and Big Data environments. A simple graphical interface enables an at-a-glance view of the entire cluster across multiple metrics, provides frictionless scalable remote management and analysis, and allows rapid provisioning of software to all the nodes of the system. Insight CMU makes the management of a cluster more user friendly, efficient, and error free than if it were being managed by scripts, or on a node-by-node basis. Insight CMU offers full support for ilo 2, ilo 3, ilo 4 and LO100i adapters on all HP ProLiant servers in the cluster. HP Insight CMU allows you to easily correlate MapR metrics with cluster infrastructure metrics, such as CPU Utilization, Network Transmit/Receive, Memory Utilization and I/O Read/Write. This allows you to characterize your MapR workloads and optimize the system thereby improving the performance of the MapR Cluster. CMU TimeView Metric Visualizations will help you understand, based on your workloads, whether your cluster needs more memory, a faster network or processors with faster clock speeds. In addition, Insight CMU also greatly simplifies the deployment of MapR, with its ability to create a Golden Image from a Node and then deploy that image to up to 4000 nodes. Insight CMU is able to deploy 800 nodes in 30 minutes. Insight CMU is highly flexible and customizable, offers both GUI and CLI interfaces, and is being used to deploy a range of software environments, from simple compute farms to highly customized, application-specific configurations. Insight CMU is available for HP ProLiant and HP BladeSystem servers, with Linux operating systems, including Red Hat Enterprise Linux, SUSE Linux Enterprise, CentOS, and Ubuntu. Insight CMU also includes options for monitoring Graphical Processing Units (GPUs) and for installing GPU drivers and software. For more information, please see hp.com/go/cmu. Table 6. HP Insight CMU options Qty Description 1 HP Insight CMU 1yr 24x7 Flex Lic 1 HP Insight CMU 1yr 24x7 Flex E-LTU 1 HP Insight CMU 3yr 24x7 Flex Lic 1 HP Insight CMU 3yr 24x7 Flex E-LTU 1 HP Insight CMU Media 9

10 Figure 4. HP Insight CMU Interface Figure 5. HP Insight CMU Interface Time View Server selection Depending on the size of the cluster, a MapR M5 deployment consists of one or more nodes running control services and one or more nodes running worker services. With highly available, distributed control services, MapR does not require servers with hardware high availability features like RAID. This allows MapR to be run in a fully homogenous server deployment. However, for smaller cluster deployments, RAID protection on the root partition is recommended since server failure will have greater impact compared to a larger cluster. 10

11 This section specifies which server to use and the rationale behind it. The Reference Architectures section will provide topologies for the deployment of control and worker services across the nodes for clusters of varying sizes. Processor configuration MapR manages the amount of work each server is able to undertake via the amount of Map/Reduce slots configured for that server. The more cores available to the server, the more Map/Reduce slots can be configured for the server (see the Computation section for more detail). We recommend 6 core processors for a good balance of price and performance. We recommend that Hyper-Threading is turned on. Drive configuration Redundancy is built into the MapR architecture and thus there is no need for RAID or additional hardware components to improve redundancy on the server as it is all coordinated and managed in the MapR software. Drives should use a Just a Bunch of Disks (JBOD) configuration, which can be achieved with the HP P420 RAID controller by configuring each individual disk as a separate RAID 0 volume. We recommend disabling array acceleration on the controller to better handle large block I/Os in the Hadoop environment. Lastly, servers should provide a large amount of storage capacity which increases the total capacity of the distributed file system and provide that capacity by using at least twelve 2TB Large Form Factor drives for optimum I/O performance. The DL380e supports 14 Large Form Factor (LFF) drives, which allows one to either use all 14 drives for data or use 12 drives for data and the additional 2 for mirroring the operating system and MapR runtime. Hot pluggable drives are recommended so that drives can be replaced without restarting the server. Memory configuration Servers running the node processes should have sufficient memory for either HBase or for the amount of Map/Reduce Slots configured on the server. A server with larger RAM configuration will deliver optimum performance for both HBase and Map/Reduce. To ensure optimal memory performance and bandwidth, we recommend using 8GB or 16GB DIMMs to populate each of the 6 memory channels as needed. Network configuration The DL380e includes four 1GbE NICs onboard. MapR automatically identifies the available NICs on the server and bonds them via the MapR software to increase throughput. Each of the reference architecture configurations below specifies an additional Top of Rack Switch for redundancy. To best make use of this, we recommend cabling the ProLiant DL380e Worker Nodes so that NIC 1 is cabled to Switch 1 and NIC 2 is cabled to Switch 2, repeating the same process for NICs 3 and 4. Each NIC in the server should have its own IP subnet instead of sharing the same subnet with other NICs. HP ProLiant DL380e Gen8 The HP ProLiant DL380e Gen8 (2U) is an excellent choice as the server platform for the worker nodes. Figure 6. HP ProLiant DL380e Gen8 Server 11

12 The recommended DL380e configuration includes: Dual Six-Core Intel Xeon E GHz Processors with 15M L3 Cache and Hyper-Threading enabled Twelve 2TB 7.2K LFF (3.5 inch) SATA MDL or SAS MDL disks (24 TB for Data) Two 500GB SATA 7.2K LFF MDL (Mirrored OS and Runtime) 1 x Smart Array P420 Controller 48 GB (6 x HP 8GB DDR3) Memory 4 x 1GbE NICs The DL380e has 12 hot swappable drives in the front of the server and 2 at the back of the server. The description above reflects a server that has the front 12 drives allocated for data in a JBOD configuration and the 2 drives at the back used to mirror the operating system and MapR runtime. This provides a very low cost approach to increase the resiliency of each of the worker nodes. Customers also have the option of not mirroring the OS and MapR runtime and leveraging all 14 of the drives for data. In addition, customers have the option of purchasing a second power supply and fans for additional power redundancy. The Bill of Materials (BOM) for the server is provided below. Table 7. The MapR DL380e server configuration Qty Description 1 HP ProLiant DL380e Gen8 12LFF CTO Base Server 1 HP 2U LFF BB Rail Gen8 Kit 1 Intel Xeon E (2.4GHz/6-core/15MB/95W) FIO Kit - CPU 1 Intel Xeon E (2.4GHz/6-core/15MB/95W) Additional CPU 1 HP DL380e Gen8 CPU1 Riser Kit 6 HP 8GB 2Rx4 PC3L-10600R-9 (1333MHz) - Memory 1 HP 2U Gen8 Rear 2LFF Kit 2 HP 500GB 6G SATA 7.2k 3.5in SC MDL HDD 1 HP Ethernet 1Gb 4-port 331T Adapter 1 HP Smart Array P420/1GB FBWC Controller 2 HP 750W CS Gold Ht Plg Pwr Supply Kit 1 HP DL380eGen8 HP Fan Kit 12 HP 2TB 6G SATA 7.2k 3.5in SC MDL HDD Reference Architectures This section provides a progression of configurations from single rack to full scale out MapR cluster configurations. Best practices for each of the components within the configurations specified have been articulated earlier in this document. Single Rack configuration The Single Rack MapR M5 configuration provides a starting point for a MapR M5 cluster configuration. The configuration reflects the following components. 12

13 Rack enclosure The rack contains nineteen HP ProLiant DL380e servers and two HP 5830AF-48G switches within a 42U rack enclosure. This leaves 2U open for an additional 2U DL380e server or a 1U KVM switch. Network As previously described in the Switches section, two HP 5830AF-48G switches are specified for performance and redundancy. The HP 5830AF-48G includes up to four 10GbE uplinks which can be used to connect the switches in the rack into the desired network. MapR nodes The ProLiant servers in the rack act as nodes in the MapR cluster and are divided up into nodes that run control services and nodes that run exclusively worker services. In the diagram for this configuration (see Figure 8 below), the nodes that run control services are specified with a number, thereby denoting Node 1, Node 2, and Node 3. It is worth noting that the nodes that run the worker services and control services can be placed on any nodes in the rack and cluster. Nodes running control services To provide high availability (HA), it is recommended to configure multiple control services in the rack. One should have at least 3 ZooKeeper, 2 to 3 CLDB, 2 to 3 JobTracker, and 2 Web Server services for the Single Rack configuration. We recommend configuring them as follows: Node 1 runs the following control services: Container Location Database (CLDB) JobTracker (or HBaseMaster) Web Server ZooKeeper Node 2 runs the following control services: Container Location Database (CLDB) JobTracker (or HBaseMaster) Web Server ZooKeeper Node 3 runs the following control services: Container Location Database(CLDB) JobTracker (or HBaseMaster) ZooKeeper HP Insight CMU In addition, each of the nodes listed above run the following worker services: FileServer Network File Share (NFS) TaskTracker (or HBaseRegionServer if you are using HBase) In order to ensure that the worker services do not consume the resources required for the control services we recommend you limit the amount of configured Map/Reduce slots (see the Computation section) to half on the nodes running the control services. This would require one to reduce the Map Slots percentage in total number of cores from 75% to 50% and reduce slots from 50% to 25%, namely, 12 map slots and 6 reduce slots. 13

14 Nodes running worker services After the nodes running the control services are configured, there are up to 16 nodes in the rack that can be configured to run worker services. Keep in mind that it is not mandatory to fully populate the rack with nodes running worker services; however, both performance and storage capacity will improve the more you add. The following worker services run on the remaining nodes: FileServer Network File Share (NFS) TaskTracker (or HBaseRegionServer if you are using HBase) Figure 7. Single Rack MapR M5 Configuration 14

15 Figure 8. Single Rack MapR M5 Configuration Medium Cluster configuration The Medium Cluster MapR M5 solution provides a configuration that allows for full scale out without additional changes to the configuration. The configuration reflects the following components: Rack This design contains two racks with identical infrastructure configurations but different software configurations. Each rack contains nineteen HP ProLiant DL380e servers and two HP 5830AF-48G switches within a 42U rack enclosure. This leaves 2U open for an additional 2U DL380e server or a 1U KVM switch. Network As previously described in the Switches section, two HP 5830AF-48G TOR switches are specified per rack for performance and redundancy. The HP 5830AF-48G includes up to four 10GbE uplinks which can be used to connect the TOR switches into the 5900AF-48XG-4QSFP aggregation switch. The aggregation switch can then be connected to your network of choice. MapR nodes The ProLiant servers in the racks act as nodes in the MapR cluster and are divided up into nodes that run control services and nodes that run exclusively worker services. The control nodes can also run some worker services if resources permit. In the diagram for this configuration (see Figure 10 below), the nodes that run control services are specified with a number, thereby denoting Node 1, Node 2, Node 3, Node 4, and Node 5. 15

16 Nodes running control services A minimum of three control nodes are required to provide HA, hence the multi-rack configuration to ensure that the services will stay available in the event that an entire rack goes offline. It is recommended to have 3 ZooKeeper, 3 CLDB, 2 JobTracker, and 2 Web Server services in the Medium Cluster configuration. Rack 1 contains two nodes that run control services and Rack 2 contains three nodes that run control services. We recommend configuring these nodes as follows: Node 1 runs the following control services: Container Location Database (CLDB) Web Server Node 2 runs the following control services: Container Location Database (CLDB) Web Server ZooKeeper Node 3 runs the following control services: Container Location Database (CLDB) Web Server Node 4 runs the following control services: JobTracker (or HBaseMaster) ZooKeeper HP Insight CMU Node 5 runs the following control services: JobTracker (or HBaseMaster) ZooKeeper In addition, each of the nodes listed above run the following worker services: FileServer Network File Share (NFS) TaskTracker (or HBaseRegionServer if you are using HBase In order to ensure that the worker services do not consume the resources required for the control services we recommend you limit the amount of configured Map/Reduce slots (see the Computation section) to half on the nodes running the control services. This would require one to reduce the Map Slots percentage in total number of cores from 75% to 50% and reduce slots from 50% to 25%, namely, 12 map slots and 6 reduce slots. Nodes running worker services The rest of the nodes in the rack that are not running control services exclusively run worker services. Keep in mind that it is not mandatory to fully populate the rack with nodes running worker services; however, both performance and storage capacity will improve the more you have. The following worker services run on the remaining nodes: FileServer Network File Share (NFS) TaskTracker (or HBaseRegionServer if you are using HBase) 16

17 Figure 9. Medium Cluster MapR M5 Configuration 17

18 Figure 10. Medium Cluster MapR M5 Configuration 18

19 Large Cluster configuration The Large Cluster design assumes the Medium Cluster design is already in place and extends its scalability. The Medium Cluster design ensures the required amount of control services are in place for large scale out. For large clusters, one simply adds more racks of the configuration provided below to the Medium Cluster design. This section reflects the design of those racks. Rack enclosure The rack contains nineteen HP ProLiant DL380e servers and two HP 5830AF-48G switches within a 42U rack enclosure. This leaves 2U open for an additional 2U DL380e server or a 1U KVM switch. Network As previously described in the Switches section, two HP 5830AF-48G switches are specified for performance and redundancy. The HP 5830AF-48G includes up to four 10GbE uplinks which can be used to connect the switches in the rack into an aggregation switch such as the HP 5900AF-48XG-4QSFP and the desired network. MapR nodes The ProLiant servers in the rack act as nodes in the MapR cluster. Since the Medium Cluster configuration already has the required control services in place, one needs only worker services on the additional nodes. These are: FileServer Network File Share (NFS) TaskTracker (or HBaseRegionServer if you are using HBase) Figure 11. Large Cluster MapR M5 Configuration 19

20 Figure 12. Large Cluster MapR M5 Configuration Edge Nodes Optionally, the Edge Nodes are servers that are typically multi-homed into both the MapR private network as well as a network accessible by users. This allows the Edge Node to run applications that can submit jobs to the MapR cluster and have the application interfaces accessible to users within the user network. The Edge Node can be used as a staging area where data from the other systems located in the external network is stored temporarily before being migrated to the MapR filesystem. Another example of how an Edge Node can be leveraged would be to install and configure a third party BI application for data analytics. This provides users with a tool that interacts with MapR to allow users to analyze and visualize their data. HP ProLiant DL360p Gen8 The HP ProLiant DL360p Gen8 (1U) is an excellent choice as an Edge Node for MapR. Figure 13. HP ProLiant DL360p Gen8 Server 20

21 The following base configuration is popular for Edge Nodes: Dual Six-Core Intel E GHz Processors Smart Array P420i Controller Eight 900GB SFF SAS 10K RPM disks 64 GB DDR3 Memory 4 x 1GbE FlexibleLOM NICs Bill of materials Table 8. The HP ProLiant DL360p Gen8 Server Configuration Qty Description 1 HP DL360p Gen8 8-SFF CTO Chassis 1 HP DL360p Gen8 E FIO Kit 1 HP DL360p Gen8 E Kit 8 HP 8GB 1Rx4 PC R-11 Kit 8 HP 900GB 6G SAS 10K 2.5in SC ENT HDD 1 HP Ethernet 1GbE 4P 331FLR FIO Adapter 1 HP 512MB FBWC for P-Series Smart Array 2 HP 460W CS Gold Hot Plug Power Supply Kit 1 HP 1U SFF BB Gen8 Rail Kit 1 ProLiant DL36x(p) HW Support Summary HP and MapR allow one to derive new business insights from Big Data by providing a platform to store, manage and process data at scale. However, designing and ordering Hadoop Clusters can be both complex and time consuming. This white paper provides several reference configurations for deploying clusters of varying sizes on MapR s M5 distribution of Apache Hadoop and HP infrastructure. These configurations leverage HP s balanced building blocks of servers, storage and networking, along with integrated management software and bundled support. In addition, this white paper has been created in order to assist in the rapid design and deployment of MapR M5 software on HP infrastructure for clusters of various sizes. 21

22 For more information MapR, mapr.com Hadoop on HP, hp.com/go/hadoop HP Insight Cluster Management Utility (CMU), hp.com/go/cmu HP ProLiant DL380e Gen8, hp.com/servers/dl380e HP ProLiant servers, hp.com/go/proliant HP Enterprise Software, hp.com/go/software HP Networking, hp.com/go/networking HP Integrated Lights-Out (ilo) Advanced, hp.com/servers/ilo HP Product Bulletin (QuickSpecs), hp.com/go/quickspecs HP Services, hp.com/go/services HP Support and Drivers, hp.com/go/support HP Systems Insight Manager (HP SIM), hp.com/go/hpsim To help us improve our documents, please provide feedback at hp.com/solutions/feedback. Sign up for updates hp.com/go/getupdated Copyright 2012, 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. The only warranties for HP products and services are set forth in the express warranty statements accompanying such products and services. Nothing herein should be construed as constituting an additional warranty. HP shall not be liable for technical or editorial errors or omissions contained herein. Intel and Xeon are trademarks of Intel Corporation in the U.S. and other countries. 4AA4-2434ENW, January 2013, Rev. 1

HP Reference Architecture for Cloudera Enterprise on ProLiant DL Servers

HP Reference Architecture for Cloudera Enterprise on ProLiant DL Servers Technical white paper HP Reference Architecture for Cloudera Enterprise on ProLiant DL Servers HP Converged Infrastructure with Cloudera Enterprise for Apache Hadoop Table of contents Executive summary...

More information

HP Reference Architecture for Hortonworks Data Platform on HP ProLiant SL4540 Gen8 Server

HP Reference Architecture for Hortonworks Data Platform on HP ProLiant SL4540 Gen8 Server Technical white paper HP Reference Architecture for Hortonworks Data Platform on HP Server HP Converged Infrastructure with the Hortonworks Data Platform for Apache Hadoop Table of contents Executive summary...

More information

HP Reference Architecture for Cloudera Enterprise

HP Reference Architecture for Cloudera Enterprise Technical white paper HP Reference Architecture for Cloudera Enterprise HP Converged Infrastructure with Cloudera Enterprise for Apache Hadoop Table of contents Executive summary 2 Cloudera Enterprise

More information

HP recommended configurations for Microsoft Exchange Server 2013 and HP ProLiant Gen8 with direct attached storage (DAS)

HP recommended configurations for Microsoft Exchange Server 2013 and HP ProLiant Gen8 with direct attached storage (DAS) HP recommended configurations for Microsoft Exchange Server 2013 and HP ProLiant Gen8 with direct attached storage (DAS) Building blocks at 1000, 3000, and 5000 mailboxes Table of contents Executive summary...

More information

HP recommended configuration for Microsoft Exchange Server 2010: HP LeftHand P4000 SAN

HP recommended configuration for Microsoft Exchange Server 2010: HP LeftHand P4000 SAN HP recommended configuration for Microsoft Exchange Server 2010: HP LeftHand P4000 SAN Table of contents Executive summary... 2 Introduction... 2 Solution criteria... 3 Hyper-V guest machine configurations...

More information

How To Write An Article On An Hp Appsystem For Spera Hana

How To Write An Article On An Hp Appsystem For Spera Hana Technical white paper HP AppSystem for SAP HANA Distributed architecture with 3PAR StoreServ 7400 storage Table of contents Executive summary... 2 Introduction... 2 Appliance components... 3 3PAR StoreServ

More information

HP recommended configuration for Microsoft Exchange Server 2010: ProLiant DL370 G6 supporting 1000-2GB mailboxes

HP recommended configuration for Microsoft Exchange Server 2010: ProLiant DL370 G6 supporting 1000-2GB mailboxes HP recommended configuration for Microsoft Exchange Server 2010: ProLiant DL370 G6 supporting 1000-2GB mailboxes Table of contents Executive summary... 2 Introduction... 3 Tiered solution matrix... 3 Recommended

More information

HP reference configuration for entry-level SAS Grid Manager solutions

HP reference configuration for entry-level SAS Grid Manager solutions HP reference configuration for entry-level SAS Grid Manager solutions Up to 864 simultaneous SAS jobs and more than 3 GB/s I/O throughput Technical white paper Table of contents Executive summary... 2

More information

HP ProLiant BL660c Gen9 and Microsoft SQL Server 2014 technical brief

HP ProLiant BL660c Gen9 and Microsoft SQL Server 2014 technical brief Technical white paper HP ProLiant BL660c Gen9 and Microsoft SQL Server 2014 technical brief Scale-up your Microsoft SQL Server environment to new heights Table of contents Executive summary... 2 Introduction...

More information

HP ConvergedSystem 900 for SAP HANA Scale-up solution architecture

HP ConvergedSystem 900 for SAP HANA Scale-up solution architecture Technical white paper HP ConvergedSystem 900 for SAP HANA Scale-up solution architecture Table of contents Executive summary... 2 Solution overview... 3 Solution components... 4 Storage... 5 Compute...

More information

Use cases and best practices for HP StorageWorks P2000 G3 MSA FC/iSCSI Combo Controller

Use cases and best practices for HP StorageWorks P2000 G3 MSA FC/iSCSI Combo Controller Use cases and best practices for HP StorageWorks P2000 G3 MSA FC/iSCSI Combo Controller Technical white paper Table of contents About this document... 2 Intended audience... 2 About the HP P2000 Combo

More information

Configuring the HP DL380 Gen9 24-SFF CTO Server as an HP Vertica Node. HP Vertica Analytic Database

Configuring the HP DL380 Gen9 24-SFF CTO Server as an HP Vertica Node. HP Vertica Analytic Database Configuring the HP DL380 Gen9 24-SFF CTO Server as an HP Vertica Node HP Vertica Analytic Database HP Big Data Foundations Document Release Date: February, 2015 Contents Using the HP DL380 Gen9 24-SFF

More information

HP high availability solutions for Microsoft SQL Server Fast Track Data Warehouse using SQL Server 2012 failover clustering

HP high availability solutions for Microsoft SQL Server Fast Track Data Warehouse using SQL Server 2012 failover clustering Technical white paper HP high availability solutions for Microsoft SQL Server Fast Track Data Warehouse using SQL Server 2012 failover clustering Table of contents Executive summary 2 Fast Track reference

More information

HP StorageWorks P2000 G3 and MSA2000 G2 Arrays

HP StorageWorks P2000 G3 and MSA2000 G2 Arrays HP StorageWorks P2000 G3 and MSA2000 G2 Arrays Family Data sheet How can the flexibility of the HP StorageWorks P2000 G3 MSA Array Systems help remedy growing storage needs and small budgets? By offering

More information

HP RA for SAS Visual Analytics on HP ProLiant BL460c Gen8 Servers running Linux

HP RA for SAS Visual Analytics on HP ProLiant BL460c Gen8 Servers running Linux Technical white paper HP RA for SAS Visual Analytics on Servers running Linux Performance results with a concurrent workload of 5 light users and 1 heavy user accessing a 112GB dataset Table of contents

More information

Performance characterization report for Microsoft Hyper-V R2 on HP StorageWorks P4500 SAN storage

Performance characterization report for Microsoft Hyper-V R2 on HP StorageWorks P4500 SAN storage Performance characterization report for Microsoft Hyper-V R2 on HP StorageWorks P4500 SAN storage Technical white paper Table of contents Executive summary... 2 Introduction... 2 Test methodology... 3

More information

HP Client Virtualization SMB Reference Architecture for Windows Server 2012

HP Client Virtualization SMB Reference Architecture for Windows Server 2012 Technical white paper HP Client Virtualization SMB Reference Architecture for Windows Server 212 Affordable, high-performing desktop virtualization on HP ProLiant DL38p Gen8 and Windows Server 212 Table

More information

HP ProLiant DL580 Gen8 and HP LE PCIe Workload WHITE PAPER Accelerator 90TB Microsoft SQL Server Data Warehouse Fast Track Reference Architecture

HP ProLiant DL580 Gen8 and HP LE PCIe Workload WHITE PAPER Accelerator 90TB Microsoft SQL Server Data Warehouse Fast Track Reference Architecture WHITE PAPER HP ProLiant DL580 Gen8 and HP LE PCIe Workload WHITE PAPER Accelerator 90TB Microsoft SQL Server Data Warehouse Fast Track Reference Architecture Based on Microsoft SQL Server 2014 Data Warehouse

More information

QuickSpecs. HP SAS Enterprise and SAS Midline Hard Drives Overview

QuickSpecs. HP SAS Enterprise and SAS Midline Hard Drives Overview Overview Serial Attached SCSI (SAS) provides a superior storage solution. With some storage requirements escalating and others becoming more complex, factors such as flexibility, performance, increased

More information

Annex 9: Private Cloud Specifications

Annex 9: Private Cloud Specifications Annex 9: Private Cloud Specifications The MoICT Private Cloud Solution is based on the Data Center Services (DCS) offering from CMS. DCS is comprised of a fabric in the form of one or more resource pools

More information

QuickSpecs. What's New Dual Port SFF 10K and 15K SAS drives Dual Port 3.5" 15K SAS drives. HP SAS Drives (Servers) Overview

QuickSpecs. What's New Dual Port SFF 10K and 15K SAS drives Dual Port 3.5 15K SAS drives. HP SAS Drives (Servers) Overview Overview Serial Attached SCSI is the logical evolution of SCSI, including its long-established software advantage and the Serial ATA electrical and physical connection interface. With enterprise storage

More information

QuickSpecs. What's New HP 750GB 1.5G SATA 7.2K 3.5" Hard Disk Drive. HP Serial-ATA (SATA) Hard Drive Option Kits. Overview

QuickSpecs. What's New HP 750GB 1.5G SATA 7.2K 3.5 Hard Disk Drive. HP Serial-ATA (SATA) Hard Drive Option Kits. Overview Overview HP offers a variety of tested, HP-qualified, SMART* capable, SATA Hard Drives offering data integrity and availability in hotpluggable models. HP 3.5" and Small Form Factor (2.5") SATA drives

More information

Desktop virtualization: Implementing Virtual Desktop Infrastructure (VDI) with HP. What is HP Virtual Desktop Infrastructure?

Desktop virtualization: Implementing Virtual Desktop Infrastructure (VDI) with HP. What is HP Virtual Desktop Infrastructure? Desktop virtualization: Implementing Virtual Desktop Infrastructure (VDI) with HP What is HP Virtual Desktop Infrastructure? Virtual Desktop Infrastructure from HP is a desktop replacement solution designed

More information

HP Moonshot System. Table of contents. A new style of IT accelerating innovation at scale. Technical white paper

HP Moonshot System. Table of contents. A new style of IT accelerating innovation at scale. Technical white paper Technical white paper HP Moonshot System A new style of IT accelerating innovation at scale Table of contents Abstract... 2 Meeting the new style of IT requirements... 2 What makes the HP Moonshot System

More information

The Advantages of Multi-Port Network Adapters in an SWsoft Virtual Environment

The Advantages of Multi-Port Network Adapters in an SWsoft Virtual Environment The Advantages of Multi-Port Network Adapters in an SWsoft Virtual Environment Introduction... 2 Virtualization addresses key challenges facing IT today... 2 Introducing Virtuozzo... 2 A virtualized environment

More information

Implementing Red Hat Enterprise Linux 6 on HP ProLiant servers

Implementing Red Hat Enterprise Linux 6 on HP ProLiant servers Technical white paper Implementing Red Hat Enterprise Linux 6 on HP ProLiant servers Table of contents Abstract... 2 Introduction to Red Hat Enterprise Linux 6... 2 New features... 2 Recommended ProLiant

More information

Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack

Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack HIGHLIGHTS Real-Time Results Elasticsearch on Cisco UCS enables a deeper

More information

Platfora Big Data Analytics

Platfora Big Data Analytics Platfora Big Data Analytics ISV Partner Solution Case Study and Cisco Unified Computing System Platfora, the leading enterprise big data analytics platform built natively on Hadoop and Spark, delivers

More information

Solution guide. HP Just Right IT. Technology made easy for your growing business

Solution guide. HP Just Right IT. Technology made easy for your growing business Solution guide HP Just Right IT Technology made easy for your growing business Build your business with Just Right IT Small businesses like yours are driving economic growth. You create the jobs. You create

More information

QuickSpecs. What's New HP 1.2TB 6G SAS 10K rpm SFF (2.5-inch) SC Enterprise 3yr Warranty Hard Drive

QuickSpecs. What's New HP 1.2TB 6G SAS 10K rpm SFF (2.5-inch) SC Enterprise 3yr Warranty Hard Drive Overview Serial Attached SCSI (SAS) provides a superior storage solution. With some storage requirements escalating and others becoming more complex, factors such as flexibility, performance, increased

More information

QuickSpecs. HP SATA Hard Drives. Overview

QuickSpecs. HP SATA Hard Drives. Overview QuickSpecs Overview HP SATA drives are designed for the reliability and larger capacities demanded by today's entry server and external storage environments. The SATA portfolio is categorized into two

More information

New to servers. Are you new to servers? Consider these HP ProLiant Essentials servers. Family guide HP ProLiant rack and tower servers

New to servers. Are you new to servers? Consider these HP ProLiant Essentials servers. Family guide HP ProLiant rack and tower servers New to servers Are you new to servers? Consider these HP ProLiant Essentials servers. MicroServer Gen8 Micro-sized server for collaboration, centralization, and command of your business with remote management

More information

HP Education Services Course Overview

HP Education Services Course Overview HP Education Services Course Overview Introduction to HP ProLiant Servers (HE643S) This two-day course provides essential HP ProLiant training to system administrators who are responsible for ProLiant

More information

QuickSpecs. HP Smart Array 5312 Controller. Overview

QuickSpecs. HP Smart Array 5312 Controller. Overview Overview Models 238633-B21 238633-291 (Japan) Feature List: High Performance PCI-X Architecture High Capacity Two Ultra 3 SCSI channels support up to 28 drives Modular battery-backed cache design 128 MB

More information

QuickSpecs. HP Serial-ATA (SATA) Entry (ETY) and Midline (MDL) Hard Drive Option Kits. Overview

QuickSpecs. HP Serial-ATA (SATA) Entry (ETY) and Midline (MDL) Hard Drive Option Kits. Overview QuickSpecs Overview HP SATA drives are designed for the reliability and larger capacities demanded by today's entry server and external storage environments. The SATA portfolio is categorized into two

More information

APACHE HADOOP PLATFORM HARDWARE INFRASTRUCTURE SOLUTIONS

APACHE HADOOP PLATFORM HARDWARE INFRASTRUCTURE SOLUTIONS APACHE HADOOP PLATFORM BIG DATA HARDWARE INFRASTRUCTURE SOLUTIONS 1 BIG DATA. BIG CHALLENGES. BIG OPPORTUNITY. How do you manage the VOLUME, VELOCITY & VARIABILITY of complex data streams in order to find

More information

QuickSpecs. HP Dynamic Smart Array Controllers. Models HP Dynamic Smart Array RAID Controllers. HP Dynamic Smart Array Controllers.

QuickSpecs. HP Dynamic Smart Array Controllers. Models HP Dynamic Smart Array RAID Controllers. HP Dynamic Smart Array Controllers. Overview HP Dynamic Smart Array is a RAID solution combining a storage host bus adapter (HBA) and proprietary software components. Eliminating most of the hardware RAID controller components and relocating

More information

QuickSpecs. HP SATA Hard Drives. Overview

QuickSpecs. HP SATA Hard Drives. Overview QuickSpecs Overview HP SATA drives are designed for the reliability and larger capacities demanded by today's entry server and external storage environments. The SATA portfolio is categorized into two

More information

HP Smart Array 5i Plus Controller and Battery Backed Write Cache (BBWC) Enabler

HP Smart Array 5i Plus Controller and Battery Backed Write Cache (BBWC) Enabler Overview HP Smart Array 5i Plus Controller and Battery Backed Write Cache (BBWC) Enabler Models Smart Array 5i Plus Controller and BBWC Enabler bundled Option Kit (for ProLiant DL380 G2, ProLiant DL380

More information

ADDENDA FOR BID DOCUMENT FOR SUPPLY, INSTALLATION OF SERVER, STORAGE AND NETWORKING EQUIPMENTS FOR MGIEP&SD, NEW DELHI

ADDENDA FOR BID DOCUMENT FOR SUPPLY, INSTALLATION OF SERVER, STORAGE AND NETWORKING EQUIPMENTS FOR MGIEP&SD, NEW DELHI ADDENDA FOR BID DOCUMENT FOR SUPPLY, INSTALLATION OF SERVER, STORAGE AND NETWORKING EQUIPMENTS FOR MGIEP&SD, NEW DELHI [Bid No.Ed.CIL/PROC/MGIEP&SD/SERVER/2013] Start/last date of sale of Bid : 30.09.2013

More information

IBM System x reference architecture for Hadoop: MapR

IBM System x reference architecture for Hadoop: MapR IBM System x reference architecture for Hadoop: MapR May 2014 Beth L Hoffman and Billy Robinson (IBM) Andy Lerner and James Sun (MapR Technologies) Copyright IBM Corporation, 2014 Table of contents Introduction...

More information

HP Cloudline Overview

HP Cloudline Overview HP Cloudline Overview Infrastructure for Cloud Service Providers Dave Peterson Hewlett-Packard Company Copyright 2015 2012 Hewlett-Packard Development Company, L.P. The information contained herein is

More information

QuickSpecs. Models HP Smart Array E200 Controller. Upgrade Options Cache Upgrade. Overview

QuickSpecs. Models HP Smart Array E200 Controller. Upgrade Options Cache Upgrade. Overview Overview The HP Smart Array E200 is HP's first entry level PCI Express (PCIe) Serial Attached SCSI (SAS) RAID controller. The full size card has 8 ports and utilizes DDR1-266 memory. The E200 is ideal

More information

QuickSpecs. What's New New 146.8-GB Pluggable Ultra320 SCSI 15,000 rpm Universal Hard Drive (1") HP SCSI Ultra320 Hard Drive Option Kits (Servers)

QuickSpecs. What's New New 146.8-GB Pluggable Ultra320 SCSI 15,000 rpm Universal Hard Drive (1) HP SCSI Ultra320 Hard Drive Option Kits (Servers) Overview A wide variety of rigorously tested, HP-qualified, SMART capable, Ultra320 Hard Drives offering data integrity and availability in hot pluggable and non-pluggable models. HP 15,000 rpm Hard Drives

More information

QuickSpecs. What's New HP 3TB 6G SAS 7.2K 3.5-inch Midline Hard Drive. HP SAS Enterprise and SAS Midline Hard Drives. Overview

QuickSpecs. What's New HP 3TB 6G SAS 7.2K 3.5-inch Midline Hard Drive. HP SAS Enterprise and SAS Midline Hard Drives. Overview Overview Serial Attached SCSI (SAS) provides a superior storage solution. With some storage requirements escalating and others becoming more complex, factors such as flexibility, performance, increased

More information

HP ConvergedSystem 300 for Microsoft Analytics Platform

HP ConvergedSystem 300 for Microsoft Analytics Platform Technical white paper HP ConvergedSystem 300 for Microsoft Analytics Platform Data Integration Platform Gen9 reference guide Table of contents Executive summary... 2 Overview of services for the Data Integration

More information

QuickSpecs. HP SATA Hard Drives. Overview

QuickSpecs. HP SATA Hard Drives. Overview QuickSpecs Overview HP SATA drives are designed for the reliability and larger capacities demanded by today's entry server and external storage environments. The SATA portfolio is categorized into two

More information

VTrak 15200 SATA RAID Storage System

VTrak 15200 SATA RAID Storage System Page 1 15-Drive Supports over 5 TB of reliable, low-cost, high performance storage 15200 Product Highlights First to deliver a full HW iscsi solution with SATA drives - Lower CPU utilization - Higher data

More information

HP 85 TB reference architectures for Microsoft SQL Server 2012 Fast Track Data Warehouse: HP ProLiant DL980 G7 and P2000 G3 MSA Storage

HP 85 TB reference architectures for Microsoft SQL Server 2012 Fast Track Data Warehouse: HP ProLiant DL980 G7 and P2000 G3 MSA Storage Technical white paper HP 85 TB reference architectures for Microsoft SQL Server 2012 Fast Track Data Warehouse: HP ProLiant DL980 G7 and P2000 G3 MSA Storage Table of contents Executive summary... 2 Fast

More information

Windows Server 2008 R2 for Itanium-Based Systems offers the following high-end features and capabilities:

Windows Server 2008 R2 for Itanium-Based Systems offers the following high-end features and capabilities: Overview NOTE: HP no longer sells Microsoft Windows Server 2008/2008 R2 on Integrity servers. HP will continue to support Microsoft Windows Server 2008/2008 R2 until Microsoft's end of mainstream support

More information

HP StorageWorks P4000 G2 SAN Solutions

HP StorageWorks P4000 G2 SAN Solutions P4300 G2 7.2 TB SAS Starter SAN Solution P4300 G2 16 TB MDL SAS Starter SAN Solution P4500 G2 14.4 TB SAS Virtualization SAN Solution P4500 G2 28.8 TB SAS Multi-Site SAN Solution P4500 G2 120 TB MDL SAS

More information

HP ProLiant DL380p Gen8 1000 mailbox 2GB mailbox resiliency Exchange 2010 storage solution

HP ProLiant DL380p Gen8 1000 mailbox 2GB mailbox resiliency Exchange 2010 storage solution Technical white paper HP ProLiant DL380p Gen8 1000 mailbox 2GB mailbox resiliency Exchange 2010 storage solution Table of contents Overview 2 Disclaimer 2 Features of the tested solution 2 Solution description

More information

OPTIMIZING SERVER VIRTUALIZATION

OPTIMIZING SERVER VIRTUALIZATION OPTIMIZING SERVER VIRTUALIZATION HP MULTI-PORT SERVER ADAPTERS BASED ON INTEL ETHERNET TECHNOLOGY As enterprise-class server infrastructures adopt virtualization to improve total cost of ownership (TCO)

More information

FUJITSU Enterprise Product & Solution Facts

FUJITSU Enterprise Product & Solution Facts FUJITSU Enterprise Product & Solution Facts shaping tomorrow with you Business-Centric Data Center The way ICT delivers value is fundamentally changing. Mobile, Big Data, cloud and social media are driving

More information

HP ProLiant Gen8 vs Gen9 Server Blades on Data Warehouse Workloads

HP ProLiant Gen8 vs Gen9 Server Blades on Data Warehouse Workloads HP ProLiant Gen8 vs Gen9 Server Blades on Data Warehouse Workloads Gen9 Servers give more performance per dollar for your investment. Executive Summary Information Technology (IT) organizations face increasing

More information

HP Intelligent Management Center Standard Software Platform

HP Intelligent Management Center Standard Software Platform Data sheet HP Intelligent Management Center Standard Software Platform Key features Highly flexible and scalable deployment Powerful administration control Rich resource management Detailed performance

More information

HP PCIe IO Accelerator For Proliant Rackmount Servers And BladeSystems

HP PCIe IO Accelerator For Proliant Rackmount Servers And BladeSystems WHITE PAPER HP PCIe IO Accelerator For Proliant Rackmount Servers And BladeSystems 951 SanDisk Drive, Milpitas, CA 95035 www.sandisk.com Table of Contents Overview & Features... 3 QuickSpecs...3 HP Supported

More information

Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays. Red Hat Performance Engineering

Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays. Red Hat Performance Engineering Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays Red Hat Performance Engineering Version 1.0 August 2013 1801 Varsity Drive Raleigh NC

More information

HP Smart Array Controllers and basic RAID performance factors

HP Smart Array Controllers and basic RAID performance factors Technical white paper HP Smart Array Controllers and basic RAID performance factors Technology brief Table of contents Abstract 2 Benefits of drive arrays 2 Factors that affect performance 2 HP Smart Array

More information

Lenovo ThinkServer and Cloudera Solution for Apache Hadoop

Lenovo ThinkServer and Cloudera Solution for Apache Hadoop Lenovo ThinkServer and Cloudera Solution for Apache Hadoop For next-generation Lenovo ThinkServer systems Lenovo Enterprise Product Group Version 1.0 December 2014 2014 Lenovo. All rights reserved. LENOVO

More information

End-to-end management

End-to-end management Brochure End-to-end management HP ProLiant servers with Insight Control Make the most of your investments in servers, staff, and facilities The server management challenge Let s face it. Managing servers

More information

Evaluation Report: Accelerating SQL Server Database Performance with the Lenovo Storage S3200 SAN Array

Evaluation Report: Accelerating SQL Server Database Performance with the Lenovo Storage S3200 SAN Array Evaluation Report: Accelerating SQL Server Database Performance with the Lenovo Storage S3200 SAN Array Evaluation report prepared under contract with Lenovo Executive Summary Even with the price of flash

More information

QuickSpecs. Models. HP Dynamic Smart Array B320i Controller. Overview

QuickSpecs. Models. HP Dynamic Smart Array B320i Controller. Overview Overview HP Dynamic Smart array provides more customer choice with higher levels of data protection for embedded controllers, these data protection levels were earlier only available to standup controllers.

More information

Models Smart Array 6402A/128 Controller 3X-KZPEC-BF Smart Array 6404A/256 two 2 channel Controllers

Models Smart Array 6402A/128 Controller 3X-KZPEC-BF Smart Array 6404A/256 two 2 channel Controllers Overview The SA6400A is a high-performance Ultra320, PCI-X array controller. It provides maximum performance, flexibility, and reliable data protection for HP OpenVMS AlphaServers through its unique modular

More information

Brainlab Node TM Technical Specifications

Brainlab Node TM Technical Specifications Brainlab Node TM Technical Specifications BRAINLAB NODE TM HP ProLiant DL360p Gen 8 CPU: Chipset: RAM: HDD: RAID: Graphics: LAN: HW Monitoring: Height: Width: Length: Weight: Operating System: 2x Intel

More information

Proof Point: Example Clustered Microsoft SQL Configuration on the HP ProLiant DL980

Proof Point: Example Clustered Microsoft SQL Configuration on the HP ProLiant DL980 Technical white paper Proof Point: Example Clustered Microsoft SQL Configuration on the HP ProLiant DL980 Table of contents Introduction 2 Server Configuration 3 Microsoft SQL Server Overview 3 Network

More information

Apache Hadoop Cluster Configuration Guide

Apache Hadoop Cluster Configuration Guide Community Driven Apache Hadoop Apache Hadoop Cluster Configuration Guide April 2013 2013 Hortonworks Inc. http://www.hortonworks.com Introduction Sizing a Hadoop cluster is important, as the right resources

More information

INDIAN INSTITUTE OF TECHNOLOGY KANPUR Department of Mechanical Engineering

INDIAN INSTITUTE OF TECHNOLOGY KANPUR Department of Mechanical Engineering INDIAN INSTITUTE OF TECHNOLOGY KANPUR Department of Mechanical Engineering Enquiry No: Enq/IITK/ME/JB/02 Enquiry Date: 14/12/15 Last Date of Submission: 21/12/15 Formal quotations are invited for HPC cluster.

More information

HP Cloud Map for TIBCO ActiveMatrix BusinessWorks: Importing the template

HP Cloud Map for TIBCO ActiveMatrix BusinessWorks: Importing the template HP Cloud Map for TIBCO ActiveMatrix BusinessWorks: Importing the template An HP Reference Architecture for TIBCO Technical white paper Table of contents Executive summary... 2 Solution environment... 2

More information

s/n item Requirements 1 Upgrading of Network Core Switches and Microsoft Servers Virtualization

s/n item Requirements 1 Upgrading of Network Core Switches and Microsoft Servers Virtualization Date: 29 th January, 2015 Invitation for bids The Bank of South Sudan would like to invite sealed/electronic bids from qualified companies to submit it Financial and Technical proposals to the bank on

More information

How to register. Who should attend Services, both internal HP and external

How to register. Who should attend Services, both internal HP and external mm Servicing HP Rack and Tower Server Solutions - Rev 12.31 Course data sheet Certification: Exam(s): The Learning Center: Format: Recommended Duration: How to register HP ATP - Rack and Tower Server Solutions

More information

Dell Reference Configuration for Hortonworks Data Platform

Dell Reference Configuration for Hortonworks Data Platform Dell Reference Configuration for Hortonworks Data Platform A Quick Reference Configuration Guide Armando Acosta Hadoop Product Manager Dell Revolutionary Cloud and Big Data Group Kris Applegate Solution

More information

Performance brief for IBM WebSphere Application Server 7.0 with VMware ESX 4.0 on HP ProLiant DL380 G6 server

Performance brief for IBM WebSphere Application Server 7.0 with VMware ESX 4.0 on HP ProLiant DL380 G6 server Performance brief for IBM WebSphere Application Server.0 with VMware ESX.0 on HP ProLiant DL0 G server Table of contents Executive summary... WebSphere test configuration... Server information... WebSphere

More information

Microsoft Exchange Server 2007 and Hyper-V high availability configuration on HP ProLiant BL680c G5 server blades

Microsoft Exchange Server 2007 and Hyper-V high availability configuration on HP ProLiant BL680c G5 server blades Microsoft Exchange Server 2007 and Hyper-V high availability configuration on HP ProLiant BL680c G5 server blades Executive summary... 2 Introduction... 2 Exchange 2007 Hyper-V high availability configuration...

More information

HP ProLiant Server Booklet. Virtualization Reliability Efficiency Agility

HP ProLiant Server Booklet. Virtualization Reliability Efficiency Agility HP ProLiant Server Booklet OST BO Virtualization Reliability Efficiency Agility Table of contents: HP Product Family HP Insight Control HP BladeSystem Matrix HP ProLiant BL2x220c Generation 6 HP ProLiant

More information

HP Intelligent Management Center Standard Software Platform

HP Intelligent Management Center Standard Software Platform Data sheet HP Intelligent Management Center Standard Software Platform Key features Highly flexible and scalable deployment Powerful administration control Rich resource management Detailed performance

More information

Cisco Unified Data Center Solutions for MapR: Deliver Automated, High-Performance Hadoop Workloads

Cisco Unified Data Center Solutions for MapR: Deliver Automated, High-Performance Hadoop Workloads Solution Overview Cisco Unified Data Center Solutions for MapR: Deliver Automated, High-Performance Hadoop Workloads What You Will Learn MapR Hadoop clusters on Cisco Unified Computing System (Cisco UCS

More information

Supported Platforms HPE Vertica Analytic Database. Software Version: 7.2.x

Supported Platforms HPE Vertica Analytic Database. Software Version: 7.2.x HPE Vertica Analytic Database Software Version: 7.2.x Document Release Date: 2/4/2016 Legal Notices Warranty The only warranties for Hewlett Packard Enterprise products and services are set forth in the

More information

Configuration Guide February 2015

Configuration Guide February 2015 Configuration Guide February 2015 1 For the latest version of this document, please go to: http://www.overlandstorage.com//pdfs/snapserver_xsr_config_guide.pdf 2015 Overland Storage. All trademarks and

More information

Evaluation Report: HP Blade Server and HP MSA 16GFC Storage Evaluation

Evaluation Report: HP Blade Server and HP MSA 16GFC Storage Evaluation Evaluation Report: HP Blade Server and HP MSA 16GFC Storage Evaluation Evaluation report prepared under contract with HP Executive Summary The computing industry is experiencing an increasing demand for

More information

SAN TECHNICAL - DETAILS/ SPECIFICATIONS

SAN TECHNICAL - DETAILS/ SPECIFICATIONS SAN TECHNICAL - DETAILS/ SPECIFICATIONS Technical Details / Specifications for 25 -TB Usable capacity SAN Solution Item 1) SAN STORAGE HARDWARE : One No. S.N. Features Description Technical Compliance

More information

Performance Comparison of Fujitsu PRIMERGY and PRIMEPOWER Servers

Performance Comparison of Fujitsu PRIMERGY and PRIMEPOWER Servers WHITE PAPER FUJITSU PRIMERGY AND PRIMEPOWER SERVERS Performance Comparison of Fujitsu PRIMERGY and PRIMEPOWER Servers CHALLENGE Replace a Fujitsu PRIMEPOWER 2500 partition with a lower cost solution that

More information

UCS M-Series Modular Servers

UCS M-Series Modular Servers UCS M-Series Modular Servers The Next Wave of UCS Innovation Marian Klas Cisco Systems June 2015 Cisco UCS - Powering Applications at Every Scale Edge-Scale Computing Cloud-Scale Computing Seamlessly Extend

More information

FLOW-3D Performance Benchmark and Profiling. September 2012

FLOW-3D Performance Benchmark and Profiling. September 2012 FLOW-3D Performance Benchmark and Profiling September 2012 Note The following research was performed under the HPC Advisory Council activities Participating vendors: FLOW-3D, Dell, Intel, Mellanox Compute

More information

HP Systems Insight Manager 7.0 and HP Agentless Management overview

HP Systems Insight Manager 7.0 and HP Agentless Management overview HP Systems Insight Manager 7.0 and HP Agentless Management overview Technology brief Introduction... 2 Overview of HP Agentless Management... 2 Using traditional OS-based management software... 4 Using

More information

Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Copyright 2013, Oracle and/or its affiliates. All rights reserved. 1 Oracle SPARC Server for Enterprise Computing Dr. Heiner Bauch Senior Account Architect 19. April 2013 2 The following is intended to outline our general product direction. It is intended for information

More information

Deployment Guide. How to prepare your environment for an OnApp Cloud deployment.

Deployment Guide. How to prepare your environment for an OnApp Cloud deployment. Deployment Guide How to prepare your environment for an OnApp Cloud deployment. Document version 1.07 Document release date 28 th November 2011 document revisions 1 Contents 1. Overview... 3 2. Network

More information

QuickSpecs. What's New. At A Glance. Models. HP StorageWorks SB40c storage blade. Overview

QuickSpecs. What's New. At A Glance. Models. HP StorageWorks SB40c storage blade. Overview DA - 12587 Worldwide Version 5 2.16.2007 Page 1 Overview The HP StorageWorks SB40c is a half-height c-class storage blade supporting up to six hot-plug Small Form Factor (SFF) Serial Attached SCSI (SAS)

More information

HP Intelligent Management Center Basic WLAN Manager Software Platform

HP Intelligent Management Center Basic WLAN Manager Software Platform Data sheet HP Intelligent Management Center Basic WLAN Manager Software Platform Key features Intuitive, easy-to-use interface Unified wired and wireless network management Range of topology management

More information

How To Use An Hp Smart Array P212 (Sas) With A 1-Year Warranty On A 2 Terabyte Hard Drive (Sata) And A 1 Terabyte (Sasa) Hard Drive With A 2-Year License (S

How To Use An Hp Smart Array P212 (Sas) With A 1-Year Warranty On A 2 Terabyte Hard Drive (Sata) And A 1 Terabyte (Sasa) Hard Drive With A 2-Year License (S Overview The HP Smart Array P212 is HP's entry level PCI Express (PCIe) Serial Attached SCSI (SAS) RAID controller. The low profile, half height card has 8 ports and utilizes DDR2-800 memory. The P212

More information

Supported Platforms. HP Vertica Analytic Database. Software Version: 7.1.x

Supported Platforms. HP Vertica Analytic Database. Software Version: 7.1.x HP Vertica Analytic Database Software Version: 7.1.x Document Release Date: 10/14/2015 Legal Notices Warranty The only warranties for HP products and services are set forth in the express warranty statements

More information

QuickSpecs. What's New. Models. ProLiant Essentials Server Migration Pack - Physical to ProLiant Edition. Overview

QuickSpecs. What's New. Models. ProLiant Essentials Server Migration Pack - Physical to ProLiant Edition. Overview Overview Upgrading or replacing your existing server? Migration is now an option! Replicate the server you are replacing using the software, the only product of its kind from a server vendor that provides

More information

Implementing the HP Cloud Map for SAS Enterprise BI on Linux

Implementing the HP Cloud Map for SAS Enterprise BI on Linux Technical white paper Implementing the HP Cloud Map for SAS Enterprise BI on Linux Table of contents Executive summary... 2 How to utilize this HP CloudSystem Matrix template... 2 Download the template...

More information

HP Insight Control for Microsoft System Center integration overview

HP Insight Control for Microsoft System Center integration overview HP Insight Control for Microsoft System Center integration overview Integration note, 2 nd edition Introduction... 3 Integrating Insight Control with System Center... 3 Licensing... 5 Complementary functionality

More information

Virtualizing SQL Server 2008 Using EMC VNX Series and Microsoft Windows Server 2008 R2 Hyper-V. Reference Architecture

Virtualizing SQL Server 2008 Using EMC VNX Series and Microsoft Windows Server 2008 R2 Hyper-V. Reference Architecture Virtualizing SQL Server 2008 Using EMC VNX Series and Microsoft Windows Server 2008 R2 Hyper-V Copyright 2011 EMC Corporation. All rights reserved. Published February, 2011 EMC believes the information

More information

Virtualizing Microsoft Exchange 2010 with HP StoreVirtual 4500 G2 and VMware vsphere 5.0

Virtualizing Microsoft Exchange 2010 with HP StoreVirtual 4500 G2 and VMware vsphere 5.0 Technical white paper Virtualizing Microsoft Exchange 2010 with HP StoreVirtual 4500 G2 and VMware vsphere 5.0 Table of contents Executive summary... 2 Introduction... 2 Benefits of virtualization... 2

More information