ORACLE BIG DATA APPLIANCE X4-2 BIG DATA FOR THE ENTERPRISE OPEN, SECURE AND INTEGRATED KEY FEATURES Massively scalable, open infrastructure to store and manage big data Industry-leading security, performance and the most comprehensive big data tool set on the market all bundled in an easy to deploy appliance. Big Data Connectors delivers load rates of up to 15TB per hour between Big Data Appliance and Oracle Exadata Cloudera s comprehensive software suite including Cloudera Distribution including Apache Hadoop (CDH) delivers managed and proven Hadoop components to the enterprise Oracle Enterprise Manager combined with Cloudera Manager simplifies management of the entire Big Data Appliance Advanced analytics with Oracle R on Hadoop data Handle low-latency unstructured workloads with the pre-installed and configured Oracle NoSQL Database Community Edition InfiniBand connectivity between nodes and across appliances as well as to Oracle Exadata Flexible configuration choices for optimizing both floor space and growth path for Hadoop and Oracle NoSQL Database KEY BENEFITS Optimized, Complete and Secure Big Data Solution Most comprehensive big data tool set integrated in a single appliance Integrated with Oracle Exadata to analyze all your data Risk-free installation and rapid time to value Simplified operations, updates and patch management though a single Oracle Big Data Appliance X4-2 is a comprehensive Big Data platform, engineered for secure data processing with a low overall total cost of ownership. It is optimized for both batch and real-time processing utilizing Cloudera s Distribution for Apache Hadoop, Oracle NoSQL Database, Cloudera Impala and Cloudera Search to satisfy diverse computing requirements. Built using industry-standard hardware from Sun, Big Data Appliance X4-2 delivers the perfect balance between compute power, I/O bandwidth and memory footprint offering 33% more storage capacity than the previous generation appliance. Big Data Appliance X4-2 provides a highly optimized platform with integrated management capabilities that allows you to derive value quickly with lower risk. Comprehensive Big Data Platform Oracle Big Data Appliance is an open, multi-purpose big data platform. It is optimized to run a diverse set of workloads including batch processing jobs as well as interactive applications. Apache Hadoop s MapReduce framework powers the batch capabilities processing massive volumes of data with linear scalability. There are several options for interactive applications each with their own unique properties. Oracle NoSQL Database is a distributed key-value database. It is designed to be highly available and extremely scalable with predictable levels of throughput and latency. Cloudera Impala provides real-time SQL queries over data stored in HDFS enabling business intelligence tools to access data in Hadoop without requiring MapReduce processing. Finally, Cloudera Search offers full-text interactive search over data stored in HDFS with results delivered using a faceted navigation model. In addition to providing the full Cloudera software platform, Big Data Appliance utilizes Oracle Big Data Connectors to simplify data integration and analytics. Big Data Connectors provide high speed access to data in Hadoop from Oracle Exadata and Oracle Database with data transfer rates in the order of 15 TB/hour. Big Data Connectors also enable integrated, highly scalable analytics to run on Big Data Appliance providing native access to Hadoop data and parallel processing using Oracle R Distribution. Finally, Oracle XQuery for Hadoop is a new capability that enables standard XQuery operations to process and transform documents in various formats (JSON, XML, Avro and others), executing in parallel across the Hadoop cluster. The big data domain is marked by continuous innovation; Big Data Appliance embraces these innovations by providing an open environment without compromising tight integration and enterprise-level support. Organizations are free to deploy external software to support new functionality such as graph analytics, natural language processing and fraud detection to meet the needs of the application. Support for non-oracle components is delivered by their respective support channels and not by Oracle. Big Data Appliance X4-2 Software Integrated Software Oracle Linux 6.4 with Unbreakable Enterprise Kernel Oracle Java JDK 7
command utility of the entire stack (OS, Java, Oracle NoSQL Database and the Cloudera stack) Single Management Console integrating Big Data Appliance hardware and software monitoring Single-vendor support for your entire big data solution covering both hardware and software RELATED PRODUCTS AND SERVICES Oracle Big Data Appliance brings a low risk, highly scalable big data platform to the enterprise. RELATED PRODUCTS The following are related products available from Oracle: Oracle Exadata Oracle Big Data Connectors Oracle NoSQL Database Oracle Exalytics Oracle Business Intelligence Enterprise Edition Oracle Endeca Information Discovery Oracle Data Integrator Oracle Enterprise Manager RELATED SERVICES The following services are available from Oracle Support Services: Advanced Customer Services Product Support Services Consulting Services Oracle University Courses Cloudera Software Cloudera s Distribution including Apache Hadoop (CDH) Impala HBase (as well as support for Accumulo) Search Cloudera Manager including: Cloudera Back-up and Disaster Recovery (BDR) Cloudera Navigator Oracle R Distribution Oracle NoSQL Database Community Edition* Oracle Big Data Appliance Enterprise Manager Plug-In Optional Software (separately licensed) Oracle Big Data Connectors Oracle SQL Connector for Hadoop Oracle Loader for Hadoop Oracle XQuery for Hadoop Oracle R Advanced Analytics for Hadoop Oracle Data Integrator Application Adapter for Hadoop Oracle Audit Vault and Database Firewall for Hadoop Auditing Oracle Data Integrator Oracle NoSQL Database Enterprise Edition * Support for Oracle NoSQL Database Community Edition is not a part of Big Data Appliance. It is a separately purchased component Lower TCO than Do-it-Yourself Hadoop Oracle Big Data Appliance lowers the total cost of ownership of a big data platform when compared to a DIY system. Not only are the costs of an initial deployment lower with Big Data Appliance, but more significantly, so are the ongoing costs of maintenance, optimization and system growth. Big Data Appliance provides unique pricing to dramatically reduce the three to four year TCO when compared to a DIY big data platform. Big Data Appliance bundles the hardware (servers, high-speed networking, power distribution units and peripherals), OS support and subscription costs for the Cloudera software into a single price for the life of the system. A single support license covers both the hardware and the integrated software. Organizations do not want to spend valuable intellectual capital assembling and tuning an optimized Hadoop/NoSQL infrastructure, especially when these resources can be applied to delivering high value business solutions. Big Data Appliance delivers a pre-configured, highly tuned environment out-of-box for Apache Hadoop and Oracle NoSQL Database. This optimized environment enables companies to focus their resources on developing compelling business applications lowering the risk for the solution. Additionally, the pre-tuned environment avoids extensive ramp-up time for new applications due to performance and production issues. Simplified Operations Oracle Enterprise Manager provides a single entry point for managing the entire system both hardware and software providing continuity across other Oracle products in the organization. To provide deep management capabilities for Hadoop, Enterprise Manager enables a context-aware integration with Cloudera Manager. Big Data Appliance simplifies day-to-day operations by providing a simple one-command installation, update, patch and expansion utility Mammoth which enables rapid deployment updates (typically quarterly) to the frequently evolving Hadoop stack without incurring significant downtime. Mammoth also enables Oracle-tested, seamless upgrades 2
between Hadoop versions and automated service management to ensure the best balance between Hadoop Master Nodes and Data Nodes. Big Data Appliance is supported by Oracle, giving organizations a single point of support for their hardware, all integrated software (including all Cloudera software) and any additional Oracle software installed. Comprehensive Security Securing data is critical to Big Data solutions in the enterprise; Big Data Appliance provides strong authentication, authorization and auditing of data in Hadoop out of the box. Strong authentication is provided using Kerberos. This ensures that all users are who they claim to be and that rogue services are not added to the system. Big Data Appliance leverages Apache Sentry (an open-source project of which Oracle is a founding member) to authorize SQL access via tools like Hive and Impala. By delivering and developing Sentry, Oracle delivers Big Data Appliance with the highest data security levels currently available for Hadoop. To ensure security and data access compliance, Big Data Appliance integrates with Oracle Audit Vault and Database Firewall. An Oracle Audit Vault agent is pre-installed on Big Data Appliance to track and audit data access on the Hadoop system. By leveraging Oracle Audit Vault and Database Firewall, all auditing across the organization is consolidated into a single audit repository ensuring a comprehensive view across all data. Flexible Configurations Big Data Appliance is designed to expand as your data and requirements grow. Initial big data implementations may start with Big Data Appliance. This six server rack comes fully equipped with a complete set of switches and power distribution units (PDU) required for a full rack. This allows the appliance to easily and efficiently expand in six node hardware increments to larger configurations using the Oracle Big Data Appliance In-. In addition to upgrading within a rack, multiple racks can be connected using the integrated InfiniBand fabric to form even larger configurations; up to 18 racks can be connected in a non-blocking manner by connecting InfiniBand cables without the need for any external switches. Larger non-blocking configurations are supported with additional external InfiniBand switches, larger blocking network configurations can be supported without additional switches. Big Data Appliance is multitenant; it can be configured as a single cluster or as a set of clusters. This provides the flexibility customers need when deploying development, test and production clusters. Big Data Appliance X4-2 Hardware In- 18 x compute/storage nodes 6 x compute/storage nodes 6 x compute/storage nodes Per Node: 2 x Eight-Core Intel Xeon E5-2650 V2 Processors 64 GB Memory (expandable to 512 GB) 3
Disk Controller HBA with 512MB Battery backed write cache 12 x 4TB 7,200 RPM High Capacity SAS Disks 2 x QDR (40Gb/s) Ports 4 x 10 Gb Ethernet Ports 1 x ILOM Ethernet Port 2 x 32 Port QDR InfiniBand Switch 32 x InfiniBand ports 8 x 10Gb Ethernet ports 1 x 36 Port QDR InfiniBand Switch 36 x InfiniBand Ports Additional Hardware Components included: Ethernet Administration Switch 2 x Redundant Power Distributions Units (PDUs) Spares Kit Included: 42U rack packaging 2 x 4 TB High Capacity SAS disk InfiniBand cables Leverages the leaf switches from the Starter Rack Leverages the spine switch from the Starter Rack Leverages the administration switch, PDUs and base rack from the Leverages the spares kit from the Big Data Appliance X4-2 Expansions Multi-Rack Connection Upgradeability: Field upgrade leveraging either a single (6 nodes) or two (2 x 6 nodes) In-Rack Expansions. Expansion supports multiple generations of hardware In- Up to 18 racks can be connected without requiring additional InfiniBand switches InfiniBand cables to connect 3 racks are included in the rack Spares Kits Additional hardware include with each In-: 6 x Compute node with direct attached storage as shown earlier InfiniBand and Ethernet cables to connect all of the components Memory Expansions Additional optical InfiniBand cables required when connecting 4 or more racks Expand the memory in any number of nodes from 64GB per node to 512GB per node. Big Data Appliance X4-2 Environmental Specificaions Physical Dimensions Height Width Depth 42U, 78.66-1998 mm 23.62-600mm 47.24-1200 mm Weight 1037 Lbs 1400 Lbs 4
Power Cooling Airflow 2 1800 Lbs 1 1 1 4.2 KW 3.0 KW 7.7 KW 5.4 KW 10.0KW 7.0 KW 14,052 BTU/hour 9,836 BTU/hour 26,411 BTU/hour 18,487 BTU/hour 34,142 BTU/hour 23,940 BTU/hour Further Environmental Specifications 676 CFM 473 CFM 1223 CFM 856 CFM 1,573 CFM 1,103 CFM Operating temperature/humidity: 5 ºC to 32 ºC (41 ºF to 89.6 ºF), 10% to 90% relative humidity, non-condensing Altitude Operating: Up to 3,048 m, max. ambient temperature is de-rated by 1 C per 300 m above 900 m Regulations 3 Safety: UL 60950-1 2nd Ed, EN60950-1:2006 2nd Ed, CB Scheme with all country differences RFI/EMI: FCC CFR 47 Part 15 Subpart B Class A, EN 55022:2006+A1:2007 Class A, EN 61000-3-11:2000, EN 61000-3-12:2005, ETSI EN 300 386 V1.4.1 (2008) Immunity: EN 55024:1998+A1:2001:+A2:2003 Certifications 3 Safety: UL/cUL, CE, BSMI, GOST R, S-Mark, CSA C22.2 No. 60950-1-07 2nd Ed, CCC EMC: CE, FCC, VCCI, ICES, KCC, GOST R, BSMI Class A, AS/NZ 3548, CCC Other: Complies with WEEE Directive (2002/96/EC) and RoHS Directive (2002/95/EC) 1 power usage varies by application workload 2 Airflow must be front to back 3 In some cases, as applicable, regulatory and certification compliance were obtained at the component level 5
Big Data Appliance Support Services Hardware Warranty: 1 year with a 4 hour web/phone response during normal business hours (Mon-Fri 8AM-5PM), with 2 business day on-site response/parts Exchange Oracle Premier Support for Systems: Oracle Linux and integrated software support and 24x7 with 2 hour on-site hardware service response (subject to proximity to service center) Oracle Premier Support for Operating Systems Oracle Customer Data and Device Retention System Installation Services Software Configuration Services System Expansion Support Services including hardware installation and software configuration Quarterly on-site patch deployment service Oracle Automatic Service Request (ASR) Contact Us For more information about Oracle Big Data Appliance, visit oracle.com or call +1.800.ORACLE1 to speak to an Oracle representative. Copyright 2013, Oracle and/or its affiliates. All rights reserved. This document is provided for information purposes only and the contents hereof are subject to change without notice. This document is not warranted to be error-free, nor subject to any other warranties or conditions, whether expressed orally or implied in law, including implied warranties and conditions of merchantability or fitness for a particular purpose. We specifically disclaim any liability with respect to this document and no contractual obligations are formed either directly or indirectly by this document. This document may not be reproduced or transmitted in any form or by any means, electronic or mechanical, for any purpose, without our prior written permission. Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners. Cloudera, Cloudera CDH, and Cloudera Manager, Cloudera Navigator and Cloudera BDR are registered and unregistered trademarks of Cloudera, Inc., Intel and Intel Xeon are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. AMD, Opteron, the AMD logo, and the AMD Opteron logo are trademarks or registered trademarks of Advanced Micro Devices. UNIX is a registered trademark licensed through X/Open Company, Ltd. 0611 6