Platfora Big Data Analytics

Similar documents
Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack

Get More Scalability and Flexibility for Big Data

Cisco for SAP HANA Scale-Out Solution on Cisco UCS with NetApp Storage

Pentaho High-Performance Big Data Reference Configurations using Cisco Unified Computing System

White Paper. Cisco and Greenplum Partner to Deliver High-Performance Hadoop Reference Configurations

Cisco Unified Data Center Solutions for MapR: Deliver Automated, High-Performance Hadoop Workloads

How To Build A Cisco Ukcsob420 M3 Blade Server

Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database

How Cisco IT Built Big Data Platform to Transform Data Management

Powerful Duo: MapR Big Data Analytics with Cisco ACI Network Switches

Build Your Competitive Edge in Big Data with Cisco. Rick Speyer Senior Global Marketing Manager Big Data Cisco Systems 6/25/2015

Unified Computing Systems

Boost Database Performance with the Cisco UCS Storage Accelerator

Cisco Data Preparation

Building & Optimizing Enterprise-class Hadoop with Open Architectures Prem Jain NetApp

Cisco Unified Computing System Hardware

Cisco, Citrix, Microsoft, and NetApp Deliver Simplified High-Performance Infrastructure for Virtual Desktops

Cisco SmartPlay Select. Cisco Global Data Center Promotional Program

Cisco UCS B-Series M2 Blade Servers

MarkLogic and Cisco: A Next-Generation, Real-Time Solution for Big Data

Cisco IT Hadoop Journey

IT Agility Delivered: Cisco Unified Computing System

UCS M-Series Modular Servers

Cisco UCS C220 M3 Server

Cisco Unified Computing System and EMC VNXe3300 Unified Storage System

The Future of Computing Cisco Unified Computing System. Markus Kunstmann Channels Systems Engineer

Cisco UCS B460 M4 Blade Server

Cisco UCS B200 M3 Blade Server

Colgate-Palmolive selects SAP HANA to improve the speed of business analytics with IBM and SAP

Veeam Backup & Replication Enterprise Plus Powered by Cisco UCS: Reliable Data Protection Designed for Virtualized Environments

VXRACK SYSTEM Product Overview DATA SHEET

UCS Storage Options. July Bertalan Dergez Consulting Systems Engineer

Cisco Unified Computing System: Meet the Challenges of Microsoft SharePoint Server Workloads

Cisco UCS C24 M3 Server

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

Cisco Unified Computing System: Meet the Challenges of Microsoft SharePoint Server Workloads

C460 M4 Flexible Compute for SAP HANA Landscapes. Judy Lee Released: April, 2015

IBM System x family brochure

Cisco Unified Computing System: Meet the Challenges of Virtualization with Microsoft Hyper-V

Cisco UCS B440 M2 High-Performance Blade Server

I/O Performance of Cisco UCS M-Series Modular Servers with Cisco UCS M142 Compute Cartridges

FlexPod for VMware The Journey to Virtualization and the Cloud

The virtualization of SAP environments to accommodate standardization and easier management is gaining momentum in data centers.

Accelerate Cloud Initiatives with Cisco UCS and Ubuntu OpenStack

Integrated Grid Solutions. and Greenplum

REFERENCE ARCHITECTURE. PernixData FVP Software and Splunk Enterprise

Cloud Ready: Architectural Integration into FlexPod with Microsoft Private Cloud Solution

Large Unstructured Data Storage in a Small Datacenter Footprint: Cisco UCS C3160 and Red Hat Gluster Storage 500-TB Solution

Lenovo ThinkServer and Cloudera Solution for Apache Hadoop

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

ORACLE BIG DATA APPLIANCE X3-2

New Hitachi Virtual Storage Platform Family. Name Date

Power Efficiency Comparison: Cisco UCS 5108 Blade Server Chassis and IBM FlexSystem Enterprise Chassis

The Future of Data Management

Dell Reference Configuration for Hortonworks Data Platform

Cisco UCS C220 M3 Server

MapR Enterprise Edition & Enterprise Database Edition

How To Write An Article On An Hp Appsystem For Spera Hana

Support a New Class of Applications with Cisco UCS M-Series Modular Servers

Dell s SAP HANA Appliance

Cisco UCS Business Advantage Delivered: Data Center Capacity Planning and Refresh

PLATFORA INTERACTIVE, IN-MEMORY BUSINESS INTELLIGENCE FOR HADOOP

How To Build A Cisco Uniden Computing System

Cisco UCS Integrated Infrastructure for Big Data with Splunk Enterprise

Hortonworks Data Platform Reference Architecture

Modernizing Your Data Warehouse for Hadoop

HUAWEI TECHNOLOGIES CO., LTD. HUAWEI FusionServer X6800 Data Center Server

IBM System x family brochure

Achieving Real-Time Business Solutions Using Graph Database Technology and High Performance Networks

Cisco Data Center Network Manager for SAN

Big Data Performance Growth on the Rise

Maximum performance, minimal risk for data warehousing

Big Data in the Enterprise: Network Design Considerations

SAP HANA - an inflection point

Mit Soft- & Hardware zum Erfolg. Giuseppe Paletta

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

Maximizing Hadoop Performance and Storage Capacity with AltraHD TM

Cisco and VMware: Transforming the End-User Desktop

Datasheet FUJITSU Integrated System PRIMEFLEX for Hadoop

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

IVA & UCS. Frank Stott UCS Sales Specialist frstott@cisco.com Cisco and/or its affiliates. All rights reserved.

Big Data. Value, use cases and architectures. Petar Torre Lead Architect Service Provider Group. Dubrovnik, Croatia, South East Europe May, 2013

A Platform Built for Server Virtualization: Cisco Unified Computing System

How System Settings Impact PCIe SSD Performance

Cisco-EMC Microsoft SQL Server Fast Track Warehouse 3.0 Enterprise Reference Configurations. Data Sheet

SUN HARDWARE FROM ORACLE: PRICING FOR EDUCATION

HadoopTM Analytics DDN

Oracle Database Reliability, Performance and scalability on Intel Xeon platforms Mitch Shults, Intel Corporation October 2011

Architectural Comparison: Cisco UCS and the Dell FX2 Platform

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

High Performance Server SAN using Micron M500DC SSDs and Sanbolic Software

IRON Big Data Appliance Platform for Hadoop

VBLOCK SOLUTION FOR SAP: SAP APPLICATION AND DATABASE PERFORMANCE IN PHYSICAL AND VIRTUAL ENVIRONMENTS

Transcription:

Platfora Big Data Analytics ISV Partner Solution Case Study and Cisco Unified Computing System Platfora, the leading enterprise big data analytics platform built natively on Hadoop and Spark, delivers business outcomes and competitive advantage. Platfora enables business users and data scientists to visually interact with petabyte-scale data in seconds, allowing them to work with even the rawest forms of transaction, customer interaction, and machine data. When combined with high-performance Cisco Unified Computing System (Cisco UCS ) servers, the solution encourages more use cases and unlocks the enterprise s competitive edge by providing deeper insights into much larger data sets. Analytics for Business Users Platfora s solution runs on a separate dedicated cluster alongside the Apache Hadoop cluster. A Platfora cluster is typically configured with large amounts of RAM to support terabytes of in-memory analysis performed directly by business users, delivering exceptional performance while still allowing users to analyze all the data inside the Hadoop cluster. Platfora s solution is optimized to run on Cisco UCS, which can be seamlessly integrated with Cisco UCS Hadoop deployments. See and Work with 100 Percent of Your Data Business analysts can dig into all the multistructured data in your organization transactions, customer interactions, and machine data via Platfora s self-service access platform. With Platfora, analysts have access to all the data they need without needing to involve IT staff. Improved Time to Value Platfora enables business analysts to access and iterate their workloads in minutes, rather than hours. This faster time-to-insight drives faster reaction to changing business conditions and better support for business users whose budget drives IT adoption. 1

Platfora on Cisco UCS Platfora Cluster Lens data import via HDFS, lens, and segment replication to HDFS HADOOP Local filesystems (replicated 3x) Platfora lenses, segments Platfora Vizboards Data sets HDFS (replicated 3x) Transactional Data Customer Interaction Data Ingest/export via Sqoop, Storm+Kafka, ETL All data to be analyzed Platfora lenses, segments (replicated from Platfora Cluster) Machine Data How Platfora Works Platfora s Interest Driven Pipeline enables business users to derive insight directly vs. engaging IT staff to condition and structure data required for analysis, an approach required by traditional business intelligence tooling. This approach reduces time-to-insight by orders of magnitude. Platfora performs behavioral analysis and iterative segmentation across 100 percent of the data in Hadoop. Traditional business intelligence tools cannot access all the data in Hadoop because they typically throttle through a SQL-over-Hadoop approach and squeeze the data into a single server, precluding exploratory analysis on very large data sets. Platfora delivers a scale-out, in-memory MPP-based accelerator engine, ensuring high-performance, low-latency response for business users. Platfora automatically generates MapReduce and Spark jobs to populate an in-memory acceleration layer, removing the need for business analysts to write SQL, Hive, or MapReduce code. Use Cases Customer Analytics and Insights: Understand Your Audience Better Than Ever Before Platfora encourages you to ask new questions about your data, making it easy for marketing professionals to follow hunches, test theories, and continuously refine analysis until they find exactly what they are looking for all with no coding required. Frequently implemented use cases include golden path to purchase, segmentation analysis, and customer churn drivers. 2

Internet of Things: Combine New Data Sets in Ways Never Before Possible Platfora enables analysts to combine large amounts of data in multiple formats from product telemetry via connected devices to user experience, traffic and parking control, and telemedicine. Frequently implemented use cases include product/feature utilization, sensor analytics, and supportability analysis. Security and Compliance: Practical, Proactive Protection Network security analytics with Platfora uses the power of Hadoop to spot subtle breach patterns across billions of events without waiting two or three months for analysis. Platfora delivers full situational awareness via the ability to rapidly interrogate data to investigate incidents and improve understanding of network-borne threats and adjust the questions being asking based on rapidly changing behaviors. Advanced persistent threat identification is the most frequentlyimplemented use case. Platfora on Cisco UCS Integrated Infrastructure for Big Data Cisco UCS is the first converged data center platform that combines industry-standard x86-architecture servers with networking and storage access into a single converged system. The Cisco UCS innovations of unified fabric and unified management for all connected devices offer simplified management, world-class performance, and exceptional scalability needed to support the Platfora analytic workloads. Cisco UCS Integrated Infrastructure for Big Data is the third generation of Cisco s solution for big data. It extends the Cisco UCS Common Platform Architecture (CPA) for Big Data with improvements in performance and capacity. The solution has been widely adopted for a variety of workloads across all enterprise segments. It accelerates deployment, delivers predictable performance, and reduces total cost of ownership (TCO). Reference Configurations The Platfora on Cisco UCS reference configurations are based on the latest Cisco solution for big data. The Platfora cluster is designed to deploy alongside Hadoop environments (shown in the figure), taking full advantage of the lossless 10Gbps unified fabric connectivity for Platfora s lens-building traffic. The solution extends Hadoop deployments on Cisco UCS CPA for Big Data with Platfora s scaleout, in-memory MPP-based accelerator engine to deliver business intelligence that meets the need of business users. The reference configurations are built with the following components: Cisco UCS 6200 Series Fabric Interconnects provide high-bandwidth, low-latency connectivity for servers, with Cisco UCS Manager providing integrated, unified management for all connected devices. Deployed in redundant pairs, Cisco fabric interconnects offer the full activeactive redundancy, performance, and exceptional scalability needed to support the large number of nodes that are typical in clusters serving big data applications. Cisco UCS Manager enables rapid and consistent server configuration using service profiles, automating ongoing system maintenance activities such as firmware updates across the entire cluster as a single operation. Cisco UCS Manager also offers advanced monitoring with options to raise alarms and send notifications about the health of the entire cluster. Platfora Nodes Hadoop Nodes Platfora and Hadoop Coexisting on Cisco UCS 3

Cisco UCS C220 M4 Rack Servers are designed for performance and density over a wide range of business workloads in a 1-rack unit (1RU) form factor. Cisco UCS C220 M4 servers are powered by dual Intel Xeon E5-2600 v3 series CPUs, and they support up to 768 GB of main memory. These servers support four or eight SAS/SATA/SSD drives as well as Cisco UCS virtual interface cards (VICs) optimized for high-bandwidth and low-latency cluster connectivity, with support for up to 256 virtual devices. Cisco UCS C220 M4 servers are ideal for building Platfora clusters. Cisco UCS C240 M4 Rack Servers are enterprise-class servers that deliver an outstanding combination of performance, flexibility, and efficiency for storage. The Cisco UCS C240M4 servers are 2-socket, 2-rack-unit (2RU) servers based on Intel Xeon E5-2600 v3 series processors supporting up to 768 GB of DDR4 main memory. These servers support up to 24 SFF SAS/SATA/SSD drives or 12 LFF SAS/SATA drives, plus 2 SFF SSD drives. Their expandability and exceptional performance makes them an ideal fit for big data analytics, virtualization, and graphics-rich and bare-metal applications. Cisco UCS C240 M4 servers are ideal for Hadoop deployments. The following table lists the reference configurations of Platfora on Cisco UCS. The options include deploying Platfora with Cisco s performance optimized or capacity optimized solution for big data. The typical ratio of Hadoop nodes to Platfora nodes in deployment ranges from 4:1 to 8:1, Reference Configurations SOLUTION Platfora Servers PLATFORA WITH PERFORMANCE OPTIMIZED HADOOP PLATFORA WITH CAPACITY OPTIMIZED HADOOP 4 Cisco UCS C220 M4 Rack Servers, each with: 2 Intel Xeon processors E5-2620 v3 at 2.4 GHz 256 GB of memory Cisco 12G-2GB RAID Controller 8 1.2TB 10K SFF SAS drives per server (38 TB total) Cisco UCS Solution Accelerator Paks Connectivity Hadoop Servers Hadoop Performance Optimized* 2 Cisco UCS 6296UP 96-Port Fabric Interconnects 16 Cisco UCS C240 M4 Rack Servers, each with: 2 Intel Xeon processor E5-2680 v3 CPUs 256 GB of memory Cisco 12-Gbps SAS Modular Raid Controller with 2-GB FBWC cache 2 120-GB 6-Gbps 2.5-inch Enterprise Value SATA SSDs 24 1.2-TB 10K SFF SAS drives Cisco UCS VIC 1227 (with 2 10GE SFP+ports) Hadoop Capacity Optimized* 2 Cisco UCS 6296UP 96-Port Fabric Interconnects 16 Cisco UCS C240 M4 Rack Servers, each with: 2 Intel Xeon processor E5-2620 v3 CPUs 128 GB of memory Cisco 12-Gbps SAS Modular Raid Controller with 2-GB FBWC cache 2 120-GB 6-Gbps 2.5-inch Enterprise Value SATA SSDs 12 4-TB 7.2K Large Form Factor (LFF) SAS drives Cisco UCS VIC 1227 (with 2 10GE SFP+ports) Storage 460 TB with 42 GBps of bandwidth 768 TB with 16 GBps of bandwidth Optional Software from Cisco Red Hat Enterprise Linux, SUSE Linux Enterprise Cloudera, MapR, or Hortonworks Cisco UCS Director Express for Big Data Red Hat Enterprise Linux SUSE Linux Enterprise Cloudera, MapR, or Hortonworks Cisco UCS Director Express for Big Data * Base rack solution is available as single SKU bundles. Performance Optimized Rack: UCS-SL-CPA3-P. Capacity Optimized: UCS-SL-CPA3-C. 4

depending on workload. Therefore, these reference configurations can be further customized to meet a variety of big data and analytics workload requirements. Up to 160 servers are supported in a single management domain with Cisco Nexus 2232PP 10GE Fabric Extenders. These configurations can be further scaled to thousands of servers using Cisco Nexus 7000 or 9000 Series Switches. Conclusion Platfora Big Data Analytics on the Cisco UCS solution is designed to help organizations derive instant value from their big data deployments. Without the expense entailed in designing and building custom solutions, this solution can help organizations quickly and easily deploy the Platfora analytics platform on a powerful, and secure Hadoop environment with 100 percent of the data available for business users, without additional developer support. More Information www.platfora.com www.cisco.com/go/ucs For more information about Cisco UCS big data solutions, visit www.cisco.com/go/bigdata. For more information about the Cisco CPA for Big Data, visit blogs.cisco.com/datacenter/cpav3. 2015 Cisco and/or its affiliates. All rights reserved. Cisco and the Cisco logo are trademarks or registered trademarks of Cisco and/or its affiliates in the U.S. and other countries. To view a list of Cisco trademarks, go to www.cisco.com/go/trademarks. Third-party trademarks mentioned are the property of their respective owners. Use of the word partner does not imply a partnership relationship between Cisco and any other company. 5