Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack



Similar documents
Platfora Big Data Analytics

Get More Scalability and Flexibility for Big Data

Pentaho High-Performance Big Data Reference Configurations using Cisco Unified Computing System

Cisco Unified Data Center Solutions for MapR: Deliver Automated, High-Performance Hadoop Workloads

Cisco for SAP HANA Scale-Out Solution on Cisco UCS with NetApp Storage

Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database

White Paper. Cisco and Greenplum Partner to Deliver High-Performance Hadoop Reference Configurations

Unified Computing Systems

Cisco UCS with ParAccel Analytic Platform Solution: Deliver Powerful Analytics to Transform Business

The Future of Computing Cisco Unified Computing System. Markus Kunstmann Channels Systems Engineer

A Platform Built for Server Virtualization: Cisco Unified Computing System

Cisco Unified Computing System: Meet the Challenges of Virtualization with Microsoft Hyper-V

How Cisco IT Built Big Data Platform to Transform Data Management

Cisco UCS B460 M4 Blade Server

Cisco UCS B-Series M2 Blade Servers

How To Build A Cisco Ukcsob420 M3 Blade Server

Cisco SmartPlay Select. Cisco Global Data Center Promotional Program

MarkLogic and Cisco: A Next-Generation, Real-Time Solution for Big Data

UCS M-Series Modular Servers

HadoopTM Analytics DDN

Cisco UCS Business Advantage Delivered: Data Center Capacity Planning and Refresh

Powerful Duo: MapR Big Data Analytics with Cisco ACI Network Switches

Support a New Class of Applications with Cisco UCS M-Series Modular Servers

The virtualization of SAP environments to accommodate standardization and easier management is gaining momentum in data centers.

Cisco Data Preparation

The Open Cloud Near-Term Infrastructure Trends in Cloud Computing

IT Agility Delivered: Cisco Unified Computing System

How To Build A Cisco Uniden Computing System

C460 M4 Flexible Compute for SAP HANA Landscapes. Judy Lee Released: April, 2015

HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW

I/O Performance of Cisco UCS M-Series Modular Servers with Cisco UCS M142 Compute Cartridges

UCS Storage Options. July Bertalan Dergez Consulting Systems Engineer

Cisco and VMware: Transforming the End-User Desktop

AMD SEAMICRO OPENSTACK BLUEPRINTS CLOUD- IN- A- BOX OCTOBER 2013

Cisco UCS B440 M2 High-Performance Blade Server

Optimally Manage the Data Center Using Systems Management Tools from Cisco and Microsoft

Cisco UCS B200 M3 Blade Server

Cisco Unified Computing System: Meet the Challenges of Microsoft SharePoint Server Workloads

Virtualizing Apache Hadoop. June, 2012

How To Write An Article On An Hp Appsystem For Spera Hana

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

FlexPod for VMware The Journey to Virtualization and the Cloud

Cisco Unified Computing System Hardware

New Hitachi Virtual Storage Platform Family. Name Date

IVA & UCS. Frank Stott UCS Sales Specialist frstott@cisco.com Cisco and/or its affiliates. All rights reserved.

Mit Soft- & Hardware zum Erfolg. Giuseppe Paletta

SQL Server Consolidation on VMware Using Cisco Unified Computing System

Unisys ClearPath Forward Fabric Based Platform to Power the Weather Enterprise

Datasheet FUJITSU Software ServerView Cloud Monitoring Manager V1.0

SQL Server Consolidation Using Cisco Unified Computing System and Microsoft Hyper-V

Cisco UCS C24 M3 Server

Driving IBM BigInsights Performance Over GPFS Using InfiniBand+RDMA

Dell s SAP HANA Appliance

Dell Reference Configuration for DataStax Enterprise powered by Apache Cassandra

How To Build An All In One, Hyperconverged, All Inone, All-In-One, And Integrated Solution With Cisco Unix (Cisco) Cisco.Com Cisco-Uku (Cio) C

EMC s Enterprise Hadoop Solution. By Julie Lockner, Senior Analyst, and Terri McClure, Senior Analyst

Maximum performance, minimal risk for data warehousing

Real-time Data Analytics mit Elasticsearch. Bernhard Pflugfelder inovex GmbH

Dell In-Memory Appliance for Cloudera Enterprise

Veeam Backup & Replication Enterprise Plus Powered by Cisco UCS: Reliable Data Protection Designed for Virtualized Environments

Cisco Desktop Virtualization with UCS: A Blueprint for Success

Cisco Business Intelligence Appliance for SAP

Achieving Real-Time Business Solutions Using Graph Database Technology and High Performance Networks

Minimize cost and risk for data warehousing

Accelerate Cloud Initiatives with Cisco UCS and Ubuntu OpenStack

Cisco Unified Data Center

Reference Architecture for OmniStack TM Integrated Solution with Cisco UCS C240

Analyzing Big Data with Splunk A Cost Effective Storage Architecture and Solution

Building & Optimizing Enterprise-class Hadoop with Open Architectures Prem Jain NetApp

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Sun Constellation System: The Open Petascale Computing Architecture

Cisco, Citrix, Microsoft, and NetApp Deliver Simplified High-Performance Infrastructure for Virtual Desktops

Improve performance and availability of Banking Portal with HADOOP

ENTERPRISE-CLASS MONITORING SOLUTION FOR EVERYONE ALL-IN-ONE OPEN-SOURCE DISTRIBUTED MONITORING

Unified Computing System When Delivering IT as a Service. Tomi Jalonen DC CSE 2015

Practical Approaches to Big Data & Analytics: From Infrastructure to

Private cloud computing advances

Cisco UCS C220 M3 Server

Pivot3 Desktop Virtualization Appliances. vstac VDI Technology Overview

Business Case for BTI Intelligent Cloud Connect for Content, Co-lo and Network Providers

Building a Scalable Big Data Infrastructure for Dynamic Workflows

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Cisco Solutions for Big Data and Analytics

Flash Storage Optimizing Virtual Desktop Deployments

Boost Database Performance with the Cisco UCS Storage Accelerator

Cisco Unified Computing System: Meet the Challenges of Microsoft SharePoint Server Workloads

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

How to Build a Sanbolic Platform User Manual

Cisco Unified Data Center: The Foundation for Private Cloud Infrastructure

Cisco Catalyst 4500-X Series Switch Family

Cisco UCS Integrated Infrastructure for Big Data with Splunk Enterprise

Power Efficiency Comparison: Cisco UCS 5108 Blade Server Chassis and IBM FlexSystem Enterprise Chassis

The Advantages of Multi-Port Network Adapters in an SWsoft Virtual Environment

Cisco IT Hadoop Journey

REFERENCE ARCHITECTURE. PernixData FVP Software and Splunk Enterprise

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM

Cisco Data Center Network Manager for SAN

Datasheet FUJITSU Integrated System PRIMEFLEX for Hadoop

Transcription:

Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack HIGHLIGHTS Real-Time Results Elasticsearch on Cisco UCS enables a deeper and richer understanding of data converged from multiple sources within the enterprise in real time. Integrates with Cisco UCS CPA for Big Data Elasticsearch on Cisco UCS can coexist with other Cisco UCS CPA for Big Data solutions in the same system, sharing high-bandwidth connectivity to speed the use of analytic results. Ease of Deployment Cisco UCS Manager automates deployment and scaling, reducing the risk of configuration errors that can cause downtime. Architectural Scalability The solution is designed to grow to maximum scale without adding complex layers of switching infrastructure. Choice of Configurations Whether you want to optimize for high IOPs and low capacity or low outputs and high capacity, the Cisco UCS Elasticsearch bundle has you covered. Elasticsearch on Cisco UCS combines hardware and software to deliver real-time actionable insights from your data One challenge that organizations face is how quickly they can glean actionable insight from their large volumes of data. Having a scalable and performant compute, storage and networking infrastructure to support Big Data projects is only part of the equation for organizations to get value out of their data they need to be able to quickly search and analyze it. Elasticsearch bridges that gap with a real-time search and analytics engine. Deployed as part of a comprehensive data center architecture, the Elasticsearch on Cisco Unified Computer System (Cisco UCS) solution delivers real-time business insights over a powerful and flexible infrastructure. Elasticsearch Solutions Elasticsearch is a company dedicated to leveraging the collective power of three massively popular open source projects: Elasticsearch, Logstash, and Kibana. Each tool can be used in isolation for search, logging, and data visualization respectively. However, when all three are combined, they create an end-to-end solution known as the Elasticsearch ELK stack, a powerful analytics platform that can help you glean actionable insights in real time from almost any type of structured and unstructured data source. The ELK stack can be configured as a stand-alone application, it can serve as a platform for custom web applications, or it can be integrated with your organization s existing data applications. Elasticsearch Elasticsearch is a powerful open source, distributed, real-time search and analytics engine built on top of Apache Lucene. Architected from the 1

ground up for use in distributed environments where reliability and scalability are must-haves, Elasticsearch gives you the ability to easily move beyond simple, full-text search. It enables real-time analytics and full text search over stored data. With its robust set of APIs, query DSLs, and clients for the most popular programming languages, Elasticsearch is gaining wide adoption as an open platform for all of your organization s log data. Logstash Logstash aggregates log data as well as time series data, csv, and over 40 data inputs from any system into a single place for additional transformation and processing. Using Elasticsearch as its backend data store, Logstash creates a powerful pipeline for storing, querying, and analyzing log files. Kibana Kibana is Elasticsearch s data visualization engine, allowing users to natively interact with all of their data in Elasticsearch via pre-built and custom dashboards. Elasticsearch for Hadoop Elasticsearch for Apache Hadoop, called es-hadoop, is an adapter that facilitates Hadoop jobs to interact with Elasticsearch through a bi-directional flow of data. It enables real-time search for Hadoop HDFS data. The adapter currently supports Map/Reduce, Cascading, Spark, Pig, Hive, and Storm. This might be a good solution if you want to search on top of your long-term data store. Cisco UCS The Cisco UCS is a next-generation data center platform that unites compute, network, and storage access into a cohesive system designed to meet a variety of scale-out application demands with transparent data and management integration capabilities. Scalability Cisco UCS brings simplified management, greater deployment flexibility, and easier scalability to the scaled-out nature of many of today s applications. Every aspect of server personality, configuration, and connectivity is set on demand, through the Cisco UCS Manager. Using Cisco s powerful service profiles, you can configure Elasticsearch clusters rapidly and automatically without the risk of configuration drift, which can lead to errors that cause downtime. Unified management in Cisco UCS enables greater agility and more rapid deployment. Unified Fabric This solution uses important Cisco UCS innovations of unified fabric, unified I/O, and unified management for all connected devices. Its low-latency, lossless 10-Gbps unified fabric is fully redundant and, through its active-active configuration, delivers higher performance compared to other vendor solutions. Reduced TCO and Improved Staff Efficiency This simplified, intelligent infrastructure reduces your total cost of ownership (TCO) with fewer management endpoints, switches, adapters, cables, power, and cooling components. Through the use of embedded management and automation, your staff s ability to quickly and efficiently deploy and troubleshoot Cisco UCS will lower operating expenses and allow staff to focus on strategic business initiatives rather than infrastructure maintenance. 2

Cisco UCS CPA for Big Data Unified fabric Elasticsearch on Cisco (Hot Option) Elasticsearch on Cisco UCS Elasticsearch on Cisco UCS delivers high performance with exceptional scalability. Cisco UCS unified fabric architecture provides fully redundant, highly scalable, lossless 10-Gbps unified fabric connectivity for Elasticsearch s query, index, and replication data traffic. The reference architecture can easily scale to support a large number of nodes to meet additional demands. Unified Mgmt This solution can connect across the same management plane to other Cisco UCS deployments such as those deployed with Cisco UCS Common Platform Architecture (CPA) for Big Data, thereby radically simplifying datacenter management and connectivity. Figure 1 The solution is built using the following components: Cisco UCS 6200 Series Fabric Interconnects Cisco s UCS 6200 Series Fabric Interconnects provide high-bandwidth, low-latency connectivity for servers, with Cisco UCS Manager providing integrated, unified management for all connected devices. Deployed in redundant pairs, Cisco fabric interconnects offer the full active-active redundancy, performance, and exceptional scalability needed to support the large number of nodes that are typical in clusters serving Big Data applications. Cisco UCS Manager enables rapid and consistent server configuration using service profiles, automating ongoing system maintenance activities such as firmware updates across the entire cluster as a single operation. Cisco UCS Manager also offers advanced monitoring with options to raise alarms and send notifications about the health of the entire cluster. Cisco UCS C220 M4 Rack Servers Cisco UCS C220 M4 Rack Servers are designed for performance and density over a wide range of business workloads in a 1-rack unit (1RU) form factor. Cisco UCS C220 M4 servers are powered by dual Intel Xeon E5-2600 v3 series CPUs, and they support up to 768 GB of main memory. These servers support four or eight SAS/SATA/SSD drives as well as Cisco UCS virtual interface cards (VICs) optimized for high-bandwidth and low-latency cluster connectivity, with support for up to 256 virtual devices. Cisco UCS C240 M4 Rack Servers Cisco UCS C240 M4 Rack Servers are enterprise-class servers that deliver an outstanding combination of performance, flexibility, and efficiency for storage. The Cisco UCS C240M4 servers are 2-socket, 2-rack-unit (2RU) servers based on Intel Xeon E5-2600 v3 series processors supporting up to 768 GB of DDR4 main memory. These servers support up to 26 SFF SAS/SATA/SSD drives or 12 LFF SAS/SATA drives plus 2 SFF SSD drives. Their expandability and exceptional performance makes them an ideal fit for big data analytics, virtualization, graphics-rich and bare-metal applications. 3

Configuration Options Systems that are optimized for performing many fast queries look different than systems optimized for just fast indexing. Figure 1 below shows how your use case impacts how you might want to configure your UCS hardware. The UCS- Elasticsearch bundles (Table-1) come in two flavors - hot and warm - and can be used separately or combined together to meet your deployment needs. The UCS-Elasticsearch bundles (Table-1) come in two flavors, each according to these common use cases: Hot Option Fast Indexing, Many Queries, Low Capacity Use Cases -- This option is for when you want to run fast indexing on today s data and only need to retain it in storage for a short period of time (~2 days). The system is optimized for high IOPS and low storage capacity. Warm Option Many Queries, Medium Capacity Use Cases -- This option is for when you need to retain data for ~3-8+ days and require medium-to-less frequent querying. This option balances I/O bandwidth with medium storage capacity. Events per second (EPS) indexing querying n days Configuration option 2 DAY RETENTION HOT OPTION High IOPs Fast queries Low capacity 3-8+ DAY RETENTION WARM OPTION High output Medium capactity Figure 2 4

Rack Solution By Use Case Hot Option Warm Option Network 2 Cisco UCS 6248UP 48-Port Fabric Interconnects 2 Cisco UCS 6248UP 48-Port Fabric Interconnects Management Cisco UCS Manager Cisco UCS Manager Servers 8 Cisco UCS C220 M4 Rack Servers, each with: 2 Intel Xeon processors E5-2620 v3 at 2.4 GHz 256 GB of memory Cisco 12G-2GB RAID Controller 8 1.2TB 10K SFF SAS drives (76 TB total) 8 Cisco UCS C240 M4 Rack Servers, each with: 2 Intel Xeon processors E5-2620 v3 at 2.4 GHz 256 GB of memory Cisco 12G-2GB RAID Controller 24 1.2TB 10K SFF SAS drives (230 TB total) 2 120GB SSD drives Table 1: Elasticsearch on Cisco UCS Solution Conclusion Big Data technology has become a valuable and compelling asset for organizations of all sizes. Harnessing the value of Big Data analytics in real time from a stable, manageable, and scalable infrastructure has been a long-standing challenge. However, the Elasticsearch on Cisco UCS solution rises above this challenge by delivering a real-time data analytics platform that can be rapidly deployed, scaled on demand, and is easy to use. It also reduces the total cost of ownership by requiring fewer infrastructure components and reducing operating expenses associated with staff time. Organizations can rely on this solution to provide real-time data intelligence and make critical business decisions. For More Information: www.elasticsearch.com www.cisco.com/go/ucs For more information about the Elasticsearch support, please visit: http://www.elasticsearch.com/support For more information about Cisco UCS big data solutions, please visit: http://www.cisco.com/go/bigdata 5