Get More Scalability and Flexibility for Big Data

Similar documents
Pentaho High-Performance Big Data Reference Configurations using Cisco Unified Computing System

Cisco for SAP HANA Scale-Out Solution on Cisco UCS with NetApp Storage

Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack

Platfora Big Data Analytics

Cisco Unified Data Center Solutions for MapR: Deliver Automated, High-Performance Hadoop Workloads

White Paper. Cisco and Greenplum Partner to Deliver High-Performance Hadoop Reference Configurations

Cisco Nexus Planning and Design Service

Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database

90% of your Big Data problem isn t Big Data.

Cisco, Citrix, Microsoft, and NetApp Deliver Simplified High-Performance Infrastructure for Virtual Desktops

Support a New Class of Applications with Cisco UCS M-Series Modular Servers

Cisco Unified Computing. Optimization Service

IT Agility Delivered: Cisco Unified Computing System

Veeam Backup & Replication Enterprise Plus Powered by Cisco UCS: Reliable Data Protection Designed for Virtualized Environments

Accelerate Cloud Initiatives with Cisco UCS and Ubuntu OpenStack

Cisco UCS Business Advantage Delivered: Data Center Capacity Planning and Refresh

Unified Computing Systems

Boost Database Performance with the Cisco UCS Storage Accelerator

Cisco UCS: Unified Infrastructure Management That HP OneView Still Can t Match

Cisco Unified Computing System: Meet the Challenges of Virtualization with Microsoft Hyper-V

MarkLogic and Cisco: A Next-Generation, Real-Time Solution for Big Data

Powerful Duo: MapR Big Data Analytics with Cisco ACI Network Switches

BUILDING A SCALABLE BIG DATA INFRASTRUCTURE FOR DYNAMIC WORKFLOWS

Cisco Virtual Desktop Infrastructure Strategy Service

Managing the Real Cost of On-Demand Enterprise Cloud Services with Chargeback Models

How To Build A Cisco Ukcsob420 M3 Blade Server

Cisco and VMware: Transforming the End-User Desktop

Cisco Extended Care 1.0

How Cisco IT Built Big Data Platform to Transform Data Management

The Production Cloud

Building a Scalable Big Data Infrastructure for Dynamic Workflows

Cisco Smart Business Communications Systems. Cisco Small Business Unified Communications 300 Series

IBM System x reference architecture solutions for big data

Cisco UCS and HP BladeSystem: A Comparison

Achieving Real-Time Business Solutions Using Graph Database Technology and High Performance Networks

The Future of Computing Cisco Unified Computing System. Markus Kunstmann Channels Systems Engineer

Cisco UCS B-Series M2 Blade Servers

Cisco Smart Business Communications Systems. Cisco Small Business Unified Communications 300 Series

How-to Guide: Top Ways to Improve Contact Center Performance

Unlock the value of data with smarter storage solutions.

Quantium captures new niche in data analytics market

Planning the Migration of Enterprise Applications to the Cloud

WHITE PAPER. Building Blocks of the Modern Data Center

Dell s SAP HANA Appliance

Cisco ASA 5500-X Series ASA 5512-X, ASA 5515-X, ASA 5525-X, ASA 5545-X, and ASA 5555-X

A Platform Built for Server Virtualization: Cisco Unified Computing System

Cisco-EMC Microsoft SQL Server Fast Track Warehouse 3.0 Enterprise Reference Configurations. Data Sheet

The virtualization of SAP environments to accommodate standardization and easier management is gaining momentum in data centers.

Key Messages of Enterprise Cluster NAS Huawei OceanStor N8500

High-Availability Fault Tolerant Computing for Remote and Branch Offices HA/FT solutions for Cisco UCS E-Series servers and VMware vsphere

What Is Microsoft Private Cloud Fast Track?

Cisco UCS B440 M2 High-Performance Blade Server

Big data management with IBM General Parallel File System

Independent Software Vendors: Upgrade and Unify Your On-Premise IT with Cisco

Introduction to VMware EVO: RAIL. White Paper

Accelerating Microsoft Exchange Servers with I/O Caching

Cisco Mobile Collaboration Management Service

VXRACK SYSTEM Product Overview DATA SHEET

Data Center Network Evolution: Increase the Value of IT in Your Organization

Ericsson Introduces a Hyperscale Cloud Solution

An Oracle White Paper November Backup and Recovery with Oracle s Sun ZFS Storage Appliances and Oracle Recovery Manager

Cisco Data Center Virtualization Assessment Service

Virtual Compute Appliance Frequently Asked Questions

Scala Storage Scale-Out Clustered Storage White Paper

IBM Storwize V7000 Unified and Storwize V7000 storage systems

Power Efficiency Comparison: Cisco UCS 5108 Blade Server Chassis and IBM FlexSystem Enterprise Chassis

Cisco SecureX Product Brochure

Cisco Data Preparation

Telepresence in an IPv6 World. Simplify the Transition

alcatel-lucent vitalqip Appliance manager End-to-end, feature-rich, appliance-based DNS/DHCP and IP address management

Private cloud computing advances

Cisco UCS Architecture and Management:

Cisco IT Hadoop Journey

What Is Microsoft Private Cloud Fast Track?

Cisco Unified Computing System and EMC VNXe3300 Unified Storage System

Kronos Workforce Central 6.1 with Microsoft SQL Server: Performance and Scalability for the Enterprise

Architectural Comparison: Cisco UCS and the Dell FX2 Platform

Cisco Unified Data Center: The Foundation for Private Cloud Infrastructure

Cisco Unified Communications on the Cisco Unified Computing System

HBA Virtualization Technologies for Windows OS Environments

Running Oracle s PeopleSoft Human Capital Management on Oracle SuperCluster T5-8 O R A C L E W H I T E P A P E R L A S T U P D A T E D J U N E

Cloud Ready: Architectural Integration into FlexPod with Microsoft Private Cloud Solution

Software-Defined Networks Powered by VellOS

Cisco Data Center Network Manager for SAN

Microsoft Private Cloud Fast Track Reference Architecture

To Cloud or Not to Cloud? Which Communications Deployment Option is Best for Your Business?

The Cisco Smart Business Communications System

Dell Microsoft Business Intelligence and Data Warehousing Reference Configuration Performance Results Phase III

Cisco Solution for EMC VSPEX Server Virtualization

Transcription:

Solution Overview LexisNexis High-Performance Computing Cluster Systems Platform Get More Scalability and Flexibility for

What You Will Learn Modern enterprises are challenged with the need to store and analyze massive amounts of structured and unstructured data. This is essential to derive crucial insights that may give the company the necessary advantage over its competition. The amount of this data grows at a faster pace than ever before. Timely and accurate analysis of this data is of utmost importance. High-Performance Computing Cluster (HPCC) Systems is an open source big data processing platform from LexisNexis. LexisNexis has collaborated with Cisco to offer a high-performance analytics platform that is scalable, flexible, and cost effective (Figure 1). Figure 1 LexisNexis HPCC Systems Structured, Semistructured, and Unstructured Data Extraction Transformation Loading High-Performance, Highly Concurrent, Real-Time Delivery THOR ROXIE ECL Web Application ECL Developer Using ECL IDE The solution is based on the Cisco Unified Computing System (Cisco UCS ) Common Platform Architecture (CPA) for. The software includes a Thor cluster that acts as the extract, transform, and load (ETL) engine; a Roxie cluster for that acts as a high-performing concurrent data delivery platform; and an Enterprise Control Language (ECL) Machine Learning (ECL-ML) module. HPCC Systems has validated and certified this solution on Cisco UCS CPA for. Challenges of and High Performance Computing Big data refers to the data in various structured and unstructured forms that originates from a multitude of devices and applications connected to the Internet. This data typically doesn t fit into traditional relational models because of its size, velocity, and composition: the amount of big data increases rapidly, and big data is a mix of structured and unstructured data. As a result of the continuing dramatic increase in data, many organizations are overwhelmed by the data in their systems, with the result that they are unable to process the information and use it effectively. The answer to these challenges is a scalable, integrated computer hardware and software architecture designed for parallel processing of data- intensive computing applications. 2

HPCC Systems HPCC Systems, from LexisNexis, is an emerging enterprise platform with massive parallel-processing performance and high levels of reliability, security, and uptime. To manage, sort, link, and analyze billions of records in less than a second, LexisNexis offers a data-intensive platform that has been proven for the past 10 years with customers who need to sort through billons of records. Customers such as leading banks and insurance companies, utilities, law enforcement, and the federal government depend on LexisNexis HPCC Systems technology and information solutions to help them make better decisions faster. HPCC Systems comes with optional subscriptionbased support services. It includes software, services, and support to help organizations discover the insights of new and emerging types of information, a capability that did not exist before. With the extensible set of fully parallel HPCC Systems Machine Learning and Matrix processing modules, enterprise customers are fully equipped to meet the challenges of big data. Cisco UCS CPA for The Cisco solution for HPCC Systems is based on the Cisco UCS CPA for, a highly scalable architecture designed to meet a variety of scale-out application demands with transparent data integration and management integration capabilities. The new Cisco UCS CPA Version 2 (CPAv2) for improves both performance and capacity. With complete, easy-to-order packages that include computing, storage, connectivity, and unified management features, Cisco UCS CPAv2 for supports rapid deployment, delivers predictable performance, and reduces total cost of ownership (TCO). Cisco UCS CPA v2 for is built using the following components: Cisco UCS 6200 Series Fabric Interconnects establish a single point of connectivity and management for the entire system. The fabric interconnects provide high-bandwidth, low-latency connectivity for servers, with integrated, unified management for all connected devices provided by Cisco UCS Manager. Deployed in redundant pairs, Cisco fabric interconnects offer the full active-active redundancy, performance, and exceptional scalability needed to support the large number of nodes typical in clusters that serve big data applications. Cisco UCS Manager enables rapid and consistent server configuration using service profiles, automating ongoing system maintenance activities such as firmware updates across the entire cluster as a single operation. Cisco UCS Manager also offers advanced monitoring with options to activate alarms and send notifications about the health of the entire cluster. Cisco Nexus 2200 platform fabric extenders bring the system s unified fabric to each rack, establishing physically distributed but logically centralized network infrastructure. These low-cost, low-power-consuming devices act as remote line cards for the fabric interconnects, providing connectivity without adding the cost and management complexity required by top-of-rack switches. The result is highly scalable and cost-effective connectivity for many nodes. Cisco UCS C240 M3 Rack Servers support a wide range of computing, I/O, and storage-capacity demands in a compact two-rack-unit (2RU) design. The servers use dual Intel Xeon processor E5-2600 v2 series CPUs and support up to 768 GB of main memory (128 or 256 GB is typical for big data applications) and a range of disk drive options. Cisco UCS virtual interface cards (VICs) are optimized for high-bandwidth and low-latency cluster connectivity, with support for up to 256 virtual devices that are configured on demand through Cisco UCS Manager. 3

Cisco CPAv2 for supports the massive scalability demanded by big data environments. Up to 160 servers are supported in a single management domain with a pair of Cisco fabric interconnects. Additional scaling can be accomplished by interconnecting multiple domains using Cisco Nexus 6000 or 7000 Series Switches. With Cisco UCS Central Software, thousands of servers and hundreds of petabytes (PB) of storage can be managed through a single interface with the same automation provided by Cisco UCS Manager. The HPCC System and Cisco solution is offered as a Cisco UCS CPAv2 reference architecture and as a single SKU bundle (Table 1). The bundle can be used as it is or as a base for larger configurations. Table 1 Cisco CPAv2 for : Performance- and Capacity-Balanced Configuration Network Fabric Management Servers per Rack 2 Cisco UCS 6296UP 96- Port Fabric Interconnects (per domain) 2 Cisco Nexus 2232PP 10 GE Fabric Extenders (per rack) Cisco UCS Manager 16 Cisco UCS C240 M3 Rack Servers, each with: - 2 Intel Xeon processors E5-2665 v2-256 GB of memory - LSI MegaRaid 9271CV-8i storage controller card - 24 x 1-TB 7200-rpm SFF SAS drives (384 TB total) Benefits of Cisco and HPCC Systems High performance and exceptional scalability: Cisco UCS unified fabric architecture provides fully redundant, highly scalable, lossless 10-Gbps unified fabric connectivity for big data traffic. Powered by the latest Intel Xeon processor, the joint Cisco and HPCC Systems solution delivers best-in-class performance and internal storage capacity. The Cisco and HPCC Systems reference configurations can easily scale to support a large number of nodes when required by business demands. The advanced management capabilities of Cisco UCS radically simplify this process with a single point of management that spans all nodes in the cluster. Simplified management: Big data analytics implementations tend to involve very large numbers of servers. In traditional environments, managing these large numbers of servers effectively can be challenging. Cisco UCS Manager delivers unified, model-based management that applies personality and configures server, network, and storage connectivity resources, making it as easy to deploy hundreds of servers as it is to deploy a single server. Additionally, Cisco UCS Manager can perform system maintenance activities such as firmware updates across the entire cluster as a single operation. Coexistence with enterprise applications: In building big data solutions that involve unstructured or NoSQL data, organizations need ways to transfer data transparently between their enterprise applications and big data platforms. This solution can connect, across the same management plane, to other Cisco UCS deployments running enterprise applications, thereby radically simplifying data center management and connectivity. HPCC Systems provides a comprehensive platform for designing and managing solutions that cross the boundaries of traditional and big data platforms. By providing easy-to-use tools and familiar design concepts for both traditional and big data platforms, HPCC Systems empowers organizations to use existing IT skill sets to build big data solutions. 4

Rapid deployment: Deployment of large numbers of servers can take time. Systems need to be racked, networked, configured, and provisioned before they can be put into use. Cisco UCS Manager uses a model-based approach to provision servers, applying the desired configuration to physical infrastructure quickly, accurately, and automatically. The capability to create consistent configurations improves business agility and eliminates a major source of errors that can cause downtime. The tightly integrated HPCC Systems platform addresses the challenges of building end-to-end solutions that encompass the workflow from data acquisition and processing to comprehensive analytics. Enterprise service and support: Enterprises want to know that the vendors providing a solution have the expertise to help them quickly proceed through the design, deployment, and testing processes for strategic big data initiatives. Businesses also need to have confidence that if a critical system fails, they will be able to get timely and competent support. The Cisco reference configurations bring together world-class service and support from Cisco and HPCC Systems. For More Information For more information about HPCC Systems, please visit http://hpccsystems.com/products-and-services/products. For more information about Cisco UCS big data solutions, please visit http://www.cisco.com/go/bigdata. For more information about the Cisco UCS CPAv2 for, please visit http://blogs.cisco.com/datacenter/cpav2. For more information about the Cisco Smart Play program, please visit http://www.cisco.com/go/smartplay. About LexisNexis LexisNexis (www.lexisnexis.com/risk/)is a leader in providing essential information that helps customers across all industries and government predict, assess and manage risk. Combining cutting-edge technology, unique data and advanced scoring analytics, provides products and services that address evolving client needs in the risk sector while upholding the highest standards of security and privacy. LexisNexis is part of Reed Elsevier, a leading publisher and information provider that serves customers in more than 100 countries with more than 30,000 employees worldwide. About Cisco Cisco is the worldwide IT leader that enables businesses to seize the opportunities of tomorrow by proving amazing things happen when you connect the previously unconnected. Cisco brings together people, process, data and things to transform how organizations meet next-generation demands. For more information: www.cisco.com. Americas Headquarters Cisco Systems, Inc. San Jose, CA Cisco Systems (USA) Pte. Ltd. Singapore Europe Headquarters Cisco Systems International BV Amsterdam, The Netherlands Cisco has more than 200 offices worldwide. Addresses, phone numbers, and fax numbers are listed on the Cisco Website at www.cisco.com/go/offices. Cisco Americas and Headquarters the Cisco logo are trademarks or registered trademarks of Cisco and/or its affiliates in the U.S. and other countries. Europe To Headquarters view a list of Cisco trademarks, go to this URL: www.cisco.com/go/trademarks. Cisco Systems, Inc. Third-party trademarks mentioned Cisco are Systems the property (USA) of Pte. their Ltd. respective owners. The use of the Cisco word Systems partner International does not imply BV a Amsterdam, partnership relationship between San Jose, Cisco CA and any other company. (1110R) Singapore The Netherlands Cisco 2014 has Cisco more and/or than 200 its affiliates. offices worldwide. All rights reserved. Addresses, This phone document numbers, is Cisco and fax Public numbers Information. are listed on the Cisco Website at www.cisco.com/go/offices. Cisco and the Cisco logo are trademarks or registered trademarks of Cisco and/or its affiliates in the U.S. and other countries. To view a list of Cisco trademarks, go to this URL: www.cisco.com/go/trademarks. Third-party trademarks mentioned are the property of their respective owners. The use of the word partner does not imply a partnership relationship between Cisco and any other company. (1110R) 2014 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public Information. LexisNexis and the Knowledge Burst logo are registered trademarks of Reed Elsevier Properties Inc., used under license. Other products and services may be trademarks or registered trademarks of their respective companies. Copyright 2014 LexisNexis. All rights reserved. NXR0XXX-0 0214