Dell Reference Configuration for DataStax Enterprise powered by Apache Cassandra
|
|
- Alexis Mathews
- 8 years ago
- Views:
Transcription
1 Dell Reference Configuration for DataStax Enterprise powered by Apache Cassandra A Quick Reference Configuration Guide Kris Applegate kris_applegate@dell.com Solution Architect Dell Solution Centers Dave Jaffe dave_jaffe@dell.com Solution Architect Dell Solution Centers Armando Acosta armando_acosta@dell.com Big Data Product Manager Dell Revolutionary Cloud and Big Data Group Rob Wilbert robert_wilbert@dell.com Solution Architect Dell Solution Centers
2 Executive Summary This document details the configuration set-up for DataStax Enterprise (DSE) software on the PowerEdge R-Series servers. The intended audiences for this document are customers and solution architects looking for information on configuring DSE clusters within their information technology environment for always on transaction processing. The reference configuration introduces the server set-ups that can run the DataStax Enterprise stack. The document will only focus on configuration; it will not go into detail about DSE or Apache Cassandra solution software components or resiliency, performance, or software considerations. This document does not focus on best practices or complete architecture for a DSE Solution. Additional DataStax Enterprise installation, administration, and optimization guides are available on the websites referenced below. Dell developed this document to help streamline configuration for the DataStax Enterprise software. THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS AND TECHNICAL INACCURACIES. THE CONTENT IS PROVIDED AS IS, WITHOUT EXPRESS OR IMPLIED WARRANTIES OF ANY KIND Dell Inc. All rights reserved. Reproduction of this material in any manner whatsoever without the express written permission of Dell Inc. is strictly forbidden. For more information, contact Dell. Dell, the DELL logo, and the DELL badge are trademarks of Dell Inc. Intel and Xeon are registered trademarks of Intel Corp. Red Hat is a registered trademark of Red Hat Inc. Linux is a registered trademark of Linus Torvalds. Other trademarks and trade names may be used in this document to refer to either the entities claiming the marks and names or their products. Dell Inc. disclaims any proprietary interest in trademarks and trade names other than its own. 2 Dell Reference Configuration for DataStax Cassandra
3 Introduction In the age of Big Data, applications operate on a global scale, and they must meet the always-on demands of their developers and their users. DataStax Enterprise is uniquely suited to address the database demands of continuously available, globally distributed online applications. Over the last two to three years customers have utilized Hadoop as a tool to help analyze large volumes of structured, semi-structured, and unstructured data. Hadoop is a valuable tool, yet as customers use-cases evolve; new tools are starting to emerge that continue to add more value to the Big Data ecosystem. NoSQL database technologies are a prime example of a new tool being integrated with Hadoop that allow low-latency read/write access to data. Apache Cassandra is one such NoSQL database, and by rewriting the laws of database architecture, Cassandra provides a single database distributed geographically over multiple data centers providing unprecedented levels of reliability. Cassandra s efficient architecture to capture data at extremely high ingest rates is valuable for Internetof-Things applications that capture large quantities of time-series data that is then analyzed to provide value to the community of users. DataStax Enterprise enhances the capabilities of Apache Cassandra providing management services to facilitate cluster operations and maintenance. DataStax Enterprise and Hadoop are very complimentary. There are a number of usecases where NoSQL databases such as DataStax Enterprise, serve as the real-time read/write, always-available database while Hadoop serves as the backend engine to help users analyze large volumes of structured, semi-structured, and unstructured data in more of a batch methodology. Within this integrated data hub, customers can run algorithms on integrated disparate data from relationship database management, enterprise data warehouses, and other sources. Additionally, a data science workbench may be layered on top to provide analytics tools transforming the results into actionable information using search, data visualizations, and reporting/analysis. These new environments are applicable across multiple vertical markets, including Government Intelligence, Healthcare, Financials, Manufacturing, Telco/Media, Retail, Web 2.0, and more. To help support this customer use-case, Dell is partnering with DataStax Enterprise to execute a reference configuration in the Dell Solution Center. DataStax Enterprise is a NoSQL big data platform powered by production-certified Apache Cassandra that is architected for today's line-of-business applications and designed to securely manage real-time, analytic, and search data all in the same database cluster. DataStax Enterprise encapsulates a peer-to-peer distributed architecture model where all nodes inside a cluster are the same. Data is automatically partitioned and distributed among all the nodes. Often, two or more data center locations are used and nodes are distributed among the physical locations. OpsCenter is a global management and monitoring tool that administers Cassandra and DSE clusters. 3 Dell Reference Configuration for DataStax Cassandra
4 Reference Configuration Apache Cassandra is an open source massively scalable NoSQL (non-relational) database. DataStax is a Dell partner who, in addition to contributing to the Apache Cassandra project, offers a commercialized version in both a community and enterprise flavor. DataStax Enterprise is available for multiple distributions of Linux. This initial configuration will target deployment on bare-metal servers running DataStax Enterprise on Redhat Linux 6.4. DataStax Enterprise can be used to provide a mechanism to rapidly ingest transactional data to facilitate a variety of emerging workloads. These workloads share a common need to provide a continuously-available, distributed, read/write capable database that does not have any single point of failure. Use Cases for NoSQL Online Data Ingestion: Time-series data Device/Sensor/Data exhaust systems Distributed applications Media streaming Online Web retail (transactional, shopping carts, etc.) Online gaming Recommendation engines Real-time data analytics Social media capture and analysis Web click-stream analysis Write-intensive transactional systems The Cassandra ring topology allows multiple nodes to service both read and write requests with a tunable consistency mechanism (both the number of replicas and at what point to acknowledge the write). 4 Dell Reference Configuration for DataStax Cassandra
5 Figure 1. Logical Diagram of Cassandra Ring Data Node Data Node Data Node Replicate n times Data Node Data Node Read / Write Application Server(s) Server Roles Cassandra Data Node(s) The data nodes conduct the principle functions in a Cassandra cluster (a cluster contains multiple nodes). In order to provide rapid response times during data ingestion, these nodes are configured to allow for rapid input/output (IO) to disk. As IO arrives the following process commences: 1. Incoming data is assigned to a data node, using a data key determined by hashing the incoming data. Each data node own a specific hash range, and the incoming data is assigned to the data node that owns the hash range the data key falls into 2. IO is written to a disk-based commit log on the assigned node 3. IO is also simultaneously written to a table in memory 4. Steps 1-3 are repeated on one or more additional data nodes in order to meet replication/durability requirements, if any 5. IO is acknowledged back to the requestor 5 Dell Reference Configuration for DataStax Cassandra
6 This process allows the cluster to maintain a tunable number of replicas across nodes, racks, and datacenters. Since the IO isn t acknowledged until it is written to a disk-based commit log, the commit log should reside on high-performance storage, such as solidstate drives (SSD). SSDs are common for read-heavy workloads, since reads involve potentially many random IOs. Performance may be increased by adding additional data nodes to the cluster/ring since Cassandra is linearly scalable. Application Server(s) Application servers reside on the outer edge of the cluster/ring. They are the interface between the Cassandra ring and the outside world. Data may be streamed from an application server programmatically (via APIs for all the popular languages) or through Cassandra s built-in query language (CQL). DataStax OpCenter Node The DataStax OpCenter Node runs the management interface. In a production environment, the OpCenter server may need to run on a dedicated physical node; however, for the purposes of this document s testing, OpCenter was installed on a virtual machine (VM). Figure 2. DataStax OpCenter Interface Node Count Recommendations Dell recognizes that use-cases for Cassandra range from early-stage development and testing clusters through large multi-datacenter installations. Dell and DataStax have services that can help appropriately size a cluster based on customer budget, performance, security, and data consistency requirements. All node-count recommendations are for the Data Nodes only. DataStax OpCenter, application servers, and additional infrastructure services may be needed to complete the environment. 6 Dell Reference Configuration for DataStax Cassandra
7 As a starting point, three cluster configurations can be defined for typical use: DataStax Recommended Starter Cluster The low-tier configuration is targeted at basic usage for online database applications, and in some cases, may even be built from existing equipment; however, the performance of these types of clusters can be significantly increased if SSD drives are added. For this configuration, only a single processor is defined. If more services (such as DataStax Search) are added, performance may suffer. DataStax Recommended Standard Cluster This configuration is a good starting spot for clusters that have the potential to scale. This configuration includes dual processor to improve performance using DataStax s search capabilities. DataStax Recommended Professional Cluster This configuration represents the top-tier of hardware recommended to run Cassandra. Adding additional performance to individual nodes (e.g. four processers, additional memory, etc.) will result in diminishing benefit. Rather, adding additional nodes yields a greater return on investment when scaling the cluster. Table 1. Recommended Cluster Sizes DataStax Recommended Starter Cluster 3 DataStax Recommended Standard Cluster 3 DataStax Recommended Professional Cluster 3 Dell Tested Configuration Server Model 1 (5) PowerEdge R320 (5) PowerEdge R420 (5) PowerEdge R620 (5) PowerEdge R720 Processor(s) Single Intel Xeon E v2 Dual Intel Xeon E v2 Dual Intel Xeon E v2 Dual Intel Xeon E RAM 64 GB 128 GB 256 GB 128 GB Storage 2 (4) 1 TB SATA Drives Read Intensive Application (6) Intel 3700 Series SSD 400GB 3Gbps (6) Intel 3700 Series SSD 400GB 6Gbps (6) Intel 3700 Series SSD 800GB 6Gbps Network Cards (2) Intel X520 DP 10GbE DA/SFP+ (2) Intel X520 DP 10GbE DA/SFP+ (2) Intel X520 DP 10GbE DA/SFP+ (2) Intel X520 DP 10GbE DA/SFP+ Data Switches (2) Dell Networking 8164F 10GbE SFP+ (2) Dell Force 10 S GbE SFP+ (2) Dell Force 10 S GbE SFP+ (2) Dell Force 10 S GbE SFP+ Management Switches (2) Dell Networking 6248 (2) Dell Networking 6248 (2) Dell Networking 6248 (2) Dell Force 10 S60 1GbE Rack Units 9U 9U 9U 14U DataStax Edition DataStax Enterprise Standard DataStax Enterprise Pro DataStax Enterprise Max DataStax Enterprise Standard 1 Any Dell server that is capable of running the supported OSs should work. Selection of these specific models was due to their targeted price brackets 2 SSDs only should be considered for any high-ingestion use-cases 3 The recommended hardware is for Data Nodes only. DataStax OpCenter, application servers, and additional infrastructure services may be needed to complete the environment. 7 Dell Reference Configuration for DataStax Cassandra
8 Figure 3. Physical networking diagram Tested Configuration For the purposes of this document, a small DataStax cluster was deployed as shown in Table 1. The specific software revisions used in the test are shown in Table 2. The hardware listed should be used as initial guidance only. Additional configurations are possible and will likely be required as each customer s environment and use-case is unique. Customers should consult with DataStax Professional Services to come up with an optimal design that has been customized to their use-case. Common parameters that could differ include: 1. Node Count Adding nodes is the best way to scale capacity and performance for a Cassandra cluster. The benefits for adding additional nodes usually outweighs most other efforts to increase disk size and memory amounts in most cases 2. Disks SSD technology is critical for maintaining the performance necessary to ingest data at a high rate. Keeping both the initial commit log and the sorted string table (SST) disk space on SSDs is strongly recommended 3. Memory Memory should be sized relative to the use-case. the cluster will benefit from additional memory when using DataStax Solr Search or other memory-intensive features 4. Processors Data ingestion is not particularly CPU intensive in of itself. However, additional processing power is required as additional capability is added (e.g. Solr Serach, etc.) or as the workload on a DataStax cluster increases 8 Dell Reference Configuration for DataStax Cassandra
9 Table 2. Software Revisions (As Tested) Component Revision Redhat Enterprise Linux 6.4 DataStax Enterprise Cassandra Version Integration with Other Solutions For customers interested in using DataStax Cassandra to compliment other Big Data solutions, DataStax Cassandra can act as a low-latency point of ingestion for data which can later be fed to other tools including data warehouses and Dell s Apache Hadoop solutions for running deep and heavy analytics. Displaying data directly from Cassandra is also possible via Dell s robust tool-belt of data visualization tools like Dell Kitenga Analytics Suite and the Dell Quest TOAD BI Suite. Figure 4. Physical networking diagram 9 Dell Reference Configuration for DataStax Cassandra
10 Dell Solution Centers The Dell Solution Centers are a global network of connected labs that allow Dell to help customers architect, validate and build solutions. With multiple footprints in every region, they help customers understand anything from simple hardware platforms, to more complex solutions. These engagements range from an informal minute briefing, through a longer half-day workshop, and on to a proof-of-concept that allow customers to kick the tires of their solution prior to signing on the dotted line. Customers may engage with their account team and have them submit a request to take advantage of these free services. Links DataStax Enterprise Cassandra Planet Cassandra Community Apache Cassandra Open Source Project Dell Reference Configuration for DataStax Cassandra
Dell Reference Configuration for Hortonworks Data Platform
Dell Reference Configuration for Hortonworks Data Platform A Quick Reference Configuration Guide Armando Acosta Hadoop Product Manager Dell Revolutionary Cloud and Big Data Group Kris Applegate Solution
More informationDell Cloudera Syncsort Data Warehouse Optimization ETL Offload
Dell Cloudera Syncsort Data Warehouse Optimization ETL Offload Drive operational efficiency and lower data transformation costs with a Reference Architecture for an end-to-end optimization and offload
More informationInteractive data analytics drive insights
Big data Interactive data analytics drive insights Daniel Davis/Invodo/S&P. Screen images courtesy of Landmark Software and Services By Armando Acosta and Joey Jablonski The Apache Hadoop Big data has
More informationDell Compellent Storage Center SAN & VMware View 1,000 Desktop Reference Architecture. Dell Compellent Product Specialist Team
Dell Compellent Storage Center SAN & VMware View 1,000 Desktop Reference Architecture Dell Compellent Product Specialist Team THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL
More informationElasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack
Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack HIGHLIGHTS Real-Time Results Elasticsearch on Cisco UCS enables a deeper
More informationDELL s Oracle Database Advisor
DELL s Oracle Database Advisor Underlying Methodology A Dell Technical White Paper Database Solutions Engineering By Roger Lopez Phani MV Dell Product Group January 2010 THIS WHITE PAPER IS FOR INFORMATIONAL
More informationDataStax Enterprise Reference Architecture
DataStax Enterprise Reference Architecture DataStax Enterprise Reference Architecture 7.8.15 1 Table of Contents ABSTRACT... 3 INTRODUCTION... 3 DATASTAX ENTERPRISE... 3 ARCHITECTURE... 3 OPSCENTER: EASY-
More informationBenchmarking Cassandra on Violin
Technical White Paper Report Technical Report Benchmarking Cassandra on Violin Accelerating Cassandra Performance and Reducing Read Latency With Violin Memory Flash-based Storage Arrays Version 1.0 Abstract
More informationDell In-Memory Appliance for Cloudera Enterprise
Dell In-Memory Appliance for Cloudera Enterprise Hadoop Overview, Customer Evolution and Dell In-Memory Product Details Author: Armando Acosta Hadoop Product Manager/Subject Matter Expert Armando_Acosta@Dell.com/
More informationConverged storage architecture for Oracle RAC based on NVMe SSDs and standard x86 servers
Converged storage architecture for Oracle RAC based on NVMe SSDs and standard x86 servers White Paper rev. 2015-11-27 2015 FlashGrid Inc. 1 www.flashgrid.io Abstract Oracle Real Application Clusters (RAC)
More informationAccelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software
WHITEPAPER Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software SanDisk ZetaScale software unlocks the full benefits of flash for In-Memory Compute and NoSQL applications
More informationDell Virtualization Solution for Microsoft SQL Server 2012 using PowerEdge R820
Dell Virtualization Solution for Microsoft SQL Server 2012 using PowerEdge R820 This white paper discusses the SQL server workload consolidation capabilities of Dell PowerEdge R820 using Virtualization.
More informationIntel Distribution for Apache Hadoop on Dell PowerEdge Servers
Intel Distribution for Apache Hadoop on Dell PowerEdge Servers A Dell Technical White Paper Armando Acosta Hadoop Product Manager Dell Revolutionary Cloud and Big Data Group Kris Applegate Solution Architect
More informationJVM Performance Study Comparing Oracle HotSpot and Azul Zing Using Apache Cassandra
JVM Performance Study Comparing Oracle HotSpot and Azul Zing Using Apache Cassandra January 2014 Legal Notices Apache Cassandra, Spark and Solr and their respective logos are trademarks or registered trademarks
More informationIntroduction to Apache Cassandra
Introduction to Apache Cassandra White Paper BY DATASTAX CORPORATION JULY 2013 1 Table of Contents Abstract 3 Introduction 3 Built by Necessity 3 The Architecture of Cassandra 4 Distributing and Replicating
More informationHigh Performance SQL Server with Storage Center 6.4 All Flash Array
High Performance SQL Server with Storage Center 6.4 All Flash Array Dell Storage November 2013 A Dell Compellent Technical White Paper Revisions Date November 2013 Description Initial release THIS WHITE
More informationThe Methodology Behind the Dell SQL Server Advisor Tool
The Methodology Behind the Dell SQL Server Advisor Tool Database Solutions Engineering By Phani MV Dell Product Group October 2009 Executive Summary The Dell SQL Server Advisor is intended to perform capacity
More informationDell* In-Memory Appliance for Cloudera* Enterprise
Built with Intel Dell* In-Memory Appliance for Cloudera* Enterprise Find out what faster big data analytics can do for your business The need for speed in all things related to big data is an enormous
More informationWell packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances
INSIGHT Oracle's All- Out Assault on the Big Data Market: Offering Hadoop, R, Cubes, and Scalable IMDB in Familiar Packages Carl W. Olofson IDC OPINION Global Headquarters: 5 Speen Street Framingham, MA
More informationIntel Cloud Builder Guide: Cloud Design and Deployment on Intel Platforms
EXECUTIVE SUMMARY Intel Cloud Builder Guide Intel Xeon Processor-based Servers Red Hat* Cloud Foundations Intel Cloud Builder Guide: Cloud Design and Deployment on Intel Platforms Red Hat* Cloud Foundations
More informationDell Desktop Virtualization Solutions Simplified. All-in-one VDI appliance creates a new level of simplicity for desktop virtualization
Dell Desktop Virtualization Solutions Simplified All-in-one VDI appliance creates a new level of simplicity for desktop virtualization Executive summary Desktop virtualization is a proven method for delivering
More informationDataStax Enterprise, powered by Apache Cassandra (TM)
PerfAccel (TM) Performance Benchmark on Amazon: DataStax Enterprise, powered by Apache Cassandra (TM) Disclaimer: All of the documentation provided in this document, is copyright Datagres Technologies
More informationPlatfora Big Data Analytics
Platfora Big Data Analytics ISV Partner Solution Case Study and Cisco Unified Computing System Platfora, the leading enterprise big data analytics platform built natively on Hadoop and Spark, delivers
More informationOptimizing SQL Server Storage Performance with the PowerEdge R720
Optimizing SQL Server Storage Performance with the PowerEdge R720 Choosing the best storage solution for optimal database performance Luis Acosta Solutions Performance Analysis Group Joe Noyola Advanced
More informationNoSQL Performance Test In-Memory Performance Comparison of SequoiaDB, Cassandra, and MongoDB
bankmark UG (haftungsbeschränkt) Bahnhofstraße 1 9432 Passau Germany www.bankmark.de info@bankmark.de T +49 851 25 49 49 F +49 851 25 49 499 NoSQL Performance Test In-Memory Performance Comparison of SequoiaDB,
More informationBig Data. Value, use cases and architectures. Petar Torre Lead Architect Service Provider Group. Dubrovnik, Croatia, South East Europe 20-22 May, 2013
Dubrovnik, Croatia, South East Europe 20-22 May, 2013 Big Data Value, use cases and architectures Petar Torre Lead Architect Service Provider Group 2011 2013 Cisco and/or its affiliates. All rights reserved.
More informationHadoopTM Analytics DDN
DDN Solution Brief Accelerate> HadoopTM Analytics with the SFA Big Data Platform Organizations that need to extract value from all data can leverage the award winning SFA platform to really accelerate
More informationOPEN MODERN DATA ARCHITECTURE FOR FINANCIAL SERVICES RISK MANAGEMENT
WHITEPAPER OPEN MODERN DATA ARCHITECTURE FOR FINANCIAL SERVICES RISK MANAGEMENT A top-tier global bank s end-of-day risk analysis jobs didn t complete in time for the next start of trading day. To solve
More informationBest Practices for Deploying SSDs in a Microsoft SQL Server 2008 OLTP Environment with Dell EqualLogic PS-Series Arrays
Best Practices for Deploying SSDs in a Microsoft SQL Server 2008 OLTP Environment with Dell EqualLogic PS-Series Arrays Database Solutions Engineering By Murali Krishnan.K Dell Product Group October 2009
More informationArchitecting for the next generation of Big Data Hortonworks HDP 2.0 on Red Hat Enterprise Linux 6 with OpenJDK 7
Architecting for the next generation of Big Data Hortonworks HDP 2.0 on Red Hat Enterprise Linux 6 with OpenJDK 7 Yan Fisher Senior Principal Product Marketing Manager, Red Hat Rohit Bakhshi Product Manager,
More informationRemoving Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays. Red Hat Performance Engineering
Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays Red Hat Performance Engineering Version 1.0 August 2013 1801 Varsity Drive Raleigh NC
More informationRED HAT ENTERPRISE VIRTUALIZATION FOR SERVERS: PRICING & LICENSING GUIDE
RED HAT ENTERPRISE VIRTUALIZATION FOR SERVERS: PRICING & LICENSING GUIDE Red Hat Enterprise Virtualization for Servers: Pricing Guide 1 TABLE OF CONTENTS Introduction to Red Hat Enterprise Virtualization
More informationOracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>
s Big Data solutions Roger Wullschleger DBTA Workshop on Big Data, Cloud Data Management and NoSQL 10. October 2012, Stade de Suisse, Berne 1 The following is intended to outline
More informationHow Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns
How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns Table of Contents Abstract... 3 Introduction... 3 Definition... 3 The Expanding Digitization
More informationSQL Server Virtualization
The Essential Guide to SQL Server Virtualization S p o n s o r e d b y Virtualization in the Enterprise Today most organizations understand the importance of implementing virtualization. Virtualization
More informationReal-Time Big Data Analytics SAP HANA with the Intel Distribution for Apache Hadoop software
Real-Time Big Data Analytics with the Intel Distribution for Apache Hadoop software Executive Summary is already helping businesses extract value out of Big Data by enabling real-time analysis of diverse
More informationEMC Unified Storage for Microsoft SQL Server 2008
EMC Unified Storage for Microsoft SQL Server 2008 Enabled by EMC CLARiiON and EMC FAST Cache Reference Copyright 2010 EMC Corporation. All rights reserved. Published October, 2010 EMC believes the information
More informationBrocade and EMC Solution for Microsoft Hyper-V and SharePoint Clusters
Brocade and EMC Solution for Microsoft Hyper-V and SharePoint Clusters Highlights a Brocade-EMC solution with EMC CLARiiON, EMC Atmos, Brocade Fibre Channel (FC) switches, Brocade FC HBAs, and Brocade
More informationFast, Low-Overhead Encryption for Apache Hadoop*
Fast, Low-Overhead Encryption for Apache Hadoop* Solution Brief Intel Xeon Processors Intel Advanced Encryption Standard New Instructions (Intel AES-NI) The Intel Distribution for Apache Hadoop* software
More informationAmazon EC2 Product Details Page 1 of 5
Amazon EC2 Product Details Page 1 of 5 Amazon EC2 Functionality Amazon EC2 presents a true virtual computing environment, allowing you to use web service interfaces to launch instances with a variety of
More informationCloudera Enterprise Reference Architecture for Google Cloud Platform Deployments
Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Important Notice 2010-2015 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, Impala, and
More informationUnisys ClearPath Forward Fabric Based Platform to Power the Weather Enterprise
Unisys ClearPath Forward Fabric Based Platform to Power the Weather Enterprise Introducing Unisys All in One software based weather platform designed to reduce server space, streamline operations, consolidate
More informationCan Flash help you ride the Big Data Wave? Steve Fingerhut Vice President, Marketing Enterprise Storage Solutions Corporation
Can Flash help you ride the Big Data Wave? Steve Fingerhut Vice President, Marketing Enterprise Storage Solutions Corporation Forward-Looking Statements During our meeting today we may make forward-looking
More informationVirtualized Hadoop. A Dell Hadoop Whitepaper. By Joey Jablonski. A Dell Hadoop Whitepaper
Virtualized Hadoop A Dell Hadoop Whitepaper By Joey Jablonski A Dell Hadoop Whitepaper Introduction to Virtualized Hadoop Hadoop has become a standard within many organizations and data centers for its
More informationFOR SERVERS 2.2: FEATURE matrix
RED hat ENTERPRISE VIRTUALIZATION FOR SERVERS 2.2: FEATURE matrix Red hat enterprise virtualization for servers Server virtualization offers tremendous benefits for enterprise IT organizations server consolidation,
More informationHow To Build A Cloud Server For A Large Company
MASSIVELY SCALED INFRASTRUCTURE FOR VERIZON CLOUD COMPUTE AND STORAGE Challenge Create the world s highest performance enterprise class public cloud Provide granular, customized configurations defined
More informationMS Exchange Server Acceleration
White Paper MS Exchange Server Acceleration Using virtualization to dramatically maximize user experience for Microsoft Exchange Server Allon Cohen, PhD Scott Harlin OCZ Storage Solutions, Inc. A Toshiba
More informationHur hanterar vi utmaningar inom området - Big Data. Jan Östling Enterprise Technologies Intel Corporation, NER
Hur hanterar vi utmaningar inom området - Big Data Jan Östling Enterprise Technologies Intel Corporation, NER Legal Disclaimers All products, computer systems, dates, and figures specified are preliminary
More informationHDB++: HIGH AVAILABILITY WITH. l TANGO Meeting l 20 May 2015 l Reynald Bourtembourg
HDB++: HIGH AVAILABILITY WITH Page 1 OVERVIEW What is Cassandra (C*)? Who is using C*? CQL C* architecture Request Coordination Consistency Monitoring tool HDB++ Page 2 OVERVIEW What is Cassandra (C*)?
More informationPowerful Duo: MapR Big Data Analytics with Cisco ACI Network Switches
Powerful Duo: MapR Big Data Analytics with Cisco ACI Network Switches Introduction For companies that want to quickly gain insights into or opportunities from big data - the dramatic volume growth in corporate
More informationDIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION
DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION A DIABLO WHITE PAPER AUGUST 2014 Ricky Trigalo Director of Business Development Virtualization, Diablo Technologies
More information8Gb Fibre Channel Adapter of Choice in Microsoft Hyper-V Environments
8Gb Fibre Channel Adapter of Choice in Microsoft Hyper-V Environments QLogic 8Gb Adapter Outperforms Emulex QLogic Offers Best Performance and Scalability in Hyper-V Environments Key Findings The QLogic
More informationHigh Performance Tier Implementation Guideline
High Performance Tier Implementation Guideline A Dell Technical White Paper PowerVault MD32 and MD32i Storage Arrays THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS
More informationSelf-Service Provisioning and the Private Cloud
Self-Service Provisioning and the Private Cloud Using Microsoft Server Virtualization and Dell Compellent Storage Virtualization to Improve Delivery of Infrastructure as a Service Solution Overview Published:
More informationIntel Ethernet Switch Load Balancing System Design Using Advanced Features in Intel Ethernet Switch Family
Intel Ethernet Switch Load Balancing System Design Using Advanced Features in Intel Ethernet Switch Family White Paper June, 2008 Legal INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL
More informationAn Oracle White Paper June 2012. High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database
An Oracle White Paper June 2012 High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database Executive Overview... 1 Introduction... 1 Oracle Loader for Hadoop... 2 Oracle Direct
More informationComparing the Hadoop Distributed File System (HDFS) with the Cassandra File System (CFS)
Comparing the Hadoop Distributed File System (HDFS) with the Cassandra File System (CFS) White Paper BY DATASTAX CORPORATION August 2013 1 Table of Contents Abstract 3 Introduction 3 Overview of HDFS 4
More informationMicrosoft SharePoint Server 2010
Microsoft SharePoint Server 2010 Small Farm Performance Study Dell SharePoint Solutions Ravikanth Chaganti and Quocdat Nguyen November 2010 THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY
More informationDataStax Enterprise Reference Architecture. White Paper
DataStax Enterprise Reference Architecture White Paper BY DATASTAX CORPORATION January 2014 Table of Contents Abstract...3 Introduction...3 DataStax Enterprise Architecture...3 Management Interface...
More informationMapR Enterprise Edition & Enterprise Database Edition
MapR Enterprise Edition & Enterprise Database Edition Reference Architecture A PSSC Labs Reference Architecture Guide June 2015 Introduction PSSC Labs continues to bring innovative compute server and cluster
More informationHP reference configuration for entry-level SAS Grid Manager solutions
HP reference configuration for entry-level SAS Grid Manager solutions Up to 864 simultaneous SAS jobs and more than 3 GB/s I/O throughput Technical white paper Table of contents Executive summary... 2
More informationParallels Cloud Storage
Parallels Cloud Storage White Paper Best Practices for Configuring a Parallels Cloud Storage Cluster www.parallels.com Table of Contents Introduction... 3 How Parallels Cloud Storage Works... 3 Deploying
More informationThe Future of Data Management
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class
More informationDell Microsoft Business Intelligence and Data Warehousing Reference Configuration Performance Results Phase III
White Paper Dell Microsoft Business Intelligence and Data Warehousing Reference Configuration Performance Results Phase III Performance of Microsoft SQL Server 2008 BI and D/W Solutions on Dell PowerEdge
More informationRED HAT ENTERPRISE VIRTUALIZATION FOR SERVERS: COMPETITIVE FEATURES
RED HAT ENTERPRISE VIRTUALIZATION FOR SERVERS: COMPETITIVE FEATURES RED HAT ENTERPRISE VIRTUALIZATION FOR SERVERS Server virtualization offers tremendous benefits for enterprise IT organizations server
More informationIntel RAID SSD Cache Controller RCS25ZB040
SOLUTION Brief Intel RAID SSD Cache Controller RCS25ZB040 When Faster Matters Cost-Effective Intelligent RAID with Embedded High Performance Flash Intel RAID SSD Cache Controller RCS25ZB040 When Faster
More informationSo What s the Big Deal?
So What s the Big Deal? Presentation Agenda Introduction What is Big Data? So What is the Big Deal? Big Data Technologies Identifying Big Data Opportunities Conducting a Big Data Proof of Concept Big Data
More informationAccelerating Database Applications on Linux Servers
White Paper Accelerating Database Applications on Linux Servers Introducing OCZ s LXL Software - Delivering a Data-Path Optimized Solution for Flash Acceleration Allon Cohen, PhD Yaron Klein Eli Ben Namer
More informationBuilt for Business. Ready for the Future.
Built for Business. Ready for the Future. Addressing End User and IT Needs Introducing 4 th Generation Intel Core Products Addressing Datacenter Needs Introducing Intel in Dell PowerEdge VRTX Usage Model
More informationIntel Cloud Builder Guide to Cloud Design and Deployment on Intel Platforms
Intel Cloud Builder Guide to Cloud Design and Deployment on Intel Platforms Ubuntu* Enterprise Cloud Executive Summary Intel Cloud Builder Guide Intel Xeon Processor Ubuntu* Enteprise Cloud Canonical*
More informationMakeMyTrip CUSTOMER SUCCESS STORY
MakeMyTrip CUSTOMER SUCCESS STORY MakeMyTrip is the leading travel site in India that is running two ClustrixDB clusters as multi-master in two regions. It removed single point of failure. MakeMyTrip frequently
More informationCisco for SAP HANA Scale-Out Solution on Cisco UCS with NetApp Storage
Cisco for SAP HANA Scale-Out Solution Solution Brief December 2014 With Intelligent Intel Xeon Processors Highlights Scale SAP HANA on Demand Scale-out capabilities, combined with high-performance NetApp
More informationAccelerating Cassandra Workloads using SanDisk Solid State Drives
WHITE PAPER Accelerating Cassandra Workloads using SanDisk Solid State Drives February 2015 951 SanDisk Drive, Milpitas, CA 95035 2015 SanDIsk Corporation. All rights reserved www.sandisk.com Table of
More informationMicrosoft Exchange 2010 on Dell Systems. Simple Distributed Configurations
Microsoft Exchange 2010 on Dell Systems Simple Distributed Configurations Global Solutions Engineering Dell Product Group Microsoft Exchange 2010 on Dell Systems Simple Distributed Configurations This
More informationDell s SAP HANA Appliance
Dell s SAP HANA Appliance SAP HANA is the next generation of SAP in-memory computing technology. Dell and SAP have partnered to deliver an SAP HANA appliance that provides multipurpose, data source-agnostic,
More informationImproving IT Operational Efficiency with a VMware vsphere Private Cloud on Lenovo Servers and Lenovo Storage SAN S3200
Improving IT Operational Efficiency with a VMware vsphere Private Cloud on Lenovo Servers and Lenovo Storage SAN S3200 Most organizations routinely utilize a server virtualization infrastructure to benefit
More informationDell Microsoft SQL Server 2008 Fast Track Data Warehouse Performance Characterization
Dell Microsoft SQL Server 2008 Fast Track Data Warehouse Performance Characterization A Dell Technical White Paper Database Solutions Engineering Dell Product Group Anthony Fernandez Jisha J Executive
More informationTHE DEFINITIVE GUIDE FOR AWS CLOUD EC2 FAMILIES
THE DEFINITIVE GUIDE FOR AWS CLOUD EC2 FAMILIES Introduction Amazon Web Services (AWS), which was officially launched in 2006, offers you varying cloud services that are not only cost effective, but also
More informationRed Hat Enterprise Linux is open, scalable, and flexible
CHOOSING AN ENTERPRISE PLATFORM FOR BIG DATA Red Hat Enterprise Linux is open, scalable, and flexible TECHNOLOGY OVERVIEW 10 things your operating system should deliver for big data 1) Open source project
More informationCost-Effective Business Intelligence with Red Hat and Open Source
Cost-Effective Business Intelligence with Red Hat and Open Source Sherman Wood Director, Business Intelligence, Jaspersoft September 3, 2009 1 Agenda Introductions Quick survey What is BI?: reporting,
More informationDELL. Virtual Desktop Infrastructure Study END-TO-END COMPUTING. Dell Enterprise Solutions Engineering
DELL Virtual Desktop Infrastructure Study END-TO-END COMPUTING Dell Enterprise Solutions Engineering 1 THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS AND TECHNICAL
More informationCloudera Enterprise Reference Architecture for Google Cloud Platform Deployments
Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Important Notice 2010-2016 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, Impala, and
More informationWHAT IS ENTERPRISE OPEN SOURCE?
WHITEPAPER WHAT IS ENTERPRISE OPEN SOURCE? ENSURING YOUR IT INFRASTRUCTURE CAN SUPPPORT YOUR BUSINESS BY DEB WOODS, INGRES CORPORATION TABLE OF CONTENTS: 3 Introduction 4 Developing a Plan 4 High Availability
More informationReference Architecture for Dell VIS Self-Service Creator and VMware vsphere 4
Reference Architecture for Dell VIS Self-Service Creator and VMware vsphere 4 Solutions for Large Environments Virtualization Solutions Engineering Ryan Weldon and Tom Harrington THIS WHITE PAPER IS FOR
More informationUsing an In-Memory Data Grid for Near Real-Time Data Analysis
SCALEOUT SOFTWARE Using an In-Memory Data Grid for Near Real-Time Data Analysis by Dr. William Bain, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 IN today s competitive world, businesses
More informationThe Open Cloud Near-Term Infrastructure Trends in Cloud Computing
The Open Cloud Near-Term Infrastructure Trends in Cloud Computing Markus Leberecht BELNET Networking Conference 25-Oct-2012 1 Growth & IT Challenges Drive Need for Cloud Computing IT Pros Growth IT Challenges
More informationUsing Red Hat Network Satellite Server to Manage Dell PowerEdge Servers
Using Red Hat Network Satellite Server to Manage Dell PowerEdge Servers Enterprise Product Group (EPG) Dell White Paper By Todd Muirhead and Peter Lillian July 2004 Contents Executive Summary... 3 Introduction...
More informationOracle Big Data SQL Technical Update
Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical
More informationHadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics
In Organizations Mark Vervuurt Cluster Data Science & Analytics AGENDA 1. Yellow Elephant 2. Data Ingestion & Complex Event Processing 3. SQL on Hadoop 4. NoSQL 5. InMemory 6. Data Science & Machine Learning
More informationWhite Paper. Recording Server Virtualization
White Paper Recording Server Virtualization Prepared by: Mike Sherwood, Senior Solutions Engineer Milestone Systems 23 March 2011 Table of Contents Introduction... 3 Target audience and white paper purpose...
More informationDell PowerEdge Blades Outperform Cisco UCS in East-West Network Performance
Dell PowerEdge Blades Outperform Cisco UCS in East-West Network Performance This white paper compares the performance of blade-to-blade network traffic between two enterprise blade solutions: the Dell
More informationLenovo ThinkServer and Cloudera Solution for Apache Hadoop
Lenovo ThinkServer and Cloudera Solution for Apache Hadoop For next-generation Lenovo ThinkServer systems Lenovo Enterprise Product Group Version 1.0 December 2014 2014 Lenovo. All rights reserved. LENOVO
More informationMarvell DragonFly. TPC-C OLTP Database Benchmark: 20x Higher-performance using Marvell DragonFly NVCACHE with SanDisk X110 SSD 256GB
PARTNER PERFORMANCE BENCHMARK PAPER / SANDISK Marvell DragonFly With SanDisk X110 SSD 256GB TPC-C Benchmark Test Results TPC-C OLTP Database Benchmark: 20x Higher-performance using Marvell DragonFly NVCACHE
More informationThe Modern Online Application for the Internet Economy: 5 Key Requirements that Ensure Success
The Modern Online Application for the Internet Economy: 5 Key Requirements that Ensure Success 1 Table of Contents Abstract... 3 Introduction... 3 Requirement #1 Smarter Customer Interactions... 4 Requirement
More informationSolid State Drive vs. Hard Disk Drive Price and Performance Study
Solid State Drive vs. Hard Disk Drive Price and Performance Study A Dell Technical White Paper Dell PowerVault Storage Systems Vamsee Kasavajhala Dell PowerVault Technical Marketing THIS WHITE PAPER IS
More informationRED HAT AND HORTONWORKS: OPEN MODERN DATA ARCHITECTURE FOR THE ENTERPRISE
WHITEPAPER RED HAT AND HORTONWORKS: OPEN MODERN DATA ARCHITECTURE FOR THE ENTERPRISE A Hortonworks and Red Hat whitepaper INTRODUCTION WHAT IS HADOOP? Apache Hadoop is an opensource technology born out
More informationAccelerating Big Data: Using SanDisk SSDs for MongoDB Workloads
WHITE PAPER Accelerating Big Data: Using SanDisk s for MongoDB Workloads December 214 951 SanDisk Drive, Milpitas, CA 9535 214 SanDIsk Corporation. All rights reserved www.sandisk.com Accelerating Big
More informationCassandra in Action ApacheCon NA 2013
Cassandra in Action ApacheCon NA 2013 Yuki Morishita Software Developer@DataStax / Apache Cassandra Committer 1 2 ebay Application/Use Case Social Signals: like/want/own features for ebay product and item
More information