Benchmarking the Availability and Fault Tolerance of Cassandra

Similar documents

LARGE-SCALE DATA STORAGE APPLICATIONS

BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB

NoSQL Performance Test In-Memory Performance Comparison of SequoiaDB, Cassandra, and MongoDB

Yahoo! Cloud Serving Benchmark

Accelerating Cassandra Workloads using SanDisk Solid State Drives

Benchmarking Couchbase Server for Interactive Applications. By Alexey Diomin and Kirill Grigorchuk

Comparing NoSQL Solutions In a Real-World Scenario: Aerospike, Cassandra Open Source, Cassandra DataStax, Couchbase and Redis Labs

PostgreSQL Performance Characteristics on Joyent and Amazon EC2

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

Self-Adapting Load Balancing for DNS

STeP-IN SUMMIT June 2014 at Bangalore, Hyderabad, Pune - INDIA. Performance testing Hadoop based big data analytics solutions

Performance Testing of a Cloud Service

JVM Performance Study Comparing Oracle HotSpot and Azul Zing Using Apache Cassandra

Benchmarking Cassandra on Violin

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform

Highly available, scalable and secure data with Cassandra and DataStax Enterprise. GOTO Berlin 27 th February 2014

C3: Cutting Tail Latency in Cloud Data Stores via Adaptive Replica Selection

Benchmarking Failover Characteristics of Large-Scale Data Storage Applications: Cassandra and Voldemort

Shared Parallel File System

Advances in Flash Memory Technology & System Architecture to Achieve Savings in Data Center Power and TCO

How To Store Data On An Ocora Nosql Database On A Flash Memory Device On A Microsoft Flash Memory 2 (Iomemory)

NoSQL Failover Characteristics: Aerospike, Cassandra, Couchbase, MongoDB

Oracle Database Scalability in VMware ESX VMware ESX 3.5

DataStax Enterprise Reference Architecture

Configuration Manual Yahoo Cloud System Benchmark (YCSB) 24-Mar-14 SEECS-NUST Faria Mehak

MASTER PROJECT. Resource Provisioning for NoSQL Datastores

Certification Document macle GmbH Grafenthal-S1212M 24/02/2015. macle GmbH Grafenthal-S1212M Storage system

EWeb: Highly Scalable Client Transparent Fault Tolerant System for Cloud based Web Applications

Can the Elephants Handle the NoSQL Onslaught?

How To Run Apa Hadoop 1.0 On Vsphere Tmt On A Hyperconverged Network On A Virtualized Cluster On A Vspplace Tmter (Vmware) Vspheon Tm (

Introduction to Apache Cassandra

IT Business Management System Requirements Guide

Benchmarking Top NoSQL Databases Apache Cassandra, Couchbase, HBase, and MongoDB Originally Published: April 13, 2015 Revised: May 27, 2015

Dell Reference Configuration for DataStax Enterprise powered by Apache Cassandra

Performance Evaluation of NoSQL Systems Using YCSB in a resource Austere Environment

TheImpactofWeightsonthe Performance of Server Load Balancing(SLB) Systems

Scalable Cloud Computing Solutions for Next Generation Sequencing Data

Accelerating and Simplifying Apache

Evaluating Apache Cassandra as a Cloud Database WHITE PAPER

Apache Cassandra 1.2

HP ProLiant BL660c Gen9 and Microsoft SQL Server 2014 technical brief

Configuring Apache Derby for Performance and Durability Olav Sandstå

EXPERIMENTATION. HARRISON CARRANZA School of Computer Science and Mathematics

Use of Hadoop File System for Nuclear Physics Analyses in STAR

Comparing the Hadoop Distributed File System (HDFS) with the Cassandra File System (CFS)

SERVER CLUSTERING TECHNOLOGY & CONCEPT

Testing Automation for Distributed Applications By Isabel Drost-Fromm, Software Engineer, Elastic

Distributed File System. MCSN N. Tonellotto Complements of Distributed Enabling Platforms

HDB++: HIGH AVAILABILITY WITH. l TANGO Meeting l 20 May 2015 l Reynald Bourtembourg

Implementation & Capacity Planning Specification

Performance Modeling and Analysis of a Database Server with Write-Heavy Workload

An Oracle White Paper August Oracle VM 3: Server Pool Deployment Planning Considerations for Scalability and Availability

SQL Server Consolidation Using Cisco Unified Computing System and Microsoft Hyper-V

Evaluating HDFS I/O Performance on Virtualized Systems

BUILDING HIGH-AVAILABILITY SERVICES IN JAVA

HP reference configuration for entry-level SAS Grid Manager solutions

Getting Started with SandStorm NoSQL Benchmark

Accelerating Big Data: Using SanDisk SSDs for MongoDB Workloads

PARALLELS CLOUD SERVER

Big Data: Beyond the Hype

Introduction to Multi-Data Center Operations with Apache Cassandra and DataStax Enterprise

<Insert Picture Here> Oracle NoSQL Database A Distributed Key-Value Store

Variations in Performance and Scalability when Migrating n-tier Applications to Different Clouds

A Middleware Strategy to Survive Compute Peak Loads in Cloud

Praktijkexamen met Project VRC. Virtual Reality Check

Mission-critical HP-UX 11i v2 WebSphere Reference Architecture White Paper

Stingray Traffic Manager Sizing Guide

CASE STUDY: Oracle TimesTen In-Memory Database and Shared Disk HA Implementation at Instance level. -ORACLE TIMESTEN 11gR1

Accelerating Big Data: Using SanDisk SSDs for Apache HBase Workloads

Big Data: Beyond the Hype

So What s the Big Deal?

Cloud Storage. Parallels. Performance Benchmark Results. White Paper.

Benchmarking and Analysis of NoSQL Technologies

DataStax Enterprise Reference Architecture. White Paper

Performance Analysis of IPv4 v/s IPv6 in Virtual Environment Using UBUNTU

Performance Tuning and Optimizing SQL Databases 2016

Certification Document macle GmbH GRAFENTHAL R2208 S2 01/04/2016. macle GmbH GRAFENTHAL R2208 S2 Storage system

Cloud Storage Solution for WSN Based on Internet Innovation Union

Performance Management for Cloudbased STC 2012

Comparison of the Frontier Distributed Database Caching System with NoSQL Databases

CS 6343: CLOUD COMPUTING Term Project

Big Data: Beyond the Hype. Why Big Data Matters to You. White Paper

GraySort on Apache Spark by Databricks

AppDynamics Lite Performance Benchmark. For KonaKart E-commerce Server (Tomcat/JSP/Struts)

VP/GM, Data Center Processing Group. Copyright 2014 Cavium Inc.

DataStax Enterprise, powered by Apache Cassandra (TM)

Case study: CASSANDRA

RAID. RAID 0 No redundancy ( AID?) Just stripe data over multiple disks But it does improve performance. Chapter 6 Storage and Other I/O Topics 29

Using Synology SSD Technology to Enhance System Performance Synology Inc.

bla bla OPEN-XCHANGE Open-Xchange Hardware Needs

Evaluating Apache Cassandra as a Cloud Database WHITE PAPER

An Oracle White Paper June High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database

Building a Scalable News Feed Web Service in Clojure

Maximum performance, minimal risk for data warehousing

Transcription:

Benchmarking the Availability and Fault Tolerance of Cassandra Marten Rosselli, Raik Niemann, Todor Ivanov, Karsten Tolle, Roberto V. Zicari Goethe-University Frankfurt, Germany Frankfurt Big Data Lab www.bigdata.uni-frankfurt.de 6 th Workshop on Big Data Benchmarking 2015 June 16 th 17 th, Toronto, Canada

Contact Marten Rosselli Goethe-University Frankfurt, Germany Frankfurt Big Data Lab rosselli@dbis.cs.uni-frankfurt.de www.bigdata.uni-frankfurt.de Accenture, Germany Accenture Digital marten.rosselli@accenture.com www.accenture.com - 2 -

Agenda Motivation Approach Background Cassandra and YCSB Setup and Configuration Experimental Results Conclusions and Outlook - 3 -

Motivation With each additional machine in a cluster the likelihood for a hardware failure increases. How is a Cassandra cluster impacted by a machine failure? Cluster remains stable? Impact on the response time for single user? Impact on system throughput? Influence of different operation types on the performance? Cassandra a good choice for high availability use cases? - 4 -

Approach Prepare Cassandra/OS: Drop page cache, remove old datasets, (re-)load dataset, rebalance cluster YCSB 1 -Workload Execution (Ramp-Up time): 100 seconds YCSB-Workload Execution (Pre-Failure): 80 seconds Shutdown of a Cassandra node (after 3 minutes) Repeated 3 times for each workload (throughput and latency monitored, average values taken) YCSB-Workload Execution (Post-Failure): 120 seconds 1 Yahoo! Cloud Serving Benchmark - 5 -

Cassandra (version 2.0.8.39 used) Peer-to-peer ring architecture Wide-column NoSQL store SQL-like interface (CQL v3 used) Source: DataStax OpsCenter (Screenshot) - 6 -

Yahoo! Cloud Serving Benchmark (YCSB) Cooper et al., Benchmarking Cloud Serving Systems with YCSB, SOCC 10-7 -

Setup Component 7 Cassandra Nodes YCSB Client Data Center Fujitsu BX620S3 Blade Center CPU AMD Operton 870 (2.0 GHz) AMD Opteron 890 (2.8 GHz) Main Memory 16 GByte DDR-2 reg. 32 GByte DDR-R reg. Hard Drives 2x 146 GByte (RAID-0) 2x 300 GByte (RAID-0) NIC Operating System Cassandra Broadcom NetXtreme BCM5704S, 1 GBit/s transfer speed Ubuntu Server 12.04 64bit DataStax Enterprise Server v4.5.1 (Apache Cassandra v2.0.8.39) YCSB v0.1.4 with a CQL-based Cassandra binding 1 JRE Oracle Java Runtime Environment v1.7.60 1 YCSB Cassandra binding based on CQL: https://github.com/jbellis/ycsb - 8 -

Setup (cont.) Seven DataStax Cassandra nodes Setup using the DataStax installation routines Source: DataStax OpsCenter (Screenshot) - 9 -

Cassandra Configuration Parameters to tune the consistency level: Replication Factor = 3 Write Consistency = 1 Read Consistency = 1-10 -

Dataset used 600 million data record (1 KB record size) 200 million records replication factor 3 Single record consists of ten fields plus PK Data uniformly distributed Replication Strategy: Replicas placed clockwise ( SimpleStrategy ) - 11 -

Data Loading - 12 -

Workloads used Read Workload: 10 million read operations (benchmark configured to read all record fields) Update Workload: 10 million update operations (an update replaces a single field on an existing record) Mixed Workload: 5 million read and 5 million update operations. Defined by Cooper et al. 1 to simulate a session store recording recent user actions. 1 Cooper et al., Benchmarking Cloud Serving Systems with YCSB, SOCC 10-13 -

Throughput Read Workload Throughput -7.5 % - 14 -

Latency Read Workload Latency +8.6 % - 15 -

Throughput Update Workload Throughput -10.2 % (for reads it was -7.5 %) - 16 -

Latency Update Workload Latency +11.9 % (for reads it was +8.6 %) - 17 -

Throughput Mixed Workload Throughput -8.7 % (close to the average based on the results of the read/update workloads) - 18 -

Latency Mixed Workload Read Latency +3.6 % (read-only was +8.6 %) Update Latency +27.1 % (update-only was 11.9 %) - 19 -

Conclusions The cluster remained stable during the node failure. Updates were more negatively affected (throughput and latency) by a node outage compared to reads. For the mixed workload, the throughput decrease was close to the average based on the results of the read/update workloads. Updates were slowed down by concurrent reads. Read operations had a negative impact on concurrent update operations in terms of latency. - 20 -

Recommendations and Outlook Recommendations (for comparable setups): Cassandra is well suited for applications that require high availability and stable read response times. For applications with mixed workloads, the latency of the updates drop significantly after failure. Reducing the number of concurrent reads helps to avoid this behaviour. Outlook: Simulation of multiple node failures Evaluation of the impact of the cluster size Inclusion of additional use case oriented workloads Benchmarking of additional database systems - 21 -