[Hadoop, Storm and Couchbase: Faster Big Data]



Similar documents
Top 10 Enterprise NoSQL Use Cases Table of Contents

Big Data Analytics - Accelerated. stream-horizon.com

Benchmarking Couchbase Server for Interactive Applications. By Alexey Diomin and Kirill Grigorchuk

Performance and Scalability Overview

Big Data Analytics - Accelerated. stream-horizon.com

BIG DATA FOR MEDIA SIGMA DATA SCIENCE GROUP MARCH 2ND, OSLO

Real-Time Analytics for Big Market Data with XAP In-Memory Computing

Big Data Success Step 1: Get the Technology Right

Talend Real-Time Big Data Sandbox. Big Data Insights Cookbook

How To Handle Big Data With A Data Scientist

INTRODUCTION TO CASSANDRA

Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January Website:

Scalable Architecture on Amazon AWS Cloud

So What s the Big Deal?

Cloudera Enterprise Data Hub in Telecom:

Enterprise Operational SQL on Hadoop Trafodion Overview

Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010

How To Use Big Data For Telco (For A Telco)

Performance and Scalability Overview

Pulsar Realtime Analytics At Scale. Tony Ng April 14, 2015

<Insert Picture Here> Big Data

Big Data for Investment Research Management

Harnessing the Power of the Microsoft Cloud for Deep Data Analytics

Modern IT Operations Management. Why a New Approach is Required, and How Boundary Delivers

Putting Apache Kafka to Use!

A TECHNICAL WHITE PAPER ATTUNITY VISIBILITY

Session 1: IT Infrastructure Security Vertica / Hadoop Integration and Analytic Capabilities for Federal Big Data Challenges

Big Data Analytics: Today's Gold Rush November 20, 2013

The 4 Pillars of Technosoft s Big Data Practice

Real Time Fraud Detection With Sequence Mining on Big Data Platform. Pranab Ghosh Big Data Consultant IEEE CNSV meeting, May Santa Clara, CA

GigaSpaces Real-Time Analytics for Big Data

Actian SQL in Hadoop Buyer s Guide

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

Can Flash help you ride the Big Data Wave? Steve Fingerhut Vice President, Marketing Enterprise Storage Solutions Corporation

Scala Storage Scale-Out Clustered Storage White Paper

Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013

Big Data: A Storage Systems Perspective Muthukumar Murugan Ph.D. HP Storage Division

Moving From Hadoop to Spark

Building your Big Data Architecture on Amazon Web Services

CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data

Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities

Hadoop in the Enterprise

Realtime Apache Hadoop at Facebook. Jonathan Gray & Dhruba Borthakur June 14, 2011 at SIGMOD, Athens

Why NoSQL? Your database options in the new non- relational world IBM Cloudant 1

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform

Hadoop: Embracing future hardware

Protect Data... in the Cloud

Big Data Performance Growth on the Rise

Software-defined Storage Architecture for Analytics Computing

Apache HBase. Crazy dances on the elephant back

STREAM PROCESSING AT LINKEDIN: APACHE KAFKA & APACHE SAMZA. Processing billions of events every day

redborder IPS redborder Just common sense IPS overview Common sense

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Database Scalability and Oracle 12c

Object Storage: A Growing Opportunity for Service Providers. White Paper. Prepared for: 2012 Neovise, LLC. All Rights Reserved.

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

The Future of Data Management

The Rise of Industrial Big Data. Brian Courtney General Manager Industrial Data Intelligence

Upcoming Announcements

How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns

Big data management with IBM General Parallel File System

From Spark to Ignition:

FINANCIAL SERVICES: FRAUD MANAGEMENT A solution showcase

Zynga Analytics Leveraging Big Data to Make Games More Fun and Social

Introducing Oracle Exalytics In-Memory Machine

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

NoSQL Data Base Basics

Traditional v/s CONVRGD

Microsoft Big Data Solutions. Anar Taghiyev P-TSP

I/O Considerations in Big Data Analytics

Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth

Analyzing Big Data with AWS

How To Create A Business Intelligence (Bi)

INTRODUCING RETAIL INTELLIGENCE

Oracle Database 12c Plug In. Switch On. Get SMART.

BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB

The StrikeIron API Management Solution

Choosing The Right Big Data Tools For The Job A Polyglot Approach

Big Data for Investment Research Management

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

Building Scalable Big Data Infrastructure Using Open Source Software. Sam William

Dell Reference Configuration for DataStax Enterprise powered by Apache Cassandra

Introduction to Multi-Data Center Operations with Apache Cassandra and DataStax Enterprise

Dell* In-Memory Appliance for Cloudera* Enterprise

Realizing the True Potential of Software-Defined Storage

More Data in Less Time

SQLstream Blaze and Apache Storm A BENCHMARK COMPARISON

Cisco Solutions for Big Data and Analytics

Cloud Based Application Architectures using Smart Computing

Microsoft Private Cloud Fast Track

Transcription:

[Hadoop, Storm and Couchbase: Faster Big Data] With over 8,500 clients, LivePerson is the global leader in intelligent online customer engagement. With an increasing amount of agent/customer engagements, LivePerson required a next generation architecture designed to meet today s real time big data requirements. After internal benchmarking and testing, the platform engineering team chose to integrate Apache Kafka, Apache Hadoop, Apache Storm, and Couchbase Server to enable Faster Big Data. LivePerson by the Numbers Over 8,500 customers; 8 of the top 10 Fortune 500 1.8 billion visits/month 20 million engagements/month 13 TB of compressed data/month Four data centers worldwide

LivePerson s Technology Challenge With billions of visits and over 20 million engagements made via the LivePerson platform every month, the company needed a scalable backend capable of streaming, processing, and storing big data in real time. LivePerson s legacy platform was somewhat monolithic. The web tier collected visitor information in the form of events and stored them in memory. It was impossible to scale out, and the ability to scale up was limited. With the massive amount of events being generated, LivePerson needed to create a distributed, service- oriented platform. This platform would rely on multiple deployments in a multi- master topology allowing events to be processed as messages in multiple data centers.

All of the messages had to be available to web agents, but the messages were being stored and processed in multiple data centers. Looking for a solution, LivePerson looked to NoSQL. LivePerson s Database Requirements The LivePerson team evaluated three NoSQL databases: Cassandra, MongoDB, and Couchbase Server. After internal testing and benchmarking, LivePerson found that Couchbase Server was the only NoSQL database that met all of its requirements: Always On: With Couchbase Server, there was no downtime; not even during SW/HW upgrades, backups, etc. Linear Scale: With Couchbase Server, it was easy for LivePerson to add additional nodes without interrupting production Searchable Document Database and Key / Value Store

High Throughput: LivePerson s application required very high read/write throughput with most writes being updates, a scenario which Couchbase Server supports XDCR: With an active- active architecture, Couchbase Server s cross data center replication was an important component of LivePerson s application success Moving to Couchbase Server and a full data stack In order to stream, process, and store messages at this scale, the team had to design a real time big data architecture based on Apache Kafka, Apache Hadoop, Apache Storm, and Couchbase Server. The monitoring and communication services respond to customer events by publishing messages to Apache Kafka. The messages are consumed by offline and real time systems for processing and analysis. Batch Track Apache Hadoop stores historical data with both HP Vertica and Couchbase Server deployed to analyze it. For example, LivePerson uses map / reduce tasks within Couchbase Server to index and aggregate data. In addition, MicroStrategy is used for business intelligence (BI).

Real Time Track Apache Storm processes streams of events by consuming the messages published to Apache Kafka with data now being persisted to Couchbase Server instead of an Apache Cassandra distribution. This enables web agents to access the data via a dashboard. The End Result For LivePerson, Couchbase Server is a strategic choice. In addition to their current usage of the technology, LivePerson plans to leverage Couchbase Server in future applications: Visitor Session State Cross Session State Server Side Cookies Fast Read/Write Persistent Store Caching See the LivePerson video presentation at http://www.youtube.com/watch?v=tzdxu2nelkg&feature=youtu.be About LivePerson At LivePerson, our mission is to Create Meaningful Connections. Connection at the workplace inspires us to serve our clients exceptionally, to work together efficiently, and to get personally involved in our local communities. Our emphasis on connection leads to more opportunity whether it s increased value for our customers, product innovation, or community growth. With over 8,500 clients, LivePerson is the global leader in intelligent online customer engagement. Our solutions empower clients to engage customers with the most appropriate, most cost- effective type of interaction such as chat, voice, targeted content, and video delivering rich, personalized, cross- channel experiences. We deliver the right experience at the right moment, on mobile, website or tablet, based on customer and business intelligence. From the largest global enterprises to small businesses, our customers trust us to deliver increased conversions, higher order values, and lower support costs.