Ground up Introduction to In-Memory Data (Grids)



Similar documents
In Memory Accelerator for MongoDB

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

Hadoop: Embracing future hardware

BigMemory & Hybris : Working together to improve the e-commerce customer experience

JBoss Seam Performance and Scalability on Dell PowerEdge 1855 Blade Servers

Scaling Database Performance in Azure

MySQL és Hadoop mint Big Data platform (SQL + NoSQL = MySQL Cluster?!)

Hadoop Architecture. Part 1

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform

Oracle TimesTen In-Memory Database on Oracle Exalogic Elastic Cloud

Overview: X5 Generation Database Machines

GridGain In- Memory Data Fabric: UlCmate Speed and Scale for TransacCons and AnalyCcs

Best Practices for Monitoring Databases on VMware. Dean Richards Senior DBA, Confio Software

BigMemory: Providing competitive advantage through in-memory data management

Cloud Computing and Amazon Web Services

Amazon EC2 Product Details Page 1 of 5

Arif Goelmhd Goelammohamed Solutions Hyperconverged Infrastructure: The How-To and Why Now?

Bigdata High Availability (HA) Architecture

Exadata for Oracle DBAs. Longtime Oracle DBA

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Apache Ignite TM (Incubating) - In- Memory Data Fabric Fast Data Meets Open Source

In-Memory BigData. Summer 2012, Technology Overview

Future-Proofed Backup For A Virtualized World!

Practical Cassandra. Vitalii

Diablo and VMware TM powering SQL Server TM in Virtual SAN TM. A Diablo Technologies Whitepaper. May 2015

Amazon Cloud Storage Options

Hyper-V over SMB Remote File Storage support in Windows Server 8 Hyper-V. Jose Barreto Principal Program Manager Microsoft Corporation

Scalability of web applications. CSCI 470: Web Science Keith Vertanen

Near Real Time Indexing Kafka Message to Apache Blur using Spark Streaming. by Dibyendu Bhattacharya

Apache Ignite TM (Incubating) - In- Memory Data Fabric Fast Data Meets Open Source

Hadoop Distributed File System (HDFS) Overview

Social Networks and the Richness of Data

DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing WHAT IS CLOUD COMPUTING? 2

SQL Server Virtualization 101. David Klee, Group Principal and Practice Lead. SQL PASS Virtualization VC,

Comparing SMB Direct 3.0 performance over RoCE, InfiniBand and Ethernet. September 2014

PADS GPFS Filesystem: Crash Root Cause Analysis. Computation Institute


Exploring Amazon EC2 for Scale-out Applications

ebay Storage, From Good to Great

Traditional v/s CONVRGD

Big Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect

Big Data Trends and HDFS Evolution

Big Data Analytics - Accelerated. stream-horizon.com

Agenda. Enterprise Application Performance Factors. Current form of Enterprise Applications. Factors to Application Performance.

Michael Kagan.

SMB Direct for SQL Server and Private Cloud

RAMCloud and the Low- Latency Datacenter. John Ousterhout Stanford University

Scalable Architecture on Amazon AWS Cloud

Apache Hadoop. Alexandru Costan

Inge Os Sales Consulting Manager Oracle Norway

Cloud Storage. Parallels. Performance Benchmark Results. White Paper.

Cloud Computing at Google. Architecture

2009 Oracle Corporation 1

WITH A FUSION POWERED SQL SERVER 2014 IN-MEMORY OLTP DATABASE

Oracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya

Cloud Based Application Architectures using Smart Computing

Oracle to SQL Server 2005 Migration

JVM Performance Study Comparing Oracle HotSpot and Azul Zing Using Apache Cassandra

JBoss Data Grid Performance Study Comparing Java HotSpot to Azul Zing

Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam

Accelerating Applications and File Systems with Solid State Storage. Jacob Farmer, Cambridge Computer

IBM System x SAP HANA

Building a Private Cloud with Eucalyptus

Zadara Storage Cloud A

Big Data Performance Growth on the Rise

Liferay Performance Tuning

Monitoring IBM WebSphere extreme Scale (WXS) Calls With dynatrace

Virtualizing Microsoft SQL Server on Dell XC Series Web-scale Converged Appliances Powered by Nutanix Software. Dell XC Series Tech Note

Accelerating Hadoop MapReduce Using an In-Memory Data Grid

Amazon Elastic Compute Cloud Getting Started Guide. My experience

Solving I/O Bottlenecks to Enable Superior Cloud Efficiency

New Features in SANsymphony -V10 Storage Virtualization Software

Business Intelligence Competency Partners

Protect Data... in the Cloud

Oracle Database Public Cloud Services

How To Use An Org.Org Cloud System For A Business

Improve Business Productivity and User Experience with a SanDisk Powered SQL Server 2014 In-Memory OLTP Database

Virtual SAN Design and Deployment Guide

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

White paper. ATCA Compute Platforms (ACP) Use ACP to Accelerate Private Cloud Deployments for Mission Critical Workloads. Rev 01

Parallels Cloud Storage

Hadoop Cluster Applications

WebLogic on Oracle Database Appliance: Combining High Availability and Simplicity

GraySort on Apache Spark by Databricks

Leveraging Public Clouds to Ensure Data Availability

Challenges for Data Driven Systems

Hadoop Size does Hadoop Summit 2013

Maximizing Your Server Memory and Storage Investments with Windows Server 2012 R2

A virtual SAN for distributed multi-site environments

Maximizing SQL Server Virtualization Performance

Performance and Scale in Cloud Computing

Accelerating Real Time Big Data Applications. PRESENTATION TITLE GOES HERE Bob Hansen

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

Transcription:

Ground up Introduction to In-Memory Data (Grids) QCON 2015 NEW YORK, NY 2014 Hazelcast Inc.

Why you here? 2014 Hazelcast Inc.

Java Developer on a quest for scalability frameworks Architect on low-latency projects NoSQL practitioner IMDG newbie Data enthusiast

Who am I? 2014 Hazelcast Inc.

About me Viktor Gamov @gamussa Solutions Architect at Hazelcast http://hazelcast.com/services/solutions-team/ International Java Conferences speaker http://lanyrd.com/gamussa/ Co-author of O Reilly s «Enterprise Web Development» Co-organizer of User Groups Princeton JUG http://www.meetup.com/njflex/ Hazelcast User Group NYC http://www.meetup.com/hug-nyc/

Agenda 2014 Hazelcast Inc.

The In-Memory landscape you need to know Distributed Data Applications Making In-memory Reliable, Scalable and Durable QA

Why In-Memory Data? 2014 Hazelcast Inc.

Problem 2014 Hazelcast Inc.

Anatomy of a JAVA Application Operating System JVM RAM Application JVM Heap Disk

Anatomy of an Application: Tiered Data Storage Tiered Storage Speed (TPS) 2,000,000+ JVM In-Memory Heap Store Disk Size (GB) 2 1,000,000 100,000 Local Off-Heap Store *** Local Disk Store (Re-startable) 1,000 10,000+ 1,000s External Data Source (e.g., Database)

Common Enterprise Application Landscape Load Balancer Load Balancer Load Balancer Load Balancer App Server App Server Application Server Application Application JVM Heap JVM Heap JVM Heap App Server App Server Application Server Application Application JVM Heap JVM Heap JVM Heap App Server App Server Application Server Application Application JVM Heap JVM Heap JVM Heap App Server App Server Application Server Application Application JVM Heap JVM Heap JVM Heap J D B C J D B C J D B C J D B C J D B C J D B C Network J D B C J D B C J D B C J D B C J D B C J D B C Database Databases Database Databases Database Databases Database Databases

Performance & Scalability Challenges JVM memory constraints Heap size limitations GC pauses and tuning Handling peak loads Swapping CPU limitations Limited processor power Many applications sharing CPU cycles CPU intensive processes

Performance & Scalability Challenges Reliance on the database All transactions go through the database Expensive and complex to scale Reliance on DBA(s) to maintain and tune Network I/O Bandwidth limitations Latency State-aware cluster Session replication

Even More Common Enterprise Application Landscape Load Balancer Load Balancer Load Balancer Load Balancer Enterprise Java Dotnet Devices Phones Tablets Web HTML5 Javascript Rich Clients Next Big Trend???Universal Data Transfer Solution?? Database Databases Database Databases Database Databases Database Databases

In-Memory Data Lanscape 2014 Hazelcast Inc.

Messaging Middleware NoSQL DATABASES SQL DATABASES IMDG In-Memory Computing

Why In-Memory Now? 70 Network 8 RAM per Commodity Server 52.5 6 35 17.5 μs 4 2 TB 0 Fast Ethernet DDR Infiniband 100 GbE 0 2010 2011 2012 2013 Addressable Memory 32 4GB 64 18EB

Distributed Data Applications 2014 Hazelcast Inc.

Cache-as-a-Service Application Scaling Database Caching Distributed Computing Reactive / Smart Clients

Cache-as-a-Service

Durability of application data

Application Scaling Elastic Scalability (just add more servers and they cluster automatically) Super Speeds (in-memory transaction speeds) High Availability (can deploy in backup pairs or even WAN replicated) Fault Tolerance (no single-point-of-failure) Cloud Readiness (deploy right into EC2, GCE, Docker)

Database Caching

Distributed Computing

Reactive Clients The clients (part of an application) enable you to do the operations without being a member of the grid. Clients can be: smart: this means that they immediately can send an operation like map.get(key) to the member that owns that specific key. dumb: it will connect to a random member in the cluster and send requests to this member. This member then needs to send the request to the correct member. High availability When the connected cluster member dies, client will automatically switch to another live member.

Making In-Memory Reliable, Scalable and Durable 2014 Hazelcast Inc.

Data Distribution http://www.ministryoftofu.com/2012/01/photos-upcoming-chinese-new-year-creates-backlogs-for-courier-companies/

Replication vs. Partitioning Replication - Copying an entire dataset onto multiple servers. Used for improving speed of access to reference records such as master data. Partitioning - Splitting up a large monolithic dataset into multiple smaller sets based on data cohesion.

Data Partitioning

Data Partitioning

Data Partitioning

Rebalance Data On New Node 2014 Hazelcast Inc.

Distributed Maps Fixed number of partitions (default 271) Each key falls into a partition partitionid = hash(keydata)%partition_count Partition ownerships are reassigned upon membership A B C

New Node Added A B C D

Migration A B C D

Migration A B C D

Migration A B C D

Migration A B C D

Migration A B C D

Migration A B C D

Crash Migration Complete A B C D

Data Safety When Node Dies 2014 Hazelcast Inc.

Node Crashes A B C D Crash

Backups are Restored A B C D Crash

Backups are Restored A B C D Crash

Backups are Restored A B C D Crash

Backups are Restored A B C D Crash

Data is Recovered from backup A B C D Crash

Data is Recovered from backup A B C D Crash

Backup for Recovered Data A B C D Crash

Backup for Recovered Data A B C D Crash

Backup for Recovered Data A B C D Crash

All Safe A C D

Scalability: Up or Out? 2014 Hazelcast Inc.

Scale Out http://fc07.deviantart.net/fs71/i/2012/156/3/b/bruce_banner_by_lightning_stroke-d52erxe.png

Durability Persist to or read from any external data store (SQL database, NoSQL data stores, REST service, EIS, file, etc.) Durability of application data (user sessions, shopping carts, master data, configs) Write through and write behind

High Density Caching

HD vs Heap Heap: 0 MB Native, 4000 MB Heap, 9 Major GC (4900 ms), 31 Minor GC (4217 ms) HD Memory v1:3000 MB Native, 600 MB Heap, 3 Major GC (2800 ms), 100 Minor GC (2018 ms) HD Memory v2:3300 MB Native, 57 MB Heap, 0 Major GC (0 ms), 356 Minor GC (349 ms)

Do I really need NoSQL? 2014 Hazelcast Inc.

NoSQL Landscape http://hazelcast.com/use-cases/nosql/nosql-data-store/

QA 2014 Hazelcast Inc.

Bonus! Go to http://hazelcast.org/t-shirts/ enter hazelcastic and get your t-shirt!

Question, please?