GridGain In- Memory Data Fabric: UlCmate Speed and Scale for TransacCons and AnalyCcs

Similar documents
Apache Ignite TM (Incubating) - In- Memory Data Fabric Fast Data Meets Open Source

Apache Ignite TM (Incubating) - In- Memory Data Fabric Fast Data Meets Open Source

INTRODUCING APACHE IGNITE An Apache Incubator Project

In Memory Accelerator for MongoDB

In-Memory BigData. Summer 2012, Technology Overview

Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities

GridGain gets open source in-memory accelerator out of the blocks

How To Create A Data Visualization With Apache Spark And Zeppelin

In-Memory Computing : Premiere

TE's Analytics on Hadoop and SAP HANA Using SAP Vora

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

The Inside Scoop on Hadoop

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

In-memory computing with SAP HANA

Big Data Analytics - Accelerated. stream-horizon.com

Ground up Introduction to In-Memory Data (Grids)

Oracle Database 12c Plug In. Switch On. Get SMART.

Architectural patterns for building real time applications with Apache HBase. Andrew Purtell Committer and PMC, Apache HBase

HADOOP MOCK TEST HADOOP MOCK TEST I

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances

The evolution of database technology (II) Huibert Aalbers Senior Certified Executive IT Architect

Big Data Technologies Compared June 2014

Big Data Technology ดร.ช ชาต หฤไชยะศ กด. Choochart Haruechaiyasak, Ph.D.

Hadoop: A Framework for Data- Intensive Distributed Computing. CS561-Spring 2012 WPI, Mohamed Y. Eltabakh

Introduction to Spark

Next-Gen Big Data Analytics using the Spark stack

IN-MEMORY DATA FABRIC: Data Grid

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam

Table Of Contents. 1. GridGain In-Memory Database

NoSQL for SQL Professionals William McKnight

<Insert Picture Here> Oracle In-Memory Database Cache Overview

An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Apache Hadoop. Alexandru Costan

SAP and Hortonworks Reference Architecture

Near Real Time Indexing Kafka Message to Apache Blur using Spark Streaming. by Dibyendu Bhattacharya

Workshop on Hadoop with Big Data

Introduction to Big Data Training

How Companies are! Using Spark

Unified Big Data Processing with Apache Spark. Matei

Moving From Hadoop to Spark

THE HADOOP DISTRIBUTED FILE SYSTEM

Hadoop: Embracing future hardware

Case Study : 3 different hadoop cluster deployments

BIG DATA SOLUTION DATA SHEET

Integrating Apache Spark with an Enterprise Data Warehouse

Hadoop Architecture. Part 1

Big data blue print for cloud architecture

I/O Considerations in Big Data Analytics

Hadoop. MPDL-Frühstück 9. Dezember 2013 MPDL INTERN

Unified Big Data Analytics Pipeline. 连 城

Evaluating NoSQL for Enterprise Applications. Dirk Bartels VP Strategy & Marketing

Federated SQL on Hadoop and Beyond: Leveraging Apache Geode to Build a Poor Man's SAP HANA. by Christian

Extending Hadoop beyond MapReduce

Introducing Oracle Exalytics In-Memory Machine

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Einsatzfelder von IBM PureData Systems und Ihre Vorteile.

Real-Time Analytics for Big Market Data with XAP In-Memory Computing

P4.1 Reference Architectures for Enterprise Big Data Use Cases Romeo Kienzler, Data Scientist, Advisory Architect, IBM Germany, Austria, Switzerland

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

Pulsar Realtime Analytics At Scale. Tony Ng April 14, 2015

The Top 10 7 Hadoop Patterns and Anti-patterns. Alex

Large scale processing using Hadoop. Ján Vaňo

Architectures for Big Data Analytics A database perspective

Welcome to the unit of Hadoop Fundamentals on Hadoop architecture. I will begin with a terminology review and then cover the major components

Real-time Big Data Analytics with Storm

Big Fast Data Hadoop acceleration with Flash. June 2013

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney

Hadoop IST 734 SS CHUNG

SAP HANA SAP s In-Memory Database. Dr. Martin Kittel, SAP HANA Development January 16, 2013

MySQL és Hadoop mint Big Data platform (SQL + NoSQL = MySQL Cluster?!)

Safe Harbor Statement

Scalable Architecture on Amazon AWS Cloud

FINANCIAL SERVICES: FRAUD MANAGEMENT A solution showcase

<Insert Picture Here> Oracle NoSQL Database A Distributed Key-Value Store

Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

Katta & Hadoop. Katta - Distributed Lucene Index in Production. Stefan Groschupf Scale Unlimited, 101tec. sg{at}101tec.com

CIO Guide How to Use Hadoop with Your SAP Software Landscape

Hadoop and Map-Reduce. Swati Gore

EMC Federation Big Data Solutions. Copyright 2015 EMC Corporation. All rights reserved.

MongoDB Developer and Administrator Certification Course Agenda

Accelerating Hadoop MapReduce Using an In-Memory Data Grid

Luncheon Webinar Series May 13, 2013

In-Memory Data Grids

An Oracle White Paper November Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics

Business Intelligence for Big Data

Big Data Course Highlights

SAP HANA Vora : Gain Contextual Awareness for a Smarter Digital Enterprise

Spark in Action. Fast Big Data Analytics using Scala. Matei Zaharia. project.org. University of California, Berkeley UC BERKELEY

STeP-IN SUMMIT June 2014 at Bangalore, Hyderabad, Pune - INDIA. Performance testing Hadoop based big data analytics solutions

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

Transcription:

GridGain In- Memory Data Fabric: UlCmate Speed and Scale for TransacCons and AnalyCcs DMITRIY SETRAKYAN Founder & EVP Engineering @dsetrakyan www.gridgain.com #gridgain

Agenda EvoluCon of In- Memory CompuCng GridGain In- Memory Data Fabric Distributed Cluster & Compute Coding Example Distributed Data Grid Coding Examples Distributed Streaming & CEP Plug- n- Play Hadoop Accelerator

What is In- Memory CompuFng High Performance & Low Latencies Faster than Disk and Flash Cost EffecCve Distributed or Not Caching, Streaming, ComputaCons Data Querying SQL or Unstructured VolaCle and Persistent OLAP and OLTP Use Cases

EvoluFon of In- Memory CompuFng Streaming Data Grid Clustering & Compute Grid Database IM opcons Hadoop accelerators Streaming BI accelerators In- Memory Data Grids IMDBs Distributed Caching Caching 2014 GridGain Systems, Inc. Hadoop Acceleration

ExisFng Market is Fragmented Company Product Proprietary/ Open Source CharacterizaFon Oracle In-Memory Option for Oracle Database Proprietary Cost Option Oracle Times Ten Proprietary Point Solution IMDB Oracle Coherence Proprietary Point Solution IMDG SAP Hana Proprietary Point Solution - IMDB Microsoft SQL Server 2014 Proprietary Feature Upgrade DataBricks Apache Spark Open Source Point Solution - Hadoop VoltDB VoltDB Open Source Point Solution IMDB Aerospike Aerospike Open Source Point Solution NoSQL DB IBM DB2 with BLU Acceleration Proprietary Feature Upgrade Software AG Terracotta Open Source Point Solution - IMDG Hazelcast Hazelcast Open Source Point Solution - IMDG

GridGain In- Memory Data Fabric: Strategic Approach to IMC Supports all Apps Streaming Data Grid Clustering & Compute Grid Hadoop Acceleration Open Source Apache 2.0 Simple Java APIs 1 JAR Dependency High Performance & Scale Automatic Fault Tolerance Management/Monitoring Runs on Commodity Hardware Supports existing & new data sources No need to rip & replace

Direct API for MapReduce Direct API for Fork/Join Zero Deployment Cron- like Task Scheduling State Checkpoints Early and Late Load Balancing AutomaCc Failover Full Cluster Management Pluggable SPI Design Clustering & Compute

AutomaFc Cluster Discovery

Closure ExecuFon

Closure ExecuFon

In- Memory Caching and Data Grid Distributed In- Memory Key- Value Store Replicated and ParCConed TBs of data, of any type On- Heap and Off- Heap Storage Backup Replicas / AutomaCc Failover Distributed ACID TransacCons SQL queries and JDBC driver CollocaCon of Compute and Data

Cache OperaFons

Cache TransacFon

Distributed Java Data Structures Distributed Map (cache) Distributed Set Distributed Queue CountDownLatch AtomicLong AtomicSequence AtomicReference Distributed ExecutorService

Client- Server vs Affinity ColocaFon Client- Server Affinity ColocaCon

In- Memory Streaming & CEP Streaming Data Never Ends Branching Pipelines CEP Sliding Windows Pluggable RouCng Real Time Analysis At Least Once Guarantee

Plug- n- Play Hadoop Accelerator Up to 100x AcceleraCon In- Memory NaCve MapReduce In- Process Data ColocaCon Eager Push Scheduling GGFS In- Memory File System Pure In- Memory Write- Through to HDFS Read- Through from HDFS Sync and Async Persistence

In- Memory NaFve MapReduce In- Memory NaCve MapReduce Zero Code Change Use exiscng MR code Use exiscng Hive queries No Name Node No Network Noise In- Process Data ColocaCon Eager Push Scheduling

DevOps Management and Monitoring

THANK YOU www.gridgain.com #gridgain @dsetrakyan