<Insert Picture Here> Getting Coherence: Introduction to Data Grids South Florida User Group



Similar documents
ORACLE COHERENCE 12CR2

Oracle Coherence Sales Playbook. April 2007

Oracle Application Grid

Aplicações empresariais de elevada performance com Oracle WebLogic e Coherence. Alexandre Vieira Middleware Solutions Team Leader

GigaSpaces Real-Time Analytics for Big Data

<Insert Picture Here> Oracle In-Memory Database Cache Overview

Elastic Application Platform for Market Data Real-Time Analytics. for E-Commerce

Affordable, Scalable, Reliable OLTP in a Cloud and Big Data World: IBM DB2 purescale

Scaling Healthcare Applications to Meet Rising Challenges of Healthcare IT

Real-Time Analytics for Big Market Data with XAP In-Memory Computing

This talk is mostly about Data Center Replication, but along the way we'll have to talk about why you'd want transactionality arnd the Low-Level API.

Table Of Contents. 1. GridGain In-Memory Database

Meeting Real Time Risk Management Challenge XAP In-Memory Computing

TIBCO Live Datamart: Push-Based Real-Time Analytics

be architected pool of servers reliability and

IN-MEMORY DATA FABRIC: Data Grid

In Memory Accelerator for MongoDB

Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities

White Paper. Optimizing the Performance Of MySQL Cluster

IBM WebSphere Distributed Caching Products

Bryan Tuft Sr. Sales Consultant Global Embedded Business Unit

Big data management with IBM General Parallel File System

WHITE PAPER. GemFireTM on Intel for Financial Services (GIFS) Reference Architecture for Trading Systems. Overview

Project Convergence: Integrating Data Grids and Compute Grids. Eugene Steinberg, CTO Grid Dynamics May, 2008

ORACLE DATA SHEET KEY FEATURES AND BENEFITS ORACLE WEBLOGIC SERVER STANDARD EDITION

CitusDB Architecture for Real-Time Big Data

From Spark to Ignition:

EMC VPLEX FAMILY. Continuous Availability and data Mobility Within and Across Data Centers

INTRODUCING APACHE IGNITE An Apache Incubator Project

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM

SOA management challenges. After completing this topic, you should be able to: Explain the challenges of managing an SOA environment

TIBCO ActiveSpaces Use Cases How in-memory computing supercharges your infrastructure

IBM Tivoli Composite Application Manager for WebSphere

JBoss & Infinispan open source data grids for the cloud era

Configuration Management of Massively Scalable Systems

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.

FIFTH EDITION. Oracle Essentials. Rick Greenwald, Robert Stackowiak, and. Jonathan Stern O'REILLY" Tokyo. Koln Sebastopol. Cambridge Farnham.

Comparison of the High Availability and Grid Options

Learn Oracle WebLogic Server 12c Administration For Middleware Administrators

F1: A Distributed SQL Database That Scales. Presentation by: Alex Degtiar (adegtiar@cmu.edu) /21/2013

An Oracle White Paper October Maximize the Benefits of Oracle SOA Suite 11g with Oracle Service Bus

In-Memory BigData. Summer 2012, Technology Overview

Ecomm Enterprise High Availability Solution. Ecomm Enterprise High Availability Solution (EEHAS) Page 1 of 7

Developing Scalable Java Applications with Cacheonix

Continuous Data Protection for any Point-in-Time Recovery: Product Options for Protecting Virtual Machines or Storage Array LUNs

Scaling Spring Applications in 4 Easy Steps

WEBLOGIC SERVER MANAGEMENT PACK ENTERPRISE EDITION

Automatic Service Migration in WebLogic Server An Oracle White Paper July 2008

EII - ETL - EAI What, Why, and How!

Real-Time Enterprise IT Building Blocks Tangosol s Coherence Clustered Data Caching

PolyServe Matrix Server for Linux

CASE STUDY: Oracle TimesTen In-Memory Database and Shared Disk HA Implementation at Instance level. -ORACLE TIMESTEN 11gR1

High Availability Solutions for the MariaDB and MySQL Database

Running a Workflow on a PowerCenter Grid

Database Decisions: Performance, manageability and availability considerations in choosing a database

Comparing Microsoft SQL Server 2005 Replication and DataXtend Remote Edition for Mobile and Distributed Applications

A Survey of Shared File Systems

Top 10 Reasons why MySQL Experts Switch to SchoonerSQL - Solving the common problems users face with MySQL

GemFire Enterprise Architectural Overview

Liferay Portal Performance. Benchmark Study of Liferay Portal Enterprise Edition

Solutions for detect, diagnose and resolve performance problems in J2EE applications

Accelerating Hadoop MapReduce Using an In-Memory Data Grid

Find the Information That Matters. Visualize Your Data, Your Way. Scalable, Flexible, Global Enterprise Ready

Using In-Memory Computing to Simplify Big Data Analytics

Portable Scale-Out Benchmarks for MySQL. MySQL User Conference 2008 Robert Hodges CTO Continuent, Inc.

Tier Architectures. Kathleen Durant CS 3200

Connectivity. Alliance Access 7.0. Database Recovery. Information Paper

WEB11 WebSphere extreme Scale et WebSphere DataPower XC10 Appliance : les solutions de caching élastique WebSphere

Connectivity. Alliance Access 7.0. Database Recovery. Information Paper

EMC VPLEX FAMILY. Continuous Availability and Data Mobility Within and Across Data Centers

I/O Considerations in Big Data Analytics

Using In-Memory Data Grids for Global Data Integration

Chapter 2 TOPOLOGY SELECTION. SYS-ED/ Computer Education Techniques, Inc.

BEA AquaLogic Integrator Agile integration for the Enterprise Build, Connect, Re-use

IBM RATIONAL PERFORMANCE TESTER

ORACLE DATA SHEET KEY FEATURES AND BENEFITS ORACLE WEBLOGIC SERVER STANDARD EDITION

High Availability in a J2EE Enterprise Application Environment

JBOSS ENTERPRISE APPLICATION PLATFORM MIGRATION GUIDELINES

Using an In-Memory Data Grid for Near Real-Time Data Analysis

Improving Grid Processing Efficiency through Compute-Data Confluence

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

Oracle TimesTen and In-Memory Database Cache 11g

Azure Scalability Prescriptive Architecture using the Enzo Multitenant Framework

An Oracle White Paper November Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics

ORACLE DATABASE 10G ENTERPRISE EDITION

Big Data Analytics - Accelerated. stream-horizon.com

How To Build A Clustered Storage Area Network (Csan) From Power All Networks

Scaling Web Applications on Server-Farms Requires Distributed Caching

WebLogic Server Foundation Topology, Configuration and Administration

Ken Bond Vice President Investor Relations

SCALABILITY AND AVAILABILITY

Implementing efficient system i data integration within your SOA. The Right Time for Real-Time

WEBLOGIC ADMINISTRATION

CDH AND BUSINESS CONTINUITY:

Benchmarking Couchbase Server for Interactive Applications. By Alexey Diomin and Kirill Grigorchuk

Protect Data... in the Cloud

In-Memory Computing Principles and Technology Overview

Oracle Reference Architecture and Oracle Cloud

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

Technology Brochure New Technology for the Digital Consumer

Transcription:

<Insert Picture Here> Getting Coherence: Introduction to Data Grids South Florida User Group Cameron Purdy Cameron Purdy Vice President of Development

Speaker Cameron Purdy is Vice President of Development at Oracle. Prior to joining Oracle, Mr. Purdy was the CEO of Tangosol, whose revolutionary Coherence Data Grid product provides reliable and scalable data management across the enterprise. Mr. Purdy has over ten years of experience with Java and Java-related technology; he regularly participates in industry standards development and is a specification lead for the Java Community Process.

Coherence History <Insert Picture Here>

Tangosol Coherence 2000-2007 Pioneer of Data Grid technology First to provide Coherent Clustered Caching First to provide Dynamic HA Partitioning First to localize Data Processing First to achieve Parallel Aggregation Consistent annual growth over 100% Innovation driven by a loyal, growing customer base #1 in Data Grids #1 in Transaction Volumes #1 in Install Base

Tangosol acquires Oracle 2007 Kick in the Afterburner Continuing Innovation: Increased investment in development Growing Adoption: Leveraging Oracle s scale Vibrant Ecosystem: Worldwide support, service, consulting, partners Increased annual growth Innovation continues to be driven by our customers Like a kid in a toy-store Berkeley DB TopLink Grid

Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid: jrockit RealTime and Coherence WebLogic Suite and WebLogic Application Grid WebLogic Liquid Operations Control WebLogic Server and Portal Integration CEP Engine Integration (OSGi) Oracle now leads in every middleware category according to Gartner Magic Quadrants! Leading Developer Mindshare & Best of Breed Technology

<Insert Picture Here> Data Grid Introduction The Oracle Coherence In-Memory Data Grid is a data management system for application objects that are shared across multiple servers, require low response time, very high throughput, predictable scalability, continuous availability and information reliability.

Data Grid Who: Oracle Coherence, IBM WebSphere XS Also Gemstone Gemfire, Gigaspaces, ScaleOut Software What: Using a grid infrastructure to host both reference and transactional data, eliminating data bottlenecks and improving performance and efficiency Where: Inside the data center When: Usually 24x7x366, but sometimes only for How: Using clustering technology Partitioning for scale (and reliability) Replicating for availability (and performance)

Oracle Coherence: A Unique Approach Peer-to-Peer Clustering and Data Management Technology No Single Points of Failure No Single Points of Bottleneck No Masters / Slaves / Registries etc All members have responsibility to; Manage Cluster Health & Data Perform Processing and Queries Work as a team in parallel Communication is point-to-point (not TCP/IP) and/or one-to-many Scale to limit of the back-plane Use with commodity infrastructure Linearly Scalable By Design (c) Copyright 2008. Oracle Corporation

Oracle Coherence: Real World Results From a presentation by Royal Bank of Scotland: Further competitive evaluation conducted that ratified Coherence as best of breed Easiest to configure Stable performance under load Much harder to break Coherence Best of rest

Oracle Coherence: A Unique Approach Data is automatically partitioned and load-balanced across the Server Cluster Data is synchronously replicated for continuous availability Servers monitor the health of each other When in doubt, servers work together to diagnose status Healthy servers assume responsibility for failed server (in parallel) Continuous Operation: No interruption to service or data loss due to a server failure (c) Copyright 2008. Oracle Corporation

Data Grid: Capability Summary A Data Grid combines data management with data processing in a scale-out environment Some or all of the servers in the grid are responsible for reliably managing live information Any or all of the servers in the grid are able to simultaneously access and manipulate a shared, consistent view of that information Processing power far exceeds aggregate network bandwidth, so data access and manipulation must be parallelized and localized within the grid Data locality is both dynamic and transparent

<Insert Picture Here> Oracle Coherence: Technical Illustrations

Partitioned Topology : Data Access Data spread and backed up across Members Transparent to developer Members have access to all Data All Data locations are known no lookup & no registry!

Partitioned Topology : Data Update Synchronous Update Avoids potential Data Loss & Corruption Predictable Performance Backup Partitions are partitioned away from Primaries for resilience No engineering requirement to setup Primaries or Backups Automatically and Dynamically Managed

Partitioned Topology : Recovery Membership changes (new members added or members leaving) Other members, using consensus, recover and repartition automatically No in-flight operations lost, no availability gap! Some latencies (due to higher priority of asynchronous recovery) Information Reliability & Continuous Availability are the priorities

Partitioned Topology : Local Storage Some members are used to manage data Other members are temporary in a cluster, or do not have memory to spare for managing data They should not cause repartitioning Specialization of roles Specialization of roles within a Data Grid: Clients and Servers

Read-Through & Write-Through Access to the data sources go through the Data Grid. Read and write operations are always managed by the node that owns the data within the Data Grid. Concurrent accesses are combined, greatly reducing database load. Write-Through keeps the in-memory data and the database in sync.

Write-Behind Write-Behind accepts data modifications directly into the Data Grid The modifications are then asynchronously written back to the data source, optionally after a specified delay All write-behind data is synchronously and redundantly managed, making it resilient to server failure

Topology Composition : Near Topology

Features : Observable Interface

Features : QueryMap Interface

Concurrency Implicit: Queueing of operations Virtual queue & thread per entry Explicit: Pessimistic locking Grid-Wide Mutex Transactions: Unit of work management Both optimistic and pessimistic transactions Isolation levels from read-committed through serializable Integrated with JTA

Features : InvocableMap Interface

Data Grid Use Cases <Insert Picture Here>

General Use Cases for a Data Grid Caching Applications request data from the Data Grid rather than backend data sources Analytics Applications ask the Data Grid questions from simple queries to advanced scenario modeling Transactions Data Grid acts as a transactional System of Record, hosting data and business logic Events Automated processing based on event

Use Case: Caching Applications request data from the Data Grid rather than backend data sources Enable faster access to frequently accessed data Reduce load on shared data sources Obviates the impact of data source outages Use Cases Reference data User preferences Market data Coherence Manageable and scalable host for the cache Guarantees consistent data and data integrity Broad industry support as a plug-in cache

Use Case: Analytics Applications ask the Data Grid questions from simple queries to advanced scenario modeling: Enables query rates beyond what a database can handle Larger data sets available in memory Complex analytics through massive parallel processing across grid Increased availability Use Cases Risk management Portfolio management Derivatives pricing Coherence Built-in query support User-defined parallel calculations Stable results even with server failure

Use Case: Transactions The Data Grid acts as a transactional System of Record, hosting data and business logic: Higher transaction volumes More predictable scalability More predictable cost of scale Flexible capacity Use Cases Order book, execution, fill and reconciliation E-Commerce, real-time inventory Large-scale data collection: Buffered transactions Coherence Multiple isolation levels, both optimistic and pessimistic modes Reliability is key to transactional integrity

Use Case: Events In addition to transactions, the Data Grid reliably manages each resulting event and delivers it to the related business logic: Real-time management for both data-driven and event-driven architectures Flexible data, event and processing capacity Use Cases Market Data-driven processes such as Algorithmic Trading Fraud Detection Large-scale work-flow: SEDA / Staged Business Event Transitions Coherence Co-located processing of data and events for low latency and high throughput Reliable Once-And-Only-Once processing for events Scalable to millions of events per second without sacrificing reliability

Financial Institution: Risk Analytics Large Data Sets Parallel Aggregation Capable of Absorbing Real Time Feeds Calc on Demand / Recalc on Changes 100% Data Locality for computations True Linear Scale Real World Results: 50 Days -> 1 Hour Scenarios Node1 Node2 Node3 Aggregated Risk

Financial Institution: Foreign Exchange.. executes any number of Business Logic driven by State Changes Once-and-Only-Once Guarantees Encapsulated Transactions No Global-TX 100% Data Locality for State Change Event Business Logic Business Logic execution Real World Results: From Big Boxes to Blades 1000x throughput increase Continuous Availability.. creates any number of

<Insert Picture Here> Coherence Update The latest news about Oracle Coherence and related projects coming out of Oracle, as of Q2/FY09.

Oracle Coherence 3.4 Launched at Oracle OpenWorld http://www.oracle.com/technology/products/coherence/index.html POF: End-to-end support for Portable Object Format Pluggable Serializers: Control wire & memory formats Feature parity across Java, C# &.NET, C++ (New!) Triggers: Validate, modify and/or reject operations Event Transformers: Alter events at their origin Backup-Count-After-Write-Behind: Purges backup copies once data has been committed to a database JMX Reporter: Cluster-wide black-box recorder Java Platform Mbeans: Cluster-wide JVM statistics

Oracle TopLink Grid Launched at Oracle OpenWorld http://www.oracle.com/technology/products/ias/toplink/tl_grid.html JPA for Oracle Coherence Oracle TopLink is the JPA Reference Implementation TopLink Grid stores all or part of an application s domain model in a Coherence Data Grid Features: Simple configuration using annotations that align with standard JPA Ability to choose which entities are stored in the grid versus those stored directly in the backing data store Support for queries being executed against either the Grid or directly against the data store Support for sequence generation and locking using either the Grid or the backing data store

Oracle Incubator Launched 15 October 2008! http://coherence.oracle.com/display/incubator/home Command Pattern - This is a scale-out and Highly Available distributed implementation of the classic "Command Pattern". Store and Forward Messaging Pattern - This is a scaleout and Highly Available framework for distributed Store and Forward Messaging. Push Replication Pattern - This is a powerful framework for replicating the transactional data in real-time systems from one data center to another; it supports bidirectional WAN replication with applicationcustomizable collision-reconciliation policies.

<Insert Picture Here> Q&A Learn more online: http://www.oracle.com/technology/products/coherence/