Design Patterns for Large Scale Data Movement. Aaron Lee aaron.lee@solacesystems.com



Similar documents
Enabling Real-Time Sharing and Synchronization over the WAN

Achieving Nanosecond Latency Between Applications with IPC Shared Memory Messaging

Enabling Cloud Architecture for Globally Distributed Applications

Integrating Web Messaging into the Enterprise Middleware Layer

Challenges of Sending Large Files Over Public Internet

Comparing Solace s Appliance- Based Guaranteed Messaging with Software Brokers

Vortex White Paper. Simplifying Real-time Information Integration in Industrial Internet of Things (IIoT) Control Systems

Security Design.

Getting the Most Out of Your Existing Network A Practical Guide to Traffic Shaping

TIBCO ActiveSpaces Use Cases How in-memory computing supercharges your infrastructure

Video Architectures Eyes on the Future: The Benefits of Wireless Technology for Fixed Video Surveillance

Tactical Service Bus: The flexibility of service oriented architectures in constrained theater environments

VitalPBX. Hosted Voice That Works. For You

PORTrockIT. Veeam : accelerating virtual machine replication with PORTrockIT

Understanding Latency in IP Telephony

I. TODAY S UTILITY INFRASTRUCTURE vs. FUTURE USE CASES...1 II. MARKET & PLATFORM REQUIREMENTS...2

1 Product. Open Text is the leading fax server vendor in the world. *

Best Effort gets Better with MPLS. Superior network flexibility and resiliency at a lower cost with support for voice, video and future applications

Real Time Big Data Processing

Getting Real Real Time Data Integration Patterns and Architectures

Network Simulation Traffic, Paths and Impairment

Fog Computing and the Internet of Things: Extend the Cloud to Where the Things Are

Communication Networks. MAP-TELE 2011/12 José Ruela

WAN & Carrier Networks

Portable Wireless Mesh Networks: Competitive Differentiation

JoramMQ, a distributed MQTT broker for the Internet of Things

Scalability in Log Management

WAN optimization and acceleration products reduce cost and bandwidth requirements while speeding throughput.

PORTrockIT. Spectrum Protect : faster WAN replication and backups with PORTrockIT

Managing Mobile Devices Over Cellular Data Networks

CISCO WIDE AREA APPLICATION SERVICES (WAAS) OPTIMIZATIONS FOR EMC AVAMAR

SAN/iQ Remote Copy Networking Requirements OPEN iscsi SANs 1

Enabling the SmartGrid through Cloud Computing

Satellite Basics. Benefits of Satellite

Solving the Big Dilemma of Big Data

Clearing the Way for VoIP

Internet of things (IOT) applications covering industrial domain. Dev Bhattacharya

Improving Effective WAN Throughput for Large Data Flows By Peter Sevcik and Rebecca Wetzel November 2008

Choosing the right Internet solution for your business.

White paper. Latency in live network video surveillance

Top IT Pain Points: Addressing the bandwidth issues with Ecessa solutions

Global Network Mobility RIPE 48

Hybrid Cloud Architectures for Operational Performance Management

Streaming Analytics Market by Verticals - Worldwide Market Forecast & Analysis ( )

Data Driven Success. Comparing Log Analytics Tools: Flowerfire s Sawmill vs. Google Analytics (GA)

White paper. Business Applications of Wide Area Ethernet

Flow Visualization Using MS-Excel

WAN and VPN Solutions:

White Paper. How Streaming Data Analytics Enables Real-Time Decisions

Improving Information Flow for Equities OMS & Trading Platforms

Broadband Bonding Network Appliance TRUFFLE BBNA6401

TRUFFLE Broadband Bonding Network Appliance BBNA6401. A Frequently Asked Question on. Link Bonding vs. Load Balancing

HIGH-SPEED BRIDGE TO CLOUD STORAGE

Four Ways High-Speed Data Transfer Can Transform Oil and Gas WHITE PAPER

Research of User Experience Centric Network in MBB Era ---Service Waiting Time

Broadband Bonding Network Appliance TRUFFLE BBNA6401

Network Architecture and Topology

VoIP Application Note:

Infoblox vnios Software for CISCO AXP

Unified Messaging Infrastructure for Global FX Trading Platforms

Industrial Dr. Stefan Bungart

Distributed Systems LEEC (2005/06 2º Sem.)

HOW PUBLIC INTERNET IS FINALLY READY FOR HD VIDEO BACKHAUL

Deploying Riverbed Cascade and Steelheads. A Best Practices Whitepaper

The Next Wave of Big Data Analytics: Internet of Things and Sensor Data. November 6, 2014 Hannah Smalltree, Director

CHOOSING A WIRELESS IMPLEMENTATION STRATEGY AND APPLICATIONS

FatPipe Networks

Pulsar Realtime Analytics At Scale. Tony Ng April 14, 2015

TRUFFLE Broadband Bonding Network Appliance. A Frequently Asked Question on. Link Bonding vs. Load Balancing

Smartphone Enterprise Application Integration

VMware vsphere Data Protection

What is this Course All About

FACT SHEET INTERNATIONAL DATA SERVICES GLOBAL IP VPN

Unified Threat Management, Managed Security, and the Cloud Services Model

Streaming Analytics and the Internet of Things: Transportation and Logistics

Product Overview. Steve Erickson

Introduction to LAN/WAN. Network Layer

Solving big data problems in real-time with CEP and Dashboards - patterns and tips

UNDERSTANDING DATA DEDUPLICATION. Tom Sas Hewlett-Packard

IBM's Strategic Approach to System- Centric MFT with Sterling Connect:Direct and WebSphere MQ Advanced

IoT / practical usage

Managing Dynamic Configuration

PipeCloud : Using Causality to Overcome Speed-of-Light Delays in Cloud-Based Disaster Recovery. Razvan Ghitulete Vrije Universiteit

Exploiting Data at Rest and Data in Motion with a Big Data Platform

October Field Area Communication Networks for Digital Oil and Gas Fields

Introduction, Rate and Latency

Adaptive Real-time Infrastructure for a complete OMG Model Driven Architecture.

NETWORK ISSUES: COSTS & OPTIONS

Big Data and Advanced Analytics Technologies for the Smart Grid

Telecommunications, Networks, and Wireless Computing

End-to-End M2M and IoT Services

Transcription:

Design Patterns for Large Scale Data Movement Aaron Lee aaron.lee@solacesystems.com

Data Movement Patterns o The right solution depends on the problem you re solving Real-time or intermittent? Any weird networks? Acceptable latency? Humans or machines? Update rates? Fan-in or fan-out? Payload size? Guarantee required? 2 http://www.dreamstime.com/stock-images-wispy-blue-spirals-pattern-image1983204

Latency Required o Some not sensitive at all Batch updates o Seconds often good enough Database sync User interfaces o Others measure in milli- or micro-seconds Algo trading Industrial controls Required Latency Low as Possible Not Critical 3

Network Distance o Co-location for max speed Minimize speed of light o LAN for many apps 10GigE networks o Long distance WAN Expensive, limited pipes Creates mismatches with other networks Network Distance Global WAN Co-location 4

Number of Messages o Few Batch updates Simple applications o Moderate Risk management Order routing o Insane Market data Click stream analysis Whatever Number of Messages OMG 5

Degree of Distribution o Point-to-point o Fan-out (many subs) o Fan-in (many pubs) o Mesh Synching data between many endpoints 1:1 Millions of Endpoints Degree of Distribution 6

Message Size o Small Status updates, activity logging events o Medium Orders, product BOMs o Large Batch updates, media files, product catalogs o Very different stresses on system based on message size and frequency. Small Huge Size of Messages 7

Importance of Delivery Guarantee o Best effort fine for some scenarios o Others require once and only once o Sequence matters for some o Some demand failsafe even in DR scenarios Very Delivery Guarantee Importance Not 8

Other Considerations o Message Format Protocol Structured/Unstructured o Network Availability RTT Bandwidth cost o Robustness Archival Caching Acceptable MTBF HA switchover times DR requirements 9

Combination of Factors Yields Design Patterns o Some attributes tend to correlate # of messages and degree of distribution Required Latency Network Distance Number of Messages o Others usually contradict Network distance and latency Guarantee and latency o Tradeoffs and creative solutions Delivery Guarantee Importance Degree of Distribution Size of Messages 10

Identifying Patterns in Real-World Use Cases Use cases unique, but patterns emerge Examples in this section: Trade Order Flow Manufacturing Data Sync Oil and Gas Monitoring Real Time Sports Betting 11 http://www.dreamstime.com/stock-images-wispy-blue-spirals-pattern-image1983204

Order Flow o Latency matters, but not every microsecond o Usually localized Required Latency Network Distance Number of Messages o Continuous, high-rate message flow o Mid-sized messages (1-2Kb) o Messages absolutely must be guaranteed Delivery Guarantee Importance Degree of Distribution Size of Messages 12

Order Flow; Architecture Management & Monitoring Smart Order Router Back Office Applications Slow Subscribers Message Bus Real Time Sync Client Gateways Exchange Gateways Disaster Recovery Site Clients Exchanges 13

Order Flow; Similar Use Cases Need a way to correlate which use case is which color on the chart. o Credit card processing Long-distance WANs latency in hundreds of milliseconds Required Latency Network Distance Number of Messages o E-commerce Higher volumes Higher guarantee required o Logistics scheduling Less latency sensitive More likely to include WANs Delivery Guarantee Importance Degree of Distribution Size of Messages 14

Manufacturing Data Sync Build from the background image on prior slide o Geographically distributed o 100% delivery guarantee required o Data rate is use case specific will assume lots of medium (< 5K) messages. o Number of endpoints use case specific, assume 10 manufacturing locations Required Latency Delivery Guarantee Importance Network Distance Number of Messages Degree of Distribution Size of Messages 15

Manufacturing Data Sync; Architecture Applications & Databases Fanout at Edge Maximizing Bandwidth Smart Buffering 16

Manufacturing Data Sync; Similar Use Cases o Real Time Risk Management Smaller messages Latency more important o Retail Global Inventory Messages can be larger Distribution can be more Required Latency Network Distance Number of Messages o Real Time Financials Messages larger Distribution less (collecting to 1 location) Delivery Guarantee Importance Size of Messages Degree of Distribution 17

Oil & Gas Pipeline Monitoring o Wifi, Satellite, proprietary and other unreliable networks o Degree of distribution off the charts. In this case, fan-in. Required Latency Network Distance Number of Messages o Messages usually pretty small, unless batch o Latency unimportant o Level of guarantee use case specific, assume status messages (ie. guarantee not essential) Delivery Guarantee Importance Size of Messages Degree of Distribution 18

Oil & Gas Pipeline Monitoring; Architecture Pipeline Sensors Wireless Unreliable Networks Analytics Engines Message Bus Real Time vs. Delayed Analytics Collection Caches Big Data & Databases Big Data Loading 19

Oil & Gas Pipeline Monitoring; Similar Use Cases o Smart Grid Small messages Massive distribution o Transportation Monitoring Fewer endpoints Bigger messages Required Latency Network Distance Number of Messages o Retail Point of Sale More predictable networks Guarantee more important Delivery Guarantee Importance Degree of Distribution Size of Messages 20

Real-Time Sports Betting o Huge message volumes (in this case fan-out) o Low level of guarantee for any one outbound message Required Latency Network Distance Number of Messages o High level of guarantee for inbound messages o Tiny messages o Network is the internet + mobile carriers o Latency (beyond network latency) is important Delivery Guarantee Importance Size of Messages Degree of Distribution 21

Real-Time Sports Betting; Architecture Highlight the degree of fan out, connection counts, event logging, real time analysis for odds adjustment Odds & Analytics Low Latency Clickstream & Marketing Big Data Message Bus Streaming Odds Data Data Streaming Huge Connection Counts Mobile Customers Customer & Betting Apps Security & Fraud Detection Web Customers 22

Real-Time Sports Betting; Similar Use Cases o Mobile Social Updates Latency less important Distribution far greater Required Latency Network Distance Number of Messages o Real Time Travel Alerting Each message more important Volumes much lower o Market Data Distribution Latency even more important Volumes often much higher Loss often tolerable Delivery Guarantee Importance Size of Messages Degree of Distribution 23

Network Distance Required Latency Number of Messages Delivery Guarantee Importance Degree of Distribution Size of Messages

Summary Questions? 25