Riding the Data Wave. New Capabilities New Techniques. Bill Chute Acadiant Limited



Similar documents
Cognos Performance Troubleshooting

Practical Cassandra. Vitalii

Cloud Scale Distributed Data Storage. Jürmo Mehine

So What s the Big Deal?

Open source large scale distributed data management with Google s MapReduce and Bigtable

GigaSpaces Real-Time Analytics for Big Data

Using distributed technologies to analyze Big Data

Database Scalability and Oracle 12c

Comparison of the Frontier Distributed Database Caching System with NoSQL Databases

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

BIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research &

Real World Hadoop Use Cases

[Hadoop, Storm and Couchbase: Faster Big Data]

Can the Elephants Handle the NoSQL Onslaught?

NoSQL: Going Beyond Structured Data and RDBMS

NoSQL Databases. Nikos Parlavantzas

SQL VS. NO-SQL. Adapted Slides from Dr. Jennifer Widom from Stanford

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

NoSQL. Thomas Neumann 1 / 22

NOSQL INTRODUCTION WITH MONGODB AND RUBY GEOFF

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Preparing Your Data For Cloud

NoSQL web apps. w/ MongoDB, Node.js, AngularJS. Dr. Gerd Jungbluth, NoSQL UG Cologne,

An Approach to Implement Map Reduce with NoSQL Databases

Big Data for everyone Democratizing big data with the cloud. Steffen Krause Technical

NoSQL Databases. Institute of Computer Science Databases and Information Systems (DBIS) DB 2, WS 2014/2015

THE ATLAS DISTRIBUTED DATA MANAGEMENT SYSTEM & DATABASES

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

Comparing SQL and NOSQL databases

MULTICULTURAL CONTENT MANAGEMENT SYSTEM

L7_L10. MongoDB. Big Data and Analytics by Seema Acharya and Subhashini Chellappan Copyright 2015, WILEY INDIA PVT. LTD.

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

PHP and MongoDB Web Development Beginners Guide by Rubayeet Islam

MySQL for Beginners Ed 3

INTRODUCING DRUID: FAST AD-HOC QUERIES ON BIG DATA MICHAEL DRISCOLL - CEO ERIC TSCHETTER - LEAD METAMARKETS

Current Data Security Issues of NoSQL Databases

A Distributed Storage Schema for Cloud Computing based Raster GIS Systems. Presented by Cao Kang, Ph.D. Geography Department, Clark University

Real World Big Data Architecture - Splunk, Hadoop, RDBMS

NoSQL Database Systems and their Security Challenges

Oracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya

How To Scale Out Of A Nosql Database

these three NoSQL databases because I wanted to see a the two different sides of the CAP

Mule Enterprise Service Bus (ESB) Hosting

Evaluation of NoSQL databases for large-scale decentralized microblogging

Real Time Big Data Processing

The Cloud to the rescue!

Learning Web App Development

OBSERVEIT DEPLOYMENT SIZING GUIDE

Department of Computer Science University of Cyprus EPL646 Advanced Topics in Databases. Lecture 12

Making Sense ofnosql A GUIDE FOR MANAGERS AND THE REST OF US DAN MCCREARY MANNING ANN KELLY. Shelter Island

MongoDB: document-oriented database

Big Data Success Step 1: Get the Technology Right

Scaling out a SharePoint Farm and Configuring Network Load Balancing on the Web Servers. Steve Smith Combined Knowledge MVP SharePoint Server

the missing log collector Treasure Data, Inc. Muga Nishizawa

Cloud Big Data Architectures

Apache HBase. Crazy dances on the elephant back

NoSQL Data Base Basics

Can Flash help you ride the Big Data Wave? Steve Fingerhut Vice President, Marketing Enterprise Storage Solutions Corporation

Informatica Data Director Performance

Introduction to Hbase Gkavresis Giorgos 1470

Harnessing the Potential Raj Nair

Unlocking The Value of the Deep Web. Harvesting Big Data that Google Doesn t Reach

Bigtable is a proven design Underpins 100+ Google services:

MongoDB. Or how I learned to stop worrying and love the database. Mathias Stearn. N*SQL Berlin October 22th, gen

Big Data and Fast Data combined is it possible?

Moving From Hadoop to Spark

Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January Website:

CAPTURING & PROCESSING REAL-TIME DATA ON AWS

PLATFORA INTERACTIVE, IN-MEMORY BUSINESS INTELLIGENCE FOR HADOOP

Overview of Databases On MacOS. Karl Kuehn Automation Engineer RethinkDB

Sisense. Product Highlights.

API Analytics with Redis and Google Bigquery. javier

Sentimental Analysis using Hadoop Phase 2: Week 2

Hurtownie Danych i Business Intelligence: Big Data

Primex Wireless OneVue Architecture Statement

Vector Web Mapping Past, Present and Future. Jing Wang MRF Geosystems Corporation

August 2014 San Antonio Texas The Power of Embedded Analytics with SAP BusinessObjects

Comparing Scalable NOSQL Databases

Not Relational Models For The Management of Large Amount of Astronomical Data. Bruno Martino (IASI/CNR), Memmo Federici (IAPS/INAF)

RazorSafe Mail Archiving Appliances

Big Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect

HYPER-CONVERGED INFRASTRUCTURE STRATEGIES

Cost-Effective Business Intelligence with Red Hat and Open Source

API documentation - 1 -

How To Choose Between A Relational Database Service From Aws.Com

SECURE Web Gateway Sizing Guide

Log Analysis: Overall Issues p. 1 Introduction p. 2 IT Budgets and Results: Leveraging OSS Solutions at Little Cost p. 2 Reporting Security

Ad Hoc Analysis of Big Data Visualization

IBM Data Retrieval Technologies: RDBMS, BLU, IBM Netezza, and Hadoop

Transcription:

Riding the Data Wave New Capabilities New Techniques Bill Chute Acadiant Limited

There are new challenges New technologies are on your side 2

MiFID II & MIFIR Basel III NAV Volcker VaR Dodd-Frank MAD II & MIR FATCA EMIR 3

MiFID II & MIFIR MAD II & MIR Basel III Dodd-Frank EMIR Volcker FATCA NAV VaR Who loaned that security? Which transaction hedged that other transaction? Which model calculated that ratio? What s your exposure to? 4

Historic Scarcity Modern Abundance Constrained Storage Constrained CPU Constrained RAM Constrained Bandwidth Computing was Expensive Cheap Storage on Demand Cheap CPU on Demand Cheap RAM on Demand Cheap Bandwidth on Demand Unit Costs Tend Towards Zero Response to Scarcity: Structured Query Language Offline Storage Batch Processing Response to Abundance: NoSQL / Document Oriented DBs Online Archive Asynchronous Processing 5

Scarcity Built the Grid Row / Column was a crude, cheap way to organise data RDBMS, Spreadsheets, Fixed Data Formats Rigid Reporting Systems Time Grids coped with limited time, limited power Batch: gather once, process once BREAK THE GRID 6

BREAK THE GRID Exploit Mobile Social Technologies Loosely Structured Data Store everything, parse when needed Flexible Query Systems Asynchronous computing Process data whenever available 7

Ride That Wave NASDAQ OMX UltraFeed 8 hours per day: ~230GB per day Sustained 8Mbps, peaks ~3X to 4X Packed Binary, Optimised for Real-Time Trading Twitter Firehose 24 hours per day: ~900GB per day Sustained 10Mbps, peaks ~3X to 4X JSON, Optimised for Rich Data 8

What is in a tweet? Along with our new #Twitterbird, we've also updated our Display Guidelines: https://t.co/ed4omjys ^JC 9

What is in a tweet? 1. { 2. "coordinates": null, 3. "favorited": false, 4. "truncated": false, 5. "created_at": "Wed Jun 06 20:07:10 +0000 2012", 6. "id_str": "210462857140252672", 7. "entities": { 8. "urls": [ 9. { 10. "expanded_url": "https://dev.twitter.com/terms/display-guidelines", 11. "url": "https://t.co/ed4omjys", 12. "indices": [ 13. 76, 14. 97 15. ], 16. "display_url": "dev.twitter.com/terms/display-\u2026" 17. } 18. ], 19. "hashtags": [ 20. { 21. "text": "Twitterbird", 22. "indices": [ 23. 19, 24. 31 25. ] 26. } 27. ], 28. "user_mentions": [ 29. 30. ] 31. }, 32. "in_reply_to_user_id_str": null, 33. "contributors": [ 34. 14927800 35. ], 36. "text": "Along with our new #Twitterbird, we've also updated our Display Guidelines: https://t.co/ed4omjys ^JC", 37. "retweet_count": 66, 38. "in_reply_to_status_id_str": null, 101. "show_all_inline_media": false, 102. "screen_name": "twitterapi" 103. }, 104. "in_reply_to_screen_name": null, 105. "source": "web", 106. "in_reply_to_status_id": null 107. } 10

Process Data At Rest Use an Aggregation Framework like MapReduce Store in Structures like BigTable, Cassandra, DynamoDB, MongoDB Think About Your Data Server & Application Server Use many CPUs 11

Process Data At Rest Use an Aggregation Framework like MapReduce Store in Structures like BigTable, Cassandra, DynamoDB, MongoDB Think About Your Data Server & Application Server Use many CPUs And In Motion Update asynchronously. Do not wait for batch time. Use an Enterprise Service Bus Use many CPUs 11

Use the Cloud Unit Cost Tends Towards Zero 2ECU, 4GB RAM, 24x365: 400 88ECU, 60GB RAM, 24x365: 11,000 Data Warehouse 1TB: 600/year Online Archive 1TB: 100/year 12

Use the Cloud Security, Audit, Compliance Can Be Managed PPCI, ITAR, FIPS, HIPAA, ISO27001 An Opportunity for Enhanced Governance 13

Acadiant Every data element is timestamped and attributed for audit Nothing is ever deleted or overwritten History is always available Multilingual Multiple Character Sets Multi Currency 14

Acadiant Every data element is timestamped and attributed for audit Nothing is ever deleted or overwritten History is always available Multilingual Multiple Character Sets Multi Currency 14

Acadiant Every data element is timestamped and attributed for audit Nothing is ever deleted or overwritten History is always available Multilingual Multiple Character Sets Multi Currency 14

Acadiant Modern Graphics: SVG Server Side: Calculation and Storage Client Side: Display and Interaction 15

Back End Stack: Asynchronous Services Ruby Golang R MongoDB Node.js ESB Front End Stack: Asynchronous Mobile Clients HTML JavaScript Cascading Style Sheets Scalable Vector Graphics Intelligent Local Cache Protocols Optimised for High Latency Mobile Networks 16

A New Model for Collaboration Define a Product, Not a One-Off Fix Reduce Implementation Risk Scale Up From a Small Proof of Concept 17

Thank You bill.chute@acadiant.com @AcadiantLtd