MATLAB in Business Critical Applications Arvind Hosagrahara Principal Technical Consultant Arvind.Hosagrahara@mathworks.



Similar documents
Tackling Big Data with MATLAB Adam Filion Application Engineer MathWorks, Inc.

2015 The MathWorks, Inc. 1

Bringing Big Data Modelling into the Hands of Domain Experts

What s New in MATLAB and Simulink

Turning Data into Actionable Insights: Predictive Analytics with MATLAB WHITE PAPER

Origins, Evolution, and Future Directions of MATLAB Loren Shure

Solving Big Data Problems in Computer Vision with MATLAB Loren Shure

Is a Data Scientist the New Quant? Stuart Kozola MathWorks

2015 Ironside Group, Inc. 2

Harnessing the power of advanced analytics with IBM Netezza

MATLAB in Production Systems, Database Integration, and Big Data Eugene McGoldrick

Speeding up MATLAB and Simulink Applications

Microsoft Research Windows Azure for Research Training

Microsoft Research Microsoft Azure for Research Training

Implement Hadoop jobs to extract business value from large and varied data sets

WHITE PAPER. Harnessing the Power of Advanced Analytics How an appliance approach simplifies the use of advanced analytics

Assignment # 1 (Cloud Computing Security)

Algorithmic Trading with MATLAB Martin Demel, Application Engineer

BIG DATA TRENDS AND TECHNOLOGIES

Find the Hidden Signal in Market Data Noise

In-Database Analytics

Architectural patterns for building real time applications with Apache HBase. Andrew Purtell Committer and PMC, Apache HBase

IBM Cognos 10: Enhancing query processing performance for IBM Netezza appliances

From Raw Data to. Actionable Insights with. MATLAB Analytics. Learn more. Develop predictive models. 1Access and explore data

Architectures for Big Data Analytics A database perspective

Big Data Analytics. Chances and Challenges. Volker Markl

HPC ABDS: The Case for an Integrating Apache Big Data Stack

CASE STUDY: Oracle TimesTen In-Memory Database and Shared Disk HA Implementation at Instance level. -ORACLE TIMESTEN 11gR1

Cisco Unified Data Center Solutions for MapR: Deliver Automated, High-Performance Hadoop Workloads

BIG DATA SOLUTION DATA SHEET

Real-Time Analytics on Large Datasets: Predictive Models for Online Targeted Advertising

Deploying MATLAB -based Applications David Willingham Senior Application Engineer

Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc.

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Where We Are. References. Cloud Computing. Levels of Service. Cloud Computing History. Introduction to Data Management CSE 344

Cisco Data Preparation

Microsoft Big Data. Solution Brief

Customized Report- Big Data

Big Data on Microsoft Platform

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data

An enterprise- grade cloud management platform that enables on- demand, self- service IT operating models for Global 2000 enterprises

Integrating a Big Data Platform into Government:

IBM Netezza High Capacity Appliance

QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering

Monitis Project Proposals for AUA. September 2014, Yerevan, Armenia

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Step by Step: Big Data Technology. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 25 August 2015

CUMULUX WHICH CLOUD PLATFORM IS RIGHT FOR YOU? COMPARING CLOUD PLATFORMS. Review Business and Technology Series

The Inside Scoop on Hadoop

Introducing Oracle Exalytics In-Memory Machine

Harnessing the Power of the Microsoft Cloud for Deep Data Analytics

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

IBM Netezza Analytics

WHAT S NEW IN SAS 9.4

Scalable Architecture on Amazon AWS Cloud

Integrating Apache Spark with an Enterprise Data Warehouse

IBM Netezza High-performance business intelligence and advanced analytics for the enterprise. The analytics conundrum

The 3 questions to ask yourself about BIG DATA

Spotfire and Tableau Positioning. Summary

Introduction to Engineering Using Robotics Experiments Lecture 18 Cloud Computing

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

Big Data & the Cloud: The Sum Is Greater Than the Parts

Data Warehouse as a Service. Lot 2 - Platform as a Service. Version: 1.1, Issue Date: 05/02/2014. Classification: Open

Next-Generation Cloud Analytics with Amazon Redshift

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Netezza and Business Analytics Synergy

Mr. Apichon Witayangkurn Department of Civil Engineering The University of Tokyo

You should have a working knowledge of the Microsoft Windows platform. A basic knowledge of programming is helpful but not required.

CLOUD ARCHITECTURE DIAGRAMS AND DEFINITIONS

Machine Learning with MATLAB David Willingham Application Engineer

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

In-memory computing with SAP HANA

Ø Teaching Evaluations. q Open March 3 through 16. Ø Final Exam. q Thursday, March 19, 4-7PM. Ø 2 flavors: q Public Cloud, available to public

Big Data. Lyle Ungar, University of Pennsylvania

Oracle Big Data Building A Big Data Management System

BI VERDICT. The ultimate report on Business Intelligence. TIBCO Spotfire 5. [Analysts: Dr. Christian Fuchs, Larissa Seidler, April 2013]

IBM Netezza Analytics

MATLAB for Use in Finance Portfolio Optimization (Mean Variance, CVaR & MAD) Market, Credit, Counterparty Risk Analysis and beyond

Oracle Data Integration: CON7926 Oracle Data Integration: A Crucial Ingredient for Cloud Integration

Where is... How do I get to...

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

Databricks. A Primer

CLOUD COMPUTING. A Primer

Cloud Courses Description

Parallel Computing with MATLAB

Resource Sizing: Spotfire for AWS

Big Data Executive Survey

Chapter 6 8/12/2015. Foundations of Business Intelligence: Databases and Information Management. Problem:

Why Big Data in the Cloud?

Cloud Courses Description

Customer Case Study. Sharethrough

The Cloud as a Computing Platform: Options for the Enterprise

Big Data Technologies Compared June 2014

Overview of HPC Resources at Vanderbilt

Transcription:

MATLAB in Business Critical Applications Arvind Hosagrahara Principal Technical Consultant Arvind.Hosagrahara@mathworks.com 310-819-3970 2014 The MathWorks, Inc. 1

Outline Problem Statement The Big Picture / Financial Applications Reference Architectures and integrations TIBCO Spotfire IBM PureData for Analytics (powered by Netezza) Microsoft SQL Server Handling Big Data and out-of-memory problems (Hadoop, NoSQL databases) MATLAB in Business Critical Applications Conclusions 2

Problem Statement Agility: Is it possible to accelerate the development of data analytics? Democratization: Is it possible to make the results and insights from these analytics available to all stakeholders in an organization? Plan / Design Code Build / Test Production: Is it possible to build data analytics algorithms in a flexible, scalable manner that is suitable for production usage and rigor? Operate/ Monitor Deploy Release 3

Taking MATLAB into production Prototyping Test Production Agility Lightweight processes Visualization Access to data Expressive highlevel language Integration with best-in-class tooling Verification and validation Correctness Performance Automation of testing Test-driven development Performance Scalability Reliability Governance Auditability Maintenance Recoverability Support and Services 4

The Big Picture Reference architectures offer solutions to finance industry applications with an emphasis on selection of the right tool for the task leading to efficient workflows. Big Data Frameworks (eg: Hadoop, Databases) Cloud Solutions (eg: Amazon, Azure, PaaS, IaaS) Analytics Platforms (eg: TIBCO Spotfire ) In-house Enterprise Software Visualization (eg: HighCharts JS) Enterprise Messaging Queues (eg: AMQP) 5

Finance Industry Applications Energy Trading Load Forecasting Model and Storage Assets Price Energy Options Risk Modeling Portfolio Optimization MathWorks Consultants were wellqualified, professional, and fast. They understood not only the technical issues but also the business goals, which is essential when working on a core business system. We got more than we expected from MathWorks Consulting. - Dr. Norbert Tönder, RWE The tools we developed with MATLAB are more reliable, scalable, and maintainable than our spreadsheet-based approach. We know the tools will work, we can add new capabilities, and we can update the production system without getting IT involved. - Manuel Arancibia, Horizon Wind Energy 6

Reference Architecture MATLAB Analytics with TIBCO Spotfire 7

MATLAB Analytics with TIBCO Spotfire 8

MATLAB Analytics with TIBCO Spotfire Rapidly create analytics for use in desktop, web, and mobile visualizations and dashboards Run MATLAB scripts or programs within MATLAB sessions (small scale) Access MATLAB analytics on MATLAB Production Server (large scale) MATLAB Develop Refine Test MATLAB Production Server MATLAB TIBCO Spotfire Develop MATLAB Refine Analytics Test 9

Reference Diagram Mobile Web Desktop MathWorks. MPSExtension TIBCO Spotfire Web Player MathWorks. MPSExtension HTTP(s) MATLAB Production Server MATLAB Analytics TIBCO Spotfire Server 10

MATLAB Analytics Library MATLAB analytics available for use in Spotfire clients LargeTableOut SmallTableOut OptionsPrice OptionsPriceShort 11

Increasing Capacity and Redundancy Mobile Web Desktop Load Balancer MathWorks. MPSExtension TIBCO Spotfire TIBCO TIBCO Web Spotfire Web Spotfire Player Player Web Player MathWorks. MPSExtension Load Balancer MATLAB Production Server MATLAB Production Server MATLAB Analytics TIBCO TIBCO Spotfire Spotfire Server Server 12

Benefits Rapid development and deployment of MATLAB analytics MATLAB Central management of locked down analytics Scalable and reliable architecture MATLAB Compiler SDK MATLAB Production Server MATLAB Analytics available to all business users TIBCO Spotfire Server MATLAB MATLABProduction Server Devel op MATLAB Refin Analytics e Test 13

Reference Architecture MATLAB Analytics with IBM PureData / Netezza 14

MATLAB Analytics with IBM PureData IBM PureData (Netezza) IBM PureData (Netezza) Off-Database Pure Data ODBC/JDBC Driver On-Database Low Performance Processes (UDAP or AE) High-Performance Function (UDF / UDTF / UDA) ODBC Database Toolbox JDBC Python Java MATLAB MDCS MPS MCR C++ Lua Workflow Compute Cluster MATLAB Compiler MATLAB Coder 15

MATLAB Analytics with IBM PureData 16

Building and Running the MATLAB Analytics 1 Generate Code and move to the IBM PureData Appliance 2 Compile the generated code into a shared object for the host and the SPU 3 Register the shared object as a library 4 Use the MATLAB analytic in the User-Defined Function / Table Function / Aggregate [ SELECT Histogram(*) ] 17

Benefits Rapid development and deployment of MATLAB analytics MATLAB High Performance Analytics written in MATLAB and generated as C++ MATLAB Analytics available to downstream users as a Stored Procedure/Function MATLAB Coder and/or Simulink Coder with Embedded Coder IBM PureData for Analytics powered by Netezza or Microsoft Netezza 18

Reference Architecture MATLAB Analytics with Microsoft SQL Server 19

MATLAB Analytics with Microsoft SQL Server Microsoft SQL Server Off-Database ODBC/JDBC Driver On-Database Processes Transactions ODBC JDBC C# Common Language Runtime(CLR) Database Toolbox.NET Assembly MATLAB Component Runtime MPS Client Library MATLAB MATLABProduction Server Devel op MATLAB Refin Analytics e Test MATLAB Compiler SDK 20

MATLAB Analytics with Microsoft SQL Server 1 2 MATLAB MATLABProduction Server Devel op MATLAB Refin Analytics e Test 3 21

Benefits Rapid development and deployment of MATLAB analytics MATLAB Analytics expressed in MATLAB with nearly all available toolbox functionality MATLAB Analytics available to downstream users as a Stored Procedure/Function MATLAB Compiler SDK MATLAB Production Server Microsoft SQL Server MATLAB MATLABProduction Server Devel op MATLAB Refin Analytics e Test 22

Big Data with MATLAB 23

Techniques for Big Data in MATLAB Load, Analyze, Discard parfor, datastore, MapReduce Distributed Memory SPMD and distributed arrays out-of-memory in-memory Embarrassingly Parallel Non- Partitionable Complexity 24

Big Data Capabilities in MATLAB Memory and Data Access 64-bit processors Memory Mapped Variables Disk Variables Programming Constructs Databases Streaming Datastores Block Processing Parallel-for loops GPU Arrays SPMD and Distributed Arrays MapReduce Platforms Desktop (Multicore, GPU) Clusters Cloud Computing (MDCS on EC2) Hadoop 25

MATLAB in Business Critical Applications 26

Conclusions: MATLAB Analytics in Production Agility: Yes. Democratization: Yes. Production Quality: Yes. Access and explore data from within MATLAB during prototype development Develop and integrate analytics with enterprise systems Compact, precise language for expression of complex analytics High-quality libraries and support Open and extensible platform for analytics Data Visualization: TIBCO Spotfire, Hadoop: Apache Data warehouses: IBM PureData, Microsoft SQL Server MATLAB products and Services provide a single-stack solution when used with supporting technologies to address production data analytics demands. 27