Solving Big Data Problems in Computer Vision with MATLAB Loren Shure



Similar documents
2015 The MathWorks, Inc. 1

Tackling Big Data with MATLAB Adam Filion Application Engineer MathWorks, Inc.

Bringing Big Data Modelling into the Hands of Domain Experts

MATLAB in Business Critical Applications Arvind Hosagrahara Principal Technical Consultant

Origins, Evolution, and Future Directions of MATLAB Loren Shure

Is a Data Scientist the New Quant? Stuart Kozola MathWorks

What s New in MATLAB and Simulink

Speeding up MATLAB and Simulink Applications

Massive Cloud Auditing using Data Mining on Hadoop

Machine Learning with MATLAB David Willingham Application Engineer

Matlab on a Supercomputer

Parallel Computing with MATLAB

Comprehensive Analytics on the Hortonworks Data Platform

Data Analysis with MATLAB The MathWorks, Inc. 1

Introduction to MATLAB for Data Analysis and Visualization

Deploying MATLAB -based Applications David Willingham Senior Application Engineer

Big Data Analytics. An Introduction. Oliver Fuchsberger University of Paderborn 2014

Volume 3, Issue 6, June 2015 International Journal of Advance Research in Computer Science and Management Studies

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc All Rights Reserved

3D Model based Object Class Detection in An Arbitrary View

How Companies are! Using Spark

Introduction to MATLAB Gergely Somlay Application Engineer

The Impact of Big Data on Classic Machine Learning Algorithms. Thomas Jensen, Senior Business Expedia

Distributed Computing and Big Data: Hadoop and MapReduce

Hadoop Technology for Flow Analysis of the Internet Traffic

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

Big Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect

Big Data Open Source Stack vs. Traditional Stack for BI and Analytics

You should have a working knowledge of the Microsoft Windows platform. A basic knowledge of programming is helpful but not required.

Research of Railway Wagon Flow Forecast System Based on Hadoop-Hazelcast

Colorado School of Mines Computer Vision Professor William Hoff

CLOUDDMSS: CLOUD-BASED DISTRIBUTED MULTIMEDIA STREAMING SERVICE SYSTEM FOR HETEROGENEOUS DEVICES

Object Recognition. Selim Aksoy. Bilkent University

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data

Parallel Programming Map-Reduce. Needless to Say, We Need Machine Learning for Big Data

Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Parallel Computing using MATLAB Distributed Compute Server ZORRO HPC

Oracle Big Data SQL Technical Update

What Is In-Memory Computing and What Does It Mean to U.S. Leaders? EXECUTIVE WHITE PAPER

Soma: Linked Data Infrastructure

A Multilevel Secure MapReduce Framework for Cross-Domain Information Sharing in the Cloud

The Principles of the Business Data Lake

MATLAB as a Collaboration Platform Marta Wilczkowiak Senior Applications Engineer MathWorks

Using an In-Memory Data Grid for Near Real-Time Data Analysis

Some vendors have a big presence in a particular industry; some are geared toward data scientists, others toward business users.

Big Data Mining Services and Knowledge Discovery Applications on Clouds

Big Data and Analytics: A Conceptual Overview. Mike Park Erik Hoel

Ø Teaching Evaluations. q Open March 3 through 16. Ø Final Exam. q Thursday, March 19, 4-7PM. Ø 2 flavors: q Public Cloud, available to public

Mobile Cloud Computing for Data-Intensive Applications

MapReduce and Hadoop Distributed File System

Data Refinery with Big Data Aspects

The future: Big Data, IoT, VR, AR. Leif Granholm Tekla / Trimble buildings Senior Vice President / BIM Ambassador

MATLAB in Production Systems, Database Integration, and Big Data Eugene McGoldrick

Hadoop and Relational Database The Best of Both Worlds for Analytics Greg Battas Hewlett Packard

Human behavior analysis from videos using optical flow

Using In-Memory Computing to Simplify Big Data Analytics

Analysis of Web Archives. Vinay Goel Senior Data Engineer

SAP and Hortonworks Reference Architecture

A Novel Cloud Based Elastic Framework for Big Data Preprocessing

Using MATLAB to Measure the Diameter of an Object within an Image

BigMemory and Hadoop: Powering the Real-time Intelligent Enterprise

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Echtzeittesten mit MathWorks leicht gemacht Simulink Real-Time Tobias Kuschmider Applikationsingenieur

Data processing goes big

A Bayesian Framework for Unsupervised One-Shot Learning of Object Categories

10- High Performance Compu5ng

TouchPaper - An Augmented Reality Application with Cloud-Based Image Recognition Service

An Industrial Perspective on the Hadoop Ecosystem. Eldar Khalilov Pavel Valov

COMP9321 Web Application Engineering

How To Build A Trading Engine In A Microsoft Microsoft Matlab (A Trading Engine)

Introduction to Cloud Computing

Implement Hadoop jobs to extract business value from large and varied data sets

MapReduce and Hadoop Distributed File System V I J A Y R A O

Eye with Intelligence. Connecting CCTV to Big Data, Hadoop and R

Integrating MATLAB into your C/C++ Product Development Workflow Andy Thé Product Marketing Image Processing Applications

Azure Data Lake Analytics

Computer Vision Technology. Dave Bolme and Steve O Hara

KNIME & Avira, or how I ve learned to love Big Data

CS 378 Big Data Programming. Lecture 2 Map- Reduce

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence

Breaking News! Big Data is Solved. What Is In-Memory Computing and What Does It Mean to U.S. Leaders? EXECUTIVE WHITE PAPER

Introduction to Cloud Computing

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

Welcome to the unit of Hadoop Fundamentals on Hadoop architecture. I will begin with a terminology review and then cover the major components

Parallel Computing with Mathematica UVACSE Short Course

Role of Cloud Computing in Big Data Analytics Using MapReduce Component of Hadoop

Copyright Soleran, Inc. esalestrack On-Demand CRM. Trademarks and all rights reserved. esalestrack is a Soleran product Privacy Statement

Developing Scalable Smart Grid Infrastructure to Enable Secure Transmission System Control


How To Use Hadoop For Gis

CS555: Distributed Systems [Fall 2015] Dept. Of Computer Science, Colorado State University

Transcription:

Solving Big Data Problems in Computer Vision with MATLAB Loren Shure 2015 The MathWorks, Inc. 1

Why Are We Talking About Big Data? 100 hours of video uploaded to YouTube per minute 1 Explosive increase in number of imaging devices Webcams Smartphone Cameras IP Cameras Industrial Cameras 1- KPCB 2013 Internet Trends http://www.kpcb.com/blog/2013-internet-trends 2

2. Can I find patterns or models to represent my image data? 5. What if my data is too big to fit into memory? Goal: Show you new functionality in MATLAB to augment existing workflows to solve these challenges 3

Problem: Image Category Classification Given a large set of images of cars, planes and motorbikes How to model the data to recognize which category an image belongs to? Image Processing Algorithm motorbike Data from Caltech 101: L. Fei-Fei, R. Fergus and P. Perona. Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. IEEE. CVPR 2004, Workshop on Generative-Model Based Vision. 2004 http://www.vision.caltech.edu/image_datasets/caltech101/ 4

imageset 2. Can I find patterns or models to represent my image data? 5. What if my data is too big to fit into memory? 5

imageset 2. Can I find patterns or models to represent my image data? 5. What if my data is too big to fit into memory? 6

Machine Learning Workflow Using Images Training Data Feature Extraction Learning or Modelling imageset bagoffeatures trainimagecategoryclassifier Classifier / Model hat imageset Input Image bagoffeatures Feature Extraction evaluate Classification 8

imageset to manage large collections of images 2. Can I find patterns or models to represent my image data? Computer Vision System Toolbox, Statistics and Machine Learning Toolbox 5. What if my data is too big to fit into memory? 9

imageset to manage large collections of images 2. Can I find patterns or models to represent my image data? Computer Vision System Toolbox, Statistics and Machine Learning Toolbox 5. What if my data is too big to fit into memory? 10

imageset to manage large collections of images 2. Can I find patterns or models to represent my image data? Computer Vision System Toolbox, Statistics and Machine Learning Toolbox Image Batch Processor App 5. Can I acquire large sets of images using MATLAB? 11

imageset to manage large collections of images 2. Can I find patterns or models to represent my image data? Computer Vision System Toolbox, Statistics and Machine Learning Toolbox Image Batch Processor App 5. What if my data is too big to fit into memory? 12

Parallel Computing with MATLAB MATLAB Distributed Computing Server MATLAB Parallel Computing Toolbox 13

14

imageset to manage large collections of images 2. Can I find patterns or models to represent my image data? Computer Vision System Toolbox, Statistics and Machine Learning Toolbox Image Batch Processor App Parallel Computing Toolbox, MATLAB Distributed Computing Server 5. What if my data is too big to fit into memory? 15

imageset to manage large collections of images 2. Can I find patterns or models to represent my image data? Computer Vision System Toolbox, Statistics and Machine Learning Toolbox Image Batch Processor App Parallel Computing Toolbox, MATLAB Distributed Computing Server 5. What if my data is too big to fit into memory? 16

Big Data Stored on Hadoop Easily migrate programs from workstation to Hadoop On Your Workstation Access, explore, and analyze out-of-memory data Prototype MapReduce based algorithms Access portions of data sets on HDFS Connected to Hadoop Run algorithms on Hadoop against data in HDFS MATLAB Distributed Computing Server Production Hadoop Create applications or components that execute on Hadoop MATLAB Compiler 17

Scaled Analysis Prototype Access Explore Share/Deploy Scale Load, Analyze, Discard datastore, parfor MapReduce Distributed Memory SPMD out-of-memory in-memory Embarrassingly Parallel Complexity Non-Partitionable 18

imageset to manage large collections of images 2. Can I find patterns or models to represent my image data? Computer Vision System Toolbox, Statistics and Machine Learning Toolbox Image Batch Processor App Parallel Computing Toolbox, MATLAB Distributed Computing Server 5. What if my data is too big to fit into memory? MapReduce, distributed arrays, SPMD, Hadoop 19

Thank You! 25