Automated Machine Learning For Autonomic Computing



Similar documents
Oracle Big Data SQL Technical Update

Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence

Towards Smart and Intelligent SDN Controller

The Internet of Things

How To Use Numenta Geospatial Tracking

Disrupting The Market: Predictive Analytics As A Service

Cisco IT Hadoop Journey

Bayesian networks - Time-series models - Apache Spark & Scala

Moving From Hadoop to Spark

Big Data for everyone Democratizing big data with the cloud. Steffen Krause Technical

Simplifying Big Data Analytics: Unifying Batch and Stream Processing. John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!!

ENVI Services Engine: Scientific Data Analysis and Image Processing for the Cloud

The Enterprise Data Hub and The Modern Information Architecture

Brett Gaines Senior Consultant, CGI Federal Geospatial and Data Analytics Lead Developer

I. TODAY S UTILITY INFRASTRUCTURE vs. FUTURE USE CASES...1 II. MARKET & PLATFORM REQUIREMENTS...2

Business Intelligence for Big Data

The Big Data Paradigm Shift. Insight Through Automation

Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL. May 2015

Apache Hadoop in the Enterprise. Dr. Amr Awadallah,

Evaluating NoSQL for Enterprise Applications. Dirk Bartels VP Strategy & Marketing

Predictive Maintenance

Zabbix for Hybrid Cloud Management

Big Data Explained. An introduction to Big Data Science.

Analyzing Big Data with AWS

Winning Against All Odds: Big Data for the Budget Travel Industry. Silviu Preoteasa Head of Marketing Technology

Big Data Use Case: Business Analytics

The Impact of Big Data on Classic Machine Learning Algorithms. Thomas Jensen, Senior Business Expedia

Dashboard Engine for Hadoop

Sunnie Chung. Cleveland State University

Search and Real-Time Analytics on Big Data

Ansible in Depth WHITEPAPER. ansible.com

Next presentation starting soon Business Analytics using Big Data to gain competitive advantage

The Analytics Value Chain Key to Delivering Value in IoT

Hadoop IST 734 SS CHUNG

Open source Google-style large scale data analysis with Hadoop

Big Data and Complex Networks Analytics. Timos Sellis, CSIT Kathy Horadam, MGS

Data Science & Big Data Practice

Comprehensive Analytics on the Hortonworks Data Platform

Thing Big: How to Scale Your Own Internet of Things.

Big data blue print for cloud architecture

Building a SaaS Application. ReddyRaja Annareddy CTO and Founder

Cloud Computing Training

Unified Batch & Stream Processing Platform

Informix The Intelligent Database for IoT

Student Project 2 - Apps Frequently Installed Together

So What s the Big Deal?

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

Schema Design Patterns for a Peta-Scale World. Aaron Kimball Chief Architect, WibiData

Real Time Fraud Detection With Sequence Mining on Big Data Platform. Pranab Ghosh Big Data Consultant IEEE CNSV meeting, May Santa Clara, CA

XpoLog Center Suite Log Management & Analysis platform

Real-Time Analytics on Large Datasets: Predictive Models for Online Targeted Advertising

Customized Report- Big Data

Fast Innovation requires Fast IT

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

The Quest for Conformance Testing in the Cloud

White paper: Delivering Business Value with Apache Mesos

An Evaluation of Machine Learning Method for Intrusion Detection System Using LOF on Jubatus

A programming model in Cloud: MapReduce

Building Out Your Cloud-Ready Solutions. Clark D. Richey, Jr., Principal Technologist, DoD

Open Cloud System. (Integration of Eucalyptus, Hadoop and AppScale into deployment of University Private Cloud)

How To Handle Big Data With A Data Scientist

Apache Stratos Building a PaaS using OSGi and Equinox. Paul Fremantle CTO and Co- Founder, WSO2 CommiCer, Apache Stratos

Data Centers and Cloud Computing

Cloud Computing at Google. Architecture

Building a Scalable Big Data Infrastructure for Dynamic Workflows

Data-intensive HPC: opportunities and challenges. Patrick Valduriez

XpoLog Competitive Comparison Sheet

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Online Content Optimization Using Hadoop. Jyoti Ahuja Dec

Testing Big data is one of the biggest

Hadoop-based Open Source ediscovery: FreeEed. (Easy as popcorn)

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

Ganzheitliches Datenmanagement

Investigating Hadoop for Large Spatiotemporal Processing Tasks

Oracle Advanced Analytics 12c & SQLDEV/Oracle Data Miner 4.0 New Features

SAP Predictive Analytics Roadmap Charles Gadalla SAP SESSION CODE: #####

Outline. What is Big data and where they come from? How we deal with Big data?

ArcGIS for Server in the Cloud

Vortex White Paper. Simplifying Real-time Information Integration in Industrial Internet of Things (IIoT) Control Systems

Welcome to the unit of Hadoop Fundamentals on Hadoop architecture. I will begin with a terminology review and then cover the major components

WHAT S NEW IN SAS 9.4

EVERYTHING THAT MATTERS IN ADVANCED ANALYTICS

Deploying Predictive Analytics Solutions in the Rail Industry and Seeing a Return on Investment

Pulsar Realtime Analytics At Scale. Tony Ng April 14, 2015

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc All Rights Reserved

From Spark to Ignition:

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

How telcos can benefit from streaming big data analytics

Health monitoring & predictive analytics To lower the TCO in a datacenter

A New Era Of Analytic

Technology Enablement

Transcription:

Automated Machine Learning For Autonomic Computing ICAC 2012 Numenta Subutai Ahmad

Autonomic Machine Learning ICAC 2012 Numenta Subutai Ahmad

35% 30% 25% 20% 15% 10% 5% 0% Percentage of Machine Learning Papers in ICAC 2009 2010 2011

Numenta! Small company, located in Redwood City, CA About 22 people

Numenta! Created in 2005 Jeff Hawkins, founder of Palm Computing and Handspring Author of On Intelligence

Early Years! 2005 2010 Focused on developing a new theory of machine intelligence and machine learning Inspired by biology and specifically neuroscience

2012! Grok (BETA) A product designed for automated streaming analytics Data Dynamically Generated Models Predictions

Talk Outline What are the challenges of traditional machine learning? What are the properties of autonomic machine learning? Numenta s algorithm and Grok product architecture The future where should we go?

Machine Learning Challenges Extremely manual

Support Vector Machines or Back propagation

Machine Learning Is Labor Intensive Select the right algorithm Experiment with parameters Construct good input features Robust testing and cross-validation

Application: Smart Meters

Application: Smart Meters

Machine Learning Challenges Extremely manual Automated

Machine Learning Challenges Extremely manual Automated Static models

Application: Smart Meters

Application: Smart Meters

Machine Learning Challenges Extremely manual Static models Automated Continuous learning

Machine Learning Challenges Extremely manual Static models Automated Continuous learning Non-temporal

Application: Smart Meters

Application: Smart Meters

Machine Learning Challenges Extremely manual Static models Non-temporal Automated Continuous learning Inherently temporal

Machine Learning Challenges Extremely manual Static models Non-temporal Automated Continuous learning Inherently temporal Batch focused

Machine Learning Challenges Extremely manual Static models Non-temporal Batch focused Automated Continuous learning Inherently temporal Streaming architecture

Properties Of Autonomic Machine Learning Automated Continuous learning Inherently temporal Streaming architecture

Talk Outline What are the challenges of traditional machine learning? What are the properties of autonomic machine learning? Numenta s algorithm and Grok product architecture The future where should we go?

Grok A prediction engine for automated streaming analytics Grok Models Predictions Data Anomaly detection

Grok Architecture A cloud based architecture running on Amazon AWS Customer App Customer App Customer App Data Predictions WebApp Quickstart, Developer Dashboard API Layer REST + Java, Python, JS Grok Engine Modified Hadoop (streaming) Compute Nodes Rabbit MQ Models Optimization Stream processing Hbase User Metadata Models

Neocortex Cortical Learning Algorithm How does this structure learn?

The Cortical Learning Algorithm 1. Learns to encode common static patterns 2. Learns the common sequences of those patterns 3. Makes predictions about future patterns 4. Continuously creates, strengthens and weakens cell connections

Static pattern (time =1)

Static pattern (time =2)

Cells form connections to previously active cells. Predicts its own future activity.

Multiple predictions can occur simultaneously.

High order sequences are enabled with multiple cells per column.

Typical Implementation Size Columns 2000 Cells per column 30 Segments per cell 128 Connections per 40 segment Storage capacity for patterns and sequences is huge

Automated Streaming Architecture Grok automatically estimates model parameters. Users stream data and it starts making predictions within minutes.

Grok Learning Demo

Application: Smart Meters

Typical Energy Profile

Grok Model Predictions

Long Range Temporal Predictions At midnight, Grok makes predictions for next 24 hours

Sequence Memory

Smart Meter: Results Model creation time about 60 minutes Overall prediction accuracy within 2.5% to 5% of actuals Grok able to make hourly, daily, weekly, and monthly predictions

Managing Server Capacity Grok used to predict server demand Actual Predicted Used to provision instances ahead of time Results show approximately 15% reduction in AWS cost Date Incoming server demand, Actual vs. Predicted

Predictive Maintenance Grok used to detect anomalies in gear bearing temperature Can detect anomalies based on temporal characteristics Can be used to proactively optimize maintenance schedules Gear bearing temperature & Grok Anomaly Score

Numerous Applications For Grok & Autonomous Machine Learning Energy prediction Auto provisioning Preventive Maintenance Ad network optimization Credit card fraud detection Power management Server monitoring Malware detection Inventory management

Properties Of Autonomic Machine Learning Automated Continuous learning Inherently temporal Streaming architecture

What s Next? Autonomic Machine Learning is inevitable Machine Learning as a field needs to expand its focus beyond just algorithms, and also address system level issues Are there lessons from Autonomic Computing that should be applied to machine learning? Hope: Autonomic Computing community is able to act as a catalyst, working with the Machine Learning community Oh, and contact me if Grok sounds interesting for your application!

Thank you Contact Info: Subutai Ahmad sahmad@numenta.com