CIRUS : A Cloud Infrastructure for Real-time Ubilytics (aka ubiquitous big data analytics)



Similar documents
Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January Website:

Hybrid Cloud Architectures for Operational Performance Management

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum

Real Time Big Data Processing

Vortex White Paper. Simplifying Real-time Information Integration in Industrial Internet of Things (IIoT) Control Systems

BIG DATA. Using the Lambda Architecture on a Big Data Platform to Improve Mobile Campaign Management. Author: Sandesh Deshmane

The Internet of Things

Systems of Discovery The Perfect Storm of Big Data, Cloud and Internet-of-Things

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

So What s the Big Deal?

The 4 Pillars of Technosoft s Big Data Practice

Could IoT be WebRTC's greatest source of innovation?

Da Internet delle persone a Internet delle cose. Giovanni Pirola Regional Service Manager Red Hat, Inc. November 19th 2014

Internet of Things. Opportunity Challenges Solutions

Big Data Analytics - Accelerated. stream-horizon.com

Big Data Analytics. Chances and Challenges. Volker Markl

Challenges for Data Driven Systems

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Enterprise Application Enablement for the Internet of Things

NoSQL for SQL Professionals William McKnight

BIG DATA TRENDS AND TECHNOLOGIES

Big-Data Computing with Smart Clouds and IoT Sensing

Cloud Computing Now and the Future Development of the IaaS

Big Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect

Developing Scalable Smart Grid Infrastructure to Enable Secure Transmission System Control

Analyzing Big Data with AWS

Big Data Analytics. Lucas Rego Drumond

Internet of things (IOT) applications covering industrial domain. Dev Bhattacharya

Big Data and Cloud Computing

Open Cloud System. (Integration of Eucalyptus, Hadoop and AppScale into deployment of University Private Cloud)

Emerging Technology for the Next Decade

Certified Cloud Computing Professional VS-1067

Cloud Computing Training

Big Data and Industrial Internet

Building Out Your Cloud-Ready Solutions. Clark D. Richey, Jr., Principal Technologist, DoD

SnapLogic extends beyond cloud and big-data integration into the Internet of Things

Informix The Intelligent Database for IoT

MASHUPS FOR THE INTERNET OF THINGS

How To Understand Cloud Computing

Big Data & Its Bigger Possibilities In The Cloud

HYPER-CONVERGED INFRASTRUCTURE STRATEGIES

Azure Data Lake Analytics

Evolving from SCADA to IoT

Big Data Are You Ready? Jorge Plascencia Solution Architect Manager

Thing Big: How to Scale Your Own Internet of Things.

ANALYTICS BUILT FOR INTERNET OF THINGS

Big Data Success Step 1: Get the Technology Right

Data Centers and Cloud Computing. Data Centers

Building the Internet of Things Jim Green - CTO, Data & Analytics Business Group, Cisco Systems

STREAM PROCESSING AT LINKEDIN: APACHE KAFKA & APACHE SAMZA. Processing billions of events every day

Security Infrastructure for Trusted Offloading in Mobile Cloud Computing

Internet of Things, 5G, Big Smart Data Interplay. The Convergence and Integration of Mobile Communications, Internet, and Smart Data Processing

Introduction to Cloud Computing

Demystifying the Cloud Computing

Big Data Technology ดร.ช ชาต หฤไชยะศ กด. Choochart Haruechaiyasak, Ph.D.

From Spark to Ignition:

Evaluating NoSQL for Enterprise Applications. Dirk Bartels VP Strategy & Marketing

NextGen Infrastructure for Big DATA Analytics.

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Delivering secure, real-time business insights for the Industrial world

Harnessing the Power of the Microsoft Cloud for Deep Data Analytics

SMART IoT PROTOCOLS. Creating the Living Network. Chonggang Wang Innovation Lab, InterDigital Communications. December 8, 2014

Mobile Cloud Computing: Paradigms and Challenges 移 动 云 计 算 : 模 式 与 挑 战

Big Data Pipeline and Analytics Platform

Cloud computing - Architecting in the cloud

THE DEVELOPER GUIDE TO BUILDING STREAMING DATA APPLICATIONS

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

Keywords Big Data, NoSQL, Relational Databases, Decision Making using Big Data, Hadoop

OpenStack Benelux. Dan Chester. Seagate Cloud Systems & Solutions

Deploying Big Data to the Cloud: Roadmap for Success

Consumption IT. Michael Shepherd Business Development Manager. Cisco Public Sector May 1 st 2014

COMP9321 Web Application Engineering

Cloud Computing An Introduction

Peter Zhang, Ph.D. Principal consultant, ZHTECH Corporation Adjunct Professor, Cal State University, Fullerton

The Top 10 7 Hadoop Patterns and Anti-patterns. Alex

Data Centers and Cloud Computing. Data Centers. MGHPCC Data Center. Inside a Data Center

Convex Optimization for Big Data: Lecture 2: Frameworks for Big Data Analytics

Cloud Computing Technology

OpenShift 3.0 in the Sogeti Services Factory

Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料

The Private Cloud Your Controlled Access Infrastructure

Big Data and Analytics in Government

Architecting for the Internet of Things & Big Data

The Flash-Transformed Financial Data Center. Jean S. Bozman Enterprise Solutions Manager, Enterprise Storage Solutions Corporation August 6, 2014

Are You Ready for Big Data?

Introducing Oracle Exalytics In-Memory Machine

Disrupting The Market: Predictive Analytics As A Service

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

How To Handle Big Data With A Data Scientist

ENABLING RELIABLE COMMUNICATION FOR EMERGING M2M/IOT APPLICATIONS

Data Centers and Cloud Computing. Data Centers

An Integrated Big Data & Analytics Infrastructure June 14, 2012 Robert Stackowiak, VP Oracle ESG Data Systems Architecture

Cloud Computing Summary and Preparation for Examination

Transcription:

CIRUS : A Cloud Infrastructure for Real-time Ubilytics (aka ubiquitous big data analytics) Didier Donsez Université de Grenoble LIG / ERODS P.N@imag.fr From Processing.org 1

Thanks to Manfred for the introduction of my talk 3

Web X.0 (X > 3) Emerging ICT domains Cloud Computing Big Data Analytics Internet of (Every)Things Social Networks Mobile computing Crowd sourcing Open data... 4

Internet of Things (IoT) Industrial IoT (IIoT) RFID / NFC phones SCADA Instrumentation Communication Mediation Decision Action Mining sensor nodes robot 5

Big Picture of Internet of (Every)Things, Data and Services Network cell size WAN MAN LAN WLAN WSN PAN BAN Home Automation SOHO 21/06/2014 Smart Public Space Smart Building Industry 4.0 Smart Cities Urban Spaces Geographic Scale 6 6

Big Picture of Internet of (Every)Things, data and services 7

Internet(s) of Data, Things and Services Internet of (Chatty) Things Internet of Everything Internet of People Internet of My Things Industrial Internet of Things (IIoT) : Industry 4.0 Fog Computing Cyber-Physical Systems... 8

What is Cloud Computing? On-demand computing Advantages IaaS, PaaS, SaaS Public, Private, Hybrid, Community, User-Centric, Souverain Virtualization, TCO, Resilience, Elasticity, Energy efficency, Big Data Analytics Drawbacks Confidentiality (Privacy,Industrial properties, ) Souverainety D. Donsez, Intergciels IoT 9

Cloud Services Models Software as a Service Saleforce, Steam,... Platform as a Service Google App Engine, Amazon Hadoop... Infrastructure as a Service Amazon EC2,... Virtual/Physical Infrastructure (FaaS) IT cooling Smart Green Grid H2 10

Cloud Computing : UbiCloud, Cloud of Things,... UbiCloud Clouds with/for Ubi-terminals (smartphones, tablet cars, IDS,...) Cloud of Things (CoT) Cloud for Things (data collection and long-term storage...) Things are facilities in the FaaS D. Donsez, Intergciels IoT 11

Cloud of Things Xively, Axeda, (on-demand access to any applications) Eurotech Platform as a Service (PaaS) Software as a Service (SaaS) (on-demand platform of delivering your own application) Inrastructure as a Service (IaaS) (on-demand cpu/storage/nw infrastructures) Virtual/Physical Infrastructure (FaaS) IT cooling Deltadrone, Cloud Robotics Smart Green Grid H2 D. Donsez, Intergciels IoT 12

What is Big Data (Analytics)? 13

Big data is like teenage sex Big data is like teenage sex: everyone talks about it, nobody really knows how to do it, everyone thinks everyone else is doing it, so everyone claims they are doing it Dan Ariely, Professor at Duke University, TED speaker 14

The Data Deluge and the next IoT Data Deluge1ZB=10^12 GB http://www.snia.org/sites/default/files2/abds2012/tutorials/robpeglar_introduction_analytics%20_big%20data_hadoop.pdf 15

The 4+1V of Big Data Volume Velocity Variety Veracity + Value 16

IoT Big Data 5V Volume Velocity Variety Veracity Value 17

Big Data and IoT Ubilytics : Ubiquitous big data analytics Realtime prediction on the sensor data flows For realtime decision (ie action) Mixin with other data sources Corporate data Open Data (gov,...) Crowd-sourced data Social networks posts/tweets... Voir Talk ICAR 2013 : Big Data par Jean-Laurent Philippe, http://erods.liglab.fr/icar2013/programme.html#intel D. Donsez, Intergciels IoT 18

How analyze the Iot Data Deluge? Fastly (hours) and Very Fastly (milliseconds) For speeding and improving decision supports Business Intelligence tools (OLTP, OLAP, ) can't! Now computing models are avalaible Massively distributed, on-demand, fault tolerant But All old-fashioned statistical/prediction methods must be rethink 19

Computing Models for Big Data Post-Processing Batch TBs / PBs of stored data Continuous Event Streaming 100 MB/s of live data High-latency Decision Support Low-latencyDecisionSupport Map Reduce Event Stream Processing Hadoop, SciDB, Spark, Giraph,... Map-Update MUD8P Storm, S4, Samza, Millwheel,... Discretized Stream Processing Spark Streaming 20

Event Stream Processing Massively distributed processing of continuous flows of events (sensors data,...) Low-latency (few millisec after) mutable state mutable state node 1 node 3 input records output records node 5 input records node 2 Node 4 21

Example : Event Stream Processing Trending Topics VoIPSTREAM (VS) Twitter Sentiment Analysis (SA) From Maycon Bordin's ms thesis 22

Lambda Architecture Nathan Marz (Twitter, Backtype) Combine Low and High Latency BD stacks An aly tic al M ode l Batch layer can compute the analytics model of the Speed Layer http://jameskinley.tumblr.com/post/37398560534/the-lambda-architecture-principles-for-architecting

Ubilytics Ubiquitous Big Data Analytics Motivation PaaS for «Ubilytics» End-to-End From sensors, gateways and lambda architecture (cloud) «Simple as Possible» Autonomic : scalability, fault tolerance for IoT SMEs & their IoT data scientists Problem Huge variety of needs Huge variety of technologies 24

New trendy Job : IoT Data Scientist Gartner says big data creates big jobs: 4.4 million IT jobs globally to support big data by 2015. http://www.gartner.com/newsroom/id/22 07915 The U.S. could face a shortage by 2018 of 140,000 to 190,000 people with "deep analytical talent" and of 1.5 million people capable of analyzing data in ways that enable business decisions. (McKinsey & Co) Big Data industry is worth more than $100 billion growing at almost 10% a year (roughly twice as fast as the software business) http://nirvacana.com/thoughts/becoming-a-data-scientist/ How can this guy deal with this deluge of technologies? How to make this guy productive?

Who is able to develop/ deploy Ublilytics infrastructures? IoT Data Scientist Cloud Computing Big Data Analytics Internet of Things D. Donsez, CAL & CIEL 2014 26

Towards "Ubilytics" PaaSs Send selected sensors meausrements M2M Gateway @ Home, Office, City, Warehouse (OpenHAB, IoTSys) In Elastic Hybrid Cloud NoSQL Store MongoDB, Cassandra, HDFS In elastic Cloud Message MessageBroker Broker orormaas or PSaaS MaaS or PSaaS Mosquitto, RabbitMQ, Mosquitto, RabbitMQ, Protocols : MQTT, AMQP, Protocols : MQTT, AMQP, STOMP, XMPP, CoAP, STOMP, XMPP, CoAP, WebRTC, Motwin... WebRTC, Motwin... SmartPhone @ Car, City, Embedded boards smartphones by millions Sensors data messages ie energy Consumption, temperature, images,... MapReduce Hadoop Computed prediction model Realtime ESP Storm, Samza, S4, MQTT, AMQP, Spark Streaming, STOMP... topologies Mashup, Reporting, Dashboard,. (history charts,...) Storing agregates Predictions Trends,... Monitoring Placement (static,dynamic) Deployment & (Re)Configuration (Roboconf) 27

Ubilytics Example Energy Consumption Forecast Domain : Smart Grid 2125 individual smart plugs in 40 houses measuring and sending instant load (W) and cumulative load (kw) Sensor Dataset*: Events contain instant load (W) and cumulative load (kw) 130 millions events/day (on one month) 3 GB/day Challenge* : Forecasts loads at 1min, 5min, 15min, 60min and 120min Goal : anticipate electricity demand for ajusting the production (ie. save energy and avoid blackout) * DEBS GC 2014 http://www.cse.iitb.ac.in/debs2014/?page_id=42# 28

Ubilytics Example Energy Consumption Forecast MQTT Broker + Storm Topology + Cassandra DB on a Azure VM cluster instant load (W) and cumulative load (kw) 29

Smart Campus (ie Small Smart City) OpenHAB, Galileo, MQTT, Storm, Azure,... 30

Smart Campus 31

Smart Campus 32

SmartCampus 33

Smart Campus 34

Q&A D. Donsez, CAL & CIEL 2014 35