Beyond Object Storage A Unified Storage Architecture for IoT. Sasikanth Eda, Sandeep Patil IBM

Similar documents
IBM Storage Technical Strategy and Trends

Clodoaldo Barrera Chief Technical Strategist IBM System Storage. Making a successful transition to Software Defined Storage

SCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS

SwiftStack Filesystem Gateway Architecture

Understanding Object Storage and How to Use It

The Future of Data Management

IBM Spectrum Protect in the Cloud

IBM ELASTIC STORAGE SEAN LEE

OpenStack Manila File Storage Bob Callaway, PhD Cloud Solutions Group,

EMC BACKUP MEETS BIG DATA

Reducing Configuration Complexity with Next Gen IoT Networks

VMware and Primary Data: Making the Software-Defined Datacenter a Reality

WHITE PAPER. Software Defined Storage Hydrates the Cloud

Fast Innovation requires Fast IT

GPFS Cloud ILM. IBM Research - Zurich. Storage Research Technology Outlook

SOFTWARE DEFINED STORAGE IN ACTION

SMB in the Cloud David Disseldorp

RED HAT STORAGE PORTFOLIO OVERVIEW

Hadoop in the Hybrid Cloud

The Design and Implementation of the Zetta Storage Service. October 27, 2009

High Performance Computing OpenStack Options. September 22, 2015

I D C A N A L Y S T C O N N E C T I O N. T h e C r i t i cal Role of I/O in Public Cloud S e r vi c e P r o vi d e r E n vi r o n m e n t s

Journey to the cloud. Sergei Butenko District Manager EMC

SOFTWARE-DEFINED STORAGE IN ACTION

Red Hat Storage Server

Vortex White Paper. Simplifying Real-time Information Integration in Industrial Internet of Things (IIoT) Control Systems

THE EMC ISILON STORY. Big Data In The Enterprise. Copyright 2012 EMC Corporation. All rights reserved.

Introduction to Cloud : Cloud and Cloud Storage. Lecture 2. Dr. Dalit Naor IBM Haifa Research Storage Systems. Dalit Naor, IBM Haifa Research

StorReduce Technical White Paper Cloud-based Data Deduplication

Jitterbit Technical Overview : Salesforce

Driving Datacenter Change

Consumption IT. Michael Shepherd Business Development Manager. Cisco Public Sector May 1 st 2014

MaxDeploy Hyper- Converged Reference Architecture Solution Brief

Securing the Internet of Things Opportunities and Challenges with scaling IoT solutions

Integrated Application and Data Protection. NEC ExpressCluster White Paper

Addressing Storage Management Challenges using Open Source SDS Controller

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE

HP Converged Cloud Cloud Platform Overview. Shane Pearson Vice President, Portfolio & Product Management

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

SOLUTIONS PRODUCTS INDUSTRIES RESOURCES SUPPORT ABOUT US ClearCube Technology, Inc. All rights reserved. Contact Support

Seagate Cloud Systems & Solutions

SOFTWARE DEFINED NETWORKING

Amazon Cloud Storage Options

OPTIMIZING PRIMARY STORAGE WHITE PAPER FILE ARCHIVING SOLUTIONS FROM QSTAR AND CLOUDIAN

THE FUTURE OF STORAGE IS SOFTWARE DEFINED. Jasper Geraerts Business Manager Storage Benelux/Red Hat

Einsatzfelder von IBM PureData Systems und Ihre Vorteile.

Scale out NAS on the outside, Object storage on the inside

Object-based Storage in Big Data and Analytics. Ashish Nadkarni Research Director Storage IDC

How To Manage A Cloud System

SUSE Storage. FUT7537 Software Defined Storage Introduction and Roadmap: Getting your tentacles around data growth. Larry Morris

How To Handle Big Data With A Data Scientist

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

Diagram 1: Islands of storage across a digital broadcast workflow

Virtualizing Apache Hadoop. June, 2012

Enabling the Future of Networks, Enterprises & Clouds Steve Smith, CEO

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

Microsoft Big Data Solutions. Anar Taghiyev P-TSP

Big Data Trends and HDFS Evolution

Big Data. White Paper. Big Data Executive Overview WP-BD Jafar Shunnar & Dan Raver. Page 1 Last Updated

Building the Internet of Things Jim Green - CTO, Data & Analytics Business Group, Cisco Systems

Data Services Advisory

NetApp Big Content Solutions: Agile Infrastructure for Big Data

Assignment # 1 (Cloud Computing Security)

A Virtual Filer for VMware s Virtual SAN A Maginatics and VMware Joint Partner Brief

2011 FileTek, Inc. All rights reserved. 1 QUESTION

EMC Isilon: Data Lake 2.0

IBM Smart Business Storage Cloud

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

NextGen Infrastructure for Big DATA Analytics.

With Red Hat Enterprise Virtualization, you can: Take advantage of existing people skills and investments

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform

Luncheon Webinar Series May 13, 2013

Jitterbit Technical Overview : Microsoft Dynamics AX

Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack

How Our Cloud Backup Solution Protects Your Network

Big Data Management and Security

SUSE Cloud 2.0. Pete Chadwick. Douglas Jarvis. Senior Product Manager Product Marketing Manager

Nutanix Solutions for Private Cloud. Kees Baggerman Performance and Solution Engineer

Johan Hallberg Research Manager / Industry Analyst IDC Nordic Services & Sourcing Digital Transformation Global CIO Agenda

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Big Data: Overview and Roadmap eglobaltech. All rights reserved.

THE REALITIES OF NOSQL BACKUPS

Software-defined Storage

Apache HBase. Crazy dances on the elephant back

Object Storage: A Growing Opportunity for Service Providers. White Paper. Prepared for: 2012 Neovise, LLC. All Rights Reserved.

Affordable, Scalable, Reliable OLTP in a Cloud and Big Data World: IBM DB2 purescale

Revitalising your Data Centre by Injecting Cloud Computing Attributes. Ricardo Lamas, Cloud Computing Consulting Architect IBM Australia

Microsoft Private Cloud Fast Track

Understanding Enterprise NAS

Addressing Open Source Big Data, Hadoop, and MapReduce limitations

Modern App Architecture for the Enterprise Delivering agility, portability and control with Docker Containers as a Service (CaaS)

IBM Global Technology Services September NAS systems scale out to meet growing storage demand.

Jitterbit Technical Overview : Microsoft Dynamics CRM

<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures

Customized Report- Big Data

Whitepaper. NexentaConnect for VMware Virtual SAN. Full Featured File services for Virtual SAN

Global Big Data Market: Trends & Opportunities ( ) June 2015

Product Brochure. Hedvig Distributed Storage Platform Modern Storage for Modern Business. Elastic. Accelerate data to value. Simple.

Improve your IT Analytics Capabilities through Mainframe Consolidation and Simplification

Transcription:

Beyond Object Storage A Unified Storage Architecture for IoT Sasikanth Eda, Sandeep Patil IBM

Agenda Internet of Things (IoT): Market Estimates And Forecasts IoT: Challenges IoT Platform Architectures IoT Platform: Storage Requirements Enhancements to Object Storage to support IoT workloads Better 2

IoT: Evolution of Devices According to IDC - The Internet of Things (IoT) is a network of networks of uniquely identifiable endpoints (or "things") that communicate without human interaction using IP connectivity - whether locally or globally. Source: http://www.gartner.com/newsroom/id/2636073 3

IoT: Market Estimates And Forecasts (Industry Specific) More than 50% of manufacturers identify lower operational costs as the most significant driver of their organization s Internet of Things (IoT) initiatives over the next 12 24 months (05/06/2015, IBM Institute for Business Value). The value of Internet of Things in healthcare is forecasted to reach $163.24 billion by 2020, with a CAGR of 38.1% for 2015-2020 (01/05/2016, emarketer). By 2018, it is expected to drive development of over 200,000 new IoT apps and services (11/03/2015, IDC). 4

IoT: Market Estimates And Forecasts (Cloud Specific) 36% of hybrid leaders are using hybrid cloud for Internet of Things. (02/09/2016, IBM Center for Applied Insights) 52% of IoT developers implement cognitive computing and artificial intelligence in their development work. (11/09/2015, Evans Data) By 2019, 80% of new applications using the Internet of Things (IoT) will analyze data in motion as well as collect this information for analysis of data at rest. (12/03/2015, Gartner) 5

IoT: Challenges (Technology) Cost of connectivity Solutions are expensive (high infrastructure, maintenance costs, service costs of middlemen). Internet after trust In IoT networks trust (privacy, anonymity) is either expensive or difficult to guarantee. Lack of functional value - Simple connectivity enablement may or may not make a device smarter or better. - Connectivity with intelligence makes a better product and experience. 6

IoT: Challenges (Business) Not future-proof - Refresh cycle for basic pieces of IoT infrastructure (door locks, bulbs etc.) is quite long (may last for years, even decades). Broken business models - Unlike PC or smart phones, IoT devices (locks, bulbs etc.) worked without apps and service contracts, which make revenue expectations unrealistic. - End to End IoT solutions (Smart TV to speak to the toaster, Smart washing machine to speak to detergent vendor) get cumbersome quickly. 7

IoT: Current State Needs a Reboot IoT landscape is been evolving at a rapid pace and there has been little innovation to address the IoT market from a storage infrastructure standpoint. Currently no storage vendor is offering specific storage solutions for IoT deployments. There is no single best approach to data management in the context of IoT. Need for a common framework, enhancements in performance and security. 8

IoT Platform: Centralized Architecture IoT centralized architecture uses a hub (typically powered by the cloud) that controls the execution of nodes (smart devices). Uses Platform-as-a-Service model. Current database architecture and data management strategy may not be sufficient to handle large scale IoT networks. Big data architecture gets complex, as IoT devices produce hundreds of thousands of data points per second. Offered Services: Event processing, Device discovery, Device management, Event Notifications, Real Time Analytics. 9

IoT Platform: Decentralized Architecture ADEPT platform Autonomous Decentralized Peer-To-Peer Telemetry An effort to prove the foundational concepts around decentralized approach. Key Objectives: Distributed Transaction Processing & Applications: Robust Security Privacy by design and default Designed for commerce and market places 10

IoT Platform: Decentralized Architecture Key Solution Components: Trustless peer-to-peer messaging Secure distributed data sharing / file transfers Autonomous Device coordination. 11

IoT Platform: Storage Requirements 1. Scalable: Cost-effective scalability, and the ability to decouple capacity and performance. 2. Self-healing: Traditional RAID mechanisms may be ineffective in large scale environments. Having a self-healing architecture with configurable data availability classes. 3. Data awareness: Ability to create a data-aware storage platform, which provides both context to the data and policy management, as well as visualization tools to harness the value of that data for better business insights. * Source: Gartner, Reassess Storage Requirements for Successful IoT Implementations 12

Why Object Storage For IoT Early predictions (why object storage suits best for IoT workload) Designed to support large volumes and velocity of data. Native HTTP / REST support. Linear Scale out architecture (Scale in all directions including IOPS, latency) Distributed Architecture (Performance, load distribution) No single point of failure. Support for heterogeneous data types, different data protection policies. 13

Why Object Storage is falling short? Micro services of IoT Type of Storage Appropriate Storage Platforms Batch Processing Primary Storage Distributed Filesystems Real-Time Processing Primary Storage Memory-centric Filesystems, NoSQL Data Stores Active Archiving Secondary Storage Distributed Filesystems, Object Storage Cold Storage Primary Storage Object Storage with Spin down capabilities, Cloud Storage, Tape Libraries * Source: Gartner 2015 Based on a recent survey on IoT implementations, it is observed that object storage is not considered for real time processing of IoT data (However success of IoT implementation hugely depends on analytics platform). 14

Factors that help Object Storage to hold its promise 1: In-Place Analytics 2: IoT data collection and Privacy filters 3: Native Event, Cross Device Notification, Life Cycle Management 15

Issue-1: In-Place Analytics Analytics on Traditional Object Store Analytics With Unified File and Object Access Explicit Data movement Object (https) In-Place Analytics Spark or Hadoop MapReduce Results returned In place Data ingested as Objects <Unified_Fileset>/<Device> Clustered Filesystem Results Published as Objects with no data movement 1. Data to be migrated from object store to dedicated analytic cluster. 2. Perform the analysis and copy results back to object store for publishing. Reference: https://aws.amazon.com/elasticmapreduce/ Object data available as Files on the same fileset. Analytics systems (Hadoop, Spark) can directly leverage this data analytics. No data movement / In-Place immediate data analytics 16

Swift-on-File for In-Place Analytics This object: http://swift.example.com/v1/acct/cont/obj was stored with Swift here: /mnt/sdb1/2/node/sdb2/objects/981/f79/f566bd022b9285b05e665f d7b843bf79/1401254393.89313.data But is now stored with SwiftOnFile here: /mnt/scaleoutfs/acct/cont/obj For more details refer: OpenStack SwiftOnFile - User Identity for Cross Protocol Access Demystified 17

Unified Identity Between Object And File Object Authentication via keystone Object Access SwiftOnFile device File Access NFS / CIFS POSIX File Authentication IBM Spectrum Scale Common AD/LDAP Common set of Object and File users using same directory service (AD+RFC 2307 or LDAP) Objects created using Swift API will be owned by the user performing the Object operation (PUT) Note that if object already exists, existing ownership of object will be retained 18

Issue-2: IoT Data Collection and Privacy Filters Data collection and maintaining privacy on IoT devices is a big deal. Lack of trust on details collected by commodity IoT devices. Storing these details on cloud makes situation more uncomfortable. End user configurable Middleware to filter out the privacy parameters per IoT device. End user configurable Expiry of objects, metadata per IoT device. Automated placement of data per IoT device. Object Server Middleware App Object Server IBM Spectrum Scale Filter custom data 19

Issue-3: Event, Cross Device Notification, Life Cycle Management Avoids need of database triggers or performance Object-x Object-y Cross Device Notification Engine IBM Spectrum Scale Object-x Slow Disk High Disk hampering analytic engine to act on a particular set of received IoT data. Policies, rules for cross device notification can applied based on received object content, type, device type, metadata details and generated event type. Auto-migration / placement of data based on its dependency for cross device notification. 20

IBM Spectrum Scale Data management at scale OpenStack and Spectrum Scale helps clients manage data at scale Business: I need virtually unlimited storage An open & scalable cloud platform HDFS POSIX NFS SMB Glance Cinder Spectrum Scale Manila Swift Operations: I need a flexible infrastructure that supports both object and file based storage Operations: I need to minimize the time it takes to perform common storage management tasks A single data plane that supports Cinder, Glance, Swift, Manila as well as NFS, et. al. A fully automated policy based data placement and migration tool SSD Fast Disk Slow Disk Tape Collaboration: I need to share data between people, departments and sites with low latency. Sharing with a variety of WAN caching modes Avoid vendor lock-in with true Software Defined Storage and Open Standards Seamless performance & capacity scaling Automate data management at scale Enable global collaboration Results Converge File and Object based storage under one roof Employ enterprise features to protect data, e.g. Snapshots, Backup, and Disaster Recovery Support native file, block and object sharing to data.

THANK YOU Acknowledgements Dean Hildebrand Bill Owen Shyama VenuGopal