Open Source for Cloud Infrastructure



Similar documents
Introduction to Big data. Why Big data? Case Studies. Introduction to Hadoop. Understanding Features of Hadoop. Hadoop Architecture.

Has been into training Big Data Hadoop and MongoDB from more than a year now

Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

Scalable Architecture on Amazon AWS Cloud

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Bringing Big Data to People

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

HADOOP ADMINISTATION AND DEVELOPMENT TRAINING CURRICULUM

Implement Hadoop jobs to extract business value from large and varied data sets

Consulting and Systems Integration (1) Networks & Cloud Integration Engineer

Upcoming Announcements

Case Study : 3 different hadoop cluster deployments

HDP Hadoop From concept to deployment.

Big data blue print for cloud architecture

Building an Open Source Private Cloud

Red Hat Enterprise Linux is open, scalable, and flexible

ebay Storage, From Good to Great

Peers Techno log ies Pv t. L td. HADOOP

HDP Enabling the Modern Data Architecture

Dominik Wagenknecht Accenture

Virtualizing Apache Hadoop. June, 2012

Oracle Big Data SQL Technical Update

Certified Cloud Computing Professional VS-1067

Hortonworks and ODP: Realizing the Future of Big Data, Now Manila, May 13, 2015

Sentimental Analysis using Hadoop Phase 2: Week 2

Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack

The Future of Data Management

APP DEVELOPMENT ON THE CLOUD MADE EASY WITH PAAS

HYPER-CONVERGED INFRASTRUCTURE STRATEGIES

6.S897 Large-Scale Systems

the missing log collector Treasure Data, Inc. Muga Nishizawa

Assignment # 1 (Cloud Computing Security)

Intel IT s Cloud Journey. Speaker: [speaker name], Intel IT

Qsoft Inc

The Open Cloud Near-Term Infrastructure Trends in Cloud Computing

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Drive new Revenue With PaaS/IaaS. Ruslan Synytsky CTO, Jelastic

HP CLOUD SYSTEM. The most complete, integrated platform for building and managing clouds featuring Intel technologies.

Unlocking the Intelligence in. Big Data. Ron Kasabian General Manager Big Data Solutions Intel Corporation

How to choose the right PaaS Platform?

Cloud Computing: Making the right choices

Enabling High performance Big Data platform with RDMA

Open Source Software. The Foundation for Tomorrow s Infrastructure. Al Gillen. Program VP, System Software IDC April 2013

RED HAT CLOUD SUITE FOR APPLICATIONS

HYBRID CLOUD SUPPORT FOR LARGE SCALE ANALYTICS AND WEB PROCESSING. Navraj Chohan, Anand Gupta, Chris Bunch, Kowshik Prakasam, and Chandra Krintz

BIG DATA TRENDS AND TECHNOLOGIES

How To Create A Data Visualization With Apache Spark And Zeppelin

Change the Game with HP Helion

Cloud Application Development (SE808, School of Software, Sun Yat-Sen University) Yabo (Arber) Xu

INTRODUCTION TO APACHE HADOOP MATTHIAS BRÄGER CERN GS-ASE

Java, PHP & Ruby - Cloud Hosting

Workshop on Hadoop with Big Data

Apache Hadoop: Past, Present, and Future

Deploying Hadoop with Manager

Delivering Managed Services Using Next Generation Branch Architectures

Cloud Computing. Big Data. High Performance Computing

Huawei and Open Source. Industry development department Shi Hao

Accelerating Enterprise Big Data Success. Tim Stevens, VP of Business and Corporate Development Cloudera

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

TRAINING PROGRAM ON BIGDATA/HADOOP

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney

Applied Micro development platform. ZT Systems (ST based) HP Redstone platform. Mitac Dell Copper platform. ARM in Servers

Cloud Computing and Big Data What Technical Writers Need to Know

Modernizing Your Data Warehouse for Hadoop

Application and practice of parallel cloud computing in ISP. Guangzhou Institute of China Telecom Zhilan Huang

HADOOP AND MAINFRAMES CRAZY OR CRAZY LIKE A FOX? Mike Combs, VP of Marketing mcombs@veristorm.com

How Cisco IT Built Big Data Platform to Transform Data Management

Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop

Department of Computer Science University of Cyprus EPL646 Advanced Topics in Databases. Lecture 14

Intel and Qihoo 360 Internet Portal Datacenter - Big Data Storage Optimization Case Study

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Addressing Open Source Big Data, Hadoop, and MapReduce limitations

Savanna Hadoop on. OpenStack. Savanna Technical Lead

Comprehensive Analytics on the Hortonworks Data Platform

Data Security in Hadoop

Architecting Open source solutions on Azure. Nicholas Dritsas Senior Director, Microsoft Singapore

How To Handle Big Data With A Data Scientist

Roadmap Talend : découvrez les futures fonctionnalités de Talend

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

Evolution from Big Data to Smart Data

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

Big Data for Big Science. Bernard Doering Business Development, EMEA Big Data Software

Open Source Technologies on Microsoft Azure

Developing Scalable Smart Grid Infrastructure to Enable Secure Transmission System Control

Cisco IT Hadoop Journey

Extending the Enterprise Data Warehouse with Hadoop Robert Lancaster. Nov 7, 2012

The Future of Data Management with Hadoop and the Enterprise Data Hub

How Bigtop Leveraged Docker for Build Automation and One-Click Hadoop Provisioning

How To Get The Most Out Of Redhat.Com

Maximizing Hadoop Performance and Storage Capacity with AltraHD TM

CLOUD TECH SOLUTION AT INTEL INFORMATION TECHNOLOGY ICApp Platform as a Service

Transcription:

Open Source for Cloud Infrastructure June 29, 2012 Jackson He General Manager, Intel APAC R&D Ltd.

Cloud is Here and Expanding More users, more devices, more data & traffic, expanding usages >3B 15B Connected Users 1 Connected Devices 2 >11X Increase in global mobile data traffic 5 Expanding Media Usages Exabytes 100 80 60 40 20 0 670% GROWTH IN STORAGE CAPACITY SHIPPED 3 2009 2010 2011 2012 2013 2014 Forecast 2000 1500 1000 500 >1500 EXABYTES OF TRAFFIC 4 Forecast 1. Cisco Global Cloud Index Nov 2011 2. Intel ECG Worldwide Device Estimates Year 2020 - Intel One Smart Network Work forecast 3. IDC Worldwide enterprise storage systems forecast 2010-2014, Dec 2010 4. Cisco Global Cloud Index Nov 2011 5. Cisco Visual Networking Index: Global Mobile Data Traffic Forecast Update, 2011 2016, Feb 2012 2 0 Video Search Video Surveillance 70% of all Global Mobile Data Traffic will be Video 5 Live Broadcasts

Intel Open Cloud Vision FEDERATED Share data securely across public and private clouds AUTOMATED IT can focus more on innovation and less on management CLIENT AWARE Optimizing services based on device capability

Realizing Business Opportunities in Open Source INTEL INTERNAL CONFIDENTIAL

Platform of Choice in Open Source Open-source-based solutions, optimized for Intel architecture, unlock new business opportunities Reach new markets and expand existing markets Differentiate your offering atop a solid, standard platform Prosper in a unique, growing ecosystem Deliver exceptional experiences that delight end-customers INTEL CONFIDENTIAL

Advancing Open Source On Many Levels Technology Innovator Catalyst for innovation through advanced hardware and software technologies INTEL IN OPEN SOURCE Igniting the sparks of technology innovation across the computing landscape. Ecosystem Builder Helps create a vibrant, thriving open-source ecosystem Project Contributor Actively contributes to a breadth of open-source projects across every layer of the software stack Funding Support Strong financial position makes Intel a healthy, open-source contributor

Open source cloud SW Components (sample) Developer IDE IDE Eclipse Compilers GNU Tools Threading Languages C/C++ Java Ruby PHP Python Javascript Perl Scala Other OS Linux Tizen Load balancer Asd Security Snort/Base Tools Memcached Management Middleware Other Service Provider Monitoring Ganglia Nagios Cacti Zenoss Zabbix Cloud Operating Env. OpenStack OpenQRM Object Store Swift Hbase Cassandra MongoDB CouchDB Database Cassandra Message Queue RabbitMQ PaaS OpenShift Cloud Foundry Analytics Hadoop Billing jbilling Reporting Logwatch RRDtool

Open Source for Cloud Solution Stack Server Graphics Big Data LAMP Legacy Enterprise App OpenStack KVM Xen 8

OpenStack Test Bed 9

OpenStack Solution Exploraiton in China SHJTU research platform China Telecom GZ Research Institute test bed Wasu OpenStack test bed OpenStack solution interest group with Sina and Sohu 10

Intel Hadoop Distribution Sqoop (1.4.1) Structured Data Collector Flume (1.1.0) Log Data Collector Hadoop Manager Deployment, Configuration, Monitoring, Alerting and Kerberos Zookeeper (3.3.5) Coordination Pig (0.9.2) Data manipulation Hive (0.9.0) SQL-like Query Map/Reduce (1.0.3) Distributed Processing Framework HBase (0.90.7) Real-time Distributed Big Table HDFS (1.0.3) Hadoop Distributed File System Stable, Enterprise ready hadoop and HBase Adding features for vertical segments, initial focus on Telco, FSI, ITS, etc. Bring Real-time BigData analysis to Hadoop, by differentiating at HBase layer. Optimized for Intel Architecture HBase 0.94 upgrade in Q4, 2012 with specific feature additions for boosting Real-time analysis performance 1 1

Intel Hadoop: Optimized for Enterprise Performance comparison Performance number gathered from 8 node cluster Server Configuration: 2S E5-2680 8-core, 64GB RAM,8x 7200rpm SATA Disks, 1Gbps Ethernet query/s 3500 3000 2500 2000 1500 1000 500 0 700 Open Source HBase (0.90.3) 3500 Optimized HDFS I/O insertion/s 90000 80000 70000 60000 50000 40000 30000 20000 10000 0 25000 Open Source HBase (0.90.3) 82000 Advanced Region Balancing HBase as the data store Inserting 10000 records/second/server (2-way, 32GB) in average (record size: 1KB in average) Read from disk: >400 query/second/server, latency within one second (0.05s~0.8s under different load) A query is a scan to get all CDR within one month for one user. 1 2

Challenges for China Cloud Open Source SW Revenue model: through support rather than licensing Lead SI: Cloud solutions are often delivered through an SI, rather than an ISV need leading cloud open source SIs Unique ecosystem: Open source support models need to comprehend for China business environment Gaps: Ecosystem for open source cloud middleware, e.g. OpenStack SI for cloud solutions, e.g. Hadoop Big Data solutions

Igniting Sparks of Innovation through Open Source

Intel Open Data Center Vision (per DCSG) Open compute, storage, network, & comms infrastructure built from a converged platform architecture designed for security, efficiency, & automation Cloud Security Multi-tenant privacy, trusted execution, and seamless federation between public & private datacenters Cloud Management Maximum utilization efficiency & automation with coordinated workload placement, QoS, and power mgt. through common system mgt. APIs Open Compute Secure, efficient, & automated servers optimized for emerging cloud workloads Open Storage Immensely scalable storage platforms built on standard high volume hardware Open Networking Programmable network platforms built on standard highvolume hardware Open Communications Rapid SW-based commsinfrastructure innovation on standard high-volume hardware