Virtualizing Apache Hadoop. June, 2012
|
|
|
- Charlene Maxwell
- 10 years ago
- Views:
Transcription
1 June, 2012
2 Table of Contents EXECUTIVE SUMMARY... 3 INTRODUCTION... 3 VIRTUALIZING APACHE HADOOP... 4 INTRODUCTION TO VSPHERE TM... 4 USE CASES AND ADVANTAGES OF VIRTUALIZING HADOOP... 4 MYTHS ABOUT RUNNING HADOOP IN A VIRTUALIZED ENVIRONMENT... 6 CONCLUSION... 8 REFERENCES... 8
3 Executive Summary Key business and technology trends are disrupting the traditional data management and processing landscape. Big data analytics is increasingly being viewed as a competitive advantage and businesses are embracing Big data technologies to gain significant insight into their business for continued success. Apache Hadoop is emerging as one of the leading application in the big data space and is being used by enterprises across verticals for Big data analytics to help make better business decisions based on large data sets. This document introduces the benefits and use cases for virtualizing Hadoop and dispels some common myths. It also describes some of the initiatives being taken by VMware in support of an optimal virtualized platform for Apache Hadoop. Introduction The amount of digital data being generated and stored has exploded in recent years. 7 exabytes of digital data was added in the enterprise in the US last year alone [1]. Data is increasing in complexity as enterprises look to exploit the value locked-up in a variety of data to get insight into its business for continued growth and success. Conventional BI systems, data warehouses, and database systems are simply not able to meet the ever increasing demands of this new situation for several reasons. The amount of data is far too large to store in relational database systems efficiently and maintain the desired level of performance. Further the data is often in unstructured format making it unsuitable for systems that only support structured schemas. Finally, the hardware required for traditional BI and Data Warehousing applications is too costly at large scale, making analytics effectively inaccessible to IT. Apache Hadoop is an open source software project that enables the distributed processing of large data sets across clusters of commodity servers. It has grown to be one of the leading Big data applications to address several of the issues discussed above in a cost effective manner, making it a natural fit as an analytics, transformation (ETL) and integration platform. These capabilities of Hadoop along with unstructured data explosion are causing CIOs to reconsider Enterprise data strategy. Figure 1: Industry Trends (Source: Forrester survey of 60 CIO s, September 2011)
4 Virtualizing Apache Hadoop Introduction to vsphere TM VMware s vsphere TM 5.0, being a cloud operating system, virtualizes the entire IT infrastructure such as servers, storage, and networks. It groups these heterogeneous resources and transforms the rigid, inflexible infrastructure into a simple and unified manageable set of elements in the virtualized environment. Broadly, vsphere TM offers two sets of services: Infrastructure Services: Virtualize and Aggregate Hardware Resources Application Services: Built-in Service Level Controls for Applications Figure 2: vsphere TM 5.0 services Use Cases and advantages of virtualizing Hadoop Apache Hadoop is emerging as the de facto standard for big data processing, however, deployment and operational complexity, the need for dedicated hardware, and concerns about security and service level assurance prevent many enterprises from leveraging the power of Hadoop. By decoupling Hadoop nodes from the underlying physical infrastructure, VMware can bring the benefits of cloud infrastructure rapid deployment, high-availability, optimal resource utilization, elasticity, and secure multi-tenancy to Hadoop. Discussed below are some of the advantage and use cases for running Apache Hadoop on a virtualized infrastructure. Rapid Provisioning: Using various tools and virtualization capabilities such as cloning, using templates, and resource allocation, significantly increases the speed of deployment of Hadoop. This is especially applicable for workloads like Hadoop that need to deploy and configure multiple nodes. On demand Hadoop instances, which are started ondemand, and shut down when not necessary are possible. VMware just launched a new open source project, Serengeti, to enable enterprises to quickly deploy, manage and scale Apache Hadoop in virtual and cloud environments. [4]
5 High Availability (HA) and Fault Tolerance (FT): Although Hadoop is known to provide reliability via replication for storing data, there are several major components that are single points of failure in the system. Examples include the namenode, the jobtracker and other supporting components such as Pig, Hive, Zookeeper, HBase, etc. Virtualizing Hadoop can address the high availability needs of all these components in a generic way with vsphere TM vmotion TM, High Availability (HA) and Fault Tolerance (FT) features and keeping the system running with minimal or no downtime. For example, vsphere TM HA and vmotion TM technology can reduce downtime when nodes need to be brought down for planned upgrades and maintenance. Datacenter efficiency: Virtualizing Hadoop can increase datacenter efficiency by increasing the types of mixed workloads that can be run on a virtualized infrastructure. This includes running different versions of Hadoop itself on the same cluster, or running Hadoop along side other applications forming an elastic environment. Shared resources lead to higher consolidation ratios that leads to requirement of less hardware, software, and infrastructure to run the customer s required set of business apps, thereby reducing the CapEx. Figure 3: Virtualized infrastructure leads to data center consolidation Efficient Resource Utilization: Co-locating Hadoop VMs and other kinds of workloads on the same hosts and applying resource controls based on priority often allows better overall utilization by consolidating applications that use different kinds of resources. Multi-tenancy: Hadoop is a multi-tenant application. Running it on a virtualized environment can improve the Quality of Service (QoS) and offered SLA s to the tenants by virtue of instance isolation and VMware resource pools. Also, in a virtualized environment, different tenants can run mixed workloads other than Hadoop on the same physical cluster, addressing yet another variance of multi-tenancy. Security: A virtualized environment provides organizational boundaries to secure the data and isolate it amongst users. An entire cluster can be run in an isolated group of virtual machines, providing full data isolation and security, while sharing the same underlying physical hardware. Time sharing: Taking advantage of unused capacity is simplified in a virtualized environment by running jobs during periods of low hardware usage by spinning up and down virtual machines easily. Easy maintenance and movement of environment: A cluster of Hadoop nodes running in a virtualized environment can be easily replicated or moved from one environment to another. This includes use cases such as moving the VM s
6 from staging to production, from one cluster to another within a data center or even deploying Hadoop in a Hybrid Cloud model. Hadoop-as-a-service: VMware platform enables Hadoop to run in a Cloud environment. VMware vcloud TM director can be configured to offer a full Hadoop-as-a-Service solution in a private or public Cloud in order to offer an agile, controlled, elastic, cost-effective, secure, and a multi-tenant service, while benefiting from the management, deployment, and provisioning tools included with it. vcenter Chargeback can account for resource usage by multiple tenants of the cluster who can then be billed back accordingly. Myths about running Hadoop in a virtualized environment This section dispels some of the myths around virtualization as a platform for Hadoop. Performance: VMware and partners have done considerable amount of work on evaluating Hadoop performance in a virtualized environment. Results show that Hadoop works quite well on vsphere TM, and in fact does better than native under certain configurations. Running 2 or 4 smaller VMs per physical machine usually resulted in better performance, often exceeding native performance. For further details, refer to [2]. SAN, NAS or Local Disk vsphere TM supports local disks and Hadoop can be configured to use local disk with same performance and functionality as native for HDFS. Local disks are recommended for cost and performance reasons and large scale. Hadoop also runs well in a shared SAN environment for small to medium sized clusters but has different performance and cost metrics. With advent of high bandwidth networks, such as10 GB Ethernet, FoE, iscsi etc., accessing data over the network is becoming less of a concern. Total Cost of Ownership (TCO) - Another concern among users is that virtualization increases the TCO of running Hadoop clusters due to acquisition cost of hardware and additional licensing costs (i.e. CAPEX). However, datacenter efficiency and hardware consolidation resulting from a virtualized infrastructure can reduce the physical hardware footprint, and bring CAPEX in line with purely commodity hardware. Further, virtualized infrastructure reduces OPEX through enabling automation, higher utilization, more efficient management and provisioning of hardware, configuration, turning etc. [3] Virtualization can minimize any potential lost revenue associated with downtime, outages, and failures resulting in reduced TCO and increased ROI.
7 VMware s support for virtualized Apache Hadoop for enterprises Apache Hadoop has the potential to transform business by allowing enterprises to harness very large amounts of data for competitive advantage. VMware is working with the Hadoop community to allow enterprise IT to deploy and manage Hadoop easily in their virtual and cloud environments and make VMware vsphere TM the best platform for scalable, highly available Enterprise Hadoop. Project Serengeti: VMware has recently launched Project Serengeti to enable enterprises to quickly deploy, manage, and scale Apache Hadoop in virtual and cloud environments. [4] Available for free download under the Apache 2.0 license, Serengeti, is a one-click deployment toolkit, that allows enterprises to leverage VMware vsphere TM platform to deploy a highly available Hadoop cluster in minutes, including common Hadoop components such as HDFS, MapReduce, Pig, and Hive on a virtual platform. By using Serengeti to run Hadoop on VMware vsphere TM, enterprises can easily leverage the high-availability, fault tolerance, and live migration capabilities of the world s most trusted and widely deployed virtualization platform to ensure the availability and manageability of Hadoop clusters. Serengeti supports multiple Hadoop based distributions from a range of vendors including: Apache Hadoop, Cloudera Distribution, Greenplum HD, and Hortonworks Data Platform. Serengeti s open architecture makes it easy to rapidly add support for additional distributions. Figure 4: Overview of Project Serengeti To further simplify and speed the enterprise use of Apache Hadoop, VMware is working with the Apache Hadoop community to contribute changes to enhance the support for failure and locality topologies by making Hadoop virtualization-aware. The topology changes help to achieve optimal data placement on a virtual infrastructure, thereby improving performance and reliability. This enables the enterprises to achieve a truly elastic and secure Hadoop cluster. Hadoop Virtualization Extensions work with multiple hypervisors. [5] VMware has also updated Spring for Apache Hadoop, an open source project first launched in February 2012 to make it easy for enterprise developers to build distributed processing solutions with Apache Hadoop. These applications range from small standalone applications to integration and workflow applications based on the Spring Integration and Batch projects. [6] The current release of Spring for Apache Hadoop enables developers to create, configure, and execute all types of Hadoop jobs including Map-Reduce, Streaming, Hive, Pig, and Cascading. The newly announced updates allow Spring developers to easily build enterprise applications that integrate with the HBase database, the Cascading library, and Hadoop security. Spring for Apache Hadoop is free to download and available now under the
8 open source Apache 2.0 license. Java workloads run well on vsphere TM. VMware has published Java best practices guidelines and these also apply to Hadoop running on a virtualized infrastructure. [7] Together, these projects and contributions will help accelerate Hadoop adoption and enable enterprises to leverage Big data analytics applications, such as Cetas Software, to obtain real-time and intelligent insight into large quantities of data. VMware acquired Cetas [8] in April 2012 and the Cetas analytics service is currently available at Conclusion In conclusion, infrastructure virtualization brings several benefits to Hadoop deployments that include: Rapid provisioning HA solution Hardware consolidation Multi-tenancy and security through isolation of resources Automation References 1. Big data: The Next Frontier for Innovation, Competition and Productivity : tion 2. A Benchmarking Case Study of Virtualized Hadoop Performance on VMware vsphere TM : Jeff Buell, VMware VMware ROI/TCO calculator: 4. Project Serengeti : 5. Apache Hadoop Virtualization extensions (HVE) : 6. Spring for Apache Hadoop : 7. Java Best practices on VMware : 8. VMware acquires Cetas: 9. Hadoop and VMware : VMware Cloud portfolio of products:
Reference Architecture and Best Practices for Virtualizing Hadoop Workloads Justin Murray VMware
Reference Architecture and Best Practices for Virtualizing Hadoop Workloads Justin Murray ware 2 Agenda The Hadoop Journey Why Virtualize Hadoop? Elasticity and Scalability Performance Tests Storage Reference
Deploying Virtualized Hadoop Systems with VMware vsphere Big Data Extensions A DEPLOYMENT GUIDE
Deploying Virtualized Hadoop Systems with VMware vsphere Big Data Extensions A DEPLOYMENT GUIDE Table of Contents Introduction.... 4 Overview of Hadoop, vsphere, and Project Serengeti.... 4 An Overview
Proact whitepaper on Big Data
Proact whitepaper on Big Data Summary Big Data is not a definite term. Even if it sounds like just another buzz word, it manifests some interesting opportunities for organisations with the skill, resources
Hadoop as a Service. VMware vcloud Automation Center & Big Data Extension
Hadoop as a Service VMware vcloud Automation Center & Big Data Extension Table of Contents 1. Introduction... 2 1.1 How it works... 2 2. System Pre-requisites... 2 3. Set up... 2 3.1 Request the Service
How To Run Apa Hadoop 1.0 On Vsphere Tmt On A Hyperconverged Network On A Virtualized Cluster On A Vspplace Tmter (Vmware) Vspheon Tm (
Apache Hadoop 1.0 High Availability Solution on VMware vsphere TM Reference Architecture TECHNICAL WHITE PAPER v 1.0 June 2012 Table of Contents Executive Summary... 3 Introduction... 3 Terminology...
Introduction to Cloud Computing
Introduction to Cloud Computing Cloud Computing I (intro) 15 319, spring 2010 2 nd Lecture, Jan 14 th Majd F. Sakr Lecture Motivation General overview on cloud computing What is cloud computing Services
Adobe Deploys Hadoop as a Service on VMware vsphere
Adobe Deploys Hadoop as a Service A TECHNICAL CASE STUDY APRIL 2015 Table of Contents A Technical Case Study.... 3 Background... 3 Why Virtualize Hadoop on vsphere?.... 3 The Adobe Marketing Cloud and
Hadoop Virtualization
Hadoop Virtualization Courtney Webster Hadoop Virtualization Courtney Webster Hadoop Virtualization by Courtney Webster Copyright 2015 O Reilly Media, Inc. All rights reserved. Printed in the United States
Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014
Forecast of Big Data Trends Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014 Big Data transforms Business 2 Data created every minute Source http://mashable.com/2012/06/22/data-created-every-minute/
Agenda. Big Data & Hadoop ViPR HDFS Pivotal Big Data Suite & ViPR HDFS ViON Customer Feedback #EMCVIPR
1 Agenda Big Data & Hadoop ViPR HDFS Pivotal Big Data Suite & ViPR HDFS ViON Customer Feedback 2 A World of Connected Devices Need a new data management architecture for Internet of Things 21% the % of
MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products
MaxDeploy Ready Hyper- Converged Virtualization Solution With SanDisk Fusion iomemory products MaxDeploy Ready products are configured and tested for support with Maxta software- defined storage and with
MaxDeploy Hyper- Converged Reference Architecture Solution Brief
MaxDeploy Hyper- Converged Reference Architecture Solution Brief MaxDeploy Reference Architecture solutions are configured and tested for support with Maxta software- defined storage and with industry
ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE
ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE Hadoop Storage-as-a-Service ABSTRACT This White Paper illustrates how EMC Elastic Cloud Storage (ECS ) can be used to streamline the Hadoop data analytics
Journey to the Private Cloud. Key Enabling Technologies
Journey to the Private Cloud Key Enabling Technologies Jeffrey Nick Chief Technology Officer Senior Vice President EMC Corporation June 2010 1 The current I/T state: Infrastructure sprawl Information explosion
How Customers Are Cutting Costs and Building Value with Microsoft Virtualization
How Customers Are Cutting Costs and Building Value with Microsoft Virtualization Introduction The majority of organizations are incorporating virtualization into their IT infrastructures because of the
CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data
Research Report CA Technologies Big Data Infrastructure Management Executive Summary CA Technologies recently exhibited new technology innovations, marking its entry into the Big Data marketplace with
Maximizing Hadoop Performance and Storage Capacity with AltraHD TM
Maximizing Hadoop Performance and Storage Capacity with AltraHD TM Executive Summary The explosion of internet data, driven in large part by the growth of more and more powerful mobile devices, has created
Learn How to Leverage System z in Your Cloud
Learn How to Leverage System z in Your Cloud Mike Baskey IBM Thursday, February 7 th, 2013 Session 12790 Cloud implementations that include System z maximize Enterprise flexibility and increase cost savings
The Future of Data Management
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class
Building & Optimizing Enterprise-class Hadoop with Open Architectures Prem Jain NetApp
Building & Optimizing Enterprise-class Hadoop with Open Architectures Prem Jain NetApp Introduction to Hadoop Comes from Internet companies Emerging big data storage and analytics platform HDFS and MapReduce
High Performance IT Insights. Building the Foundation for Big Data
High Performance IT Insights Building the Foundation for Big Data Page 2 For years, companies have been contending with a rapidly rising tide of data that needs to be captured, stored and used by the business.
VMware Solutions for Small and Midsize Business
SOLUTION BRIEF VMware Solutions for Small and Midsize Business Protect Your Business, Simplify and Save on IT, and Empower Your Employees AT A GLANCE VMware is a leader in virtualization and cloud infrastructure
HDP Hadoop From concept to deployment.
HDP Hadoop From concept to deployment. Ankur Gupta Senior Solutions Engineer Rackspace: Page 41 27 th Jan 2015 Where are you in your Hadoop Journey? A. Researching our options B. Currently evaluating some
Virtualization Essentials
Virtualization Essentials Table of Contents Introduction What is Virtualization?.... 3 How Does Virtualization Work?... 4 Chapter 1 Delivering Real Business Benefits.... 5 Reduced Complexity....5 Dramatically
VMware and Primary Data: Making the Software-Defined Datacenter a Reality
VMware and Primary Data: Making the Software-Defined Datacenter a Reality CONTENTS About This Document... 3 Freeing Data From Physical Storage Silos... 3 Dynamically Move Virtual Disks to Meet Business
VMware Virtual Infrastucture From the Virtualized to the Automated Data Center
VMware Virtual Infrastucture From the Virtualized to the Automated Data Center Senior System Engineer VMware Inc. [email protected] Agenda Vision VMware Enables Datacenter Automation VMware Solutions
TECH TIPS. Integer eleif end conse quat molestie morbi ac eros sagittis. ebook
//ebook 2012 Integer eleifend consequat molestie morbi ac eros sagittis diam ferm entum congue sed laoreet tincidunt libero TECH vitae tincidunt, nulla vestib ulum justo at leo pulvinar nec vene natis
Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack
Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack HIGHLIGHTS Real-Time Results Elasticsearch on Cisco UCS enables a deeper
VMware Software-defined Data Center Technical Strategy and Customer Benefits
VMware Software-defined Data Center Technical Strategy and Customer Benefits 2014 VMware Inc. All rights reserved. Past IT Focuses on Non-Value-Added Service 30% Application maintenance 23% Application
Oracle Platform as a Service (PaaS) FAQ
Oracle Platform as a Service (PaaS) FAQ 1. What is Platform as a Service (PaaS)? Platform as a Service (PaaS) is a standardized, shared and elastically scalable application development and deployment platform
Consolidate and Virtualize Your Windows Environment with NetApp and VMware
White Paper Consolidate and Virtualize Your Windows Environment with NetApp and VMware Sachin Chheda, NetApp and Gaetan Castelein, VMware October 2009 WP-7086-1009 TABLE OF CONTENTS 1 EXECUTIVE SUMMARY...
Big Data and Apache Hadoop Adoption:
Expert Reference Series of White Papers Big Data and Apache Hadoop Adoption: Key Challenges and Rewards 1-800-COURSES www.globalknowledge.com Big Data and Apache Hadoop Adoption: Key Challenges and Rewards
Clodoaldo Barrera Chief Technical Strategist IBM System Storage. Making a successful transition to Software Defined Storage
Clodoaldo Barrera Chief Technical Strategist IBM System Storage Making a successful transition to Software Defined Storage Open Server Summit Santa Clara Nov 2014 Data at the core of everything Data is
How To Compare The Cost Of A Microsoft Private Cloud To A Vcloud With Vsphere And Vspheon
A Comparative Look at Functionality, Benefits, and Economics November 2012 1 1 Copyright Information 2012 Microsoft Corporation. All rights reserved. This document is provided "as-is." Information and
Top 5 Reasons to choose Microsoft Windows Server 2008 R2 SP1 Hyper-V over VMware vsphere 5
Top 5 Reasons to choose Microsoft Windows Server 2008 R2 SP1 Hyper-V over VMware Published: April 2012 2012 Microsoft Corporation. All rights reserved. This document is provided "as-is." Information and
vcloud Suite Architecture Overview and Use Cases
vcloud Suite Architecture Overview and Use Cases vcloud Suite 5.8 This document supports the version of each product listed and supports all subsequent versions until the document is replaced by a new
vsphere 6.0 Advantages Over Hyper-V
v3c Advantages Over Hyper-V The most trusted and complete virtualization platform 2015 Q1 2015 VMware Inc. All rights reserved. The Most Trusted Virtualization Platform Hypervisor Architecture Broad Support
Implement Hadoop jobs to extract business value from large and varied data sets
Hadoop Development for Big Data Solutions: Hands-On You Will Learn How To: Implement Hadoop jobs to extract business value from large and varied data sets Write, customize and deploy MapReduce jobs to
BIG DATA TRENDS AND TECHNOLOGIES
BIG DATA TRENDS AND TECHNOLOGIES THE WORLD OF DATA IS CHANGING Cloud WHAT IS BIG DATA? Big data are datasets that grow so large that they become awkward to work with using onhand database management tools.
TRANSFORM YOUR BUSINESS: BIG DATA AND ANALYTICS WITH VCE AND EMC
TRANSFORM YOUR BUSINESS: BIG DATA AND ANALYTICS WITH VCE AND EMC Vision Big data and analytic initiatives within enterprises have been rapidly maturing from experimental efforts to production-ready deployments.
With Red Hat Enterprise Virtualization, you can: Take advantage of existing people skills and investments
RED HAT ENTERPRISE VIRTUALIZATION DATASHEET RED HAT ENTERPRISE VIRTUALIZATION AT A GLANCE Provides a complete end-toend enterprise virtualization solution for servers and desktop Provides an on-ramp to
MANAGEMENT AND ORCHESTRATION WORKFLOW AUTOMATION FOR VBLOCK INFRASTRUCTURE PLATFORMS
VCE Word Template Table of Contents www.vce.com MANAGEMENT AND ORCHESTRATION WORKFLOW AUTOMATION FOR VBLOCK INFRASTRUCTURE PLATFORMS January 2012 VCE Authors: Changbin Gong: Lead Solution Architect Michael
Providing Self-Service, Life-cycle Management for Databases with VMware vfabric Data Director
Providing Self-Service, Life-cycle Management for Databases with VMware vfabric Data Director Graeme Gordon Senior Systems Engineer, VMware 2013 VMware Inc. All rights reserved Traditional IT Application
Addressing Open Source Big Data, Hadoop, and MapReduce limitations
Addressing Open Source Big Data, Hadoop, and MapReduce limitations 1 Agenda What is Big Data / Hadoop? Limitations of the existing hadoop distributions Going enterprise with Hadoop 2 How Big are Data?
Building the Virtual Information Infrastructure
Technology Concepts and Business Considerations Abstract A virtual information infrastructure allows organizations to make the most of their data center environment by sharing computing, network, and storage
Software Defined Hybrid IT. Execute your 2020 plan
Software Defined Hybrid IT Execute your 2020 plan Disruptive Change Changing IT Service Delivery Cloud Computing Social Computing Big Data Mobility Cyber Security 2015 Unisys Corporation. All rights reserved.
VMware Software-Defined Storage Vision
VMware Software-Defined Storage Vision Lee Dilworth (@leedilworth) Principal Systems Engineer 2014 VMware Inc. All rights reserved. The Software-Defined Data Center Expand virtual compute to all applications
Microsoft Private Cloud
Microsoft Private Cloud Lorenz Wolf, Solution Specialist Datacenter, Microsoft SoftwareOne @ Au Premier Zürich - 22.03.2011 What is PRIVATE CLOUD Private Public Public Cloud Private Cloud shared resources.
CONVERGE APPLICATIONS, ANALYTICS, AND DATA WITH VCE AND PIVOTAL
CONVERGE APPLICATIONS, ANALYTICS, AND DATA WITH VCE AND PIVOTAL Vision In today s volatile economy, an organization s ability to exploit IT to speed time-to-results, control cost and risk, and drive differentiation
Hadoop: Embracing future hardware
Hadoop: Embracing future hardware Suresh Srinivas @suresh_m_s Page 1 About Me Architect & Founder at Hortonworks Long time Apache Hadoop committer and PMC member Designed and developed many key Hadoop
Master Hybrid Cloud Management with VMware vrealize Suite. Increase Business Agility, Efficiency, and Choice While Keeping IT in Control
Master Hybrid Cloud Management with VMware vrealize Suite Increase Business Agility, Efficiency, and Choice While Keeping IT in Control Empower IT to Innovate The time is now for IT organizations to take
VMware Software-Defined Storage and EVO:RAIL
VMware Software-Defined Storage and EVO:RAIL Gaetan Castelein, Sr. Director, Storage Product Marketing Michael McDonough, Sr. Director, EVO 9/14/2014 2014 VMware Inc. All rights reserved. Agenda VMware
VMware's Cloud Management Platform Simplifies and Automates Operations of Heterogeneous Environments and Hybrid Clouds
VMware's Cloud Platform Simplifies and Automates Operations of Heterogeneous Environments and Hybrid Clouds Ekkarat Klinbubpa Senior Business Development Manager, VMware 2009 VMware Inc. All rights reserved
EMC ENTERPRISE HYBRID CLOUD 2.5 FEDERATION SOFTWARE- DEFINED DATA CENTER EDITION
Solution Guide EMC ENTERPRISE HYBRID CLOUD 2.5 FEDERATION SOFTWARE- DEFINED DATA CENTER EDITION Hadoop Applications Solution Guide EMC Solutions Abstract This document serves as a reference for planning
COMPARISON OF VMware VSHPERE HA/FT vs stratus
COMPARISON OF VMware VSHPERE HA/FT vs stratus ftserver SYSTEMS White Paper 2 Ensuring Availability of Virtualized Business-Critical Applications in an Always-On World Introduction Virtualization has become
Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop
Lecture 32 Big Data 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop 1 2 Big Data Problems Data explosion Data from users on social
The New Economics of SAP Business Suite powered by SAP HANA. 2013 SAP AG. All rights reserved. 2
The New Economics of SAP Business Suite powered by SAP HANA 2013 SAP AG. All rights reserved. 2 COMMON MYTH Running SAP Business Suite on SAP HANA is more expensive than on a classical database 2013 2014
Is Hyperconverged Cost-Competitive with the Cloud?
Economic Insight Paper Is Hyperconverged Cost-Competitive with the Cloud? An Evaluator Group TCO Analysis Comparing AWS and SimpliVity By Eric Slack, Sr. Analyst January 2016 Enabling you to make the best
Apache Hadoop Storage Provisioning Using VMware vsphere Big Data Extensions TECHNICAL WHITE PAPER
Apache Hadoop Storage Provisioning Using VMware vsphere Big Data Extensions TECHNICAL WHITE PAPER Table of Contents Apache Hadoop Deployment on VMware vsphere Using vsphere Big Data Extensions.... 3 Local
A Guide to Disaster Recovery in the Cloud. Simple, Affordable Protection for Your Applications and Data
A Guide to Disaster Recovery in the Cloud Simple, Affordable Protection for Your Applications and Data Table of Contents Introduction Cloud-Based Disaster Recovery................................... 3
VMware s Virtualization & Cloud Computing Solutions for Enterprise
VMware s Virtualization & Cloud Computing Solutions for Enterprise Huynh Phuc Yem Quan Country Manager,VMware VietNam E: [email protected] M: 0903730404 2009 VMware Inc. All rights reserved VMware: The
Keith Luck, CISSP, CCSK Security & Compliance Specialist, VMware, Inc. [email protected]
1 Keith Luck, CISSP, CCSK Security & Compliance Specialist, VMware, Inc. [email protected] Agenda Cloud Computing VMware and Security Network Security Use Case Securing View Deployments Questions 2 IT consumption
The next step in Software-Defined Storage with Virtual SAN
The next step in Software-Defined Storage with Virtual SAN VMware vforum, 2014 Lee Dilworth, principal SE @leedilworth 2014 VMware Inc. All rights reserved. The Software-Defined Data Center Expand virtual
CA Big Data Management: It s here, but what can it do for your business?
CA Big Data Management: It s here, but what can it do for your business? Mike Harer CA Technologies August 7, 2014 Session Number: 16256 Insert Custom Session QR if Desired. Test link: www.share.org Big
SOFTWARE DEFINED NETWORKING
SOFTWARE DEFINED NETWORKING Bringing Networks to the Cloud Brendan Hayes DIRECTOR, SDN MARKETING AGENDA Market trends and Juniper s SDN strategy Network virtualization evolution Juniper s SDN technology
Ubuntu OpenStack on VMware vsphere: A reference architecture for deploying OpenStack while limiting changes to existing infrastructure
TECHNICAL WHITE PAPER Ubuntu OpenStack on VMware vsphere: A reference architecture for deploying OpenStack while limiting changes to existing infrastructure A collaboration between Canonical and VMware
Private Cloud: A Key Strategic Differentiator
Automation and Orchestration Drive Virtualization into Private Clouds Table of Contents After Virtualization........................................3 Private Cloud: A Key Strategic Differentiator.................3
vcloud Virtual Private Cloud Fulfilling the promise of cloud computing A Resource Pool of Compute, Storage and a Host of Network Capabilities
vcloud Virtual Private Cloud A Resource Pool of Compute, Storage and a Host of Network Capabilities Fulfilling the promise of cloud computing FULFILLING THE PROMISE OF CLOUD COMPUTING Businesses are looking
Simplified Private Cloud Management
BUSINESS PARTNER ClouTor Simplified Private Cloud Management ClouTor ON VSPEX by LOCUZ INTRODUCTION ClouTor on VSPEX for Enterprises provides an integrated software solution for extending your existing
A Guide to Hybrid Cloud An inside-out approach for extending your data center to the cloud
A Guide to Hybrid Cloud An inside-out approach for extending your data center to the cloud Inside Introduction Create a Flexible IT Environment With Hybrid Cloud Chapter 1 Common Business Drivers for Hybrid
VMware Virtualization and Cloud Management Solutions. A Modern Approach to IT Management
VMware Virtualization and Cloud Management Solutions A Modern Approach to IT Management Transform IT Management to Enable IT as a Service Corporate decision makers are transforming their businesses by
<Insert Picture Here> Infrastructure as a Service (IaaS) Cloud Computing for Enterprises
Infrastructure as a Service (IaaS) Cloud Computing for Enterprises Speaker Title The following is intended to outline our general product direction. It is intended for information
Understanding Virtualization and Cloud in the Enterprise
Understanding Virtualization and Cloud in the Enterprise James Staten Vice President, Principal Analyst Forrester Research Virtualization is evolving toward cloud but won t be subsumed by it 2 What s different
Protecting Data and Applications in Private Clouds for VMware environments
ROV I U S Solution Overview Protecting Data and Applications in Private Clouds for VMware environments OVERVIEW AT-A-GLANCE Rovius, an Accelerite product, is an application and data replication software
Introducing the New Hitachi Storage Virtualization Operating System and Hitachi Virtual Storage Platform G1000
Introducing the New Hitachi Storage Virtualization Operating System and Hitachi Virtual Storage Platform G1000 Greg Knieriemen - technology evangelist Bob Madaio - senior director, Product Marketing April
IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures
IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF Introduction
Non-Stop Hadoop Paul Scott-Murphy VP Field Techincal Service, APJ. Cloudera World Japan November 2014
Non-Stop Hadoop Paul Scott-Murphy VP Field Techincal Service, APJ Cloudera World Japan November 2014 WANdisco Background WANdisco: Wide Area Network Distributed Computing Enterprise ready, high availability
Enabling High performance Big Data platform with RDMA
Enabling High performance Big Data platform with RDMA Tong Liu HPC Advisory Council Oct 7 th, 2014 Shortcomings of Hadoop Administration tooling Performance Reliability SQL support Backup and recovery
Whitepaper. NexentaConnect for VMware Virtual SAN. Full Featured File services for Virtual SAN
Whitepaper NexentaConnect for VMware Virtual SAN Full Featured File services for Virtual SAN Table of Contents Introduction... 1 Next Generation Storage and Compute... 1 VMware Virtual SAN... 2 Highlights
Clouds. Microsoft Private Cloud- Making It Real
Clouds IT Microsoft Private Cloud- Making It Real Contents Copyright Information... 3 Built for the Future. Ready Now.... 4 A Private Cloud: Today s Datacenter... Optimized... 4 Why Microsoft?... 5 System
Hadoop & Spark Using Amazon EMR
Hadoop & Spark Using Amazon EMR Michael Hanisch, AWS Solutions Architecture 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Agenda Why did we build Amazon EMR? What is Amazon EMR?
Building Private Cloud Architectures
Building Private Cloud Architectures Chandra Rangan Sr. Director, Storage & Availability Management Group Symantec Corporation SNW Spring 2011: Building Private Cloud Architectures 1 State of the infrastructure
Cloud Infrastructure Services for Service Providers VERYX TECHNOLOGIES
Cloud Infrastructure Services for Service Providers VERYX TECHNOLOGIES Meeting the 7 Challenges in Testing and Performance Management Introduction With advent of the cloud paradigm, organizations are transitioning
Best Practices for Managing Storage in the Most Challenging Environments
Best Practices for Managing Storage in the Most Challenging Environments Sanjay Srivastava Senior Product Manager, Symantec The Typical Virtualization Adoption Path Today, 20-25% of server workloads are
Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies
Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies Big Data: Global Digital Data Growth Growing leaps and bounds by 40+% Year over Year! 2009 =.8 Zetabytes =.08
NETAPP WHITE PAPER USING A NETWORK APPLIANCE SAN WITH VMWARE INFRASTRUCTURE 3 TO FACILITATE SERVER AND STORAGE CONSOLIDATION
NETAPP WHITE PAPER USING A NETWORK APPLIANCE SAN WITH VMWARE INFRASTRUCTURE 3 TO FACILITATE SERVER AND STORAGE CONSOLIDATION Network Appliance, Inc. March 2007 TABLE OF CONTENTS 1 INTRODUCTION... 3 2 BACKGROUND...
BIG DATA-AS-A-SERVICE
White Paper BIG DATA-AS-A-SERVICE What Big Data is about What service providers can do with Big Data What EMC can do to help EMC Solutions Group Abstract This white paper looks at what service providers
OPTIMIZING SERVER VIRTUALIZATION
OPTIMIZING SERVER VIRTUALIZATION HP MULTI-PORT SERVER ADAPTERS BASED ON INTEL ETHERNET TECHNOLOGY As enterprise-class server infrastructures adopt virtualization to improve total cost of ownership (TCO)
Next-Generation Cloud Analytics with Amazon Redshift
Next-Generation Cloud Analytics with Amazon Redshift What s inside Introduction Why Amazon Redshift is Great for Analytics Cloud Data Warehousing Strategies for Relational Databases Analyzing Fast, Transactional
CloudCenter Full Lifecycle Management. An application-defined approach to deploying and managing applications in any datacenter or cloud environment
CloudCenter Full Lifecycle Management An application-defined approach to deploying and managing applications in any datacenter or cloud environment CloudCenter Full Lifecycle Management Page 2 Table of
MICROSOFT CLOUD REFERENCE ARCHITECTURE: FOUNDATION
Reference Architecture Guide MICROSOFT CLOUD REFERENCE ARCHITECTURE: FOUNDATION EMC VNX, EMC VMAX, EMC ViPR, and EMC VPLEX Microsoft Windows Hyper-V, Microsoft Windows Azure Pack, and Microsoft System
BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE
BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE Current technology for Big Data allows organizations to dramatically improve return on investment (ROI) from their existing data warehouse environment.
Big Data Trends and HDFS Evolution
Big Data Trends and HDFS Evolution Sanjay Radia Founder & Architect Hortonworks Inc Page 1 Hello Founder, Hortonworks Part of the Hadoop team at Yahoo! since 2007 Chief Architect of Hadoop Core at Yahoo!
Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.
Collaborative Big Data Analytics 1 Big Data Is Less About Size, And More About Freedom TechCrunch!!!!!!!!! Total data: bigger than big data 451 Group Findings: Big Data Is More Extreme Than Volume Gartner!!!!!!!!!!!!!!!
