S3IT: Service and Support for Science IT. Scaling R on cloud infrastructure Sergio Maffioletti IS/Cloud S3IT: Service and Support for Science IT
|
|
|
- Blaze Lindsey
- 10 years ago
- Views:
Transcription
1 S3IT: Service and Support for Science IT Scaling R on cloud infrastructure Sergio Maffioletti IS/Cloud S3IT: Service and Support for Science IT Zurich,
2 Who am I? Sergio Maffioletti: Cloud and Application specialist Head of the IS/Cloud Services unit Head of the S3IT User support
3 What is S3IT? Connect IT and Science Zentrale Informatik ScienceIT support unit Dedicated support for computations and data analysis SPEED : faster time to solution ACCESS : to competitive infrastructure ENABLE : remove barriers new possibilities
4 Disclaimer What is presented here is *not* an authoritative analysis of how R can be scaled. It is just the result of few years of experience in supporting and helping research groups in improving R performances.
5 What are we going to talk about today? How to scale your R code on a cloud infrastructure 1. What is a could infrastructure 2. What problem are we trying to address 3. Possible scenario for scaling R
6 What is a cloud infrastructure? Infrastructure Cloud Service Multi-Tenancy Compute Storage Network Virtualisation infrastructure Self-provisioning and Elasticity of resources Customization and control of the environment Multi-tenancy
7 What is a cloud infrastructure? Infrastructure Cloud Service Multi-Tenancy Compute Storage Network Virtualisation infrastructure Virtual Machines (VM) Virtual Storage block devices Virtual private network Self-provisioning and Elasticity of resources Customization and control of the environment Multi-tenancy
8 What is a cloud infrastructure? Infrastructure Cloud Service Multi-Tenancy Compute Storage Network Virtualisation infrastructure Self-provisioning and Elasticity of resources End-users can allocate and release resources when needed. Customization and control of the environment Multi-tenancy
9 What is a cloud infrastructure? Infrastructure Cloud Service Multi-Tenancy Compute Storage Network Virtualisation infrastructure Self-provisioning and Elasticity of resources Customization and control of the environment End-users can tailor the research infrastructure to his/her specific needs. Multi-tenancy
10 What problem are we trying to address? You normally run your R scripts on your local workstation At best: 8 cores, 16GB RAM, GB SSD drive This configuration is quite sufficient for most of the small/medium scale analysis problems
11 How your R script lever the infrastructure R is single-threaded There are several packages for parallel computation mclapply: a parallelized version of lapply parallel: built on multicore and snow
12 How your R script lever the infrastructure R is single-threaded There are several packages for parallel computation mclapply: a parallelized version of lapply Single node, multi-core Relatively easy to use within your script Scaling limited to single node availability parallel: built on multicore and snow
13 How your R script lever the infrastructure R is single-threaded There are several packages for parallel computation mclapply: a parallelized version of lapply parallel: built on multicore and snow Can use the CPUs/cores on a single machine (multicore), or several machines, using MPI (snow) Need to prepare the cluster of resources before initialization Need to apply LoadBalancing to distribute tasks Not widely used in community
14 mclapply example library(multicore) workerfunc <- function(n) return(nˆ2) values <- 1:100 library(parallel) numworkers <- 8 res <- mclapply(values, workerfunc, mc.cores = numworkers) print(unlist(res))
15 using parallel package workerfunc <- function(n) return(nˆ2) values <- 1:100 library(parallel) numworkers <- 8 cl <- makecluster(numworkers, type = "MPI") res <- parlapply(cl, values, workerfunc) stopcluster(cl) mpi.exit() print(unlist(res)) Running: mpirun -n 1 R -slave -f simple_mpi.r
16 Common limitation All make the assumption the infrastructure is available On a cloud infrastructure, provision (and release) of resources is a necessary step
17 How do you scale your R code on a cloud infrastructure? Scale-up Bigger workstation RStudio Scale-out
18 Scale-up: Bigger workstation
19 Scale-up: Bigger workstation Drawbacks: Cannot scale indefinitely (limit is the node size) Higher specs for the node = higher costs
20 Scale-up: RStudio Web-based interface to run R scripts Same single point of access (group, individual user)
21 Scale-up: RStudio Web-based interface to run R scripts Same single point of access (group, individual user) Drawbacks: Still runs on single node
22 Scale-out: GC3Pie Scaling outside of your R script Divide large execution in smaller chunks Run those chunks independently from each other Provision of the infrastructure based on script requirements
23 Let s see an example: gweight Usecase from Business department Run GetWeight function over 3.5M forum entries Each GetWeight takes 2 That s still 75 days on a 64 cores node Each item in the forum could be processed independently from the others
24 gweight workflow Takes initial 3.5M entries input (in.csv format) Create smaller.csv files of size chunk (default 1000) For each chunk file, create a dedicated VM On the VM, run GetWeight function with the specific.csv chunk file Terminate VM if no more.csv files have to be processed Aggregate results at the end into a single large.csv result file
25 Let s see how it work...
26 Conclusions Scaling R effectively on cloud infrastructure requires knowledge on how to split your computation S3IT provides support and tools (e.g. GC3Pie) Get in touch with us: [email protected], Visit our website:
Introducing ScienceCloud
Zentrale Informatik Introducing ScienceCloud Sergio Maffioletti IS/Cloud S3IT: Service and Support for Science IT Zurich, 10.03.2015 What are we going to talk about today? 1. Why are we building ScienceCloud?
Big Data and Parallel Work with R
Big Data and Parallel Work with R What We'll Cover Data Limits in R Optional Data packages Optional Function packages Going parallel Deciding what to do Data Limits in R Big Data? What is big data? More
Parallel Options for R
Parallel Options for R Glenn K. Lockwood SDSC User Services [email protected] Motivation "I just ran an intensive R script [on the supercomputer]. It's not much faster than my own machine." Motivation "I
Scaling up to Production
1 Scaling up to Production Overview Productionize then Scale Building Production Systems Scaling Production Systems Use Case: Scaling a Production Galaxy Instance Infrastructure Advice 2 PRODUCTIONIZE
HPC performance applications on Virtual Clusters
Panagiotis Kritikakos EPCC, School of Physics & Astronomy, University of Edinburgh, Scotland - UK [email protected] 4 th IC-SCCE, Athens 7 th July 2010 This work investigates the performance of (Java)
Leveraging BlobSeer to boost up the deployment and execution of Hadoop applications in Nimbus cloud environments on Grid 5000
Leveraging BlobSeer to boost up the deployment and execution of Hadoop applications in Nimbus cloud environments on Grid 5000 Alexandra Carpen-Amarie Diana Moise Bogdan Nicolae KerData Team, INRIA Outline
SURFsara HPC Cloud Workshop
SURFsara HPC Cloud Workshop doc.hpccloud.surfsara.nl UvA workshop 2016-01-25 UvA HPC Course Jan 2016 Anatoli Danezi, Markus van Dijk [email protected] Agenda Introduction and Overview (current
Hardware/Software Guidelines
There are many things to consider when preparing for a TRAVERSE v11 installation. The number of users, application modules and transactional volume are only a few. Reliable performance of the system is
ADAM 5.5. System Requirements
ADAM 5.5 System Requirements 1 1. Overview The schema below shows an overview of the ADAM components that will be installed and set up. ADAM Server: hosts the ADAM core components. You must install the
Running R from Amazon's Elastic Compute Cloud
Running R on the Running R from Amazon's Elastic Compute Cloud Department of Statistics University of NebraskaLincoln April 30, 2014 Running R on the 1 Introduction 2 3 Running R on the Pre-made AMI Building
Enterprise Application Integration (Middleware)
Enterprise Application Integration (Middleware) Gustavo Alonso Systems Group Computer Science Department - ETH Zurich [email protected] http://www.systems.inf.ethz.ch/ EAI Course Administration Lecture:
Veeam Cloud Connect. Version 8.0. Administrator Guide
Veeam Cloud Connect Version 8.0 Administrator Guide April, 2015 2015 Veeam Software. All rights reserved. All trademarks are the property of their respective owners. No part of this publication may be
BT Ireland and the Cloud
BT Ireland and the Cloud Beyond the Cloud v1 5.9.11 v1 5.9.11 The Promise Ease of Use New Features Agility Reduced Cost v1 5.9.11 It is everything you expect or More expensive than you think Contracts
CLUSTER COMPUTING TODAY
David Chappell June 2011 CLUSTER COMPUTING TODAY WHAT S CHANGED AND WHY IT MATTERS Sponsored by Microsoft Corporation Copyright 2011 Chappell & Associates One way to make an application run faster is to
SURFsara HPC Cloud Workshop
SURFsara HPC Cloud Workshop www.cloud.sara.nl Tutorial 2014-06-11 UvA HPC and Big Data Course June 2014 Anatoli Danezi, Markus van Dijk [email protected] Agenda Introduction and Overview (current
DevOps with Containers. for Microservices
DevOps with Containers for Microservices DevOps is a Software Development Method Keywords Communication, collaboration, integration, automation, measurement Goals improved deployment frequency faster time
Openstack. Cloud computing with Openstack. Saverio Proto [email protected]
Openstack Cloud computing with Openstack Saverio Proto [email protected] Lugano, 23/03/2016 Agenda SWITCH role in Openstack and Cloud Computing What is Virtualization? Why is Cloud computing more
Cloud Computing through Virtualization and HPC technologies
Cloud Computing through Virtualization and HPC technologies William Lu, Ph.D. 1 Agenda Cloud Computing & HPC A Case of HPC Implementation Application Performance in VM Summary 2 Cloud Computing & HPC HPC
Ubuntu OpenStack on VMware vsphere: A reference architecture for deploying OpenStack while limiting changes to existing infrastructure
TECHNICAL WHITE PAPER Ubuntu OpenStack on VMware vsphere: A reference architecture for deploying OpenStack while limiting changes to existing infrastructure A collaboration between Canonical and VMware
High Performance Computing in CST STUDIO SUITE
High Performance Computing in CST STUDIO SUITE Felix Wolfheimer GPU Computing Performance Speedup 18 16 14 12 10 8 6 4 2 0 Promo offer for EUC participants: 25% discount for K40 cards Speedup of Solver
Your Place or Mine? In-House e- Discovery Platform vs. Software as a Service
Your Place or Mine? In-House e-discovery Platform vs. Software as a Service Your Place or Mine? In-House e- Discovery Platform vs. Software as a Service Teresa McMahon Managing Director Navigant Consulting
Best Practices for Virtualised SharePoint
Best Practices for Virtualised SharePoint Brendan Law [email protected] @FlamerNZ Flamer.co.nz/spag/ Nathan Mercer [email protected] @NathanM blogs.technet.com/nmercer/ Agenda Why Virtualise? Hardware
Performance Testing of a Cloud Service
Performance Testing of a Cloud Service Trilesh Bhurtun, Junior Consultant, Capacitas Ltd Capacitas 2012 1 Introduction Objectives Environment Tests and Results Issues Summary Agenda Capacitas 2012 2 1
Hadoop as a Service. VMware vcloud Automation Center & Big Data Extension
Hadoop as a Service VMware vcloud Automation Center & Big Data Extension Table of Contents 1. Introduction... 2 1.1 How it works... 2 2. System Pre-requisites... 2 3. Set up... 2 3.1 Request the Service
Scalable Data Analysis in R. Lee E. Edlefsen Chief Scientist UserR! 2011
Scalable Data Analysis in R Lee E. Edlefsen Chief Scientist UserR! 2011 1 Introduction Our ability to collect and store data has rapidly been outpacing our ability to analyze it We need scalable data analysis
Contact for all enquiries Phone: +61 2 8006 9730. Email: [email protected]. Page 2. RecordPoint Release Notes V3.8 for SharePoint 2013
Release Notes V3.8 Notice This document contains confidential and trade secret information of RecordPoint Software ( RPS ). RecordPoint Software has prepared this document for use solely with RecordPoint.
Is there any alternative to Exadata X5? March 2015
Is there any alternative to Exadata X5? March 2015 Contents 1 About Benchware Ltd. 2 Licensing 3 Scalability 4 Exadata Specifics 5 Performance 6 Costs 7 Myths 8 Conclusion copyright 2015 by benchware.ch
Cloud Federation to Elastically Increase MapReduce Processing Resources
Cloud Federation to Elastically Increase MapReduce Processing Resources A.Panarello, A.Celesti, M. Villari, M. Fazio and A. Puliafito {apanarello,acelesti, mfazio, mvillari, apuliafito}@unime.it DICIEAMA,
Server Installation Manual 4.4.1
Server Installation Manual 4.4.1 1. Product Information Product: BackupAgent Server Version: 4.4.1 2. Introduction BackupAgent Server has several features. The application is a web application and offers:
Open source Google-style large scale data analysis with Hadoop
Open source Google-style large scale data analysis with Hadoop Ioannis Konstantinou Email: [email protected] Web: http://www.cslab.ntua.gr/~ikons Computing Systems Laboratory School of Electrical
IBM Platform Computing : infrastructure management for HPC solutions on OpenPOWER Jing Li, Software Development Manager IBM
IBM Platform Computing : infrastructure management for HPC solutions on OpenPOWER Jing Li, Software Development Manager IBM #OpenPOWERSummit Join the conversation at #OpenPOWERSummit 1 Scale-out and Cloud
Powering the Next Generation Cloud with Azure Stack, Nano Server & Windows Server 2016! Jeff Woolsey Principal Program Manager Cloud & Enterprise
Powering the Next Generation Cloud with Azure Stack, Nano Server & Windows Server 2016! Jeff Woolsey Principal Program Manager Cloud & Enterprise NEW: Operations Management Suite Microsoft Cloud Public
FAQ. NetApp MAT4Shift. March 2015
i FAQ NetApp MAT4Shift March 2015 TABLE OF CONTENTS 1 General... 3 1.1 Solution Overview...3 What is NetApp MAT4Shift?... 3 What business needs does this solution address?... 3 What is the value of the
SAP HANA virtualized Technology Roadmap. Arne Arnold, SAP HANA Product Management September, 2014
Technology Roadmap Arne Arnold, SAP HANA Product Management September, 2014 High-Level Overview Why care about SAP HANA virtualized Why SAP HANA virtualized offer customers reduced TCO and additional options
Cloud Optimize Your IT
Cloud Optimize Your IT Windows Server 2012 The information contained in this presentation relates to a pre-release product which may be substantially modified before it is commercially released. This pre-release
Virtualisation Cloud Computing at the RAL Tier 1. Ian Collier STFC RAL Tier 1 HEPiX, Bologna, 18 th April 2013
Virtualisation Cloud Computing at the RAL Tier 1 Ian Collier STFC RAL Tier 1 HEPiX, Bologna, 18 th April 2013 Virtualisation @ RAL Context at RAL Hyper-V Services Platform Scientific Computing Department
PARALLELS CLOUD STORAGE
PARALLELS CLOUD STORAGE Performance Benchmark Results 1 Table of Contents Executive Summary... Error! Bookmark not defined. Architecture Overview... 3 Key Features... 5 No Special Hardware Requirements...
Parallels Cloud Server 6.0
Parallels Cloud Server 6.0 Getting Started Guide March 17, 2015 Copyright 1999-2015 Parallels IP Holdings GmbH and its affiliates. All rights reserved. Parallels IP Holdings GmbH Vordergasse 59 8200 Schaffhausen
Het is een kleine stap naar een hybrid cloud
Het is een kleine stap naar een hybrid cloud Isabel Moll-Kranenburg Microsoft 14 jaar IT industrie 4 jaar Microsoft Cloud Private Private Cloud Meeting customers where they are The Microsoft Cloud Computing
Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013
Big Data Use Case How Rackspace is using Private Cloud for Big Data Bryan Thompson May 8th, 2013 Our Big Data Problem Consolidate all monitoring data for reporting and analytical purposes. Every device
WINDOWS AZURE AND WINDOWS HPC SERVER
David Chappell March 2012 WINDOWS AZURE AND WINDOWS HPC SERVER HIGH-PERFORMANCE COMPUTING IN THE CLOUD Sponsored by Microsoft Corporation Copyright 2012 Chappell & Associates Contents High-Performance
Facilitating Consistency Check between Specification and Implementation with MapReduce Framework
Facilitating Consistency Check between Specification and Implementation with MapReduce Framework Shigeru KUSAKABE, Yoichi OMORI, and Keijiro ARAKI Grad. School of Information Science and Electrical Engineering,
MongoDB and Couchbase
Benchmarking MongoDB and Couchbase No-SQL Databases Alex Voss Chris Choi University of St Andrews TOP 2 Questions Should a social scientist buy MORE or UPGRADE computers? Which DATABASE(s)? Document Oriented
Hardware Configuration Guide
Hardware Configuration Guide Contents Contents... 1 Annotation... 1 Factors to consider... 2 Machine Count... 2 Data Size... 2 Data Size Total... 2 Daily Backup Data Size... 2 Unique Data Percentage...
Hadoop IST 734 SS CHUNG
Hadoop IST 734 SS CHUNG Introduction What is Big Data?? Bulk Amount Unstructured Lots of Applications which need to handle huge amount of data (in terms of 500+ TB per day) If a regular machine need to
THE ON-DEMAND DATA CENTER NETWORK TRANSFORMATION IN A CHANGING INDUSTRY. Ken Cheng, CTO, Brocade October, 2013
THE ON-DEMAND DATA CENTER NETWORK TRANSFORMATION IN A CHANGING INDUSTRY Ken Cheng, CTO, Brocade October, 2013 Fundamentally we ve had to redesign and challenge the way we ve run distributed infrastructure
BlobSeer: Towards efficient data storage management on large-scale, distributed systems
: Towards efficient data storage management on large-scale, distributed systems Bogdan Nicolae University of Rennes 1, France KerData Team, INRIA Rennes Bretagne-Atlantique PhD Advisors: Gabriel Antoniu
A Novel Cloud Based Elastic Framework for Big Data Preprocessing
School of Systems Engineering A Novel Cloud Based Elastic Framework for Big Data Preprocessing Omer Dawelbeit and Rachel McCrindle October 21, 2014 University of Reading 2008 www.reading.ac.uk Overview
Dell One Identity Manager Scalability and Performance
Dell One Identity Manager Scalability and Performance Scale up and out to ensure simple, effective governance for users. Abstract For years, organizations have had to be able to support user communities
HPC Cluster Decisions and ANSYS Configuration Best Practices. Diana Collier Lead Systems Support Specialist Houston UGM May 2014
HPC Cluster Decisions and ANSYS Configuration Best Practices Diana Collier Lead Systems Support Specialist Houston UGM May 2014 1 Agenda Introduction Lead Systems Support Specialist Cluster Decisions Job
Understanding Enterprise NAS
Anjan Dave, Principal Storage Engineer LSI Corporation Author: Anjan Dave, Principal Storage Engineer, LSI Corporation SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA
Solution for private cloud computing
The CC1 system Solution for private cloud computing 1 Outline What is CC1? Features Technical details Use cases By scientist By HEP experiment System requirements and installation How to get it? 2 What
Emerging Technology for the Next Decade
Emerging Technology for the Next Decade Cloud Computing Keynote Presented by Charles Liang, President & CEO Super Micro Computer, Inc. What is Cloud Computing? Cloud computing is Internet-based computing,
Relational Databases in the Cloud
Contact Information: February 2011 zimory scale White Paper Relational Databases in the Cloud Target audience CIO/CTOs/Architects with medium to large IT installations looking to reduce IT costs by creating
SAP HANA In-Memory Database Sizing Guideline
SAP HANA In-Memory Database Sizing Guideline Version 1.4 August 2013 2 DISCLAIMER Sizing recommendations apply for certified hardware only. Please contact hardware vendor for suitable hardware configuration.
Cloud Storage. Parallels. Performance Benchmark Results. White Paper. www.parallels.com
Parallels Cloud Storage White Paper Performance Benchmark Results www.parallels.com Table of Contents Executive Summary... 3 Architecture Overview... 3 Key Features... 4 No Special Hardware Requirements...
Invest in your business with Ubuntu Advantage.
Invest in your business with Ubuntu Advantage. Expert advice. Specialist tools. Dedicated support. Introducing Ubuntu Advantage Contents 02 Introducing Ubuntu Advantage 03 Ubuntu Advantage 04 - Landscape
CUMULUX WHICH CLOUD PLATFORM IS RIGHT FOR YOU? COMPARING CLOUD PLATFORMS. Review Business and Technology Series www.cumulux.com
` CUMULUX WHICH CLOUD PLATFORM IS RIGHT FOR YOU? COMPARING CLOUD PLATFORMS Review Business and Technology Series www.cumulux.com Table of Contents Cloud Computing Model...2 Impact on IT Management and
Distributed Text Mining with tm
Distributed Text Mining with tm Stefan Theußl 1 Ingo Feinerer 2 Kurt Hornik 1 Department of Statistics and Mathematics, WU Vienna University of Economics and Business 1 Institute of Information Systems,
Challenges for cloud software engineering
Challenges for cloud software engineering Ian Sommerville St Andrews University Why is cloud software engineering different or is it? What needs to be done to make cloud software engineering easier for
Gladstone Health & Leisure Technical Services
Gladstone Health & Leisure Technical Services Plus2 Environment Server Recommendations Commercial in Confidence Database Server Specifications Database server specifications are based on sizes in use on
Self service for software development tools
Self service for software development tools Michal Husejko, behalf of colleagues in CERN IT/PES CERN IT Department CH-1211 Genève 23 Switzerland www.cern.ch/it Self service for software development tools
What Is Microsoft Private Cloud Fast Track?
What Is Microsoft Private Cloud Fast Track? MICROSOFT PRIVATE CLOUD FAST TRACK is a reference architecture for building private clouds that combines Microsoft software, consolidated guidance, and validated
EMC ISILON AND ELEMENTAL SERVER
Configuration Guide EMC ISILON AND ELEMENTAL SERVER Configuration Guide for EMC Isilon Scale-Out NAS and Elemental Server v1.9 EMC Solutions Group Abstract EMC Isilon and Elemental provide best-in-class,
Understanding Neo4j Scalability
Understanding Neo4j Scalability David Montag January 2013 Understanding Neo4j Scalability Scalability means different things to different people. Common traits associated include: 1. Redundancy in the
Cloud-pilot.doc 12-12-2010 SA1 Marcus Hardt, Marcin Plociennik, Ahmad Hammad, Bartek Palak E U F O R I A
Identifier: Date: Activity: Authors: Status: Link: Cloud-pilot.doc 12-12-2010 SA1 Marcus Hardt, Marcin Plociennik, Ahmad Hammad, Bartek Palak E U F O R I A J O I N T A C T I O N ( S A 1, J R A 3 ) F I
PrimaryIO Application Performance Acceleration Date: July 2015 Author: Tony Palmer, Senior Lab Analyst
ESG Lab Spotlight PrimaryIO Application Performance Acceleration Date: July 215 Author: Tony Palmer, Senior Lab Analyst Abstract: PrimaryIO Application Performance Acceleration (APA) is designed to provide
SolidFire SF3010 All-SSD storage system with Citrix CloudPlatform 3.0.5 Reference Architecture
SolidFire SF3010 All-SSD storage system with Citrix CloudPlatform 3.0.5 Reference Architecture 2 This reference architecture is a guideline for deploying Citrix CloudPlatform, powered by Apache CloudStack,
Deploying and Managing Microsoft System Center Virtual Machine Manager
Deploying and Managing Microsoft System Center Virtual Machine Manager Overview About this Course This three-day instructor-led course teaches students how to implement a Microsoft System Center Virtual
SAP BusinessObjects BI4 Sizing What You Need to Know
SAP BusinessObjects BI4 Sizing What You Need to Know Ian Treleaven Senior Portfolio Product Owner, BI Suite P&R, Enterprise Deployment SAP Product Group, Vancouver, Canada Session 0509 Disclaimer This
An HPC Application Deployment Model on Azure Cloud for SMEs
An HPC Application Deployment Model on Azure Cloud for SMEs Fan Ding CLOSER 2013, Aachen, Germany, May 9th,2013 Rechen- und Kommunikationszentrum (RZ) Agenda Motivation Windows Azure Relevant Technology
System requirements. for Installation of LANDESK Service Desk 2016. Clarita-Bernhard-Str. 25 D 81249 Muenchen. Magelan GmbH
System requirements for Installation of LANDESK Service Desk 2016 Created by: Magelan GmbH Magelan GmbH Clarita-Bernhard-Str. 25 D 81249 Muenchen Version Version Date Author Description 1.0 22.11.2012
Package parallel. R-core. May 19, 2015
Package parallel R-core May 19, 2015 1 Introduction Package parallel was first included in R 2.14.0. It builds on the work done for CRAN packages multicore (Urbanek, 2009 present) and snow (Tierney et
Advances in Virtualization In Support of In-Memory Big Data Applications
9/29/15 HPTS 2015 1 Advances in Virtualization In Support of In-Memory Big Data Applications SCALE SIMPLIFY OPTIMIZE EVOLVE Ike Nassi [email protected] 9/29/15 HPTS 2015 2 What is the Problem We
Urbancode Deploy Overview
Urbancode Deploy Overview Continuous delivery challenges facing customers 2 *Data based on UrbanCode customer survey Multi-Platform Application Deployment Automation Visibility and automated control of
3 Ways to build a SaaS Product. Asteor Software Inc Ram Kumar - Director Product Management
3 Ways to build a SaaS Product Asteor Software Inc Ram Kumar - Director Product Management SaaS without Multi-tenancy A separate server instance for each customer Separate Box Separate Shared Hosting Slice
Are You Ready for the Holiday Rush?
Are You Ready for the Holiday Rush? Five Survival Tips Written by Joseph Palumbo, Cloud Usability Team Leader Are You Ready for the Holiday Rush? Five Survival Tips Cover Table of Contents 1. Vertical
DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION
DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION A DIABLO WHITE PAPER AUGUST 2014 Ricky Trigalo Director of Business Development Virtualization, Diablo Technologies
Part 1: Windows Server 2012 R2. [email protected] Datacenter Specialist
Part 1: Windows Server 2012 R2 [email protected] Datacenter Specialist Agenda Trends Windows Server: At the heart of the Cloud OS Customer challenges and opportunities Windows Server 2012 R2 capability
Big Data Analytics and HPC
Big Data Analytics and HPC Matthew J. Denny [email protected] www.mjdenny.com @MatthewJDenny www.mjdenny.com/icpsr Data Science 2015.html July 28, 2015 Overview 1. Overview of High Performance Computing/Big
Amazon EC2 XenApp Scalability Analysis
WHITE PAPER Citrix XenApp Amazon EC2 XenApp Scalability Analysis www.citrix.com Table of Contents Introduction...3 Results Summary...3 Detailed Results...4 Methods of Determining Results...4 Amazon EC2
BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB
BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB Planet Size Data!? Gartner s 10 key IT trends for 2012 unstructured data will grow some 80% over the course of the next
Hadoop Architecture. Part 1
Hadoop Architecture Part 1 Node, Rack and Cluster: A node is simply a computer, typically non-enterprise, commodity hardware for nodes that contain data. Consider we have Node 1.Then we can add more nodes,
Continuous Integration for XML and RDF Data
Continuous Integration for XML and RDF Data Sandro Cirulli Language Technologist Oxford University Press (OUP) 6 June 2015 Table of contents 1. Context 2. Continuous Integration with Jenkins 3. Automatic
Hadoop: Embracing future hardware
Hadoop: Embracing future hardware Suresh Srinivas @suresh_m_s Page 1 About Me Architect & Founder at Hortonworks Long time Apache Hadoop committer and PMC member Designed and developed many key Hadoop
Diablo and VMware TM powering SQL Server TM in Virtual SAN TM. A Diablo Technologies Whitepaper. May 2015
A Diablo Technologies Whitepaper Diablo and VMware TM powering SQL Server TM in Virtual SAN TM May 2015 Ricky Trigalo, Director for Virtualization Solutions Architecture, Diablo Technologies Daniel Beveridge,
Hadoop & SAS Data Loader for Hadoop
Turning Data into Value Hadoop & SAS Data Loader for Hadoop Sebastiaan Schaap Frederik Vandenberghe Agenda What s Hadoop SAS Data management: Traditional In-Database In-Memory The Hadoop analytics lifecycle
JovianDSS Evaluation and Product Training. Presentation updated: October 2015
JovianDSS Evaluation and Product Training JovianDSS High Availability Cluster running in a virtual environment on VMware ESXi Step-by-Step Presentation updated: October 2015 www.open-e.com 1 The aim of
