S3IT: Service and Support for Science IT. Scaling R on cloud infrastructure Sergio Maffioletti IS/Cloud S3IT: Service and Support for Science IT

Size: px
Start display at page:

Download "S3IT: Service and Support for Science IT. Scaling R on cloud infrastructure Sergio Maffioletti IS/Cloud S3IT: Service and Support for Science IT"

Transcription

1 S3IT: Service and Support for Science IT Scaling R on cloud infrastructure Sergio Maffioletti IS/Cloud S3IT: Service and Support for Science IT Zurich,

2 Who am I? Sergio Maffioletti: Cloud and Application specialist Head of the IS/Cloud Services unit Head of the S3IT User support

3 What is S3IT? Connect IT and Science Zentrale Informatik ScienceIT support unit Dedicated support for computations and data analysis SPEED : faster time to solution ACCESS : to competitive infrastructure ENABLE : remove barriers new possibilities

4 Disclaimer What is presented here is *not* an authoritative analysis of how R can be scaled. It is just the result of few years of experience in supporting and helping research groups in improving R performances.

5 What are we going to talk about today? How to scale your R code on a cloud infrastructure 1. What is a could infrastructure 2. What problem are we trying to address 3. Possible scenario for scaling R

6 What is a cloud infrastructure? Infrastructure Cloud Service Multi-Tenancy Compute Storage Network Virtualisation infrastructure Self-provisioning and Elasticity of resources Customization and control of the environment Multi-tenancy

7 What is a cloud infrastructure? Infrastructure Cloud Service Multi-Tenancy Compute Storage Network Virtualisation infrastructure Virtual Machines (VM) Virtual Storage block devices Virtual private network Self-provisioning and Elasticity of resources Customization and control of the environment Multi-tenancy

8 What is a cloud infrastructure? Infrastructure Cloud Service Multi-Tenancy Compute Storage Network Virtualisation infrastructure Self-provisioning and Elasticity of resources End-users can allocate and release resources when needed. Customization and control of the environment Multi-tenancy

9 What is a cloud infrastructure? Infrastructure Cloud Service Multi-Tenancy Compute Storage Network Virtualisation infrastructure Self-provisioning and Elasticity of resources Customization and control of the environment End-users can tailor the research infrastructure to his/her specific needs. Multi-tenancy

10 What problem are we trying to address? You normally run your R scripts on your local workstation At best: 8 cores, 16GB RAM, GB SSD drive This configuration is quite sufficient for most of the small/medium scale analysis problems

11 How your R script lever the infrastructure R is single-threaded There are several packages for parallel computation mclapply: a parallelized version of lapply parallel: built on multicore and snow

12 How your R script lever the infrastructure R is single-threaded There are several packages for parallel computation mclapply: a parallelized version of lapply Single node, multi-core Relatively easy to use within your script Scaling limited to single node availability parallel: built on multicore and snow

13 How your R script lever the infrastructure R is single-threaded There are several packages for parallel computation mclapply: a parallelized version of lapply parallel: built on multicore and snow Can use the CPUs/cores on a single machine (multicore), or several machines, using MPI (snow) Need to prepare the cluster of resources before initialization Need to apply LoadBalancing to distribute tasks Not widely used in community

14 mclapply example library(multicore) workerfunc <- function(n) return(nˆ2) values <- 1:100 library(parallel) numworkers <- 8 res <- mclapply(values, workerfunc, mc.cores = numworkers) print(unlist(res))

15 using parallel package workerfunc <- function(n) return(nˆ2) values <- 1:100 library(parallel) numworkers <- 8 cl <- makecluster(numworkers, type = "MPI") res <- parlapply(cl, values, workerfunc) stopcluster(cl) mpi.exit() print(unlist(res)) Running: mpirun -n 1 R -slave -f simple_mpi.r

16 Common limitation All make the assumption the infrastructure is available On a cloud infrastructure, provision (and release) of resources is a necessary step

17 How do you scale your R code on a cloud infrastructure? Scale-up Bigger workstation RStudio Scale-out

18 Scale-up: Bigger workstation

19 Scale-up: Bigger workstation Drawbacks: Cannot scale indefinitely (limit is the node size) Higher specs for the node = higher costs

20 Scale-up: RStudio Web-based interface to run R scripts Same single point of access (group, individual user)

21 Scale-up: RStudio Web-based interface to run R scripts Same single point of access (group, individual user) Drawbacks: Still runs on single node

22 Scale-out: GC3Pie Scaling outside of your R script Divide large execution in smaller chunks Run those chunks independently from each other Provision of the infrastructure based on script requirements

23 Let s see an example: gweight Usecase from Business department Run GetWeight function over 3.5M forum entries Each GetWeight takes 2 That s still 75 days on a 64 cores node Each item in the forum could be processed independently from the others

24 gweight workflow Takes initial 3.5M entries input (in.csv format) Create smaller.csv files of size chunk (default 1000) For each chunk file, create a dedicated VM On the VM, run GetWeight function with the specific.csv chunk file Terminate VM if no more.csv files have to be processed Aggregate results at the end into a single large.csv result file

25 Let s see how it work...

26 Conclusions Scaling R effectively on cloud infrastructure requires knowledge on how to split your computation S3IT provides support and tools (e.g. GC3Pie) Get in touch with us: [email protected], Visit our website:

Introducing ScienceCloud

Introducing ScienceCloud Zentrale Informatik Introducing ScienceCloud Sergio Maffioletti IS/Cloud S3IT: Service and Support for Science IT Zurich, 10.03.2015 What are we going to talk about today? 1. Why are we building ScienceCloud?

More information

Big Data and Parallel Work with R

Big Data and Parallel Work with R Big Data and Parallel Work with R What We'll Cover Data Limits in R Optional Data packages Optional Function packages Going parallel Deciding what to do Data Limits in R Big Data? What is big data? More

More information

Parallel Options for R

Parallel Options for R Parallel Options for R Glenn K. Lockwood SDSC User Services [email protected] Motivation "I just ran an intensive R script [on the supercomputer]. It's not much faster than my own machine." Motivation "I

More information

Scaling up to Production

Scaling up to Production 1 Scaling up to Production Overview Productionize then Scale Building Production Systems Scaling Production Systems Use Case: Scaling a Production Galaxy Instance Infrastructure Advice 2 PRODUCTIONIZE

More information

HPC performance applications on Virtual Clusters

HPC performance applications on Virtual Clusters Panagiotis Kritikakos EPCC, School of Physics & Astronomy, University of Edinburgh, Scotland - UK [email protected] 4 th IC-SCCE, Athens 7 th July 2010 This work investigates the performance of (Java)

More information

Leveraging BlobSeer to boost up the deployment and execution of Hadoop applications in Nimbus cloud environments on Grid 5000

Leveraging BlobSeer to boost up the deployment and execution of Hadoop applications in Nimbus cloud environments on Grid 5000 Leveraging BlobSeer to boost up the deployment and execution of Hadoop applications in Nimbus cloud environments on Grid 5000 Alexandra Carpen-Amarie Diana Moise Bogdan Nicolae KerData Team, INRIA Outline

More information

SURFsara HPC Cloud Workshop

SURFsara HPC Cloud Workshop SURFsara HPC Cloud Workshop doc.hpccloud.surfsara.nl UvA workshop 2016-01-25 UvA HPC Course Jan 2016 Anatoli Danezi, Markus van Dijk [email protected] Agenda Introduction and Overview (current

More information

Hardware/Software Guidelines

Hardware/Software Guidelines There are many things to consider when preparing for a TRAVERSE v11 installation. The number of users, application modules and transactional volume are only a few. Reliable performance of the system is

More information

ADAM 5.5. System Requirements

ADAM 5.5. System Requirements ADAM 5.5 System Requirements 1 1. Overview The schema below shows an overview of the ADAM components that will be installed and set up. ADAM Server: hosts the ADAM core components. You must install the

More information

Running R from Amazon's Elastic Compute Cloud

Running R from Amazon's Elastic Compute Cloud Running R on the Running R from Amazon's Elastic Compute Cloud Department of Statistics University of NebraskaLincoln April 30, 2014 Running R on the 1 Introduction 2 3 Running R on the Pre-made AMI Building

More information

Enterprise Application Integration (Middleware)

Enterprise Application Integration (Middleware) Enterprise Application Integration (Middleware) Gustavo Alonso Systems Group Computer Science Department - ETH Zurich [email protected] http://www.systems.inf.ethz.ch/ EAI Course Administration Lecture:

More information

Veeam Cloud Connect. Version 8.0. Administrator Guide

Veeam Cloud Connect. Version 8.0. Administrator Guide Veeam Cloud Connect Version 8.0 Administrator Guide April, 2015 2015 Veeam Software. All rights reserved. All trademarks are the property of their respective owners. No part of this publication may be

More information

BT Ireland and the Cloud

BT Ireland and the Cloud BT Ireland and the Cloud Beyond the Cloud v1 5.9.11 v1 5.9.11 The Promise Ease of Use New Features Agility Reduced Cost v1 5.9.11 It is everything you expect or More expensive than you think Contracts

More information

CLUSTER COMPUTING TODAY

CLUSTER COMPUTING TODAY David Chappell June 2011 CLUSTER COMPUTING TODAY WHAT S CHANGED AND WHY IT MATTERS Sponsored by Microsoft Corporation Copyright 2011 Chappell & Associates One way to make an application run faster is to

More information

SURFsara HPC Cloud Workshop

SURFsara HPC Cloud Workshop SURFsara HPC Cloud Workshop www.cloud.sara.nl Tutorial 2014-06-11 UvA HPC and Big Data Course June 2014 Anatoli Danezi, Markus van Dijk [email protected] Agenda Introduction and Overview (current

More information

DevOps with Containers. for Microservices

DevOps with Containers. for Microservices DevOps with Containers for Microservices DevOps is a Software Development Method Keywords Communication, collaboration, integration, automation, measurement Goals improved deployment frequency faster time

More information

Openstack. Cloud computing with Openstack. Saverio Proto [email protected]

Openstack. Cloud computing with Openstack. Saverio Proto saverio.proto@switch.ch Openstack Cloud computing with Openstack Saverio Proto [email protected] Lugano, 23/03/2016 Agenda SWITCH role in Openstack and Cloud Computing What is Virtualization? Why is Cloud computing more

More information

Cloud Computing through Virtualization and HPC technologies

Cloud Computing through Virtualization and HPC technologies Cloud Computing through Virtualization and HPC technologies William Lu, Ph.D. 1 Agenda Cloud Computing & HPC A Case of HPC Implementation Application Performance in VM Summary 2 Cloud Computing & HPC HPC

More information

Ubuntu OpenStack on VMware vsphere: A reference architecture for deploying OpenStack while limiting changes to existing infrastructure

Ubuntu OpenStack on VMware vsphere: A reference architecture for deploying OpenStack while limiting changes to existing infrastructure TECHNICAL WHITE PAPER Ubuntu OpenStack on VMware vsphere: A reference architecture for deploying OpenStack while limiting changes to existing infrastructure A collaboration between Canonical and VMware

More information

High Performance Computing in CST STUDIO SUITE

High Performance Computing in CST STUDIO SUITE High Performance Computing in CST STUDIO SUITE Felix Wolfheimer GPU Computing Performance Speedup 18 16 14 12 10 8 6 4 2 0 Promo offer for EUC participants: 25% discount for K40 cards Speedup of Solver

More information

Your Place or Mine? In-House e- Discovery Platform vs. Software as a Service

Your Place or Mine? In-House e- Discovery Platform vs. Software as a Service Your Place or Mine? In-House e-discovery Platform vs. Software as a Service Your Place or Mine? In-House e- Discovery Platform vs. Software as a Service Teresa McMahon Managing Director Navigant Consulting

More information

Best Practices for Virtualised SharePoint

Best Practices for Virtualised SharePoint Best Practices for Virtualised SharePoint Brendan Law [email protected] @FlamerNZ Flamer.co.nz/spag/ Nathan Mercer [email protected] @NathanM blogs.technet.com/nmercer/ Agenda Why Virtualise? Hardware

More information

Performance Testing of a Cloud Service

Performance Testing of a Cloud Service Performance Testing of a Cloud Service Trilesh Bhurtun, Junior Consultant, Capacitas Ltd Capacitas 2012 1 Introduction Objectives Environment Tests and Results Issues Summary Agenda Capacitas 2012 2 1

More information

Hadoop as a Service. VMware vcloud Automation Center & Big Data Extension

Hadoop as a Service. VMware vcloud Automation Center & Big Data Extension Hadoop as a Service VMware vcloud Automation Center & Big Data Extension Table of Contents 1. Introduction... 2 1.1 How it works... 2 2. System Pre-requisites... 2 3. Set up... 2 3.1 Request the Service

More information

Scalable Data Analysis in R. Lee E. Edlefsen Chief Scientist UserR! 2011

Scalable Data Analysis in R. Lee E. Edlefsen Chief Scientist UserR! 2011 Scalable Data Analysis in R Lee E. Edlefsen Chief Scientist UserR! 2011 1 Introduction Our ability to collect and store data has rapidly been outpacing our ability to analyze it We need scalable data analysis

More information

Contact for all enquiries Phone: +61 2 8006 9730. Email: [email protected]. Page 2. RecordPoint Release Notes V3.8 for SharePoint 2013

Contact for all enquiries Phone: +61 2 8006 9730. Email: info@recordpoint.com.au. Page 2. RecordPoint Release Notes V3.8 for SharePoint 2013 Release Notes V3.8 Notice This document contains confidential and trade secret information of RecordPoint Software ( RPS ). RecordPoint Software has prepared this document for use solely with RecordPoint.

More information

Is there any alternative to Exadata X5? March 2015

Is there any alternative to Exadata X5? March 2015 Is there any alternative to Exadata X5? March 2015 Contents 1 About Benchware Ltd. 2 Licensing 3 Scalability 4 Exadata Specifics 5 Performance 6 Costs 7 Myths 8 Conclusion copyright 2015 by benchware.ch

More information

Cloud Federation to Elastically Increase MapReduce Processing Resources

Cloud Federation to Elastically Increase MapReduce Processing Resources Cloud Federation to Elastically Increase MapReduce Processing Resources A.Panarello, A.Celesti, M. Villari, M. Fazio and A. Puliafito {apanarello,acelesti, mfazio, mvillari, apuliafito}@unime.it DICIEAMA,

More information

Server Installation Manual 4.4.1

Server Installation Manual 4.4.1 Server Installation Manual 4.4.1 1. Product Information Product: BackupAgent Server Version: 4.4.1 2. Introduction BackupAgent Server has several features. The application is a web application and offers:

More information

Open source Google-style large scale data analysis with Hadoop

Open source Google-style large scale data analysis with Hadoop Open source Google-style large scale data analysis with Hadoop Ioannis Konstantinou Email: [email protected] Web: http://www.cslab.ntua.gr/~ikons Computing Systems Laboratory School of Electrical

More information

IBM Platform Computing : infrastructure management for HPC solutions on OpenPOWER Jing Li, Software Development Manager IBM

IBM Platform Computing : infrastructure management for HPC solutions on OpenPOWER Jing Li, Software Development Manager IBM IBM Platform Computing : infrastructure management for HPC solutions on OpenPOWER Jing Li, Software Development Manager IBM #OpenPOWERSummit Join the conversation at #OpenPOWERSummit 1 Scale-out and Cloud

More information

Powering the Next Generation Cloud with Azure Stack, Nano Server & Windows Server 2016! Jeff Woolsey Principal Program Manager Cloud & Enterprise

Powering the Next Generation Cloud with Azure Stack, Nano Server & Windows Server 2016! Jeff Woolsey Principal Program Manager Cloud & Enterprise Powering the Next Generation Cloud with Azure Stack, Nano Server & Windows Server 2016! Jeff Woolsey Principal Program Manager Cloud & Enterprise NEW: Operations Management Suite Microsoft Cloud Public

More information

FAQ. NetApp MAT4Shift. March 2015

FAQ. NetApp MAT4Shift. March 2015 i FAQ NetApp MAT4Shift March 2015 TABLE OF CONTENTS 1 General... 3 1.1 Solution Overview...3 What is NetApp MAT4Shift?... 3 What business needs does this solution address?... 3 What is the value of the

More information

SAP HANA virtualized Technology Roadmap. Arne Arnold, SAP HANA Product Management September, 2014

SAP HANA virtualized Technology Roadmap. Arne Arnold, SAP HANA Product Management September, 2014 Technology Roadmap Arne Arnold, SAP HANA Product Management September, 2014 High-Level Overview Why care about SAP HANA virtualized Why SAP HANA virtualized offer customers reduced TCO and additional options

More information

Cloud Optimize Your IT

Cloud Optimize Your IT Cloud Optimize Your IT Windows Server 2012 The information contained in this presentation relates to a pre-release product which may be substantially modified before it is commercially released. This pre-release

More information

Virtualisation Cloud Computing at the RAL Tier 1. Ian Collier STFC RAL Tier 1 HEPiX, Bologna, 18 th April 2013

Virtualisation Cloud Computing at the RAL Tier 1. Ian Collier STFC RAL Tier 1 HEPiX, Bologna, 18 th April 2013 Virtualisation Cloud Computing at the RAL Tier 1 Ian Collier STFC RAL Tier 1 HEPiX, Bologna, 18 th April 2013 Virtualisation @ RAL Context at RAL Hyper-V Services Platform Scientific Computing Department

More information

PARALLELS CLOUD STORAGE

PARALLELS CLOUD STORAGE PARALLELS CLOUD STORAGE Performance Benchmark Results 1 Table of Contents Executive Summary... Error! Bookmark not defined. Architecture Overview... 3 Key Features... 5 No Special Hardware Requirements...

More information

Parallels Cloud Server 6.0

Parallels Cloud Server 6.0 Parallels Cloud Server 6.0 Getting Started Guide March 17, 2015 Copyright 1999-2015 Parallels IP Holdings GmbH and its affiliates. All rights reserved. Parallels IP Holdings GmbH Vordergasse 59 8200 Schaffhausen

More information

Het is een kleine stap naar een hybrid cloud

Het is een kleine stap naar een hybrid cloud Het is een kleine stap naar een hybrid cloud Isabel Moll-Kranenburg Microsoft 14 jaar IT industrie 4 jaar Microsoft Cloud Private Private Cloud Meeting customers where they are The Microsoft Cloud Computing

More information

Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013

Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013 Big Data Use Case How Rackspace is using Private Cloud for Big Data Bryan Thompson May 8th, 2013 Our Big Data Problem Consolidate all monitoring data for reporting and analytical purposes. Every device

More information

WINDOWS AZURE AND WINDOWS HPC SERVER

WINDOWS AZURE AND WINDOWS HPC SERVER David Chappell March 2012 WINDOWS AZURE AND WINDOWS HPC SERVER HIGH-PERFORMANCE COMPUTING IN THE CLOUD Sponsored by Microsoft Corporation Copyright 2012 Chappell & Associates Contents High-Performance

More information

Facilitating Consistency Check between Specification and Implementation with MapReduce Framework

Facilitating Consistency Check between Specification and Implementation with MapReduce Framework Facilitating Consistency Check between Specification and Implementation with MapReduce Framework Shigeru KUSAKABE, Yoichi OMORI, and Keijiro ARAKI Grad. School of Information Science and Electrical Engineering,

More information

MongoDB and Couchbase

MongoDB and Couchbase Benchmarking MongoDB and Couchbase No-SQL Databases Alex Voss Chris Choi University of St Andrews TOP 2 Questions Should a social scientist buy MORE or UPGRADE computers? Which DATABASE(s)? Document Oriented

More information

Hardware Configuration Guide

Hardware Configuration Guide Hardware Configuration Guide Contents Contents... 1 Annotation... 1 Factors to consider... 2 Machine Count... 2 Data Size... 2 Data Size Total... 2 Daily Backup Data Size... 2 Unique Data Percentage...

More information

Hadoop IST 734 SS CHUNG

Hadoop IST 734 SS CHUNG Hadoop IST 734 SS CHUNG Introduction What is Big Data?? Bulk Amount Unstructured Lots of Applications which need to handle huge amount of data (in terms of 500+ TB per day) If a regular machine need to

More information

THE ON-DEMAND DATA CENTER NETWORK TRANSFORMATION IN A CHANGING INDUSTRY. Ken Cheng, CTO, Brocade October, 2013

THE ON-DEMAND DATA CENTER NETWORK TRANSFORMATION IN A CHANGING INDUSTRY. Ken Cheng, CTO, Brocade October, 2013 THE ON-DEMAND DATA CENTER NETWORK TRANSFORMATION IN A CHANGING INDUSTRY Ken Cheng, CTO, Brocade October, 2013 Fundamentally we ve had to redesign and challenge the way we ve run distributed infrastructure

More information

BlobSeer: Towards efficient data storage management on large-scale, distributed systems

BlobSeer: Towards efficient data storage management on large-scale, distributed systems : Towards efficient data storage management on large-scale, distributed systems Bogdan Nicolae University of Rennes 1, France KerData Team, INRIA Rennes Bretagne-Atlantique PhD Advisors: Gabriel Antoniu

More information

A Novel Cloud Based Elastic Framework for Big Data Preprocessing

A Novel Cloud Based Elastic Framework for Big Data Preprocessing School of Systems Engineering A Novel Cloud Based Elastic Framework for Big Data Preprocessing Omer Dawelbeit and Rachel McCrindle October 21, 2014 University of Reading 2008 www.reading.ac.uk Overview

More information

Dell One Identity Manager Scalability and Performance

Dell One Identity Manager Scalability and Performance Dell One Identity Manager Scalability and Performance Scale up and out to ensure simple, effective governance for users. Abstract For years, organizations have had to be able to support user communities

More information

HPC Cluster Decisions and ANSYS Configuration Best Practices. Diana Collier Lead Systems Support Specialist Houston UGM May 2014

HPC Cluster Decisions and ANSYS Configuration Best Practices. Diana Collier Lead Systems Support Specialist Houston UGM May 2014 HPC Cluster Decisions and ANSYS Configuration Best Practices Diana Collier Lead Systems Support Specialist Houston UGM May 2014 1 Agenda Introduction Lead Systems Support Specialist Cluster Decisions Job

More information

Understanding Enterprise NAS

Understanding Enterprise NAS Anjan Dave, Principal Storage Engineer LSI Corporation Author: Anjan Dave, Principal Storage Engineer, LSI Corporation SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA

More information

Solution for private cloud computing

Solution for private cloud computing The CC1 system Solution for private cloud computing 1 Outline What is CC1? Features Technical details Use cases By scientist By HEP experiment System requirements and installation How to get it? 2 What

More information

Emerging Technology for the Next Decade

Emerging Technology for the Next Decade Emerging Technology for the Next Decade Cloud Computing Keynote Presented by Charles Liang, President & CEO Super Micro Computer, Inc. What is Cloud Computing? Cloud computing is Internet-based computing,

More information

Relational Databases in the Cloud

Relational Databases in the Cloud Contact Information: February 2011 zimory scale White Paper Relational Databases in the Cloud Target audience CIO/CTOs/Architects with medium to large IT installations looking to reduce IT costs by creating

More information

SAP HANA In-Memory Database Sizing Guideline

SAP HANA In-Memory Database Sizing Guideline SAP HANA In-Memory Database Sizing Guideline Version 1.4 August 2013 2 DISCLAIMER Sizing recommendations apply for certified hardware only. Please contact hardware vendor for suitable hardware configuration.

More information

Cloud Storage. Parallels. Performance Benchmark Results. White Paper. www.parallels.com

Cloud Storage. Parallels. Performance Benchmark Results. White Paper. www.parallels.com Parallels Cloud Storage White Paper Performance Benchmark Results www.parallels.com Table of Contents Executive Summary... 3 Architecture Overview... 3 Key Features... 4 No Special Hardware Requirements...

More information

Invest in your business with Ubuntu Advantage.

Invest in your business with Ubuntu Advantage. Invest in your business with Ubuntu Advantage. Expert advice. Specialist tools. Dedicated support. Introducing Ubuntu Advantage Contents 02 Introducing Ubuntu Advantage 03 Ubuntu Advantage 04 - Landscape

More information

CUMULUX WHICH CLOUD PLATFORM IS RIGHT FOR YOU? COMPARING CLOUD PLATFORMS. Review Business and Technology Series www.cumulux.com

CUMULUX WHICH CLOUD PLATFORM IS RIGHT FOR YOU? COMPARING CLOUD PLATFORMS. Review Business and Technology Series www.cumulux.com ` CUMULUX WHICH CLOUD PLATFORM IS RIGHT FOR YOU? COMPARING CLOUD PLATFORMS Review Business and Technology Series www.cumulux.com Table of Contents Cloud Computing Model...2 Impact on IT Management and

More information

Distributed Text Mining with tm

Distributed Text Mining with tm Distributed Text Mining with tm Stefan Theußl 1 Ingo Feinerer 2 Kurt Hornik 1 Department of Statistics and Mathematics, WU Vienna University of Economics and Business 1 Institute of Information Systems,

More information

Challenges for cloud software engineering

Challenges for cloud software engineering Challenges for cloud software engineering Ian Sommerville St Andrews University Why is cloud software engineering different or is it? What needs to be done to make cloud software engineering easier for

More information

Gladstone Health & Leisure Technical Services

Gladstone Health & Leisure Technical Services Gladstone Health & Leisure Technical Services Plus2 Environment Server Recommendations Commercial in Confidence Database Server Specifications Database server specifications are based on sizes in use on

More information

Self service for software development tools

Self service for software development tools Self service for software development tools Michal Husejko, behalf of colleagues in CERN IT/PES CERN IT Department CH-1211 Genève 23 Switzerland www.cern.ch/it Self service for software development tools

More information

What Is Microsoft Private Cloud Fast Track?

What Is Microsoft Private Cloud Fast Track? What Is Microsoft Private Cloud Fast Track? MICROSOFT PRIVATE CLOUD FAST TRACK is a reference architecture for building private clouds that combines Microsoft software, consolidated guidance, and validated

More information

EMC ISILON AND ELEMENTAL SERVER

EMC ISILON AND ELEMENTAL SERVER Configuration Guide EMC ISILON AND ELEMENTAL SERVER Configuration Guide for EMC Isilon Scale-Out NAS and Elemental Server v1.9 EMC Solutions Group Abstract EMC Isilon and Elemental provide best-in-class,

More information

Understanding Neo4j Scalability

Understanding Neo4j Scalability Understanding Neo4j Scalability David Montag January 2013 Understanding Neo4j Scalability Scalability means different things to different people. Common traits associated include: 1. Redundancy in the

More information

Cloud-pilot.doc 12-12-2010 SA1 Marcus Hardt, Marcin Plociennik, Ahmad Hammad, Bartek Palak E U F O R I A

Cloud-pilot.doc 12-12-2010 SA1 Marcus Hardt, Marcin Plociennik, Ahmad Hammad, Bartek Palak E U F O R I A Identifier: Date: Activity: Authors: Status: Link: Cloud-pilot.doc 12-12-2010 SA1 Marcus Hardt, Marcin Plociennik, Ahmad Hammad, Bartek Palak E U F O R I A J O I N T A C T I O N ( S A 1, J R A 3 ) F I

More information

PrimaryIO Application Performance Acceleration Date: July 2015 Author: Tony Palmer, Senior Lab Analyst

PrimaryIO Application Performance Acceleration Date: July 2015 Author: Tony Palmer, Senior Lab Analyst ESG Lab Spotlight PrimaryIO Application Performance Acceleration Date: July 215 Author: Tony Palmer, Senior Lab Analyst Abstract: PrimaryIO Application Performance Acceleration (APA) is designed to provide

More information

SolidFire SF3010 All-SSD storage system with Citrix CloudPlatform 3.0.5 Reference Architecture

SolidFire SF3010 All-SSD storage system with Citrix CloudPlatform 3.0.5 Reference Architecture SolidFire SF3010 All-SSD storage system with Citrix CloudPlatform 3.0.5 Reference Architecture 2 This reference architecture is a guideline for deploying Citrix CloudPlatform, powered by Apache CloudStack,

More information

Deploying and Managing Microsoft System Center Virtual Machine Manager

Deploying and Managing Microsoft System Center Virtual Machine Manager Deploying and Managing Microsoft System Center Virtual Machine Manager Overview About this Course This three-day instructor-led course teaches students how to implement a Microsoft System Center Virtual

More information

SAP BusinessObjects BI4 Sizing What You Need to Know

SAP BusinessObjects BI4 Sizing What You Need to Know SAP BusinessObjects BI4 Sizing What You Need to Know Ian Treleaven Senior Portfolio Product Owner, BI Suite P&R, Enterprise Deployment SAP Product Group, Vancouver, Canada Session 0509 Disclaimer This

More information

An HPC Application Deployment Model on Azure Cloud for SMEs

An HPC Application Deployment Model on Azure Cloud for SMEs An HPC Application Deployment Model on Azure Cloud for SMEs Fan Ding CLOSER 2013, Aachen, Germany, May 9th,2013 Rechen- und Kommunikationszentrum (RZ) Agenda Motivation Windows Azure Relevant Technology

More information

System requirements. for Installation of LANDESK Service Desk 2016. Clarita-Bernhard-Str. 25 D 81249 Muenchen. Magelan GmbH

System requirements. for Installation of LANDESK Service Desk 2016. Clarita-Bernhard-Str. 25 D 81249 Muenchen. Magelan GmbH System requirements for Installation of LANDESK Service Desk 2016 Created by: Magelan GmbH Magelan GmbH Clarita-Bernhard-Str. 25 D 81249 Muenchen Version Version Date Author Description 1.0 22.11.2012

More information

Package parallel. R-core. May 19, 2015

Package parallel. R-core. May 19, 2015 Package parallel R-core May 19, 2015 1 Introduction Package parallel was first included in R 2.14.0. It builds on the work done for CRAN packages multicore (Urbanek, 2009 present) and snow (Tierney et

More information

Advances in Virtualization In Support of In-Memory Big Data Applications

Advances in Virtualization In Support of In-Memory Big Data Applications 9/29/15 HPTS 2015 1 Advances in Virtualization In Support of In-Memory Big Data Applications SCALE SIMPLIFY OPTIMIZE EVOLVE Ike Nassi [email protected] 9/29/15 HPTS 2015 2 What is the Problem We

More information

Urbancode Deploy Overview

Urbancode Deploy Overview Urbancode Deploy Overview Continuous delivery challenges facing customers 2 *Data based on UrbanCode customer survey Multi-Platform Application Deployment Automation Visibility and automated control of

More information

3 Ways to build a SaaS Product. Asteor Software Inc Ram Kumar - Director Product Management

3 Ways to build a SaaS Product. Asteor Software Inc Ram Kumar - Director Product Management 3 Ways to build a SaaS Product Asteor Software Inc Ram Kumar - Director Product Management SaaS without Multi-tenancy A separate server instance for each customer Separate Box Separate Shared Hosting Slice

More information

Are You Ready for the Holiday Rush?

Are You Ready for the Holiday Rush? Are You Ready for the Holiday Rush? Five Survival Tips Written by Joseph Palumbo, Cloud Usability Team Leader Are You Ready for the Holiday Rush? Five Survival Tips Cover Table of Contents 1. Vertical

More information

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION A DIABLO WHITE PAPER AUGUST 2014 Ricky Trigalo Director of Business Development Virtualization, Diablo Technologies

More information

Part 1: Windows Server 2012 R2. [email protected] Datacenter Specialist

Part 1: Windows Server 2012 R2. Clive.watson@Microsoft.com Datacenter Specialist Part 1: Windows Server 2012 R2 [email protected] Datacenter Specialist Agenda Trends Windows Server: At the heart of the Cloud OS Customer challenges and opportunities Windows Server 2012 R2 capability

More information

Big Data Analytics and HPC

Big Data Analytics and HPC Big Data Analytics and HPC Matthew J. Denny [email protected] www.mjdenny.com @MatthewJDenny www.mjdenny.com/icpsr Data Science 2015.html July 28, 2015 Overview 1. Overview of High Performance Computing/Big

More information

Amazon EC2 XenApp Scalability Analysis

Amazon EC2 XenApp Scalability Analysis WHITE PAPER Citrix XenApp Amazon EC2 XenApp Scalability Analysis www.citrix.com Table of Contents Introduction...3 Results Summary...3 Detailed Results...4 Methods of Determining Results...4 Amazon EC2

More information

BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB

BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB Planet Size Data!? Gartner s 10 key IT trends for 2012 unstructured data will grow some 80% over the course of the next

More information

Hadoop Architecture. Part 1

Hadoop Architecture. Part 1 Hadoop Architecture Part 1 Node, Rack and Cluster: A node is simply a computer, typically non-enterprise, commodity hardware for nodes that contain data. Consider we have Node 1.Then we can add more nodes,

More information

Continuous Integration for XML and RDF Data

Continuous Integration for XML and RDF Data Continuous Integration for XML and RDF Data Sandro Cirulli Language Technologist Oxford University Press (OUP) 6 June 2015 Table of contents 1. Context 2. Continuous Integration with Jenkins 3. Automatic

More information

Hadoop: Embracing future hardware

Hadoop: Embracing future hardware Hadoop: Embracing future hardware Suresh Srinivas @suresh_m_s Page 1 About Me Architect & Founder at Hortonworks Long time Apache Hadoop committer and PMC member Designed and developed many key Hadoop

More information

Diablo and VMware TM powering SQL Server TM in Virtual SAN TM. A Diablo Technologies Whitepaper. May 2015

Diablo and VMware TM powering SQL Server TM in Virtual SAN TM. A Diablo Technologies Whitepaper. May 2015 A Diablo Technologies Whitepaper Diablo and VMware TM powering SQL Server TM in Virtual SAN TM May 2015 Ricky Trigalo, Director for Virtualization Solutions Architecture, Diablo Technologies Daniel Beveridge,

More information

Hadoop & SAS Data Loader for Hadoop

Hadoop & SAS Data Loader for Hadoop Turning Data into Value Hadoop & SAS Data Loader for Hadoop Sebastiaan Schaap Frederik Vandenberghe Agenda What s Hadoop SAS Data management: Traditional In-Database In-Memory The Hadoop analytics lifecycle

More information

JovianDSS Evaluation and Product Training. Presentation updated: October 2015

JovianDSS Evaluation and Product Training. Presentation updated: October 2015 JovianDSS Evaluation and Product Training JovianDSS High Availability Cluster running in a virtual environment on VMware ESXi Step-by-Step Presentation updated: October 2015 www.open-e.com 1 The aim of

More information