How To Calculate Estimatepi On An Ec2 Cluster
|
|
|
- Maximillian Sharp
- 5 years ago
- Views:
Transcription
1 deathstar seamless cloud computing for R Whit Armstrong [email protected] KLS Diversified Asset Management May 11, 2012 deathstar seamless cloud computing for R 1 / 27
2 What is deathstar? deathstar is a dead simple way to run lapply across EC2 nodes or local compute nodes deathstar uses ZMQ (via the rzmq package) to deliver both data and code over the wire deathstar consists of two components: a daemon and an R package simplicity is the key theme deathstar seamless cloud computing for R 2 / 27
3 deathstar visual user user workstation node0,node1 local or cloud machines n0 workers, n1 workers deathstar workers deathstar seamless cloud computing for R 3 / 27
4 deathstar daemon details deathstar is a daemon that listens on two ports the exec port listens for incoming work the status port fulfills queries on the current workload capacity of the server (this is what allows dynamic job allocation) deathstar seamless cloud computing for R 4 / 27
5 deathstar queue device (about 100 lines of c++) void deathstar(zmq::socket_t& frontend, zmq::socket_t& backend, zmq::socket_t& worker_signal, zmq::socket_t& status) { int number_of_workers(0); // loosely copied from lazy pirate (ZMQ guide) while (1) { zmq_pollitem_t items [] = { { worker_signal, 0, ZMQ_POLLIN, 0, { status, 0, ZMQ_POLLIN, 0, { backend, 0, ZMQ_POLLIN, 0, { frontend, 0, ZMQ_POLLIN, 0 ; // frontend only if we have available workers // otherwise poll both worker_signal and backend int rc = zmq_poll(items, number_of_workers? 4 : 3, -1); // Interrupted if(rc == -1) { break; // worker_signal -- worker came online if(items[0].revents & ZMQ_POLLIN) { process_worker_signal(worker_signal, number_of_workers); // status -- client status request if(items[1].revents & ZMQ_POLLIN) { process_status_request(status, number_of_workers); // backend -- send data from worker to client if(items[2].revents & ZMQ_POLLIN) { process_backend(frontend, backend); // frontend -- send data from client to worker if (items[3].revents & ZMQ_POLLIN) { process_frontend(frontend, backend, number_of_workers); deathstar seamless cloud computing for R 5 / 27
6 deathstar worker (25 SLOC of R) #!/usr/bin/env Rscript library(rzmq) worker.id <- paste(sys.info()["nodename"],sys.getpid(),sep=":") cmd.args <- commandargs(trailingonly=true) print(cmd.args) work.endpoint <- cmd.args[1] ready.endpoint <- cmd.args[2] log.file <- cmd.args[3] sink(log.file) context = init.context() ready.socket = init.socket(context,"zmq_push") work.socket = init.socket(context,"zmq_rep") connect.socket(ready.socket,ready.endpoint) connect.socket(work.socket,work.endpoint) while(1) { ## send control message to indicate worker is up send.null.msg(ready.socket) ## wait for work msg = receive.socket(work.socket); index <- msg$index fun <- msg$fun args <- msg$args print(system.time(result <- try(do.call(fun,args),silent=true))) send.socket(work.socket,list(index=index,result=result,node=worker.id)); print(gc(verbose=true)) deathstar seamless cloud computing for R 6 / 27
7 deathstar package exposes zmq.cluster.lapply code is a little too long for a slide, but does threetwo basic things: check node capacity, if capacity > 0, submit job collect jobs node life cycle monitoring (future feature) So, what can it do? deathstar seamless cloud computing for R 7 / 27
8 Stochastic estimation of Pi Borrowed from JD Long: or estimating-pi-with-monte-carlo-methods estimatepi <- function(seed,draws) { set.seed(seed) r <-.5 x <- runif(draws, min=-r, max=r) y <- runif(draws, min=-r, max=r) incircle <- ifelse( (x^2 + y^2)^.5 < r, 1, 0) sum(incircle) / length(incircle) * 4 deathstar seamless cloud computing for R 8 / 27
9 Basic lapply example estimatepi require(deathstar) cluster <- c("krypton","xenon","node1","mongodb","research") pi.local.runtime <- system.time(pi.local <- zmq.cluster.lapply(cluster=cluster,1:1e4,estimatepi,draws=1e5)) print(mean(unlist(pi.local))) ## [1] print(pi.local.runtime) ## user system elapsed ## print(node.summary(attr(pi.local,"execution.report"))) ## node jobs.completed ## 1 krypton 2206 ## 2 mongodb 2191 ## 3 node ## 4 research 1357 ## 5 xenon 2648 deathstar seamless cloud computing for R 9 / 27
10 Simple beta calculation calc.beta <- function(ticker,mkt,n.years) { require(kls) x <- get.bbg(ticker); x <- tail(x,250*n.years) ticker <- gsub(" ",".",ticker) x.diff <- diff(x,1) * 100 reg.data <- as.data.frame(cbind(x.diff,mkt)) colnames(reg.data) <- c("asofdate",ticker,"spx") formula <- paste(ticker,"spx",sep=" ~ ") coef(lm(formula,data=reg.data)) deathstar seamless cloud computing for R 10 / 27
11 Simple beta calculation setup require(kls) spx <- get.bbg("spxt Index") spx.ret <- (spx/lag(spx, 1) - 1) * 100 tickers <- c("usgg30yr Index", "USGG10YR Index", "USGG5YR Index", "USGG2YR Index", "USGG3YR Index", "USSW10 Curncy", "USSW5 Curncy", "USSW2 Curncy", "TY1 Comdty", "CN1 Comdty", "RX1 Comdty", "G 1 Comdty", "JB1 Comdty", "XM1 Comdty") cluster <- c("krypton", "research", "mongodb") deathstar seamless cloud computing for R 11 / 27
12 Simple beta calculation results require(deathstar) beta <- zmq.cluster.lapply(cluster = cluster, tickers, calc.beta, mkt = spx.ret, n.years = 4) beta <- do.call(rbind, beta) rownames(beta) <- tickers print(beta) ## (Intercept) SPX ## USGG30YR Index ## USGG10YR Index ## USGG5YR Index ## USGG2YR Index ## USGG3YR Index ## USSW10 Curncy ## USSW5 Curncy ## USSW2 Curncy ## TY1 Comdty ## CN1 Comdty ## RX1 Comdty ## G 1 Comdty ## JB1 Comdty ## XM1 Comdty deathstar seamless cloud computing for R 12 / 27
13 So, what have we learned? deathstar looks pretty much like lapply, or parlapply it s pretty fast not much configuration deathstar seamless cloud computing for R 13 / 27
14 into the cloud! Basic steps to customize your EC2 instance. The deathstar daemon can be downloaded here: 1-1_amd64.deb ## launch EC2 instance from web console ## assume instance public dns is: ec compute-1.amazonaws.com ## from local workstation scp rzmq_0.6.6.tar.gz \ ec compute-1.amazonaws.com:/home/ubuntu scp ~/.s3cfg \ ec compute-1.amazonaws.com:/var/lib/deathstar ## from EC2 node sudo apt-get install r-base-core r-base-dev libzmq-dev s3cmd sudo R CMD INSTALL rzmq_0.6.6.tar.gz sudo R CMD INSTALL AWS.tools_0.0.6.tar.gz sudo dpkg -i deathstar_ _amd64.deb ## from local workstation ## check that the right ports are open (6000,6001) ## and AWS firewall is configured properly nmap ec compute-1.amazonaws.com deathstar seamless cloud computing for R 14 / 27
15 a little more aggressive estimatepi on EC2 c1.xlarge is an 8 core machine with 7 GB of RAM. require(deathstar) require(aws.tools) ## start the EC2 cluster cluster <- startcluster(ami = "ami-1c05a075", key = "kls-ec2", instance.count = 4, instance.type = "c1.xlarge") nodes <- get.nodes(cluster) pi.ec2.runtime <- system.time(pi.ec2 <- zmq.cluster.lapply(cluster = nodes, 1:10000, estimatepi, draws = 1e+05)) deathstar seamless cloud computing for R 15 / 27
16 estimatepi on EC2 workload distribution print(mean(unlist(pi.ec2))) ## [1] print(pi.ec2.runtime) ## user system elapsed ## print(node.summary(attr(pi.ec2, "execution.report"))) ## node jobs.completed ## 1 ip ## 2 ip ## 3 ip ## 4 ip deathstar seamless cloud computing for R 16 / 27
17 mixed mode use local + cloud resources ## the EC2 cluster is still running from the last example cloud.nodes <- get.nodes(cluster) nodes <- c(c("krypton", "xenon", "node1", "mongodb", "research"), cloud.nodes) pi.mixed.runtime <- system.time(pi.mixed <- zmq.cluster.lapply(cluster = nodes, 1:10000, estimatepi, draws = 1e+05)) ## turn the cluster off terminatecluster(cluster) ## [,1] [,2] [,3] [,4] ## [1,] "INSTANCE" "i-d35d82b5" "running" "shutting-down" ## [2,] "INSTANCE" "i-d15d82b7" "running" "shutting-down" ## [3,] "INSTANCE" "i-d75d82b1" "running" "shutting-down" ## [4,] "INSTANCE" "i-d55d82b3" "running" "shutting-down" deathstar seamless cloud computing for R 17 / 27
18 workload distribution for local + cloud print(mean(unlist(pi.mixed))) ## [1] print(pi.mixed.runtime) ## user system elapsed ## print(node.summary(attr(pi.mixed, "execution.report"))) ## node jobs.completed ## 1 ip ## 2 ip ## 3 ip ## 4 ip ## 5 krypton 1470 ## 6 mongodb 1648 ## 7 node ## 8 research 940 ## 9 xenon 1613 deathstar seamless cloud computing for R 18 / 27
19 distributed memory regression Borrowing from Thomas Lumley s presentation 1, one can calculate regression coefficients in a distributed memory environment via these steps: Compute X T X and X T y in chunks aggregate chunks reduce by ˆβ = (X T X ) 1 X T y 1 deathstar seamless cloud computing for R 19 / 27
20 distributed memory regression data Airline on-time performance The data: The data consists of flight arrival and departure details for all commercial flights within the USA, from October 1987 to April This is a large dataset: there are nearly 120 million records in total, and takes up 1.6 gigabytes of space compressed and 12 gigabytes when uncompressed. To make sure that you re not overwhelmed by the size of the data, we ve provide two brief introductions to some useful tools: linux command line tools and sqlite, a simple sql database. deathstar seamless cloud computing for R 20 / 27
21 distributed memory regression setup distributed.reg <- function(bucket, nodes, y.transform, x.transform) { calc.xtx.xty <- function(chnk.name, y.transform, x.transform) { require(aws.tools) raw.data <- s3.get(chnk.name) y <- y.transform(raw.data) X <- x.transform(raw.data) bad.mask <- is.na(y) apply(is.na(x), 1, any) y <- y[!bad.mask] X <- X[!bad.mask, ] list(xtx = t(x) %*% X, xty = t(x) %*% y) chunks <- s3.ls(bucket)[, "bucket"] ans <- zmq.cluster.lapply(cluster = nodes, chunks, calc.xtx.xty, y.transform = y.transform, x.transform = x.transform) exe.rpt <- attr(ans, "execution.report") xtx <- Reduce("+", lapply(ans, "[[", "xtx")) xty <- Reduce("+", lapply(ans, "[[", "xty")) ans <- solve(xtx) %*% xty attr(ans, "execution.report") <- exe.rpt ans deathstar seamless cloud computing for R 21 / 27
22 distributed memory regression more setup What are we estimating? We supply get.y and get.x which are applied to the each chunk. get.y returns the log of the departure delay in minutes. get.x returns a matrix of the departure time in hours and the log(distance) of the flight. require(aws.tools) get.y <- function(x) { log(ifelse(!is.na(x[, "DepDelay"]) & X[, "DepDelay"] > 0, X[, "DepDelay"], 1e-06)) get.x <- function(x) { as.matrix(cbind(round(x[, "DepTime"]/100, 0) + X[, "DepTime"]%%100/60, log(ifelse(x[, "Distance"] > 0, X[, "Distance"], 0.1)))) deathstar seamless cloud computing for R 22 / 27
23 distributed memory regression local execution ## research is an 8 core, 64GB server, 64/8 ~ 8GB per worker ## mongo is a 12 core, 32GB server, 32/12 ~ 2.6GB per worker nodes <- c("research", "mongodb", "krypton") coefs.local.runtime <- system.time(coefs.local <- distributed.reg(bucket = "s3://airline.data", nodes, y.transform = get.y, x.transform = get.x)) print(as.vector(coefs.local)) ## [1] print(coefs.local.runtime) ## user system elapsed ## print(node.summary(attr(coefs.local, "execution.report"))) ## node jobs.completed ## 1 krypton 1 ## 2 mongodb 12 ## 3 research 8 deathstar seamless cloud computing for R 23 / 27
24 distributed memory regression cloud execution Make sure you pick an instance type that provides adequate RAM per core. m1.large is a 2 core machine with 7.5 GB of RAM 3.75GB per core (which is more than we need for this chunk size). require(aws.tools) cluster <- startcluster(ami = "ami-1c05a075", key = "kls-ec2", instance.count = 11, instance.type = "m1.large") nodes <- get.nodes(cluster) coefs.ec2.runtime <- system.time(coefs.ec2 <- distributed.reg(bucket = "s3://airline.data", nodes, y.transform = get.y, x.transform = get.x)) ## turn the cluster off res <- terminatecluster(cluster) print(as.vector(coefs.ec2)) ## [1] print(coefs.ec2.runtime) ## user system elapsed ## print(node.summary(attr(coefs.ec2, "execution.report"))) ## node jobs.completed ## 1 ip ## 2 ip ## 3 ip ## 4 ip ## 5 ip ## 6 ip ## 7 ip ## 8 ip ## 9 ip ## 10 ip ## 11 ip deathstar seamless cloud computing for R 24 / 27
25 Final points consuming cloud resources should not require arduous configuration steps (in contrast to MPI, SLURM, Sun Grid Engine, etc.) deathstar allows cloud access with minimal headache maximum simplicity deathstar seamless cloud computing for R 25 / 27
26 Conclusion The next generation of R programmers needs something better than MPI... Please help me continue to build that tool. deathstar seamless cloud computing for R 26 / 27
27 Thanks! Many people contributed ideas and helped debug work in progress as the package was being developed. Kurt Hornik for putting up with my packaging. John Laing for initial debugging. Gyan Sinha for ideas and initial testing. Prof Brian Ripley for just being himself. My wife for enduring many solitary hours while I built these packages. deathstar seamless cloud computing for R 27 / 27
Alinto Mail Server Pro
Alinto Mail Server Pro Installation Guide Alinto Version 2.0.1 Index 1. Introduction....................................................................................... 1 2. Prerequisites......................................................................................
FREE computing using Amazon EC2
FREE computing using Amazon EC2 Seong-Hwan Jun 1 1 Department of Statistics Univ of British Columbia Nov 1st, 2012 / Student seminar Outline Basics of servers Amazon EC2 Setup R on an EC2 instance Stat
CSE 344 Introduction to Data Management. Section 9: AWS, Hadoop, Pig Latin TA: Yi-Shu Wei
CSE 344 Introduction to Data Management Section 9: AWS, Hadoop, Pig Latin TA: Yi-Shu Wei Homework 8 Big Data analysis on billion triple dataset using Amazon Web Service (AWS) Billion Triple Set: contains
dbx SN Azure Setup Guide
dbx SN Azure Setup Guide Rev 1.0 Oct 2014 XtremeData, Inc. 999 Plaza Dr., Ste. 570 Schaumburg, IL 60173 www.xtremedata.com Overview... 3 Virtual machine setup... 3 Step 1: Launch Virtual machine (node)...
Amazon Elastic Compute Cloud Getting Started Guide. My experience
Amazon Elastic Compute Cloud Getting Started Guide My experience Prepare Cell Phone Credit Card Register & Activate Pricing(Singapore) Region Amazon EC2 running Linux(SUSE Linux Windows Windows with SQL
INSTALLING KAAZING WEBSOCKET GATEWAY - HTML5 EDITION ON AN AMAZON EC2 CLOUD SERVER
INSTALLING KAAZING WEBSOCKET GATEWAY - HTML5 EDITION ON AN AMAZON EC2 CLOUD SERVER A TECHNICAL WHITEPAPER Copyright 2012 Kaazing Corporation. All rights reserved. kaazing.com Executive Overview This document
On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform
On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform Page 1 of 16 Table of Contents Table of Contents... 2 Introduction... 3 NoSQL Databases... 3 CumuLogic NoSQL Database Service...
GreenSQL AWS Deployment
GreenSQL AWS Deployment Table of Contents 2 Table of Contents... 3 from Amazon's Marketplace... 3 from Amazon's EC2 Console... 3 Amazon Launch Wizard walkthrough... 4 Post-installation procedures... 7
SUSE Manager in the Public Cloud. SUSE Manager Server in the Public Cloud
SUSE Manager in the Public Cloud SUSE Manager Server in the Public Cloud Contents 1 Instance Requirements... 2 2 Setup... 3 3 Registration of Cloned Systems... 6 SUSE Manager delivers best-in-class Linux
VX 9000E WiNG Express Manager INSTALLATION GUIDE
VX 9000E WiNG Express Manager INSTALLATION GUIDE 2 VX 9000E WiNG Express Manager Service Information If you have a problem with your equipment, contact support for your region. Support and issue resolution
Introduction to analyzing big data using Amazon Web Services
Introduction to analyzing big data using Amazon Web Services This tutorial accompanies the BARC seminar given at Whitehead on January 31, 2013. It contains instructions for: 1. Getting started with Amazon
USER CONFERENCE 2011 SAN FRANCISCO APRIL 26 29. Running MarkLogic in the Cloud DEVELOPER LOUNGE LAB
USER CONFERENCE 2011 SAN FRANCISCO APRIL 26 29 Running MarkLogic in the Cloud DEVELOPER LOUNGE LAB Table of Contents UNIT 1: Lab description... 3 Pre-requisites:... 3 UNIT 2: Launching an instance on EC2...
Accessing bigger datasets in R using SQLite and dplyr
Accessing bigger datasets in R using SQLite and dplyr Amherst College, Amherst, MA, USA March 24, 2015 [email protected] Thanks to Revolution Analytics for their financial support to the Five College
Citrix XenServer Backups with Xen & Now by SEP
Citrix Backups with Xen & Now by SEP [email protected] www.sepusa.com Table of Contents INTRODUCTION AND OVERVIEW... 3 CITRIX XENDESKTOP ENVIRONMENT... 4 CITRIX DESKTOP DELIVERY CONTROLLER BACKUP... 5 CITRIX
Novell File Reporter 2.5 Who Has What?
Novell File Reporter 2.5 Who Has What? Richard Cabana Senior Systems Engineer File Access & Mgmt Solution Principal Attachmate Novell North America [email protected] Joe Marton Senior Systems Engineer
Citrix XenServer Backups with SEP sesam
Citrix XenServer Backups with SEP sesam Contents Introduction and Overview...2 XenServer Backup Methods...2 Offline Backup... 3 Online Live Memory Backup... 3 Online Quiesced Backup... 4 Online Normal
VOL. 5, NO. 2, August 2015 ISSN 2225-7217 ARPN Journal of Systems and Software 2009-2015 AJSS Journal. All rights reserved
Big Data Analysis of Airline Data Set using Hive Nillohit Bhattacharya, 2 Jongwook Woo Grad Student, 2 Prof., Department of Computer Information Systems, California State University Los Angeles nbhatta2
DVS-100 Installation Guide
DVS-100 Installation Guide DVS-100 can be installed on any system running the Ubuntu 14.04 64 bit Linux operating system, the guide below covers some common installation scenarios. Contents System resource
ELIXIR LOAD BALANCER 2
ELIXIR LOAD BALANCER 2 Overview Elixir Load Balancer for Elixir Repertoire Server 7.2.2 or greater provides software solution for load balancing of Elixir Repertoire Servers. As a pure Java based software
Guide to the LBaaS plugin ver. 1.0.2 for Fuel
Guide to the LBaaS plugin ver. 1.0.2 for Fuel Load Balancing plugin for Fuel LBaaS (Load Balancing as a Service) is currently an advanced service of Neutron that provides load balancing for Neutron multi
1 Basic Configuration of Cisco 2600 Router. Basic Configuration Cisco 2600 Router
1 Basic Configuration of Cisco 2600 Router Basic Configuration Cisco 2600 Router I decided to incorporate the Cisco 2600 into my previously designed network. This would give me two seperate broadcast domains
References. Introduction to Database Systems CSE 444. Motivation. Basic Features. Outline: Database in the Cloud. Outline
References Introduction to Database Systems CSE 444 Lecture 24: Databases as a Service YongChul Kwon Amazon SimpleDB Website Part of the Amazon Web services Google App Engine Datastore Website Part of
Introduction to Database Systems CSE 444
Introduction to Database Systems CSE 444 Lecture 24: Databases as a Service YongChul Kwon References Amazon SimpleDB Website Part of the Amazon Web services Google App Engine Datastore Website Part of
Getting Started with SandStorm NoSQL Benchmark
Getting Started with SandStorm NoSQL Benchmark SandStorm is an enterprise performance testing tool for web, mobile, cloud and big data applications. It provides a framework for benchmarking NoSQL, Hadoop,
Citrix XenServer Backups with Xen & Now by SEP WWW.SEPUSA.COM
Citrix XenServer Backups with Xen & Now by SEP WWW.SEPUSA.COM Contents INTRODUCTION AND OVERVIEW... 3 XEN SERVER BACKUP METHODS... 3 OFFLINE BACKUP... 4 ONLINE LIVE MEMORY BACKUP... 4 ONLINE QUIESCED BACKUP...
INUVIKA OVD INSTALLING INUVIKA OVD ON UBUNTU 14.04 (TRUSTY TAHR)
INUVIKA OVD INSTALLING INUVIKA OVD ON UBUNTU 14.04 (TRUSTY TAHR) Mathieu SCHIRES Version: 0.9.1 Published December 24, 2014 http://www.inuvika.com Contents 1 Prerequisites: Ubuntu 14.04 (Trusty Tahr) 3
Resource Sizing: Spotfire for AWS
Resource Sizing: for AWS With TIBCO for AWS, you can have the best in analytics software available at your fingertips in just a few clicks. On a single Amazon Machine Image (AMI), you get a multi-user
DVS-100 Installation Guide
DVS-100 Installation Guide DVS-100 can be installed on any system running the Ubuntu 14.04 64 bit Linux operating system, the guide below covers some common installation scenarios. Contents System resource
Maintaining Non-Stop Services with Multi Layer Monitoring
Maintaining Non-Stop Services with Multi Layer Monitoring Lahav Savir System Architect and CEO of Emind Systems [email protected] www.emindsys.com The approach Non-stop applications can t leave on their
Every Silver Lining Has a Vault in the Cloud
Irvin Hayes Jr. Autodesk, Inc. PL6015-P Don t worry about acquiring hardware and additional personnel in order to manage your Vault software installation. Learn how to spin up a hosted server instance
Using Amazon EMR and Hunk to explore, analyze and visualize machine data
Using Amazon EMR and Hunk to explore, analyze and visualize machine data Machine data can take many forms and comes from a variety of sources; system logs, application logs, service and system metrics,
Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL. May 2015
Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL May 2015 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document
ULTEO OPEN VIRTUAL DESKTOP UBUNTU 12.04 (PRECISE PANGOLIN) SUPPORT
ULTEO OPEN VIRTUAL DESKTOP V4.0.2 UBUNTU 12.04 (PRECISE PANGOLIN) SUPPORT Contents 1 Prerequisites: Ubuntu 12.04 (Precise Pangolin) 3 1.1 System Requirements.............................. 3 1.2 sudo.........................................
Server Installation Manual 4.4.1
Server Installation Manual 4.4.1 1. Product Information Product: BackupAgent Server Version: 4.4.1 2. Introduction BackupAgent Server has several features. The application is a web application and offers:
Building a Private Cloud Cloud Infrastructure Using Opensource
Cloud Infrastructure Using Opensource with Ubuntu Server 10.04 Enterprise Cloud (Eucalyptus) OSCON (Note: Special thanks to Jim Beasley, my lead Cloud Ninja, for putting this document together!) Introduction
Your Best Next Business Solution Big Data In R 24/3/2010
Your Best Next Business Solution Big Data In R 24/3/2010 Big Data In R R Works on RAM Causing Scalability issues Maximum length of an object is 2^31-1 Some packages developed to help overcome this problem
Ulteo Open Virtual Desktop Installation
Ulteo Open Virtual Desktop Installation Copyright 2008 Ulteo SAS - CONTENTS CONTENTS Contents 1 Prerequisites 2 1.1 Installation of MySQL....................................... 2 2 Session Manager (sm.ulteo.com)
Technology and Cost Considerations for Cloud Deployment: Amazon Elastic Compute Cloud (EC2) Case Study
Creating Value Delivering Solutions Technology and Cost Considerations for Cloud Deployment: Amazon Elastic Compute Cloud (EC2) Case Study Chris Zajac, NJDOT Bud Luo, Ph.D., Michael Baker Jr., Inc. Overview
Citrix XenDesktop Backups with Xen & Now by SEP
Citrix XenDesktop Backups with Xen & Now by SEP WWW.SEPUSA.COM Table of Contents INTRODUCTIONANDOVERVIEW...3 CITRIXXENDESKTOPENVIRONMENT...4 CITRIXDESKTOPDELIVERYCONTROLLERBACKUP...5 CITRIXLICENSESERVERBACKUP...5
Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013
Big Data Use Case How Rackspace is using Private Cloud for Big Data Bryan Thompson May 8th, 2013 Our Big Data Problem Consolidate all monitoring data for reporting and analytical purposes. Every device
Amazon Web Services. Elastic Compute Cloud (EC2) and more...
Amazon Web Services Elastic Compute Cloud (EC2) and more... I don t work for Amazon I do however, have a small research grant from Amazon (in AWS$) Portions of this presentation are reproduced from slides
ArcGIS for Server in the Amazon Cloud. Michele Lundeen Esri
ArcGIS for Server in the Amazon Cloud Michele Lundeen Esri What we will cover ArcGIS for Server in the Amazon Cloud Why How Extras Why do you need ArcGIS Server? Some examples Publish - Dynamic Map Services
ST 810, Advanced computing
ST 810, Advanced computing Eric B. Laber & Hua Zhou Department of Statistics, North Carolina State University January 30, 2013 Supercomputers are expensive. Eric B. Laber, 2011, while browsing the internet.
Drobo How-To Guide. Cloud Storage Using Amazon Storage Gateway with Drobo iscsi SAN
The Amazon Web Services (AWS) Storage Gateway uses an on-premises virtual appliance to replicate a portion of your local Drobo iscsi SAN (Drobo B1200i, left below, and Drobo B800i, right below) to cloudbased
Preinstallation Requirements Guide
Preinstallation Requirements Guide Synergy 3.4.9 June 2015 Synergy 2015 TOC 1: Introduction 4 Synergy platform modules 4 Synergy install procedure - your responsibilities 4 Further information about Synergy
Alfresco Enterprise on AWS: Reference Architecture
Alfresco Enterprise on AWS: Reference Architecture October 2013 (Please consult http://aws.amazon.com/whitepapers/ for the latest version of this paper) Page 1 of 13 Abstract Amazon Web Services (AWS)
Aneka Dynamic Provisioning
MANJRASOFT PTY LTD Aneka Aneka 2.0 Manjrasoft 10/22/2010 This document describes the dynamic provisioning features implemented in Aneka and how it is possible to leverage dynamic resources for scaling
Assignment # 1 (Cloud Computing Security)
Assignment # 1 (Cloud Computing Security) Group Members: Abdullah Abid Zeeshan Qaiser M. Umar Hayat Table of Contents Windows Azure Introduction... 4 Windows Azure Services... 4 1. Compute... 4 a) Virtual
Local Caching Servers (LCS): User Manual
Local Caching Servers (LCS): User Manual Table of Contents Local Caching Servers... 1 Supported Browsers... 1 Getting Help... 1 System Requirements... 2 Macintosh... 2 Windows... 2 Linux... 2 Downloading
Getting Started with HC SharePoint Module
Getting Started with HC SharePoint Module HOSTING CONTROLLER WWW.HOSTINGCONROLLER.COM HOSTING CONTROLLER Contents Introduction... 1 Minimum System Requirements for SharePoint 2013... 1 Hardware Requirements
FortiGate Amazon Machine Image (AMI) Selection Guide for Amazon EC2
FortiGate Amazon Machine Image (AMI) Selection Guide for Amazon EC2 New Place, Same Feel Secure Your AWS Cloud with Fortinet Fortinet s Amazon Machine Image (AMI) and subscription based portfolio offer
Server Installation Procedure - Load Balanced Environment
Server Installation Procedure - Load Balanced Environment 1. Introduction This document explains a step-by-step procedure in case BackupAgent BV performs an installation of the BackupAgent Server Software
Benchmarking Couchbase Server for Interactive Applications. By Alexey Diomin and Kirill Grigorchuk
Benchmarking Couchbase Server for Interactive Applications By Alexey Diomin and Kirill Grigorchuk Contents 1. Introduction... 3 2. A brief overview of Cassandra, MongoDB, and Couchbase... 3 3. Key criteria
How to Run Spark Application
How to Run Spark Application Junghoon Kang Contents 1 Intro 2 2 How to Install Spark on a Local Machine? 2 2.1 On Ubuntu 14.04.................................... 2 3 How to Run Spark Application on a
Aqua Connect Load Balancer User Manual (Mac)
Aqua Connect Load Balancer User Manual (Mac) Table of Contents About Aqua Connect Load Balancer... 3 System Requirements... 4 Hardware... 4 Software... 4 Installing the Load Balancer... 5 Configuration...
Pertino HA Cluster Deployment: Enabling a Multi- Tier Web Application Using Amazon EC2 and Google CE. A Pertino Deployment Guide
Pertino HA Cluster Deployment: Enabling a Multi- Tier Web Application Using Amazon EC2 and Google CE A Pertino Deployment Guide 1 Table of Contents Abstract... 2 Introduction... 3 Before you get Started...
Running Knn Spark on EC2 Documentation
Pseudo code Running Knn Spark on EC2 Documentation Preparing to use Amazon AWS First, open a Spark launcher instance. Open a m3.medium account with all default settings. Step 1: Login to the AWS console.
KeyControl Installation on Amazon Web Services
KeyControl Installation on Amazon Web Services Contents Introduction Deploying an initial KeyControl Server Deploying an Elastic Load Balancer (ELB) Adding a KeyControl node to a cluster in the same availability
Exchange Migration Guide
IceWarp Unified Communications Exchange Migration Guide Version 11.3 Exchange Migration Guide 2 Exchange Migration Guide This document will guide you through a process of migration from Microsoft Exchange
CycleServer Grid Engine Support Install Guide. version 1.25
CycleServer Grid Engine Support Install Guide version 1.25 Contents CycleServer Grid Engine Guide 1 Administration 1 Requirements 1 Installation 1 Monitoring Additional OGS/SGE/etc Clusters 3 Monitoring
TECHNOLOGY WHITE PAPER Jun 2012
TECHNOLOGY WHITE PAPER Jun 2012 Technology Stack C# Windows Server 2008 PHP Amazon Web Services (AWS) Route 53 Elastic Load Balancing (ELB) Elastic Compute Cloud (EC2) Amazon RDS Amazon S3 Elasticache
Consolidated Monitoring, Analysis and Automated Remediation For Hybrid IT Infrastructures. Goliath Performance Monitor Installation Guide v11.
Consolidated Monitoring, Analysis and Automated Remediation For Hybrid IT Infrastructures Goliath Performance Monitor Installation Guide v11.5 (v11.5) Document Date: March 2015 www.goliathtechnologies.com
PostgreSQL Performance Characteristics on Joyent and Amazon EC2
OVERVIEW In today's big data world, high performance databases are not only required but are a major part of any critical business function. With the advent of mobile devices, users are consuming data
SECURE, ENTERPRISE FILE SYNC AND SHARE WITH EMC SYNCPLICITY UTILIZING EMC ISILON, EMC ATMOS, AND EMC VNX
White Paper SECURE, ENTERPRISE FILE SYNC AND SHARE WITH EMC SYNCPLICITY UTILIZING EMC ISILON, EMC ATMOS, AND EMC VNX Abstract This white paper explains the benefits to the extended enterprise of the on-
19.10.11. Amazon Elastic Beanstalk
19.10.11 Amazon Elastic Beanstalk A Short History of AWS Amazon started as an ECommerce startup Original architecture was restructured to be more scalable and easier to maintain Competitive pressure for
Table of Contents Introduction and System Requirements 9 Installing VMware Server 35
Table of Contents Introduction and System Requirements 9 VMware Server: Product Overview 10 Features in VMware Server 11 Support for 64-bit Guest Operating Systems 11 Two-Way Virtual SMP (Experimental
Leveraging SAP HANA & Hortonworks Data Platform to analyze Wikipedia Page Hit Data
Leveraging SAP HANA & Hortonworks Data Platform to analyze Wikipedia Page Hit Data 1 Introduction SAP HANA is the leading OLTP and OLAP platform delivering instant access and critical business insight
Nexio Connectus with Nexio G-Scribe
Nexio Connectus with Nexio G-Scribe 2.1.2 3/20/2014 Edition: A 2.1.2 Publication Information 2014 Imagine Communications. Proprietary and Confidential. Imagine Communications considers this document and
Automation Engine 14. Troubleshooting
4 Troubleshooting 2-205 Contents. Troubleshooting the Server... 3. Checking the Databases... 3.2 Checking the Containers...4.3 Checking Disks...4.4.5.6.7 Checking the Network...5 Checking System Health...
How To Install Linux Titan
Linux Titan Distribution Presented By: Adham Helal Amgad Madkour Ayman El Sayed Emad Zakaria What Is a Linux Distribution? What is a Linux Distribution? The distribution contains groups of packages and
How To - Implement Clientless Single Sign On Authentication with Active Directory
How To Implement Clientless Single Sign On in Single Active Directory Domain Controller Environment How To - Implement Clientless Single Sign On Authentication with Active Directory Applicable Version:
FleSSR Project: Installing Eucalyptus Open Source Cloud Solution at Oxford e- Research Centre
FleSSR Project: Installing Eucalyptus Open Source Cloud Solution at Oxford e- Research Centre Matteo Turilli, David Wallom Eucalyptus is available in two versions: open source and enterprise. Within this
Using ZeBeDee with Firebird to Encrypt and Compress Network Traffic
Using ZeBeDee with Firebird to Encrypt and Compress Network Traffic Table of Contents 1. Introduction 2. What is ZeBeDee? 3. ZeBeDee Configuration to Work with Firebird Configuration File ZeBeDee Server
Contents Set up Cassandra Cluster using Datastax Community Edition on Amazon EC2 Installing OpsCenter on Amazon AMI References Contact
Contents Set up Cassandra Cluster using Datastax Community Edition on Amazon EC2... 2 Launce Amazon micro-instances... 2 Install JDK 7... 7 Install Cassandra... 8 Configure cassandra.yaml file... 8 Start
Installation documentation for Ulteo Open Virtual Desktop
Installation documentation for Ulteo Open Virtual Desktop Copyright 2008 Ulteo SAS - 1 PREREQUISITES CONTENTS Contents 1 Prerequisites 1 1.1 Installation of MySQL.......................................
Cloud3DView: Gamifying Data Center Management
Cloud3DView: Gamifying Data Center Management Yonggang Wen Assistant Professor School of Computer Engineering Nanyang Technological University [email protected] November 26, 2013 School of Computer Engineering
Immersion Day. Creating an Elastic Load Balancer. Rev 2015-01
Rev 2015-01 Table of Contents Overview...3 Launch a Second Web Server...4 Create an ELB...6 Copyright 2015, Amazon Web Services, All Rights Reserved Page 2 Overview This lab will walk the user through
Gladinet Cloud Enterprise
GLADINET, INC Gladinet Cloud Enterprise Multi-Site Deployment Guide Gladinet, Inc. 9/27/2013 This document discusses the technologies behind Gladinet Cloud Enterprise Copyright 2013 Gladinet, Inc. Table
Hadoop Setup. 1 Cluster
In order to use HadoopUnit (described in Sect. 3.3.3), a Hadoop cluster needs to be setup. This cluster can be setup manually with physical machines in a local environment, or in the cloud. Creating a
OpenAdmin Tool for Informix (OAT) October 2012
OpenAdmin Tool for Informix (OAT) October 2012 What is OpenAdmin Tool for Informix? OAT is a web-based administration tool for the IBM Informix database server A single OAT installation can administer
PATROL Console Server and RTserver Getting Started
PATROL Console Server and RTserver Getting Started Supporting PATROL Console Server 7.5.00 RTserver 6.6.00 February 14, 2005 Contacting BMC Software You can access the BMC Software website at http://www.bmc.com.
Syncplicity On-Premise Storage Connector
Syncplicity On-Premise Storage Connector Implementation Guide Abstract This document explains how to install and configure the Syncplicity On-Premise Storage Connector. In addition, it also describes how
Manjrasoft Market Oriented Cloud Computing Platform
Manjrasoft Market Oriented Cloud Computing Platform Aneka Aneka is a market oriented Cloud development and management platform with rapid application development and workload distribution capabilities.
QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering
QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering June 2014 Page 1 Contents Introduction... 3 About Amazon Web Services (AWS)... 3 About Amazon Redshift... 3 QlikView on AWS...
Cloud computing is a marketing term that means different things to different people. In this presentation, we look at the pros and cons of using
Cloud computing is a marketing term that means different things to different people. In this presentation, we look at the pros and cons of using Amazon Web Services rather than setting up a physical server
TECHNOLOGY WHITE PAPER Jan 2016
TECHNOLOGY WHITE PAPER Jan 2016 Technology Stack C# PHP Amazon Web Services (AWS) Route 53 Elastic Load Balancing (ELB) Elastic Compute Cloud (EC2) Amazon RDS Amazon S3 Elasticache CloudWatch Paypal Overview
Quick Setup Guide. 2 System requirements and licensing. 2011 Kerio Technologies s.r.o. All rights reserved.
Kerio Control VMware Virtual Appliance Quick Setup Guide 2011 Kerio Technologies s.r.o. All rights reserved. This document provides detailed description on installation and basic configuration of the Kerio
How AWS Pricing Works May 2015
How AWS Pricing Works May 2015 (Please consult http://aws.amazon.com/whitepapers/ for the latest version of this paper) Page 1 of 15 Table of Contents Table of Contents... 2 Abstract... 3 Introduction...
Cloudera Manager Training: Hands-On Exercises
201408 Cloudera Manager Training: Hands-On Exercises General Notes... 2 In- Class Preparation: Accessing Your Cluster... 3 Self- Study Preparation: Creating Your Cluster... 4 Hands- On Exercise: Working
Best Practices for Amazon EMR
Best Practices for Amazon EMR Parviz Deyhim August 2013 (Please consult for the latest version of this paper) Page 1 of 38 Table of Contents Abstract... 3 Introduction...
Why do I need a pen test lab? Requirements. Virtual Machine Downloads
Why do I need a pen test lab? Hacking and or scanning machines without consent is against the law in most countries To become an effective penetration tester or ethical hacker you need to practice to enhance
Volume SYSLOG JUNCTION. User s Guide. User s Guide
Volume 1 SYSLOG JUNCTION User s Guide User s Guide SYSLOG JUNCTION USER S GUIDE Introduction I n simple terms, Syslog junction is a log viewer with graphing capabilities. It can receive syslog messages
CDH installation & Application Test Report
CDH installation & Application Test Report He Shouchun (SCUID: 00001008350, Email: [email protected]) Chapter 1. Prepare the virtual machine... 2 1.1 Download virtual machine software... 2 1.2 Plan the guest
Cloud Bursting with SLURM and Bright Cluster Manager. Martijn de Vries CTO
Cloud Bursting with SLURM and Bright Cluster Manager Martijn de Vries CTO Architecture CMDaemon 2 Management Interfaces Graphical User Interface (GUI) Offers administrator full cluster control Standalone
Amazon EC2 Product Details Page 1 of 5
Amazon EC2 Product Details Page 1 of 5 Amazon EC2 Functionality Amazon EC2 presents a true virtual computing environment, allowing you to use web service interfaces to launch instances with a variety of
