Setting Up the ARC Activity-Based Model in the Cloud: Lessons to Date. Ben Stabler, PB. Guy Rousseau, ARC. Matthew Martimo, Citilabs

Size: px
Start display at page:

Download "Setting Up the ARC Activity-Based Model in the Cloud: Lessons to Date. Ben Stabler, PB. Guy Rousseau, ARC. Matthew Martimo, Citilabs"

Transcription

1 Setting Up the ARC Activity-Based Model in the Cloud: Lessons to Date Ben Stabler, PB Guy Rousseau, ARC Matthew Martimo, Citilabs November 16, 2011 Topic Area: Innovations in Computation Introduction The purpose of this paper is to describe our efforts to migrate the Atlanta Regional Commission (ARC) Activity-Based Model (ABM) modeling system to the cloud, and to share lessons learned. While the existing ABM runs well on ARC s modeling cluster, there are times when the cluster is busy and a run of the ABM is needed. Running the model in the cloud represents a potential solution to this problem. This paper begins with a brief overview of cloud computing, followed by a short review of the ARC ABM. The paper then focuses on options for cloud-based modeling, setting up the ARC cloudbased ABM, problems encountered and solved, and resulting run times as compared with the existing ARC modeling cluster. This paper concludes with a discussion of our in-progress efforts to fine tune the ARC cloud-based ABM so it can be run efficiently for multiple runs by multiple users at the same time. Cloud Computing Cloud computing is the on-demand use of remote computer resources. Examples include computing solutions such as Gmail, Survey Monkey, and Google Docs, where the computing solution is run entirely by remote machines. Other examples of cloud computing are the Amazon Elastic Compute Cloud (EC2) 1, a cloud computing service for software developers, and Citilabs Cube Cloud Services, which uses Amazon EC2. The key features of cloud computing are scalability, a fee structure that usually is ondemand or subscription-based, and support for multiple user/instances. Scalability is the ability for the implementation to scale up or down by adding or subtracting more computing resources such as more processors in order to solve different size problems or to solve problems faster. For the ARC ABM, the model scales well by the number of households, since most of the sub-models are solved independently by household. Since the cloud is remote, and is often not owned by the user, the user rents the resources on-demand or via a subscription-based fee structure and therefore only pays for what they use. For the ARC ABM, this means paying for computing time on multiple computing cores and also for data transfer. A cloud solution is often setup to support 1

2 multiple users/instances, so multiple users can run multiple model runs by interacting with a client user interface that coordinates with a cluster of computers (i.e. a cloud). For the ARC ABM, this means being able to run multiple simultaneous runs as opposed to one run at a time on the existing ARC modeling cluster. ARC ABM Review The ARC ABM is based on the CT-RAMP (Coordinated Travel-Regional Activity-Based Modeling Platform) family of ABMs developed, or being developed, in Columbus, Atlanta, the San Francisco Bay Area, San Diego, Phoenix, Chicago, Miami, and other regions. Various components of the modeling system have been presented at TRB, TRB Planning Applications, and the previous ITM conferences 234. The model includes explicit intra-household interactions, a continuous temporal dimension (in hourly time periods), and integration of location, time-of-day, and mode choice models. The ARC region is split into over 2000 zones (with over 6000 transit subzones), has two time periods (AM peak and off peak) for network level-of-service matrices, and has a population size of about 1.7 million households in 2005 and 2.7 million in As presented at the previous ITM conference in , the ARC ABM is implemented in Java and Cube. Java is responsible for the internal demand models such as workplace location, tour generation, tour mode choice, stop location choice, and trip mode choice. Cube is used for networks, assignments and skimming, overall model running, and ancillary models such as the external model and truck model. The model uses the Java Parallel Processing Framework 6 and Cube Cluster 7 to thread and distribute work across multiple machines. The base year model runs 3 feedback loops in about 16 hours on the ARC modeling cluster using three Windows 64bit machines with 8 processors and 32 GB of RAM each. This exact same model was transferred to the cloud for testing. Setting Up the ARC Cloud Based ABM There are two basic approaches to a cloud-based modeling system. The first is a more open system that allows the user to configure and use custom remote machines. An example is renting a few instances (i.e. remote machines) from Amazon EC2 and then configuring them as needed. The Amazon EC2 system is designed for software developers so it is not very easy to use, but it is very flexible. The second approach is a more closed system that is configured ahead of time and includes a client user interface that sits between the user and the remote machines in order to handle setting up the instances, adding instances, starting the model runs, ensuring the runs complete, and getting results. An example is Cube Cloud Services, which wraps a web-based user interface around Amazon EC2 and allows the user to upload files, run models, and get results. The advantage of this approach is ease-ofuse and true scalability, while the disadvantage is a lack of flexibility since the user interface dictates what is possible. We tested both approaches. 2

3 The first step to setting up the cloud is to select and configure the machine instances. There are a number of instance options available from Amazon, including a machine that is similar to those in ARC s modeling cluster: High-Memory Quadruple Extra Large Instances (68GB RAM, 8 virtual cores, 1690GB HD, Windows 64bit) The user also needs to select either on-demand instance pricing or reserved instance pricing. On-demand pricing is simply pay by the hour pricing. Reserve pricing consists of a one-time fee for one year or three years and then a lower per hour use fee than ondemand pricing. For the ARC ABM base model run, the computing cost is approximately $120, which is 16 hours x $2.48/hr x 3 instances. If the $24,000 three year reserve pricing fee is paid, then the same model run costs $46. The model would need to be run about 324 times over the three year period to justify the reserve pricing option. The next step was to configure the instances and upload the model files. The cloud machines were configured through Amazon s AWS Management Console 8 and remote desktop to install software such as Java, which is required by the ARC ABM. Citilabs installed a keyless version of Cube on the machine instances for testing purposes. Finally, the ARC ABM specific software and input files were copied to the machines via remote desktop and the cluster configured for running the model. After setting up the cloud-based cluster, a small sample of households was run through the model to ensure the model setup worked. Not surprisingly, the first run failed. Review of the model run revealed that a low-level C DLL used by the model for reading Cube matrices had a small bug related to referencing objects in memory after the objects had been released. This issue only came up in the virtualized cloud-based computing environment since memory is reclaimed much more efficiently than in actual computing environments such as traditional desktop and server configurations. The code was fixed, the DLL re-compiled, and the model successfully run. Approach One Results Two model runs were completed with the first approach. The first is a three machine run with six feedback loops (due to a tighter overall model convergence criteria than previously used) and the second is a six machine run with six feedback loops. As shown in the table below, the three instance run completed in a comparable amount of time to the ARC modeling cluster (17 hours for three iterations as compared to about 16 hours for the ARC run). The increase in computing instances for the second run resulted in about a 35 percent reduction in run times. 3

4 Table 1 Cloud ABM Run Times Iteration Run Times (hh:mm) Reduction 3 machine 6 machine 1 3:11 3:03 4% 2 6:11 2:50 54% 3 7:38 4:07 46% 4 7:16 4:37 36% 5 7:32 5:04 33% 6 11:37 7:30 35% Total Java Run Time 43:25 27:11 37% Total Model Run Time 48:10 31:25 35% There are a few details to note in the model run time comparisons. The Java run time is the run time for the CT-RAMP ABM model components, while the model run time minus the Java run time is the run time for everything else. The second item to note is the first iteration six machine run did not scale well, which was not expected, and is currently being investigated. Overall, these results show that the cloud-based ABM produces reasonable run times and has reasonable scalability. After completing the model run, the outputs were zipped up and posted to ftp for download. Amazon charges $0.10 per GB for data transferred into the cloud and about $ $0.15 per GB for data transferred out. The 15 GB of outputs cost about $2.25 to transfer out of the cloud. Approach Two Improved Scalability The cloud-based ABM tested above used Amazon EC2 machines via remote desktop, which is very similar to how the model is run at ARC by agency staff and consultants. As a result, the cloud cluster can only be used for one run at a time, and has no cluster management user interface, which is useful for letting users know that the cluster is busy for example. The approach two configuration solves these problems and allows ARC s member agencies to run the model (and pay for it) as well. It is not yet known though what the pricing model will be. Setting up the ARC ABM with Cube Cloud Services required some improvements to the CT-RAMP to make the configuration of the modeling cluster for distributed computing more generic and scalable. This included scripting to make the starting and stopping of 4

5 remote instance machines that are used by CT-RAMP to solve sub-models by household. Instead of being hard-wired to three or four machines, the setup was made more flexible so the user is able to specify at run time how many instances (machines) to use. Citilabs product handled starting machine instances, starting the model run, copying output files, and shutting down instances. Approach Two Results A series of model runs were completed with approach two. As shown in the table and illustration below, the increase in computing instances resulted in significant non-linear reductions in the CT-RAMP run times. Doubling the number of cores from 32 to 64 reduced the run time by 37 percent. Doubling it again reduced the run time relative to the 32 core run by 55 percent. The 256 and 512 core runs show little improvement beyond the 128 core run. Based on this, it appears the computing power sweet spot is somewhere around 128 cores. Table 2 Cloud ABM Run Times Approach Two Iteration HH Sample Rate Cores (Machines) & Run Times (4) (8) (16) (32) (64) 1 25% 2:03 1:29 1:06 1:02 1: % 2:51 2:02 1:27 1:18 1: % 4:06 2:42 1:59 1:41 1: % 5:35 3:27 2:20 2:12 2: % 6:05 3:16 2:22 2:09 2:10 Total Java Run Time 20:40 12:56 9:14 8:22 8:47 Run Time Reduction - 37% 55% 60% 58% 5

6 Figure 1 ARC ABM Cloud Run Times There are a couple of interesting findings from this test. The first is that there are dimensioning returns on additional computing power. One of the likely bottlenecks in the process is the household data manager, which stores all the households in memory and handles a significant amount of I/O related to passing data to/from the worker machines. With so many worker machines, the household manager becomes a bottleneck. Some solutions to this problem are to increase the household job size in order to reduce communication, move the household data manager to a dedicated machine, and to add additional household data managers, each responsible for a subset of households. A second finding is that larger core configurations can better handle additional work, since the 128 core setup runs relatively faster for larger sample rates than the 32 core setup. 6

7 Conclusions and Next Steps The purpose of this effort was to transfer the ARC ABM into the cloud to better understand run times and scalability. Overall, the effort was quite successful, as the configuration and running of the model went largely as expected. The run times were comparable to the ARC cluster runs and the model scaled well with the addition of computing cores, although dimensioning returns were observed. Some work was required to make the configuration of the cluster more generic and some additional work is planned to fine-tune the cloud-based model setup. This in-progress effort has also assisted ARC in determining how to best provide future access to its ABM for planning partners and stakeholders. Currently, it appears unrealistic to expect all potential users (municipalities, counties, GDOT, consultants) of the model to invest in the computing power required to run the ABM in a short amount of time. Thus, cloud computing, depending on the cost, offers the alternative of accessing the ARC ABM on an as-needed and pay-as-you go basis, to any eventual ABM user. Given today s needs for quick answers to complex policy questions from decision-makers, the cloud-based ABM allows for virtually unlimited model runs within reasonable timelines, and relieves the burden placed upon ARC s server infrastructure, in addition to providing a viable option for model runs when servers are down or busy. 7

8 References Parsons Brinckerhoff. (2009). Activity Based Travel Model Specifications: Coordinated Travel Regional Activity Based Modeling Platform (CT RAMP) for the Atlanta Region and the San Francisco Bay Area. 3 Vovsha, P., Freedman, J., Gupta, S., Sun, W., Livshits, V., (2010). Workplace Choice Model: Insights into Spatial Patterns of Commuting in Three Metropolitan Regions. Innovations in Travel Modeling, Tempe, Arizona. 4 Vovsha, P., Freedman, J., Sun, W., Livshits, V., (2011). Activity Based Models in Practice: CT RAMP Experience. Transportation Research Board, Washington, DC. 5 Stabler, B., Hicks, J., Rousseau, G., Nicholson, J, Simons, C., Freedman, J., Purvis, C., Ory, D. (2010). Computation Challenges of Implementing the Atlanta Regional Commission Activity Based Modeling System. Innovations in Travel Modeling, Tempe, Arizona. 6 Java Parallel Processing Framework, cluster 8 8

Cloud Computing and Amazon Web Services

Cloud Computing and Amazon Web Services Cloud Computing and Amazon Web Services Gary A. McGilvary edinburgh data.intensive research 1 OUTLINE 1. An Overview of Cloud Computing 2. Amazon Web Services 3. Amazon EC2 Tutorial 4. Conclusions 2 CLOUD

More information

Cloud computing is a marketing term that means different things to different people. In this presentation, we look at the pros and cons of using

Cloud computing is a marketing term that means different things to different people. In this presentation, we look at the pros and cons of using Cloud computing is a marketing term that means different things to different people. In this presentation, we look at the pros and cons of using Amazon Web Services rather than setting up a physical server

More information

CloudFTP: A free Storage Cloud

CloudFTP: A free Storage Cloud CloudFTP: A free Storage Cloud ABSTRACT: The cloud computing is growing rapidly for it offers on-demand computing power and capacity. The power of cloud enables dynamic scalability of applications facing

More information

An Esri White Paper January 2011 Estimating the Cost of a GIS in the Amazon Cloud

An Esri White Paper January 2011 Estimating the Cost of a GIS in the Amazon Cloud An Esri White Paper January 2011 Estimating the Cost of a GIS in the Amazon Cloud Esri, 380 New York St., Redlands, CA 92373-8100 USA TEL 909-793-2853 FAX 909-793-5953 E-MAIL [email protected] WEB esri.com

More information

An Introduction to Cloud Computing Concepts

An Introduction to Cloud Computing Concepts Software Engineering Competence Center TUTORIAL An Introduction to Cloud Computing Concepts Practical Steps for Using Amazon EC2 IaaS Technology Ahmed Mohamed Gamaleldin Senior R&D Engineer-SECC [email protected]

More information

Cloud Computing. Adam Barker

Cloud Computing. Adam Barker Cloud Computing Adam Barker 1 Overview Introduction to Cloud computing Enabling technologies Different types of cloud: IaaS, PaaS and SaaS Cloud terminology Interacting with a cloud: management consoles

More information

Cloud Computing For Bioinformatics

Cloud Computing For Bioinformatics Cloud Computing For Bioinformatics Cloud Computing: what is it? Cloud Computing is a distributed infrastructure where resources, software, and data are provided in an on-demand fashion. Cloud Computing

More information

Data Centers and Cloud Computing

Data Centers and Cloud Computing Data Centers and Cloud Computing CS377 Guest Lecture Tian Guo 1 Data Centers and Cloud Computing Intro. to Data centers Virtualization Basics Intro. to Cloud Computing Case Study: Amazon EC2 2 Data Centers

More information

Technical Aspects to GIS in the Cloud

Technical Aspects to GIS in the Cloud Technical Aspects to GIS in the Cloud Nathan Watermeier State GIS Coordinator State of Nebraska OCIO Michael Schonlau GIS Manager Douglas County 2013 NSGIC Annual Meeting October 30, 2013 NebraskaMAP.gov

More information

Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers

Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers Ntinos Krampis Asst. Professor J. Craig Venter Institute [email protected] http://www.jcvi.org/cms/about/bios/kkrampis/

More information

Estimating the Cost of a GIS in the Amazon Cloud. An Esri White Paper August 2012

Estimating the Cost of a GIS in the Amazon Cloud. An Esri White Paper August 2012 Estimating the Cost of a GIS in the Amazon Cloud An Esri White Paper August 2012 Copyright 2012 Esri All rights reserved. Printed in the United States of America. The information contained in this document

More information

The world s most popular transportation modeling suite

The world s most popular transportation modeling suite technical brochure of cube The world s most popular transportation modeling suite Cube is the most widely used and most complete transportation analysis system in the world. With Cube 5, Citilabs integrates

More information

Migration Scenario: Migrating Batch Processes to the AWS Cloud

Migration Scenario: Migrating Batch Processes to the AWS Cloud Migration Scenario: Migrating Batch Processes to the AWS Cloud Produce Ingest Process Store Manage Distribute Asset Creation Data Ingestor Metadata Ingestor (Manual) Transcoder Encoder Asset Store Catalog

More information

GeoCloud Project Report USGS/EROS Spatial Data Warehouse Project

GeoCloud Project Report USGS/EROS Spatial Data Warehouse Project GeoCloud Project Report USGS/EROS Spatial Data Warehouse Project Description of Application The Spatial Data Warehouse project at the USGS/EROS distributes services and data in support of The National

More information

Amazon Hosted ESRI GeoPortal Server. GeoCloud Project Report

Amazon Hosted ESRI GeoPortal Server. GeoCloud Project Report Amazon Hosted ESRI GeoPortal Server GeoCloud Project Report Description of Application Operating Organization The USDA participated in the FY 2011 Federal Geographic Data Committee (FGDC) GeoCloud Sandbox

More information

GeoCloud Project Report GEOSS Clearinghouse

GeoCloud Project Report GEOSS Clearinghouse GeoCloud Project Report GEOSS Clearinghouse Qunying Huang, Doug Nebert, Chaowei Yang, Kai Liu 2011.12.06 Description of Application GEOSS clearinghouse is a FGDC, GEO, and NASA project that connects directly

More information

Tool - 1: Health Center

Tool - 1: Health Center Tool - 1: Health Center Joseph Amrith Raj http://facebook.com/webspherelibrary 2 Tool - 1: Health Center Table of Contents WebSphere Application Server Troubleshooting... Error! Bookmark not defined. About

More information

Introduction to Engineering Using Robotics Experiments Lecture 18 Cloud Computing

Introduction to Engineering Using Robotics Experiments Lecture 18 Cloud Computing Introduction to Engineering Using Robotics Experiments Lecture 18 Cloud Computing Yinong Chen 2 Big Data Big Data Technologies Cloud Computing Service and Web-Based Computing Applications Industry Control

More information

Amazon EC2 XenApp Scalability Analysis

Amazon EC2 XenApp Scalability Analysis WHITE PAPER Citrix XenApp Amazon EC2 XenApp Scalability Analysis www.citrix.com Table of Contents Introduction...3 Results Summary...3 Detailed Results...4 Methods of Determining Results...4 Amazon EC2

More information

ST 810, Advanced computing

ST 810, Advanced computing ST 810, Advanced computing Eric B. Laber & Hua Zhou Department of Statistics, North Carolina State University January 30, 2013 Supercomputers are expensive. Eric B. Laber, 2011, while browsing the internet.

More information

Web Application Deployment in the Cloud Using Amazon Web Services From Infancy to Maturity

Web Application Deployment in the Cloud Using Amazon Web Services From Infancy to Maturity P3 InfoTech Solutions Pvt. Ltd http://www.p3infotech.in July 2013 Created by P3 InfoTech Solutions Pvt. Ltd., http://p3infotech.in 1 Web Application Deployment in the Cloud Using Amazon Web Services From

More information

WILLAMALANE PARK AND RECREATION DISTRICT. Springfield, Oregon GIS ASSET MANAGEMENT IN A PARK AND RECREATION DISTRICT ESRI USER CONFERENCE JULY 2014

WILLAMALANE PARK AND RECREATION DISTRICT. Springfield, Oregon GIS ASSET MANAGEMENT IN A PARK AND RECREATION DISTRICT ESRI USER CONFERENCE JULY 2014 WILLAMALANE PARK AND RECREATION DISTRICT Springfield, Oregon GIS ASSET MANAGEMENT IN A PARK AND RECREATION DISTRICT ESRI USER CONFERENCE JULY 2014 SESSION GOALS Share our process of implementing and developing

More information

Chapter 19 Cloud Computing for Multimedia Services

Chapter 19 Cloud Computing for Multimedia Services Chapter 19 Cloud Computing for Multimedia Services 19.1 Cloud Computing Overview 19.2 Multimedia Cloud Computing 19.3 Cloud-Assisted Media Sharing 19.4 Computation Offloading for Multimedia Services 19.5

More information

jbase 5 Install on Amazon AWS a Primer

jbase 5 Install on Amazon AWS a Primer jbase 5 Install on Amazon AWS a Primer Revision 1.0 August 2012 1 jbase 5 Install on Amazon AWS This document contains proprietary information that is protected by copyright. No part of this document may

More information

Visualisation in the Google Cloud

Visualisation in the Google Cloud Visualisation in the Google Cloud by Kieran Barker, 1 School of Computing, Faculty of Engineering ABSTRACT Providing software as a service is an emerging trend in the computing world. This paper explores

More information

Amazon Cloud Storage Options

Amazon Cloud Storage Options Amazon Cloud Storage Options Table of Contents 1. Overview of AWS Storage Options 02 2. Why you should use the AWS Storage 02 3. How to get Data into the AWS.03 4. Types of AWS Storage Options.03 5. Object

More information

SETUP AND MANAGEMENT OF MODELS IN CUBE CLOUD. Colby M. Brown, AICP PTP

SETUP AND MANAGEMENT OF MODELS IN CUBE CLOUD. Colby M. Brown, AICP PTP SETUP AND MANAGEMENT OF MODELS IN CUBE CLOUD Colby M. Brown, AICP PTP 1 CUBE CLOUD INTRODUCING CUBE CLOUD Develop the Model with Cube in the Desktop Environment Publish the Model from Cube to the Cube

More information

CLOUD PERFORMANCE TESTING - KEY CONSIDERATIONS (COMPLETE ANALYSIS USING RETAIL APPLICATION TEST DATA)

CLOUD PERFORMANCE TESTING - KEY CONSIDERATIONS (COMPLETE ANALYSIS USING RETAIL APPLICATION TEST DATA) CLOUD PERFORMANCE TESTING - KEY CONSIDERATIONS (COMPLETE ANALYSIS USING RETAIL APPLICATION TEST DATA) Abhijeet Padwal Performance engineering group Persistent Systems, Pune email: [email protected]

More information

Hosting Blackbaud Software in the Cloud

Hosting Blackbaud Software in the Cloud Software and services are moving off corporate and organizational networks and into the cloud. The trends are clear and the pace is accelerating. But what exactly is the cloud? Can client-server software

More information

How cloud computing can transform your business landscape

How cloud computing can transform your business landscape How cloud computing can transform your business landscape Introduction It seems like everyone is talking about the cloud. Cloud computing and cloud services are the new buzz words for what s really a not

More information

FREE computing using Amazon EC2

FREE computing using Amazon EC2 FREE computing using Amazon EC2 Seong-Hwan Jun 1 1 Department of Statistics Univ of British Columbia Nov 1st, 2012 / Student seminar Outline Basics of servers Amazon EC2 Setup R on an EC2 instance Stat

More information

Installing and Configuring Windows Server 2008. Module Overview 14/05/2013. Lesson 1: Planning Windows Server 2008 Installation.

Installing and Configuring Windows Server 2008. Module Overview 14/05/2013. Lesson 1: Planning Windows Server 2008 Installation. Installing and Configuring Windows Server 2008 Tom Brett Module Overview Planning Windows Server 2008 Installations Performing a Windows Server 2008 Installation Configuring Windows Server 2008 Following

More information

WHITE PAPER SETTING UP AND USING ESTATE MASTER ON THE CLOUD INTRODUCTION

WHITE PAPER SETTING UP AND USING ESTATE MASTER ON THE CLOUD INTRODUCTION WHITE PAPER SETTING UP AND USING ESTATE MASTER ON THE CLOUD INTRODUCTION Cloud Computing can provide great flexibility for the Estate Master user. You can access your feasibilities, manage you projects

More information

Build Your Own Performance Test Lab in the Cloud. Leslie Segal Testware Associate, Inc. [email protected]

Build Your Own Performance Test Lab in the Cloud. Leslie Segal Testware Associate, Inc. leslie.segal@testwareinc.com Build Your Own Performance Test Lab in the Cloud Leslie Segal Testware Associate, Inc. [email protected] Problem(s) I was trying to Solve Need more VUs Don t have Tool Times up reschedule test

More information

Using ArcGIS for Server in the Amazon Cloud

Using ArcGIS for Server in the Amazon Cloud Federal GIS Conference February 9 10, 2015 Washington, DC Using ArcGIS for Server in the Amazon Cloud Bonnie Stayer, Esri Amy Ramsdell, Blue Raster Session Outline AWS Overview ArcGIS in AWS Cloud Builder

More information

Where We Are. References. Cloud Computing. Levels of Service. Cloud Computing History. Introduction to Data Management CSE 344

Where We Are. References. Cloud Computing. Levels of Service. Cloud Computing History. Introduction to Data Management CSE 344 Where We Are Introduction to Data Management CSE 344 Lecture 25: DBMS-as-a-service and NoSQL We learned quite a bit about data management see course calendar Three topics left: DBMS-as-a-service and NoSQL

More information

Building your Big Data Architecture on Amazon Web Services

Building your Big Data Architecture on Amazon Web Services Building your Big Data Architecture on Amazon Web Services Abhishek Sinha @abysinha [email protected] AWS Services Deployment & Administration Application Services Compute Storage Database Networking

More information

Using SUSE Studio to Build and Deploy Applications on Amazon EC2. Guide. Solution Guide Cloud Computing. www.suse.com

Using SUSE Studio to Build and Deploy Applications on Amazon EC2. Guide. Solution Guide Cloud Computing. www.suse.com Using SUSE Studio to Build and Deploy Applications on Amazon EC2 Guide Solution Guide Cloud Computing Cloud Computing Solution Guide Using SUSE Studio to Build and Deploy Applications on Amazon EC2 Quickly

More information

LabStats 5 System Requirements

LabStats 5 System Requirements LabStats Tel: 877-299-6241 255 B St, Suite 201 Fax: 208-473-2989 Idaho Falls, ID 83402 LabStats 5 System Requirements Server Component Virtual Servers: There is a limit to the resources available to virtual

More information

The Easiest Way to Run Spark Jobs. How-To Guide

The Easiest Way to Run Spark Jobs. How-To Guide The Easiest Way to Run Spark Jobs How-To Guide The Easiest Way to Run Spark Jobs Recently, Databricks added a new feature, Jobs, to our cloud service. You can find a detailed overview of this feature in

More information

How To Test A Web Server

How To Test A Web Server Performance and Load Testing Part 1 Performance & Load Testing Basics Performance & Load Testing Basics Introduction to Performance Testing Difference between Performance, Load and Stress Testing Why Performance

More information

Getting Familiar with Cloud Terminology. Cloud Dictionary

Getting Familiar with Cloud Terminology. Cloud Dictionary Getting Familiar with Cloud Terminology Cloud computing is a hot topic in today s IT industry. However, the technology brings with it new terminology that can be confusing. Although you don t have to know

More information

Cloud Computing for Education Workshop

Cloud Computing for Education Workshop Cloud Computing for Education Workshop 2012 Copyright REZA CURTMOLA, NJIT What Have We Learned So Far? We have learned several cloud applications (Dropbox, Google Docs, Piazza, etc.) Dropbox SugarSync

More information

CSE 344 Introduction to Data Management. Section 9: AWS, Hadoop, Pig Latin TA: Yi-Shu Wei

CSE 344 Introduction to Data Management. Section 9: AWS, Hadoop, Pig Latin TA: Yi-Shu Wei CSE 344 Introduction to Data Management Section 9: AWS, Hadoop, Pig Latin TA: Yi-Shu Wei Homework 8 Big Data analysis on billion triple dataset using Amazon Web Service (AWS) Billion Triple Set: contains

More information

How to Ingest Data into Google BigQuery using Talend for Big Data. A Technical Solution Paper from Saama Technologies, Inc.

How to Ingest Data into Google BigQuery using Talend for Big Data. A Technical Solution Paper from Saama Technologies, Inc. How to Ingest Data into Google BigQuery using Talend for Big Data A Technical Solution Paper from Saama Technologies, Inc. July 30, 2013 Table of Contents Intended Audience What you will Learn Background

More information

Outline. What is cloud computing? History Cloud service models Cloud deployment forms Advantages/disadvantages

Outline. What is cloud computing? History Cloud service models Cloud deployment forms Advantages/disadvantages Ivan Zapevalov 2 Outline What is cloud computing? History Cloud service models Cloud deployment forms Advantages/disadvantages 3 What is cloud computing? 4 What is cloud computing? Cloud computing is the

More information

Part V Applications. What is cloud computing? SaaS has been around for awhile. Cloud Computing: General concepts

Part V Applications. What is cloud computing? SaaS has been around for awhile. Cloud Computing: General concepts Part V Applications Cloud Computing: General concepts Copyright K.Goseva 2010 CS 736 Software Performance Engineering Slide 1 What is cloud computing? SaaS: Software as a Service Cloud: Datacenters hardware

More information

CHAPTER 2 BACKGROUND AND OBJECTIVE OF PRESENT WORK

CHAPTER 2 BACKGROUND AND OBJECTIVE OF PRESENT WORK CHAPTER 2 BACKGROUND AND OBJECTIVE OF PRESENT WORK 2.1 Background Today middleware technology is not implemented only in banking and payment system even this is the most important point in the field of

More information

How To Set Up Wiremock In Anhtml.Com On A Testnet On A Linux Server On A Microsoft Powerbook 2.5 (Powerbook) On A Powerbook 1.5 On A Macbook 2 (Powerbooks)

How To Set Up Wiremock In Anhtml.Com On A Testnet On A Linux Server On A Microsoft Powerbook 2.5 (Powerbook) On A Powerbook 1.5 On A Macbook 2 (Powerbooks) The Journey of Testing with Stubs and Proxies in AWS Lucy Chang [email protected] Abstract Intuit, a leader in small business and accountants software, is a strong AWS(Amazon Web Services) partner

More information

In a dynamic economic environment, your company s survival

In a dynamic economic environment, your company s survival Chapter 1 Cloud Computing Defined In This Chapter Examining the reasons for cloud Understanding cloud types Defining the elements of cloud computing Comparing private and public clouds In a dynamic economic

More information

Cloud computing. Examples

Cloud computing. Examples Cloud computing Cloud computing Web Systems and Algorithms Cloud Computing Chris Brooks Department of Computer Science University of San Francisco What is cloud computing? What separates it from: grid

More information

Cloud Computing. Following the American Psychological Association s Guidelines. Dustin Self. The University of North Texas

Cloud Computing. Following the American Psychological Association s Guidelines. Dustin Self. The University of North Texas Running Head: CLOUD COMPUTING 1 Cloud Computing Following the American Psychological Association s Guidelines Dustin Self The University of North Texas 2 Cloud Computing by Dustin Self ABSTRACT According

More information

Continuous Integration (CI) for Mobile Applications

Continuous Integration (CI) for Mobile Applications Continuous Integration (CI) for Mobile Applications Author: Guy Arieli, CTO, Experitest Table of Contents: What Continuous Integration Adds to the Mobile Development Process 2 What is Continuous Integration?

More information

Amazon Elastic Compute Cloud Getting Started Guide. My experience

Amazon Elastic Compute Cloud Getting Started Guide. My experience Amazon Elastic Compute Cloud Getting Started Guide My experience Prepare Cell Phone Credit Card Register & Activate Pricing(Singapore) Region Amazon EC2 running Linux(SUSE Linux Windows Windows with SQL

More information

Last time. Today. IaaS Providers. Amazon Web Services, overview

Last time. Today. IaaS Providers. Amazon Web Services, overview Last time General overview, motivation, expected outcomes, other formalities, etc. Please register for course Online (if possible), or talk to Yvonne@CS Course evaluation forgotten Please assign one volunteer

More information

Scalable Data Analysis in R. Lee E. Edlefsen Chief Scientist UserR! 2011

Scalable Data Analysis in R. Lee E. Edlefsen Chief Scientist UserR! 2011 Scalable Data Analysis in R Lee E. Edlefsen Chief Scientist UserR! 2011 1 Introduction Our ability to collect and store data has rapidly been outpacing our ability to analyze it We need scalable data analysis

More information

DISTRIBUTED SYSTEMS AND CLOUD COMPUTING. A Comparative Study

DISTRIBUTED SYSTEMS AND CLOUD COMPUTING. A Comparative Study DISTRIBUTED SYSTEMS AND CLOUD COMPUTING A Comparative Study Geographically distributed resources, such as storage devices, data sources, and computing power, are interconnected as a single, unified resource

More information

owncloud Enterprise Edition on IBM Infrastructure

owncloud Enterprise Edition on IBM Infrastructure owncloud Enterprise Edition on IBM Infrastructure A Performance and Sizing Study for Large User Number Scenarios Dr. Oliver Oberst IBM Frank Karlitschek owncloud Page 1 of 10 Introduction One aspect of

More information

ArcGIS for Server: In the Cloud

ArcGIS for Server: In the Cloud DevSummit DC February 11, 2015 Washington, DC ArcGIS for Server: In the Cloud Bonnie Stayer, Esri Session Outline Cloud Overview - Benefits - Types of clouds ArcGIS in AWS - Cloud Builder - Maintenance

More information

MyCloudLab: An Interactive Web-based Management System for Cloud Computing Administration

MyCloudLab: An Interactive Web-based Management System for Cloud Computing Administration MyCloudLab: An Interactive Web-based Management System for Cloud Computing Administration Hoi-Wan Chan 1, Min Xu 2, Chung-Pan Tang 1, Patrick P. C. Lee 1 & Tsz-Yeung Wong 1, 1 Department of Computer Science

More information

White Paper. Cloud Native Advantage: Multi-Tenant, Shared Container PaaS. http://wso2.com Version 1.1 (June 19, 2012)

White Paper. Cloud Native Advantage: Multi-Tenant, Shared Container PaaS. http://wso2.com Version 1.1 (June 19, 2012) Cloud Native Advantage: Multi-Tenant, Shared Container PaaS Version 1.1 (June 19, 2012) Table of Contents PaaS Container Partitioning Strategies... 03 Container Tenancy... 04 Multi-tenant Shared Container...

More information

The Theory And Practice of Testing Software Applications For Cloud Computing. Mark Grechanik University of Illinois at Chicago

The Theory And Practice of Testing Software Applications For Cloud Computing. Mark Grechanik University of Illinois at Chicago The Theory And Practice of Testing Software Applications For Cloud Computing Mark Grechanik University of Illinois at Chicago Cloud Computing Is Everywhere Global spending on public cloud services estimated

More information

WELCOME TO CITUS CLOUD LOAD TEST

WELCOME TO CITUS CLOUD LOAD TEST USER S GUIDE CONTENTS Contents... 2 Chapter 1: Welcome to Citus Cloud Load Test... 3 1. What is Citus Cloud Load Test?... 3 2. Why Citus Cloud Load Test?... 3 3. Before using this guide... 3 Chapter 2:

More information

Cloud Computing INTRODUCTION

Cloud Computing INTRODUCTION Cloud Computing INTRODUCTION Cloud computing is where software applications, processing power, data and potentially even artificial intelligence are accessed over the internet. or in simple words any situation

More information

Description of Application

Description of Application Description of Application Operating Organization: Coeur d Alene Tribe, Plummer, Idaho Community of Interest: U.S. Indian tribes and their governments; rural governments OS and software requirements: Microsoft

More information

The Cloud Hosting Revolution: Learn How to Cut Costs and Eliminate Downtime with GlowHost's Cloud Hosting Services

The Cloud Hosting Revolution: Learn How to Cut Costs and Eliminate Downtime with GlowHost's Cloud Hosting Services The Cloud Hosting Revolution: Learn How to Cut Costs and Eliminate Downtime with GlowHost's Cloud Hosting Services For years, companies have struggled to find an affordable and effective method of building

More information

Leveraging Public Clouds to Ensure Data Availability

Leveraging Public Clouds to Ensure Data Availability Systems Engineering at MITRE CLOUD COMPUTING SERIES Leveraging Public Clouds to Ensure Data Availability Toby Cabot Lawrence Pizette The MITRE Corporation manages federally funded research and development

More information

The Cloud at Crawford. Evaluating the pros and cons of cloud computing and its use in claims management

The Cloud at Crawford. Evaluating the pros and cons of cloud computing and its use in claims management The Cloud at Crawford Evaluating the pros and cons of cloud computing and its use in claims management The Cloud at Crawford Wikipedia defines cloud computing as Internet-based computing, whereby shared

More information

Cloud Computing demystified! ISACA-IIA Joint Meeting Dec 9, 2014 By: Juman Doleh-Alomary Office of Internal Audit [email protected]

Cloud Computing demystified! ISACA-IIA Joint Meeting Dec 9, 2014 By: Juman Doleh-Alomary Office of Internal Audit jdoleh@wayne.edu Cloud Computing demystified! ISACA-IIA Joint Meeting Dec 9, 2014 By: Juman Doleh-Alomary Office of Internal Audit [email protected] 2 If cloud computing is so simple, then what s the big deal? What is the

More information

The Cost of the Cloud. Steve Saporta CTO, SwipeToSpin Mar 20, 2015

The Cost of the Cloud. Steve Saporta CTO, SwipeToSpin Mar 20, 2015 The Cost of the Cloud Steve Saporta CTO, SwipeToSpin Mar 20, 2015 The SwipeToSpin product SpinCar 360 WalkAround JPEG images HTML JavaScript CSS WA for short Creating a WA 1. Download and parse CSV file

More information

Liferay Portal Performance. Benchmark Study of Liferay Portal Enterprise Edition

Liferay Portal Performance. Benchmark Study of Liferay Portal Enterprise Edition Liferay Portal Performance Benchmark Study of Liferay Portal Enterprise Edition Table of Contents Executive Summary... 3 Test Scenarios... 4 Benchmark Configuration and Methodology... 5 Environment Configuration...

More information

Cloud Storage. Parallels. Performance Benchmark Results. White Paper. www.parallels.com

Cloud Storage. Parallels. Performance Benchmark Results. White Paper. www.parallels.com Parallels Cloud Storage White Paper Performance Benchmark Results www.parallels.com Table of Contents Executive Summary... 3 Architecture Overview... 3 Key Features... 4 No Special Hardware Requirements...

More information

Alfresco Enterprise on AWS: Reference Architecture

Alfresco Enterprise on AWS: Reference Architecture Alfresco Enterprise on AWS: Reference Architecture October 2013 (Please consult http://aws.amazon.com/whitepapers/ for the latest version of this paper) Page 1 of 13 Abstract Amazon Web Services (AWS)

More information

Understanding ArcGIS Deployments in Public and Private Cloud. Marwa Mabrouk

Understanding ArcGIS Deployments in Public and Private Cloud. Marwa Mabrouk Understanding ArcGIS Deployments in Public and Private Cloud Marwa Mabrouk Agenda Back to Basics What are people doing? New Features Using ArcGIS in the Cloud - Private Cloud - Public Cloud Technical Demos

More information

Zadara Storage Cloud A whitepaper. @ZadaraStorage

Zadara Storage Cloud A whitepaper. @ZadaraStorage Zadara Storage Cloud A whitepaper @ZadaraStorage Zadara delivers two solutions to its customers: On- premises storage arrays Storage as a service from 31 locations globally (and counting) Some Zadara customers

More information

A Web Base Information System Using Cloud Computing

A Web Base Information System Using Cloud Computing A Web Base Information System Using Cloud Computing Zainab Murtadha, Mohammad Amin Roshanasan Abstract: Cloud Computing is the new field that was invented and developed during a period not so long ago.

More information

InstaFile. Complete Document management System

InstaFile. Complete Document management System InstaFile Complete Document management System Index : About InstaFile 1.1 What is InstaFile 1.2 How does it work 1.3 Where you can use InstaFile 1.4 Why only InstaFile InstaFile features and benefits Start

More information

01/02/2012. Meet CloudOne. James Pietrocarlo Vice President of Business Development CloudOne. [email protected].

01/02/2012. Meet CloudOne. James Pietrocarlo Vice President of Business Development CloudOne. james.pietrocarlo@oncloudone.net. Meet CloudOne James Pietrocarlo Vice President of Business Development CloudOne [email protected] net 2 1 Who is CloudOne? A unique IBM Premier Business Partner created specifically to help

More information

Cloud Computing Deja Vu

Cloud Computing Deja Vu Cloud Computing Deja Vu Hengming Zou, Ph.D. Shanghai Jiao Tong University Time Is Late An auto company need to know the price of front bumper And want it right now So it reconfigures some server to run

More information

References. Introduction to Database Systems CSE 444. Motivation. Basic Features. Outline: Database in the Cloud. Outline

References. Introduction to Database Systems CSE 444. Motivation. Basic Features. Outline: Database in the Cloud. Outline References Introduction to Database Systems CSE 444 Lecture 24: Databases as a Service YongChul Kwon Amazon SimpleDB Website Part of the Amazon Web services Google App Engine Datastore Website Part of

More information

Introduction to Database Systems CSE 444

Introduction to Database Systems CSE 444 Introduction to Database Systems CSE 444 Lecture 24: Databases as a Service YongChul Kwon References Amazon SimpleDB Website Part of the Amazon Web services Google App Engine Datastore Website Part of

More information

Deployment Options for Microsoft Hyper-V Server

Deployment Options for Microsoft Hyper-V Server CA ARCserve Replication and CA ARCserve High Availability r16 CA ARCserve Replication and CA ARCserve High Availability Deployment Options for Microsoft Hyper-V Server TYPICALLY, IT COST REDUCTION INITIATIVES

More information

Performance Optimization Guide

Performance Optimization Guide Performance Optimization Guide Publication Date: July 06, 2016 Copyright Metalogix International GmbH, 2001-2016. All Rights Reserved. This software is protected by copyright law and international treaties.

More information

Cloud Computing 159.735. Submitted By : Fahim Ilyas (08497461) Submitted To : Martin Johnson Submitted On: 31 st May, 2009

Cloud Computing 159.735. Submitted By : Fahim Ilyas (08497461) Submitted To : Martin Johnson Submitted On: 31 st May, 2009 Cloud Computing 159.735 Submitted By : Fahim Ilyas (08497461) Submitted To : Martin Johnson Submitted On: 31 st May, 2009 Table of Contents Introduction... 3 What is Cloud Computing?... 3 Key Characteristics...

More information

ediscovery and Search of Enterprise Data in the Cloud

ediscovery and Search of Enterprise Data in the Cloud ediscovery and Search of Enterprise Data in the Cloud From Hype to Reality By John Patzakis & Eric Klotzko ediscovery and Search of Enterprise Data in the Cloud: From Hype to Reality Despite the enormous

More information

AMAZON S3: ARCHITECTING FOR RESILIENCY IN THE FACE OF FAILURES Jason McHugh

AMAZON S3: ARCHITECTING FOR RESILIENCY IN THE FACE OF FAILURES Jason McHugh AMAZON S3: ARCHITECTING FOR RESILIENCY IN THE FACE OF FAILURES Jason McHugh CAN YOUR S ERVICE S URVIVE? CAN YOUR S ERVICE S URVIVE? CAN YOUR SERVICE SURVIVE? Datacenter loss of connectivity Flood Tornado

More information

Session 3. the Cloud Stack, SaaS, PaaS, IaaS

Session 3. the Cloud Stack, SaaS, PaaS, IaaS Session 3. the Cloud Stack, SaaS, PaaS, IaaS The service models resemble a cascading architecture where services on a higher level, as identified by Weinhardt et.al. (2009); encapsulate functionality from

More information

Virtualization and Cloud Computing

Virtualization and Cloud Computing Written by Zakir Hossain, CS Graduate (OSU) CEO, Data Group Fed Certifications: PFA (Programming Foreign Assistance), COR (Contracting Officer), AOR (Assistance Officer) Oracle Certifications: OCP (Oracle

More information

OpenNebula Open Souce Solution for DC Virtualization. C12G Labs. Online Webinar

OpenNebula Open Souce Solution for DC Virtualization. C12G Labs. Online Webinar OpenNebula Open Souce Solution for DC Virtualization C12G Labs Online Webinar What is OpenNebula? Multi-tenancy, Elasticity and Automatic Provision on Virtualized Environments I m using virtualization/cloud,

More information