Best Practices for Sharing Imagery using Amazon Web Services. Peter Becker



Similar documents
Enterprise Architectures for Large Tiled Basemap Projects. Tommy Fauvell

Alfresco Enterprise on AWS: Reference Architecture

ArcGIS for Server in the Amazon Cloud. Michele Lundeen Esri

Big data, big deal? Presented by: David Stanley, CTO PCI Geomatics

Managing Lidar (and other point cloud) Data. Lindsay Weitz Cody Benkelman

Every Silver Lining Has a Vault in the Cloud

ArcGIS for Server in the Cloud

Enterprise GIS Architecture Deployment Options. Andrew Sakowicz

Administrative Issues

Technical Aspects to GIS in the Cloud

SMB in the Cloud David Disseldorp

Storage Solutions in the AWS Cloud. Miles Ward Enterprise Solutions Architect

Using ArcGIS for Server in the Amazon Cloud

Description of Application

Using ArcGIS for Server in the Amazon Cloud

Backup and Recovery of SAP Systems on Windows / SQL Server

SERVER CLOUD DISASTER RECOVERY. User Manual

GeoCloud Project Report GEOSS Clearinghouse

How To Use Arcgis For Free On A Gdb (For A Gis Server) For A Small Business

TECHNOLOGY WHITE PAPER Jun 2012

ArcGIS 10.3 Server on Amazon Web Services

PostgreSQL Performance Characteristics on Joyent and Amazon EC2

Deploying ArcGIS for Server Using Esri Managed Services

Zend Server Amazon AMI Quick Start Guide

Advanced Image Management using the Mosaic Dataset

ArcGIS for Server: In the Cloud

Oracle Applications and Cloud Computing - Future Direction

Amazon Elastic Beanstalk

National Center for Education Statistics. Amazon Hosted ESRI ArcGIS Servers Project Final Report

Enterprise Image Management. An Esri White Paper November 2015

GeoCloud Project Report USGS/EROS Spatial Data Warehouse Project

Enterprise Applications on AWS

Online Backup Guide for the Amazon Cloud: How to Setup your Online Backup Service using Vembu StoreGrid Backup Virtual Appliance on the Amazon Cloud

Deploying ArcGIS for Server Using Managed Services

Drupal in the Cloud. Scaling with Drupal and Amazon Web Services. Northern Virginia Drupal Meetup

Using DSC with Visual Studio Release Management

Chapter 9 PUBLIC CLOUD LABORATORY. Sucha Smanchat, PhD. Faculty of Information Technology. King Mongkut s University of Technology North Bangkok

EEDC. Scalability Study of web apps in AWS. Execution Environments for Distributed Computing

Intro to AWS: Storage Services

Media Exchange really puts the power in the hands of our creative users, enabling them to collaborate globally regardless of location and file size.

Introduction to Cloud Computing

TECHNOLOGY WHITE PAPER Jan 2016

Managing a local Galaxy Instance. Anushka Brownley / Adam Kraut BioTeam Inc.

SharePoint 2013 on Windows Azure Infrastructure David Aiken & Dan Wesley Version 1.0

Amazon Web Services Student Tutorial

Assignment # 1 (Cloud Computing Security)

Big Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect

Infrastructure Clouds for Science and Education: Platform Tools

Investigating Private Cloud Storage Deployment using Cumulus, Walrus, and OpenStack/Swift

How To Image A Single Vm For Forensic Analysis On Vmwarehouse.Com

Flight Workflow User's Guide. Release

Server Installation Manual 4.4.1

WEBAPP PATTERN FOR APACHE TOMCAT - USER GUIDE

System Administration Training Guide. S100 Installation and Site Management

Deploy Your First CF App on Azure with Template and Service Broker. Thomas Shao, Rita Zhang, Bin Xia Microsoft Azure Team

A Study on Architecture of Private Cloud Based on Virtual Technology

Estimating the Cost of a GIS in the Amazon Cloud. An Esri White Paper August 2012

Eucalyptus User Console Guide

Cloud Computing with Amazon Web Services and the DevOps Methodology.

Deploying ArcGIS for Server using Managed Services

The Cost of the Cloud. Steve Saporta CTO, SwipeToSpin Mar 20, 2015

Migration Scenario: Migrating Batch Processes to the AWS Cloud

GIS and the Cloud. Richard Cantwell

Amazon EC2 Product Details Page 1 of 5

Using SUSE Studio to Build and Deploy Applications on Amazon EC2. Guide. Solution Guide Cloud Computing.

Developing Microsoft Azure Solutions 20532A; 5 days

Amazon Cloud Storage Options

Server application Client application Quick remote support application. Server application

In order to upload a VM you need to have a VM image in one of the following formats:

APP DEVELOPMENT ON THE CLOUD MADE EASY WITH PAAS

This works very well for situations where all computers are within the same LAN and can access both the SQL server and the network shares.

Load Balancing Advanced Operation Manual

How To Use Kentico+ On A Pc Or Mac Or Macbook

Amazon Web Services Demo Tech Exchange. Slides:

Storage Options in the AWS Cloud

WELCOME TO CITUS CLOUD LOAD TEST

Migration Scenario: Migrating Backend Processing Pipeline to the AWS Cloud

Building your Server for High Availability and Disaster Recovery. Witt Mathot Danny Krouk

If you do NOT use applications based on Amazon Web Services raise your hand.

Implementing Microsoft Windows Server Failover Clustering (WSFC) and SQL Server 2012 AlwaysOn Availability Groups in the AWS Cloud

Automated Application Provisioning for Cloud

MANAGE YOUR AMAZON AWS ASSETS USING BOTO

Web Application Deployment in the Cloud Using Amazon Web Services From Infancy to Maturity

RSA Security Analytics Virtual Appliance Setup Guide

VMUnify EC2 Gateway Guide

Experian Secure Transport Service

CloudFTP: A free Storage Cloud

RDS Migration Tool Customer FAQ Updated 7/23/2015

Postgres Plus Cloud Database!

Managing Imagery and Raster Data in ArcGIS

NAS 254 Cloud Backup. Use Cloud Backup to backup your data to Amazon S3 A S U S T O R C O L L E G E

How To Choose Cloud Computing

Storing and Processing Sensor Networks Data in Public Clouds

Deployment Guide: Unidesk and Hyper- V

Web Conferencing Version 8.3 Troubleshooting Guide

PassTest. Bessere Qualität, bessere Dienstleistungen!

Installation Guide on Cloud Platform

Developing Microsoft Azure Solutions

Data Pipeline with Kafka

Using and Contributing Virtual Machines to VM Depot

Transcription:

Best Practices for Sharing Imagery using Amazon Web Services Peter Becker

Objectives Making Imagery Accessible Store massive volumes of imagery on inexpensive cloud storage Use elastic compute for image services www.esri.com/landsatonaws Focus on Imagery - Large in Volume - Static - Basis for visualization and Analytics Note: Implementation of these workflows currently require additional tools and a component that is not publically available. Details for access will be provided at the end

What s Changed Simple/ Object Store NAS NFS Http Standalone Direct Access Elasticity Enterprise Cloud

Amazon Storage Options EBS (Elastic Block Storage) 1TB/Disk ($100/TB/Month) Ephemeral (80GB) $0 (inc. with EC2) Fileshare NFS NFS

Amazon Storage Options Ephemeral (80GB) $0 (inc. with EC2) EBS (Elastic Block Storage) 1TB/Disk ($100/TB/Month) S3 Simple Storage $30/TB/Month 99.999999999% Durability 99.99% Reliability/year 10-50MB/s NFS Http

Azure Storage Options Temporary Disk (80GB) Azure Files 5TB, max 60MBps/Share NFS+REST Blobs REST Azure Blobs 500TB/container max 60MBps/Blob

Optimizations Use Inexpensive Scalable Storage - Minimize Number of requests - Minimize Size of requests - Minimize Repeat requests Enable Elasticity Enable Caching Object Store Http Suitable data structure: - Enable access in single request - - - Data Tiles Driver Driver Driver Enable http access Cache to ephemeral/temporary storage

File Format Considerations / Optimization Tiling of imagery Enables partial access Compression Reduce storage and transfer Weigh against additional compute Data access complexity Reduce subsequent requests Bad Good Optimum Striped TIF Tiled TIF MRF separ Cache Locally Non Optimum Access Pyramid Enables partial Access Pyramid Enable separate caching of metadata and index

MRF (Meta Raster) Format NASA designed format Split raster into three (or more) files - MRF Includes links to and DataFile - Simple index to tiles - DataFile Data file - Auxiliary files e.g. Aux.xml Open Source Implemented in GDAL Multiple compression options Multiple modes: Static, Split, Cloned, Caching Datafile Pyramid MRF file is seen as a Raster Dataset in ArcGIS MRF file and index is small and can easily be on a local file system Data can be on slower storage

Storage Formats & Compression Advantages/Disadvantages Raw/Striped TIF Sequential access / No compression Tiled GeoTIF Tiled Access - Compression Options - None - Lossy JPEG.....Fast Lossy compression 8 & 12bit - Lossless Deflate/LZW Generic lossless compression JPEG2000 Tiled Access - Compression Options - Lossy.Computationally expensive - Lossless Computationally expensive MRF Tiled Access - Compression Options - As geotif JPEG,Deflate - PNG Generic lossless - LERC Controlled Lossy Very Fast Optimum for higher bit depth and catagorical

Converting Data ArcGIS CopyRasters, GDAL_Translate/GDALaddo - Convert rasters, but don t copy associated metadata OptimizeRasters - Python Script (Calls ArcGIS Version of GDAL) - From any GDAL supported format to MRF or TiledTIF - Parameters to define compression, tilesize etc - Currently command line tool

Getting Imagery to and from the Cloud AWS command line CloudBerry, FileZilla,. OptimizeRasters - Copy and conversion in parallel - Option to creates Cloned MRF, Caching MRF - Check success and retries if necessary Import/Export - Ship hard drives for large projects Note: File Names on S3 are case sensitive

Demo Conversion and Uploading Using OptimizeRasters

Accessing Cloud Imagery

ArcGIS Prerequisites ArcGIS typically access files through a file system Cannot access files in S3 directly Use MRF Driver to point to MRF file that points to data on S3 MRF Driver is custom GDAL Driver for ArcGIS Available in 10.3.1* (with issues) Best to download See later Note: Following can be used to Optimize Enterprise implementations as well Works with data on S3, but also from any local storage

ArcGIS MRF Driver Special MRF driver for ArcGIS Incorporated into ArcGIS 10.3.1* Desktop & Server 4 Modes StaticMRF Datafile One Directory

ArcGIS MRF Driver Special MRF driver for ArcGIS Incorporated into ArcGIS 10.3.1* Desktop & Server 4 Modes StaticMRF SplitMRF Datafile Fast Expensive Storage Datafile Slow Cheap Storage

ArcGIS MRF Driver Special MRF driver for ArcGIS Incorporated into ArcGIS 10.3.1* Desktop & Server 4 Modes StaticMRF SplitMRF ClonedMRF Datafile Cache Fast Expensive Storage e.g. Ephemeral VSI Curl / Http Datafile Datafile Slow Cheap Storage e.g. S3

ArcGIS MRF Driver Special MRF driver for ArcGIS Incorporated into ArcGIS 10.3.1* Desktop & Server 4 Modes StaticMRF SplitMRF ClonedMRF CachingMRF Cached File Datafile Cache Cache Datafile VSI Curl / Http Datafile VSI Curl / Http XXX File Any GDAL Readable file

Creating Local MRF Files that Point to Source Use OptimizeRasters - Creates MRF files on local file system (Ephemeral or EBS) - MRF points to and Data Files - Copies Auxiliary files for RasterType - Note MRF files can have any extension (incl..tif) Auxiliary Files Auxiliary Files Cache Cache OptimizeRastes If source is MRF Use ClonedMRF - Adds caching optimization If source is other format Use CachingMRF - Enables source to be any Image on S3 (or SAN etc) Define Cache location as Ephemeral Datafile Auxiliary Files XXX File Auxiliary Files

Demo Creating and Using CachingMRF

Creating and Using CachingMRF

Creating Mosaic Dataset Use Standard Workflow - Create Mosaic Dataset - Add Rasters - Define and Build Overviews - Set Properties - Perform QC - Publish More Advance Workflow - Create Source Mosaic Dataset for each project - As for Standard Workflow - Create Derived Mosaic Dataset For more details see: Image Management Workflows - Resource Center landing page http://esriurl.com/6005 - Guidebook in Help System http://esriurl.com/6007 - ArcGIS Online Group http://esriurl.com/6539

Handling Overviews Create Overviews on Ephemeral or EBS Use OptimizeRasters to copy to S3 Delete Overviews Use Optimize Rasters to copy back MRF files Repair paths (if necessary) (If very large can generate on EBS and copy MRF back to Ephemeral)

Publishing Image Services For initial testing use Standard Workflow - Right click Publish as Image Service Publishing can be automated

Configuring ArcGIS Server for Imagery in AWS Use AutoScaling Group Use Silo configuration of ArcGIS Server (each server is its own site) Load Balancer User Regular Amazon Elastic Load Balancer (ELB) - Turn On Sticky session on using Duration-Based Session Stickiness - Listen on ports 80 (http) & 443 (https) - Install SSL certification on port 443 - Forward Port 80 (ELB) -> 6080 (ArcGIS Server) - Forward Port 443 (EBL) -> 6443 (ArcGIS Server) On ArcGIS server turn on http & https Create Master Server Create AMI of Master Server Config AMI to Autoscaling group Scale out and scale in based on CPU Load (typically CPU Load >80% or <20% for 5min) Set Min and Max Instances

Publish Elastic Image Services Prepare Base - Delete all Cache files - ZIP MD and MRF files - Store on S3 For each machine - Use Deployer Script / Cloud Formation Template to Start up - Pull Mosaic Datasets, MRF overviews, MRF source rasters from S3 in a zip file. Above can be automated with Cloud Formation templates available from Professional Services

Advanced Techniques S3 Security - Use ACL to define Public Read - Else use Bucket Policy to define which IPs etc. can read Embed MRF files into Mosaic Dataset - MRF files fully encapsulated in Mosaic Dataset PostgreSQL - Multiple machines from same Mosaic Dataset (dynamic updates)

Getting Access Currently the scripts are in Prototype state To get access to OptimzeRasters, MRF Drivers, PPT and doc: - Send email to - ImageManagementWorkflows@esri.com - Provide GitHub ID - Name & email - Brief explanation of project goals

Thank you Please fill out the session survey in your mobile app Select Note: Incorrect time [Best Practices for Sharing Imagery using Amazon Web Services] in the Mobile App - Use the Search Feature to quickly find this title Click Technical Workshop Survey Answer a few short questions and enter any comments