Benchmarking OPeNDAP services for modern ESM data workloads



Similar documents
Maximising the utility of OpeNDAP datasets through the NetCDF4 API

AppDynamics Lite Performance Benchmark. For KonaKart E-commerce Server (Tomcat/JSP/Struts)

EVALUATING NETWORK BUFFER SIZE REQUIREMENTS

Performing Load Capacity Test for Web Applications

Infor Web UI Sizing and Deployment for a Thin Client Solution

JBoss Seam Performance and Scalability on Dell PowerEdge 1855 Blade Servers

Microsoft Exchange Server 2003 Deployment Considerations

Shoal: IaaS Cloud Cache Publisher

Performance Characteristics of VMFS and RDM VMware ESX Server 3.0.1

SOLUTION BRIEF: SLCM R12.8 PERFORMANCE TEST RESULTS JANUARY, Submit and Approval Phase Results

COLO: COarse-grain LOck-stepping Virtual Machine for Non-stop Service

IT Business Management System Requirements Guide

Performance of VMware vcenter (VC) Operations in a ROBO Environment TECHNICAL WHITE PAPER

VI Performance Monitoring

Transparent Optimization of Grid Server Selection with Real-Time Passive Network Measurements. Marcia Zangrilli and Bruce Lowekamp

SuperGIS Server 3 Website Performance and Stress Test Report

ESPRESSO: An Encryption as a Service for Cloud Storage Systems

USING VIRTUAL MACHINE REPLICATION FOR DYNAMIC CONFIGURATION OF MULTI-TIER INTERNET SERVICES

Data Centers and Cloud Computing

Detailed Lab Report DR101115D. Citrix XenDesktop 4 vs. VMware View 4 using Citrix Branch Repeater and Riverbed Steelhead

NetCDF and HDF Data in ArcGIS

OpenDAP configuration course

The ORIENTGATE data platform

1000Mbps Ethernet Performance Test Report

DELL. Virtual Desktop Infrastructure Study END-TO-END COMPUTING. Dell Enterprise Solutions Engineering

Legal Notices Introduction... 3

Performance Analysis of Lucene Index on HBase Environment

Data Centers and Cloud Computing. Data Centers

Brainlab Node TM Technical Specifications

Microsoft Dynamics AX 2012 System Requirements. Microsoft Corporation Published: November 2011

InterScan Web Security Virtual Appliance

CHAPTER FIVE RESULT ANALYSIS

W H I T E P A P E R : T E C H N I C A L. Understanding and Configuring Symantec Endpoint Protection Group Update Providers

SMART Bridgit software

Muse Server Sizing. 18 June Document Version Muse

Zeus Traffic Manager VA Performance on vsphere 4

Dragon NaturallySpeaking and citrix. A White Paper from Nuance Communications March 2009

Knowledge Article Performance Comparison: BMC Remedy ITSM Incident Management version Vs on Windows

Merge CADstream. For IT Professionals. Merge CADstream,

Installation Guide for Citrix XenServer 5.5

Understanding the Benefits of IBM SPSS Statistics Server

Pivot3 Reference Architecture for VMware View Version 1.03

CMIP6 Data Management at DKRZ

STI Hardware Specifications for PCs

AgencyPortal v5.1 Performance Test Summary Table of Contents

HPSA Agent Characterization

Liferay Portal Performance. Benchmark Study of Liferay Portal Enterprise Edition

Client-aware Cloud Storage

Achieving Nanosecond Latency Between Applications with IPC Shared Memory Messaging

The High Performance Internet of Things: using GVirtuS for gluing cloud computing and ubiquitous connected devices

Deploying in a Distributed Environment

High Performance Data-Transfers in Grid Environment using GridFTP over InfiniBand

The Monitis Monitoring Agent ver. 1.2

DELL TM PowerEdge TM T Mailbox Resiliency Exchange 2010 Storage Solution

GraySort and MinuteSort at Yahoo on Hadoop 0.23

Evaluation of Open Source Data Cleaning Tools: Open Refine and Data Wrangler

Performance test report

Distributed File System. MCSN N. Tonellotto Complements of Distributed Enabling Platforms

CEDA Storage. Dr Matt Pritchard. Centre for Environmental Data Archival (CEDA)

Model examples Store and provide Challenges WCS and OPeNDAP Recommendations. WCS versus OPeNDAP. Making model results available through the internet.

Remote Desktop Software Benchmark

WHITE PAPER 1

inforouter V8.0 Server & Client Requirements

Performance Analysis of Web based Applications on Single and Multi Core Servers

Performance Test Results Report for the Sled player

Igor Seletskiy. CEO, CloudLinux

owncloud Enterprise Edition on IBM Infrastructure

Delivering Quality in Software Performance and Scalability Testing

Marvell DragonFly Virtual Storage Accelerator Performance Benchmarks

Microsoft Dynamics AX 2012 System Requirements. Microsoft Corporation Published: August 2011

Using. Microsoft Virtual PC. Page 1

Using Synology SSD Technology to Enhance System Performance Synology Inc.

Ahsay Online Backup Suite v5.0. Whitepaper Backup speed

Performance Analysis and Capacity Planning Whitepaper

vrealize Business System Requirements Guide

VMWARE WHITE PAPER 1

RSM Web Gateway RSM Web Client INSTALLATION AND ADMINISTRATION GUIDE

NetIQ Access Manager 4.1

farmerswife Contents Hourline Display Lists 1.1 Server Application 1.2 Client Application farmerswife.com

Features of The Grinder 3

Interactive Data Visualization with Focus on Climate Research

Microsoft Dynamics AX 2012 System Requirements. Microsoft Corporation Published: March 2012

Dragon Medical Enterprise Network Edition Technical Note: Requirements for DMENE Networks with virtual servers

Virtualization Technologies and Blackboard: The Future of Blackboard Software on Multi-Core Technologies

Performance of Enterprise Java Applications on VMware vsphere 4.1 and SpringSource tc Server

SAP BusinessObjects BI4 Sizing What You Need to Know

Storage benchmarking cookbook

Comtrol Corporation: Latency Performance Testing of Network Device Servers

Business white paper. HP Process Automation. Version 7.0. Server performance

A vexpert Load Test of the KEMP VLM-5000

JASMIN Cloud ESGF and UV- CDAT Conference December 2014 STFC / Stephen Kill

Fixed Price Website Load Testing

Data Centers and Cloud Computing. Data Centers. MGHPCC Data Center. Inside a Data Center

wu.cloud: Insights Gained from Operating a Private Cloud System

Transcription:

Benchmarking OPeNDAP services for modern ESM data workloads Stephen Pascoe (Stephen.Pascoe@stfc.ac.uk) Richard Wilkinson (Tessella plc. Abingdon, UK) Phil Kershaw (Philip.Kershaw@stfc.ac.uk) 1

BADC: British Atmospheric Data Centre NEODC: Near-Earth Observation Data Centre IPCC-DDC: IPCC Data Distribution Centre 2

Opportunities for the NetCDF/OPeNDAP Platform Local Local Files Files WAN WAN ESGF Security Security ESGF LAN ESGF Security Security ESGF NetCDF-API NetCDF-API Script Script High-Level Tools Tools High-Level Application Application Web Web App. App. Remote Remote Files Files OPeNDAP OPeNDAP Virtual Virtual Datasets Datasets Remote Remote Files Files OPeNDAP OPeNDAP Virtual Virtual Datasets Datasets 3

Key Questions What performance can we expect? Can OPeNDAP servers cope? How can we improve? 4

dapbench OPeNDAP Test Framework http://github.com/stephenpascoe/dapbench Intercept and records OPeNDAP requests from clients Load-test OPeNDAP servers Jython NetCDF abstraction layer Verifying ESG security constraints Using: Python, Jython, Grinder, Pandas, Matplotlib, Nmon 5

Client Tests dapbench proxy Client Client Capture Test Clients CDAT (cdat-lite 6.0-alpha-3) CDO 1.4.7 Test Dataset 4D temperature field shape [120,4,144,192] time/level/lat/lon 51MB NetCDF file. Test Requests 1. Take a 45ºx45º subset of entire field 2. Regrid entire field to 100x100 lat/lon resolution 6

Client Results Test Subset Tool CDO CDAT Regrid CDO CDAT DODS Requests Download Size (Mb) 481 17281 481 69121 Comment 50 1 + time*level requests 1.5 1 + time*level*lat requests 50 1 + time*level requests 50 1 + time*level*lat requests CDO downloads too much and with sub-optimal number of requests. CDAT downloads just enough but with very sub-optimal number of requests. CDAT iterates over all outer dimensions. 7

ESGF Security testing $ dapbench-thredds <cmd> <options> Detect OPeNDAP URLs from THREDDS URL or URL list Test access to all responses.das,.dods,.dds,.ascii Confirms authz redirect correctness Useful for C.I. Testing 8

Tests Test Framework Grinder load-testing framework NetCDF-Java API Jython NetCDF abstraction layer Nmon server monitoring Test Dataset 30 files of 4D temperature field shape [120,4,144,192] time/level/lat/lon 600MB per NetCDF file. Test Requests 1. Download a single file (non-dap response) 2. Request all of a random file in n slices, n = [15, 30, 60, 120, 240, 720, 1440] 3. Continuously reqest random subsets in m threads, m = [1,.. 40] 9

Test System Measured Bandwidth: 930Mbps (iperf) Client Client DELL optiplex 980. Intel i5 4-cores @ 3.2GHz 1Gbps NIC NetCDF 4.1.2-beta2, HDF51.8.4-patch1 LAN Xen Virtual Machine Host: DELL PowerEdge 2950 III 2x quad-core CPU, 24GB RAM Guest VM: Paravirtualised OpenSuSE 11.0 2 CPU, 8GB RAM, 4GB swap Test s THREDDS Data : 3.17.3.1 Pydap: 3.0.rc.15. netcdf4-python-0.9.3. Platforms 64-bit Java SDK 1.6.0-13, Tomcat-6.0.20. JAVA_OPTS=-Xms1000M -Xmn500M -Xmx1500M Apache-2.2.8 (prefork), mod_wsgi-3.3 10

Simulating WAN performance $ tc qdisc add dev eth0 root netem delay <n>ms Host Latency (ms) CEDA PCMDI <0.5 160 DKRZ 40 EGU >1000 Ha ha ha! 11

Results: ramp slices 12

Results: Sequential Download 60 slices 30 slices 15 slices Direct download Including apache 13

Results: parallel requests Select Random File Select random time slices 4 Processes 60 Threads/Process 14

Results: load 0ms 20ms 15

How do we save clients from themselves? 16

DAP Response tiling / chunking WMS web-clients use tiling to improve client & server performance Browser Browser Cache Cache Proxy Proxy Cache Cache Tiled WMS Tile Tile Cache Cache WMS Tiles Tiles 17

Possible Future OPeNDAP OPeNDAP Client Client NetCDF-API Cache Cache Cache Proxy Proxy Cache Cache 4 Tiled OPeNDAP? 1 5 2 Tile Tile Cache Cache OPeNDAP 3 Tiles Tiles pydap.wsgi.cache 18

Conclusions High-level clients need optimising to use OPeNDAP effectively OPeNDAP servers can compete with whole-file download or we need inteligent proxies / caches If you minimise the number of requests implementations broadly competitive Effective caching needs to include clients and/or client-side proxies 19

Thanks Stephen.Pascoe@stfc.ac.uk Twitter: @spascoe G+: Stephen Pascoe http://github.com/stephenpascoe/dapbench 20