VAPOR: A tool for interactive visualization of massive earth-science datasets

Similar documents
Understanding WRF-ARW Datasets with Interactive 3D Visualization

An Integrated Simulation and Visualization Framework for Tracking Cyclone Aila

A Multiresolution Approach to Large Data Visualization

International Workshop on VISUALIZATION OF HIGH-RESOLUTION 3D TURBULENT FLOWS. Friday June 8 th 2007 Ecole Normale Supérieure, Paris Salle Dussane

A Prototype Discovery Environment for Analyzing and Visualizing Terascale Turbulent Fluid Flow Simulations

Visualization with ParaView. Greg Johnson

VisCMD: Visualizing Cloud Modeling Data

The THREDDS Data Repository: for Long Term Data Storage and Access

Hank Childs, University of Oregon

Elements of Scalable Data Analysis and Visualization

Ultra-Scale Visualization: Research and Education

Visualization with ParaView

Interactive Visualization of Magnetic Fields

In-Situ Bitmaps Generation and Efficient Data Analysis based on Bitmaps. Yu Su, Yi Wang, Gagan Agrawal The Ohio State University

Visualization and Data Analysis

NEXT-GENERATION. EXTREME Scale. Visualization Technologies: Enabling Discoveries at ULTRA-SCALE VISUALIZATION

The Design and Implement of Ultra-scale Data Parallel. In-situ Visualization System

13.2 THE INTEGRATED DATA VIEWER A WEB-ENABLED APPLICATION FOR SCIENTIFIC ANALYSIS AND VISUALIZATION

HPC & Visualization. Visualization and High-Performance Computing

CITRIS Founding Corporate Members Meeting

An Analytical Framework for Particle and Volume Data of Large-Scale Combustion Simulations. Franz Sauer 1, Hongfeng Yu 2, Kwan-Liu Ma 1

Visualization and Astronomy

VisIVO, a VO-Enabled tool for Scientific Visualization and Data Analysis: Overview and Demo

Why are we teaching you VisIt?

Project Summary. Project Description

CONVERGE Features, Capabilities and Applications

Application Frameworks for High Performance and Grid Computing

Very High Resolution Arctic System Reanalysis for

Parallel Simplification of Large Meshes on PC Clusters

Parallel Analysis and Visualization on Cray Compute Node Linux

Parallel Large-Scale Visualization

VisIt Visualization Tool

Real-time Processing and Visualization of Massive Air-Traffic Data in Digital Landscapes

ENABLING TOOLS AND TECHNIQUES FOR THE OPTIMIZATION OF THE HERACLES SIMULATION PROGRAM

Data Visualization. Principles and Practice. Second Edition. Alexandru Telea

PACE Predictive Analytics Center of San Diego Supercomputer Center, UCSD. Natasha Balac, Ph.D.

PyFR: Bringing Next Generation Computational Fluid Dynamics to GPU Platforms

Coupling micro-scale CFD simulations to meso-scale models

Professional Organization Checklist for the Computer Science Curriculum Updates. Association of Computing Machinery Computing Curricula 2008

Visualization and Feature Extraction, FLOW Spring School 2016 Prof. Dr. Tino Weinkauf. Flow Visualization. Image-Based Methods (integration-based)

Distributed Computing. Mark Govett Global Systems Division

Interactive Level-Set Segmentation on the GPU

COMP/CS 605: Intro to Parallel Computing Lecture 01: Parallel Computing Overview (Part 1)

The Scientific Data Mining Process

Computational Fluid Dynamics. Department of Aerospace Engineering, IIT Bombay

Graphical Processing Units to Accelerate Orthorectification, Atmospheric Correction and Transformations for Big Data

SGI HPC Systems Help Fuel Manufacturing Rebirth

Visualization Infrastructure and Services at the MPCDF

James Ahrens, Berk Geveci, Charles Law. Technical Report

walberla: A software framework for CFD applications on Compute Cores

In-Situ Processing and Visualization for Ultrascale Simulations

DIY Parallel Data Analysis

Cluster Scalability of ANSYS FLUENT 12 for a Large Aerodynamics Case on the Darwin Supercomputer

Visualisatie BMT. Introduction, visualization, visualization pipeline. Arjan Kok Huub van de Wetering

Visualizing and Analyzing Massive Astronomical Datasets with Partiview

Facts about Visualization Pipelines, applicable to VisIt and ParaView

CLOUD BASED N-DIMENSIONAL WEATHER FORECAST VISUALIZATION TOOL WITH IMAGE ANALYSIS CAPABILITIES

Data Centric Systems (DCS)

Introduction to Flow Visualization

Interactive Level-Set Deformation On the GPU

How is EnSight Uniquely Suited to FLOW-3D Data?

EUFORIA: Grid and High Performance Computing at the Service of Fusion Modelling

Graphical Processing Units to Accelerate Orthorectification, Atmospheric Correction and Transformations for Big Data

THE RESEARCH OF DEM DATA COMPRESSING ARITHMETIC IN HELICOPTER NAVIGATION SYSTEM

1. Why Scientific Visualization is a Good Application of Virtual Reality

High-Performance Visualization of Geographic Data

Overview of HPC Resources at Vanderbilt

The Ultra-scale Visualization Climate Data Analysis Tools (UV-CDAT): A Vision for Large-Scale Climate Data

Data analysis and visualization topics

Multiphysics Software Applications in Reverse Engineering

Norwegian Satellite Earth Observation Database for Marine and Polar Research USE CASES

Hue Streams. Seismic Compression Technology. Years of my life were wasted waiting for data loading and copying

Introduction to Visualization with VTK and ParaView

VisIVO, an open source, interoperable visualization tool for the Virtual Observatory

IS-ENES/PrACE Meeting EC-EARTH 3. A High-resolution Configuration

Kinetic effects in the turbulent solar wind: capturing ion physics with a Vlasov code

IBM Deep Computing Visualization Offering

The Construction of Seismic and Geological Studies' Cloud Platform Using Desktop Cloud Visualization Technology

SECONDARY STORAGE TERRAIN VISUALIZATION IN A CLIENT-SERVER ENVIRONMENT: A SURVEY

Large-Data Software Defined Visualization on CPUs

Medical Image Processing on the GPU. Past, Present and Future. Anders Eklund, PhD Virginia Tech Carilion Research Institute

Big Data and Cloud Computing for GHRSST

Big Data in HPC Applications and Programming Abstractions. Saba Sehrish Oct 3, 2012

Big Data Challenges in Bioinformatics

Parallel Hierarchical Visualization of Large Time-Varying 3D Vector Fields

Optimizing Storage for Better TCO in Oracle Environments. Part 1: Management INFOSTOR. Executive Brief

Large Vector-Field Visualization, Theory and Practice: Large Data and Parallel Visualization Hank Childs + D. Pugmire, D. Camp, C. Garth, G.

LA-UR- Title: Author(s): Intended for: Approved for public release; distribution is unlimited.

Use of OpenFoam in a CFD analysis of a finger type slug catcher. Dynaflow Conference 2011 January , Rotterdam, the Netherlands

DELPH v3.0. seabed mapping software suite

Graphics. Computer Animation 고려대학교 컴퓨터 그래픽스 연구실. kucg.korea.ac.kr 1

Data Requirements from NERSC Requirements Reviews

Amazing renderings of 3D data... in minutes.

Parallel Visualization for GIS Applications

Part I Courses Syllabus

MEng, BSc Applied Computer Science

Schedule WRF model executions in parallel computing environments using Python

Petascale Visualization: Approaches and Initial Results

NVIDIA IndeX. Whitepaper. Document version June 2013

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets

Transcription:

VAPOR: A tool for interactive visualization of massive earth-science datasets Alan Norton and John Clyne National Center for Atmospheric Research Boulder, CO USA Presentation at AGU Dec. 13, 2010 This work is funded in part through U.S. National Science Foundation grants 03-25934 and 09-06379, and through a TeraGrid GIG award

Introduction: Outline Problems with visualizing and analyzing massive data VAPOR project goals Overview of VAPOR s capabilities Support for browsing massive data Basic visualization features Unique visualization capabilities Three live demos: Geo-referencing, tracking Hurricane Katrina in WRF output Identification and analysis of P. Mininni s Current Roll in a large MHD dataset Identification and tracking of vortices in a high-resolution hurricane dataset (Y. Chen)

Typical Analysis/Vis Workflow Supercomputing Temp Disk Archive Offline processing: Analysis and Visualization Analysis Repository

Archival is not keeping up Supercomputer sustained computation rate is doubling every 12-15 months Archive storage capacity is doubling every 25-26 months Fraction of data saved for analysis halves every ~2 years.

Visualization and Analysis are limited by I/O Performance improvements for I/O 1977-2005, compared with computation rate improvements Source: DARPA HPCS I/O presentation

Problems with Petascale Analysis/Vis Workflow Insufficient capacity, speed Only infrequent archival Supercomputing Only for small samples, statistics Offline processing: Temp Disk Analysis and Visualization Archive Analysis Repository Takes days or weeks Insufficient speed For interactivity

Implications for Visualization and Analysis of Petascale computations Two serious problems: Smaller portion of data is available for analysis because of limited storage and archive capacity. Analysis and visualization of the available data becomes non-interactive due to limited I/O rates Result: Loss of scientific productivity [Numerical] models that can currently be run on typical supercomputing platforms produce data in amounts that make storage expensive, movement cumbersome, visualization difficult, and detailed analysis impossible. The result is a significantly reduced scientific return from the nation's largest computational efforts. Mark Rast University of Colorado, LASP

VAPOR project overview The VAPOR project is intended to address the problem of datasets that are becoming too big to analyze and visualize interactively VAPOR is the Visualization and Analysis Platform for Oceanic, atmospheric and solar Research Goal: Enable scientists to interactively analyze and visualize massive datasets resulting from fluid dynamics simulation Domain focus: 2D and 3D, gridded, time-varying turbulence datasets, especially earth-science simulation output. Essential features: Multi-resolution data representation for accelerated data access Exploits GPU for fast rendering Interactive user interface for scientific visual data exploration Available (free) on Mac, Windows, Linux

Wavelet transforms for 3D multiresolution data representation Some wavelet properties: Data can be accessed at desired resolution and compression level Lossless or Lossy (up to 500:1 compression) Numerically efficient (O(n)) Forward and inverse transform No additional storage cost

Wavelet compression enables 100:1 or better compression with minimal degradation original 64-fold averaged with Haar 64-fold compress biorth spline Smyth: salt sheet boundary simulation

Petascale workflow using wavelet compression Supercomputing Data would be interactively analyzed and visualized, during and after simulation. Intermediate times available in compressed form Infrequent full saves; Frequent compressed saves Monitor waveletcompressed results Interactive Analysis and Visualization Temp Disk Archive Rapid retrieval of requested data Remote visualization sessions supported Wavelet Repository

VAPOR capabilities (latest version: 2.0) All tools perform interactively, exploiting multiresolution representation Wavelet compression enables up to 500:1 reduction of I/O reads GPU-accelerated interactive graphics Python calculation of derived variables Flow integration Streamlines, particle traces Field line advection Image-based flow visualization Data probing and contour planes WRF-ARW terrain-following grids Direct import of WRF output files Geo-referenced image support Smyth, salt sheet boundary simulation Mininni, Current roll

How VAPOR differs from other visualization platforms Multi-resolution data representation with compression To enable interactive display and analysis of peta-scale datasets Python and NumPy embedded support Intended to be used by scientists, not visualization engineers Requirements defined by a steering committee of scientists Narrow focus: turbulence simulation on gridded domains Not built on existing visualization libraries (e.g. VTK) Emphasis on desktop and laptop platforms; no distributed implementation

Demonstrations Visualization of WRF-ARW output: Hurricane Katrina with geo-referenced terrain images Identification and analysis of Current Roll in a 1536 3 MHD simulation (Pablo Mininni) Interactive browsing by controlling refinement level and compression Flow seed positioning for magnetic field lines Magnetic field line advection Identification and tracking of vortices in eye-wall of highresolution hurricane simulation (Yongsheng Chen) Image-based flow visualization identifies vortices Field-line advection tracks them

Visualize and track hurricane Katrina Visualization of WRF-ARW simulation with moving nest Apply terrain image and county maps obtained from Web Mapping Services Geo-referencing provides spatial context for streamlines, volume rendering, isosurfaces.

Visualization Discovery Example Small scale structures in MHD turbulence with high Reynolds number Data from Pablo Mininni, NCAR 1536x1536x1536 volume, 16 variables (216 GB per timestep) Scientific goal: understand MHD flow dynamics at high resolution and high Reynolds no. Analysis and visualization performed with VAPOR and IDL Resulted in discovery of intertwining current sheets ( current rolls )

Multi-resolution data browsing Wavelet data representation supports control of data resolution as well as compression level Interactively visualize full data at low resolution, high compression Zoom in, increase resolution, reduce compression for detailed understanding P. Mininni, current roll

Using multiple time-steps: Track evolving structures with field line advection Animates field lines in velocity field Useful for tracking evolution of geometric structures (e.g. vorticity field lines in tornado) Based on algorithm proposed by Aake Nordlund Data provided by P. Mininni

High-res hurricane analysis demo Visualization was used to understand the nature of increased turbulence along eye-wall Unsteady flow shows overall wind dynamics VAPOR s IBFV tool can identify horizontally oriented transient vortex tubes near the ocean surface Using VAPOR s field line advection, these vortices can be tracked and animated over time Results of Yongsheng Chen

VAPOR plans VAPOR s steering committee and other users help prioritize features, with releases every 9-12 months Vapor 2.0.0 was just released Some high priority features for upcoming releases: Parallel conversion of data during simulation Animation control Extensible architecture Iso-Lines Support for ocean modeling

Summary VAPOR is designed to enable interactive visualization and analysis of massive datasets by exploiting the wavelet multi-scale representation. VAPOR supports a variety of useful interactive techniques for investigating and visualizing data, based on needs expressed by scientific users. Recent improvements to VAPOR are designed to enable interactive access to petabyte datasets and support anticipated peta-scale workflows.

Version 2.0 just released available on Website VAPOR Availability Runs on Linux, Windows, Mac System requirements: a modern (nvidia or ATI) graphics card (available for about $200) ~1GB of memory Executables, documentation available (free) at http://www.vapor.ucar.edu/ Source code, feature requests, etc. at http://sourceforge.net/projects/vapor Contact: vapor@ucar.edu

Questions?

Steering Committee Nic Brummell - CU Yuhong Fan - NCAR, HAO Aimé Fournier NCAR, IMAGe Pablo Mininni, NCAR, IMAGe Aake Nordlund, University of Copenhagen Helene Politano - Observatoire de la Cote d'azur Yannick Ponty - Observatoire de la Cote d'azur Annick Pouquet - NCAR, ESSL Mark Rast - CU Duane Rosenberg - NCAR, IMAGe Matthias Rempel - NCAR, HAO Geoff Vasil, CU Thara Phrabhakaran, IITM Gerry Creager, TAMU Leigh Orf, Central Mich. U. Acknowledgements WRF consultation Wei Wang NCAR, MMM Cindy Bruyere NCAR, MMM Yongsheng Chen-NCAR,MMM Wei Huang NCAR, CISL Developers John Clyne NCAR, CISL Alan Norton NCAR, CISL Kenny Gruchalla CU Victor Snyder CSM Rick Brownrigg NCAR, CISL Pam Gillman NCAR, CISL Research Collaborators Kwan-Liu Ma, U.C. Davis Hiroshi Akiba, U.C. Davis Han-Wei Shen, Ohio State Liya Li, Ohio State Systems Support Joey Mendoza, NCAR, CISL