How To Visualize At The Dlr



Similar documents
Scalable Distributed Schur Complement Solvers for Internal and External Flow Computations on Many-Core Architectures

Automation of Aircraft Pre-Design with Chameleon

Parallel Large-Scale Visualization

Petascale Visualization: Approaches and Initial Results

Advances and Work in Progress in Aerospace Predesign Data Exchange, Validation and Software Integration at the German Aerospace Center

Cluster Scalability of ANSYS FLUENT 12 for a Large Aerodynamics Case on the Darwin Supercomputer

OpenMP Programming on ScaleMP

Agenda. HPC Software Stack. HPC Post-Processing Visualization. Case Study National Scientific Center. European HPC Benchmark Center Montpellier PSSC

Parallel Analysis and Visualization on Cray Compute Node Linux

Faculty of Computer Science Computer Graphics Group. Final Diploma Examination

Volume visualization I Elvins

Interactive Data Visualization with Focus on Climate Research

HPC technology and future architecture

Parallel Simplification of Large Meshes on PC Clusters

High Performance Computing in CST STUDIO SUITE

RepoGuard Validation Framework for Version Control Systems

A GPU COMPUTING PLATFORM (SAGA) AND A CFD CODE ON GPU FOR AEROSPACE APPLICATIONS

Andreas Schreiber, Michael Meinel, Tobias Schlauch

Performance Characteristics of Large SMP Machines

Facts about Visualization Pipelines, applicable to VisIt and ParaView

HPC and Big Data. EPCC The University of Edinburgh. Adrian Jackson Technical Architect

NVIDIA IndeX Enabling Interactive and Scalable Visualization for Large Data Marc Nienhaus, NVIDIA IndeX Engineering Manager and Chief Architect

Wind-Tunnel Simulation using TAU on a PC-Cluster: Resources and Performance Stefan Melber-Wilkending / DLR Braunschweig

Clusters: Mainstream Technology for CAE

Numerical Calculation of Laminar Flame Propagation with Parallelism Assignment ZERO, CS 267, UC Berkeley, Spring 2015

Kriterien für ein PetaFlop System

YALES2 porting on the Xeon- Phi Early results

The DLR Satellite Station in Inuvik a Milestone for the Canadian-German Cooperation in Earth Observation

GPU Architecture. Michael Doggett ATI

CHAPTER FIVE RESULT ANALYSIS

The Scientific Data Mining Process

Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel

VisIt Visualization Tool

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage

Interactive Motion Simulators

Graphical Processing Units to Accelerate Orthorectification, Atmospheric Correction and Transformations for Big Data

How To Make A Video From A Computer To A Phone Or Tablet Or Ipad Or Ipa Or Ipam Or Ipan Or Ipar Or Ipra Or Ipro Or Ipor Or Ipo Or Ipom Or Ipr Or Ipon

Radar Image Processing with Clusters of Computers

Big Data and Analytics: A Conceptual Overview. Mike Park Erik Hoel

Computational infrastructure for NGS data analysis. José Carbonell Caballero Pablo Escobar

Recent Advances and Future Trends in Graphics Hardware. Michael Doggett Architect November 23, 2005

Very special thanks to Wolfgang Gentzsch and Burak Yenier for making the UberCloud HPC Experiment possible.

Parallel Programming Survey

Building Clusters for Gromacs and other HPC applications

HPC & Visualization. Visualization and High-Performance Computing

Collaborative modelling and concurrent scientific data analysis:

Aeronautical Testing Service, Inc th DR NE Arlington, WA USA. CFD and Wind Tunnel Testing: Complimentary Methods for Aircraft Design

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

IBM Deep Computing Visualization Offering

GPU System Architecture. Alan Gray EPCC The University of Edinburgh

Recommended hardware system configurations for ANSYS users

Computational Modeling of Wind Turbines in OpenFOAM

Introduction to DISC and Hadoop

Cray: Enabling Real-Time Discovery in Big Data

Using the Windows Cluster

SECONDARY STORAGE TERRAIN VISUALIZATION IN A CLIENT-SERVER ENVIRONMENT: A SURVEY

Chapter 7. Using Hadoop Cluster and MapReduce

The Design and Implement of Ultra-scale Data Parallel. In-situ Visualization System

Hybrid Software Architectures for Big

Computational Fluid Dynamics - CFD Francisco Palacios & Markus Widhalm fpalacios@gmail.com, markus.widhalm@dlr.de Basque Center for Applied

Computational Fluid Dynamics. Department of Aerospace Engineering, IIT Bombay

Visualization with ParaView

OMIQ srl. Advanced Post-Processing of Flow3D Calculations using Ensight. Federico Monterosso. Ingegneria di componenti e sistemi industriali

Customer Training Material. Lecture 2. Introduction to. Methodology ANSYS FLUENT. ANSYS, Inc. Proprietary 2010 ANSYS, Inc. All rights reserved.

Numerical simulation of maneuvering combat aircraft

Visualization with ParaView. Greg Johnson

Deploying and managing a Visualization Onera

VisIVO, a VO-Enabled tool for Scientific Visualization and Data Analysis: Overview and Demo

O.F.Wind Wind Site Assessment Simulation in complex terrain based on OpenFOAM. Darmstadt,

High Performance. CAEA elearning Series. Jonathan G. Dudley, Ph.D. 06/09/ CAE Associates

How To Compare Amazon Ec2 To A Supercomputer For Scientific Applications

Bringing Big Data Modelling into the Hands of Domain Experts

Andreas Schreiber

Turbomachinery CFD on many-core platforms experiences and strategies

TIME-ACCURATE SIMULATION OF THE FLOW AROUND THE COMPLETE BO105 WIND TUNNEL MODEL

Computational Fluid Dynamics

Study of Large-Scale Data Visualization

Open Source CFD Solver - OpenFOAM

PACE Predictive Analytics Center of San Diego Supercomputer Center, UCSD. Natasha Balac, Ph.D.

On Demand Satellite Image Processing

CCTech TM. ICEM-CFD & FLUENT Software Training. Course Brochure. Simulation is The Future

Purchase of High Performance Computing (HPC) Central Compute Resources by Northwestern Researchers

Visualization of Adaptive Mesh Refinement Data with VisIt

Performance, Reliability, and Operational Issues for High Performance NAS Storage on Cray Platforms. Cray User Group Meeting June 2007

Visualization and Feature Extraction, FLOW Spring School 2016 Prof. Dr. Tino Weinkauf. Flow Visualization. Image-Based Methods (integration-based)

HPC Cloud. Focus on your research. Floris Sluiter Project leader SARA

Data Centric Systems (DCS)

Parallel Visualization for GIS Applications

Data Visualization Study at Earth Simulator Center

Transcription:

Interactive Visualization of Large Simulation Datasets Andreas Gerndt, Rolf Hempel, Robin Wolff DLR Simulation and Software Technology EuroMPI 2010 Conference, Stuttgart, Sept. 13-15, 2010 Folie 1

Introduction Folie 2

DLR German Aerospace Center Research on Aeronautics Space Transport Energy Space Agency Project Management Agency Folie 3

Locations and employees 6500 employees across 29 research institutes and facilities at 13 sites. Offices in Brussels, Paris and Washington. Hamburg Bremen Trauen Berlin Braunschweig Dortmund Goettingen Koeln Bonn Neustrelitz Lampoldshausen Stuttgart Oberpfaffenhofen Weilheim Folie 4

Visualization at DLR Applications in Experiment and Simulation Earth observation National data repository Technology development Left: SAR image of Cotopaxi volcano Atmospheric and climate research Planetary science Several space missions, e.g. Mars Express High-resolution Stereo Camera Multi-spectral data 10 100 meters / pixel Folie 5

Visualization at DLR Applications in Experiment and Simulation Simulators for cars, aircraft, helicopters, Left: Dynamic car simulator Research into driver assistance systems Visualization of external view Robotics / Mechatronics Spacecraft design On-orbit satellite servicing Astronaut training Folie 6

Visualization at DLR Applications in Experiment and Simulation Aerodynamics CFD and experiment Turbines, airplanes, helicopters, spacecraft Development of CFD codes (TAU, TRACE) Experiments used to validate simulations Prominent DLR institutes specialized in CFD Inst. for aerodynamics and flow technology Institute for propulsion technology Folie 7

Visualization at DLR DLR Simulation and Software Technology Research on software / computer science topics Co-operations with engineering institutes in advanced software projects Main topics include HPC and Visualization Visualization projects with DLR institutes on CFD around airplanes / in turbomachines planetary surface data on-orbit servicing of satellites Folie 8

Objectives of Virtual Reality (VR) Research Important aspects Immersion Interactivity 3D Visualization Multi-modality Interactive CFD data exploration in a virtual environment Folie 9

Overview Problem Description Pipeline Distribution / Parallelization Framework Large-scale Dataset I/O Multi-Resolution and Data Streaming Co-Execution of Simulation and Post-Processing Folie 10

Problem Description Folie 11

Problem Description What is a large simulation dataset? CFD calculations at the Institute for aerodynamics and flow technology with the TAU code Solution of the Raynolds-averaged Navier-Stokes equations HPC system: C 2 A 2 S 2 E (see next slide) Simulation of the air flow around a complete transport airplane configuration 300-400 / 2000 mio. grid points (without / with acoustics) Test case: Slat track noise (LES) 80 mio. pnts., 6.7 GB, 10,000 time steps 19 minutes / time step on 2048 cores Folie 12

Problem Description The C 2 A 2 S 2 E Supercomputer C 2 A 2 S 2 E Cluster (SUN) 16 compute racks, 48 blades each 2 AMD Opteron quad core processors, 1.9 GHz (Barcelona) 768 nodes, 6144 cores 46.6 TFlop/s Peak performance TAU 1 core: 1 GFlop/s all cores: 3 TFlop/s 12288 GB Main memory 16 GB per node (758 nodes) 32 GB per node ( 10 nodes) SUN Mega Switch Parallel file system (GPFS) High speed link to Airbus Bremen (100 Mbit/s) Dedicated to aircraft research 2010 performance update by factor 3 Folie 13

Problem Description First Experiences with Virtual Reality 2007: VR system set up at Institute for aerodynamics and flow technology Powerwall 2.7m x 2.0m 2 projectors IR tracking system 2 dual-core Opteron processors Visualization software: Ensight Gold 8.2 Problems System cannot handle full-resolution datasets Visualization is too slow Missing interaction metaphors System mostly used for Colorful pictures for visitors Folie 14

Problem Description CFD Simulation of a Turbo-Engine Compressor Section Four-stage compressor section Simulation of the transition point between laminar and turbulent flow instationary problem Computational domain split into blocks (colored) 123 Million mesh points National Supercomputer HLRB-II (SGI Altix 4700) at the Leibnitz- Rechenzentrum in Munich In total 15 Terabytes generated Compute time: 210,000 CPUh Example is two years old Today: Intention to run a problem three times as large Folie 15

Pipeline Distribution Folie 16

Pipeline Distribution Visualization Pipeline Visual analysis of simulation data is organized as a post-processing pipeline Pipeline input: large-scale grid-based CFD dataset Filtering stage processes the data Produces more manageable data sizes (cutting, clipping, converting, merging, re-sampling, ) Extracts features of interest (isosurfaces, particle tracing, vortices, topology information, ) May contain multiple processing steps and can be organized as a data flow network (multiple inputs and outputs) Mapping Creates visualization objects Rendering Dataset Filtering Extracted Data Mapping Visual Primitive Rendering Image Data Folie 17

Pipeline Distribution Data Parallelization Several parallelization approaches possible By nature of the post-processing pipeline, heavy work is usually located at the beginning of the pipeline Data parallelization is the most efficient approach in most cases Communication costs are reduced to a minimum Folie 18

Pipeline Distribution Data Parallelization Determine heavy work and parallelize these pipeline sections Split data and allocate sections to parallel processes Combine partial results and execute remaining steps Folie 19

Pipeline Distribution Distributed Post-Processing Framework Distributed post-processing framework Viracocha Uses data parallelization approach Has been developed in co-operation with the VR Group at Aachen University (RWTH) Message-passing best choice for parallel post-processing TCP/IP for communication in heterogeneous hardware systems ViSTA FlowLib Viracocha VTK VTK Extraction Manager Scheduler Worker Worker TCP/IP Message Passing Folie 20

Pipeline Distribution Distributed Post-Processing Framework Generic Viracocha layer architecture Application-independent Easy integration of new functions Parallel concept (scheduler, workers) stays untouched Visualization Toolkit (VTK) is used for most of the basic postprocessing algorithms Algorithms from own research added to Toolkits ViSTA FlowLib Viracocha VTK VTK Extraction Manager Scheduler Worker Worker TCP/IP Message Passing Folie 21

Pipeline Distribution Distributed Post-Processing Framework ViSTA FlowLib is used as frontend Optimized for visualization of unsteady simulation data Interactive visualization using Virtual Reality technologies Real-time interaction by decoupling heavy data handling and post-processing work More features can be requested from backend without disrupting interactive exploration ViSTA FlowLib Viracocha VTK VTK Extraction Manager Scheduler Worker Worker TCP/IP Message Passing Folie 22

Large-scale Dataset I/O Folie 23

Large-scale Dataset I/O Viracocha Data Management System Concurrent data management handles I/O load Caching (primary / secondary) Prefetching Optimized loading strategies Data services Ask scheduler for next data Load balancing Data Data Data ViSTA FlowLib Server Proxy Viracocha Proxy Extraction Manager Scheduler Worker Worker TCP/IP Message Passing Folie 24

Large-scale Dataset I/O Domain Decomposition Multi-block datasets Can easily be distributed among several processes One block per time level = one file on disk Sufficient for cell-based (local) algorithms Unstructured datasets Make use of TAU s partitioner Online Monitoring: no I/O required Domain decomposition for a simulation using TAU Blocks of a Propfan dataset, block-wise colored Folie 25

Large-scale Dataset I/O Loading on Demand Global post-processing algorithms E.g. particle integration Only block containing current particle position needs to be loaded into main memory Metadata about block connectivity helps to identify next block when particle leaves current block Load on demand considerably reduces I/O load Interactive particle seeding in an immersive environment Folie 26

Multi-Resolution and Data Streaming Folie 27

Multi-Resolution and Data Streaming Data Streaming Decouple frontend and backend Improvement of interactivity within virtual environments However: Extraction still needs time Solution: Data streaming Motivation: Progressive JPG >>> Immediate feedback Fast impression of the final result Exploit first approximate results Quick start of data exploration Possible to abort running comp. and restart it with new parameters Problem Additional communication and more computation time Appropriate CFD algorithms Progressive approaches hardly available Source: http://wp.netscape.com/eng/mozilla/2.0/relnotes/demo/pjpegdemo.html Folie 28

Multi-Resolution and Data Streaming Data Streaming Multiple streaming steps of Lambda-2 vortex extraction computed by 4 workers and then streamed independently to the visualization frontend Folie 29

Multi-Resolution and Data Streaming Progressive Streaming Analytical cut function Intersections on cell edges have to be found Geometry of cut mesh unknown in advance Fast Sampling Geometry is already known everywhere in advance Cut surface can be visualized immediately Interactivity! Only scalars sampled at vertices are transmitted Computed in parallel on backend Triangulation parameters of a cut cylinder (right), resulting cut through the propfan (top) Folie 30

Multi-Resolution and Data Streaming Progressive Streaming Progressive approach Realized as surface sub-division Dyadic split: classical approach for triangle meshes Sqrt-3 refinement Fewer points to add Slower refinement Smoother streaming Sqrt-3 refinement (Level 1) 3 3 Sqrt-3 refinement steps with special boundary treatment Sqrt-3 refinement (Level 2), initial grid (red) Folie 31

T11 T16 Multi-Resolution and Data Streaming Multi-Resolution Geometry Adaptive rendering Cache multi-level streaming data for Level-of-Detail switch Increase frame rate by using different LODs selected by priorities E.g. context geometry vs. extraction data Handled by multi-resolution manager Selective refinement Get viewpoint of the user Close parts of objects are rendered with higher resolution Requires adaptive meshes Main advantage of Sqrt-3 data structure Approach can also be applied for polylines (e.g. particle trajectories) T4 T1 T2 T3 T4 T2 T1 T3 T1 T2 T3 T4 T4 T12 T1 T5 T6 T2 T7 T8 T3 T9 T10 T4 T11 T12 T14 T8 T6 T13 T15 T3 T1 T10 T6 T13 T14 T11 T15 T16 T2 T7 T5 T9 Sqrt-3 selective refinement Folie 32

Multi-Resolution and Data Streaming Multi-Resolution Geometry View-dependent optimized view of propfan blades, original with 340,000 triangles (left), view-dependent optimized with 8,532 triangles (mid), and wireframe presentation of the entire view-dependent optimized model (right) Folie 33

Co-Execution of Simulation and Post-Processing Folie 34

Co-Execution of Simulation and Post-Processing Online Monitoring and Computational Steering Goal: Interact with the simulation during the run Approach: Interactive post-processing Integrate steering module Attach streaming visualization pipeline traditional approach Challenges: Steer simulation and maintain data coherence Integration of interactive postprocessing monitoring & steering approach Simulation Simulation Visualization post-processing Storage post-processing Streaming Monitoring Steering Visualization Visualization Folie 35

Co-Execution of Simulation and Post-Processing Online Monitoring and Computational Steering First results of prototype Mesh adaption while simulation runs Processing done in parallel (shown by color) original mesh refined mesh Folie 36

Co-Execution of Simulation and Post-Processing Distributed Post-Processing Infrastructure Goals: Exploit parallelism using various techniques and resources Reduce amount of data communicated across pipeline Rendering in parallel on GPGPU cluster (integrated in HPC environment) Light-weight frontend sufficient Backend Frontend Simulation Dataset Filtering Extracted Data Mapping Visual Primitive Rendering Image Data Storage HPC- Resources Render Cluster Displays Data-Parallelism Task A Task D Task G Task B Task E Task H Task C Task F Task I Task-Parallelism Parallel Rendering Folie 37

Co-Execution of Simulation and Post-Processing Distributed Post-Processing Infrastructure Planned Infrastructure at DLR: Infiniband > 10Gbit/s Intranet Ethernet 100/1000Mbit 155Mbit DFN-Network Ethernet Router Ethernet Switch SfR Ethernet Switch GPU Compute/Render Nodes DFN- Network Login Node Remote Locations Teamserver SC-BS Infiniband Switch SC C 2 A 2 S 2 E Cluster Workstations SC-BS Infiniband Switch AS Anton Cluster Optional Infiniband Connection 200TB GPFS AS 500TB GPFS CASE 100TB GPFS Anton VR-Lab SC-BS +50TB SC Folie 38

Summary Growing importance of visualization in large-scale numerical simulation Today VR techniques mainly used for pretty pictures Fast processing of large datasets required for interactive visualization HPC (parallel execution) for first stages in visualization pipeline Close integration of application and postprocessing requires dedicated hardware / software infrastructure Pure batch execution model for HPC no longer feasible Folie 39