Parallel Large-Scale Visualization

Similar documents
Visualization with ParaView. Greg Johnson

Cloud Computing through Virtualization and HPC technologies

Introduction to Paraview. H.D.Rajesh

Why are we teaching you VisIt?

MapReduce and Hadoop. Aaron Birkland Cornell Center for Advanced Computing. January 2012

Large-Data Software Defined Visualization on CPUs

Overview of HPC systems and software available within

Parallel Analysis and Visualization on Cray Compute Node Linux

Cluster Scalability of ANSYS FLUENT 12 for a Large Aerodynamics Case on the Darwin Supercomputer

Petascale Visualization: Approaches and Initial Results

Visualization with ParaView

Data Mining with Hadoop at TACC

Introduction to Scientific Data Visualization

How To Build A Supermicro Computer With A 32 Core Power Core (Powerpc) And A 32-Core (Powerpc) (Powerpowerpter) (I386) (Amd) (Microcore) (Supermicro) (

VisIt Visualization Tool

Stream Processing on GPUs Using Distributed Multimedia Middleware

Facts about Visualization Pipelines, applicable to VisIt and ParaView

HPC and Big Data. EPCC The University of Edinburgh. Adrian Jackson Technical Architect

GPU File System Encryption Kartik Kulkarni and Eugene Linkov

High Performance Computing in CST STUDIO SUITE

Remote & Collaborative Visualization. Texas Advanced Compu1ng Center

ArcGIS Pro: Virtualizing in Citrix XenApp and XenDesktop. Emily Apsey Performance Engineer

HPC Cluster Decisions and ANSYS Configuration Best Practices. Diana Collier Lead Systems Support Specialist Houston UGM May 2014

PLANNING FOR DENSITY AND PERFORMANCE IN VDI WITH NVIDIA GRID JASON SOUTHERN SENIOR SOLUTIONS ARCHITECT FOR NVIDIA GRID

CORRIGENDUM TO TENDER FOR HIGH PERFORMANCE SERVER

SR-IOV In High Performance Computing

LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance

Windows Server Performance Monitoring

FLOW-3D Performance Benchmark and Profiling. September 2012

HPC Update: Engagement Model

Tableau Server Scalability Explained

Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays. Red Hat Performance Engineering

Flash Memory Arrays Enabling the Virtualized Data Center. July 2010

Estonian Scientific Computing Infrastructure (ETAIS)

The Design and Implement of Ultra-scale Data Parallel. In-situ Visualization System

Qualified Apple Mac Workstations for Avid Media Composer v5.0.x

Purchase of High Performance Computing (HPC) Central Compute Resources by Northwestern Researchers

Introduction to Visualization with VTK and ParaView

A Study on the Scalability of Hybrid LS-DYNA on Multicore Architectures

Hadoop on the Gordon Data Intensive Cluster

Overview and Introduction to Scientific Visualization

Visualization Infrastructure and Services at the MPCDF

Benchmark Hadoop and Mars: MapReduce on cluster versus on GPU

CHAPTER FIVE RESULT ANALYSIS

In-situ Visualization

HPC Wales Skills Academy Course Catalogue 2015

New Storage System Solutions

RDMA over Ethernet - A Preliminary Study

2009 Oracle Corporation 1

Evoluzione dell Infrastruttura di Calcolo e Data Analytics per la ricerca

White Paper. Recording Server Virtualization

An Introduction to Parallel Computing/ Programming

Parallel Computing with Mathematica UVACSE Short Course

Cornell University Center for Advanced Computing

Bringing Big Data Modelling into the Hands of Domain Experts

GPU System Architecture. Alan Gray EPCC The University of Edinburgh

Business white paper. HP Process Automation. Version 7.0. Server performance

A Hybrid Visualization System for Molecular Models

Virtualization of ArcGIS Pro. An Esri White Paper December 2015

Parallel Programming Survey

Clusters: Mainstream Technology for CAE

Agenda. Enterprise Application Performance Factors. Current form of Enterprise Applications. Factors to Application Performance.

Introduction to the NI Real-Time Hypervisor

Tableau Server 7.0 scalability

Configuration Maximums

Introduction. Need for ever-increasing storage scalability. Arista and Panasas provide a unique Cloud Storage solution

Using WestGrid. Patrick Mann, Manager, Technical Operations Jan.15, 2014

Parallel Firewalls on General-Purpose Graphics Processing Units

Unisys ClearPath Forward Fabric Based Platform to Power the Weather Enterprise

Several tips on how to choose a suitable computer

BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB

Deploying and managing a Visualization Onera

A GPU COMPUTING PLATFORM (SAGA) AND A CFD CODE ON GPU FOR AEROSPACE APPLICATIONS

High Performance. CAEA elearning Series. Jonathan G. Dudley, Ph.D. 06/09/ CAE Associates

MICROSOFT HYPER-V SCALABILITY WITH EMC SYMMETRIX VMAX

An Alternative Storage Solution for MapReduce. Eric Lomascolo Director, Solutions Marketing

Brainlab Node TM Technical Specifications

ParFUM: A Parallel Framework for Unstructured Meshes. Aaron Becker, Isaac Dooley, Terry Wilmarth, Sayantan Chakravorty Charm++ Workshop 2008

Building Clusters for Gromacs and other HPC applications

INSTALLATION GUIDE ENTERPRISE DYNAMICS 9.0

1 DCSC/AU: HUGE. DeIC Sekretariat /RB. Bilag 1. DeIC (DCSC) Scientific Computing Installations

ARCHITECTING COST-EFFECTIVE, SCALABLE ORACLE DATA WAREHOUSES

Configuration Maximums VMware vsphere 4.0

Simulation Platform Overview

Data Movement and Storage. Drew Dolgert and previous contributors

System Requirements Table of contents

In-situ Visualization: State-of-the-art and Some Use Cases

Understanding the Benefits of IBM SPSS Statistics Server

Data-intensive HPC: opportunities and challenges. Patrick Valduriez

James Ahrens, Berk Geveci, Charles Law. Technical Report

Outline. High Performance Computing (HPC) Big Data meets HPC. Case Studies: Some facts about Big Data Technologies HPC and Big Data converging

Trends in High-Performance Computing for Power Grid Applications

Transcription:

Parallel Large-Scale Visualization Aaron Birkland Cornell Center for Advanced Computing Data Analysis on Ranger January 2012

Parallel Visualization Why? Performance Processing may be too slow on one CPU Interactive visualization requires real-time frame rates Use lots of CPUs Shared-memory/multicore or distributed Data may be too big for available node Virtual memory works, but paging is slow Use lots of nodes to increase physical memory size Big shared-memory/multicore scaling is costly ($/CPU) Increase interactivity or feasibility 1/20/2012 2

Memory Utilization Some visualization techniques cause memory use to skyrocket! 1/20/2012 3

Memory Utilization: Regular Grids Specified by: (x,y,z) origin (nx, ny, nz) counts Data array Requires very little memory 1/20/2012 4

Memory Utilization: Regular Grids Chop off corner -> need an unstructured grid to represent data points Specified by Explicit list of vertices Explicit list of triangles Memory use can go up many times 1/20/2012 5

Memory Utilization: examples Mummy.vtk: Structured Grid (128x128x128) 2MB raw data Contour: 7MB Polygonal Mesh Slice of Contour:.1MB Tetrahedralize: 520MB!! Unstructured Grid Data points -> Tetrahedrons 1/20/2012 6

Visualization scales with HPC Large data produced by large simulations require large visualization machines and produce large visualization results Data and all derivations in memory, cumulative! High Res HPC data Visualized data 1x 10x-100x + Low Res 1/20/2012 7

TACC Parallel Visualization Systems Spur, Longhorn 1/20/2012 8

TACC Parallel Visualization Systems Spur 8 nodes, 1 TB total aggregate memory 16 cores per node (128 total) 4 GPUs per node (32 total) DDR (dual data rate) Infiniband interconnect Longhorn 256 nodes, 14.5 TB total aggregate memory! 8 cores per node (2048 total) 2 GPUs per node (512 total) QDR (quad data rate) Infiniband interconnect 1/20/2012 9

Longhorn Configuration 256 Dell Quad Core Intel Nehalem Nodes 240 Nodes Dual socket, quad core per socket: 8 cores/node 48 GB shared memory/node (6 GB/core) 73 GB Local Disk 2 Nvidia GPUs/node (FX 5800-4GB RAM) 16 Nodes Dual socket, quad core per socket: 8 cores/node 144 GB shared memory/node (18 GB/core) 73 GB Local Disk 2 Nvidia GPUs/node (FX 5800 4GB RAM) Direct Connection to Ranger s Lustre Parallel File System 10G Connection to 210 TB Local Lustre Parallel File System 1/20/2012 10

Processes Parallel Visualization: Task Parallelism Divide the overall workflow into tasks that can happen independently and, hence, concurrently Usually does not scale well in practice Timesteps 1 2 3 4 5 1 Read file 1 Isosurface 1 Cut Plane 1 2 Read file 2 Streamlines 2 Render 3 Read file 3 Triangulate 3 Decimate 3 Glyph 3 1/20/2012 11

Processes Parallel Visualization: Pipeline Parallelism Useful when processes have different/specialized resources Bottlenecks if one stage is particularly slow Timesteps 1 2 3 4 5 1 Read file 1 Read file 2 Read File 3 2 Isosurface 1 Isosurface 2 Isosurface 3 3 Render 1 Render 2 Render 3 1/20/2012 12

Processes Parallel Algorithms: Data Parallelism Data parallelism Data set is partitioned among the processes and all processes execute same operations on the data. Scales well as long as the data and operations can be decomposed. Timesteps 1 2 3 1 Read partition 1 Isosurface partition 1 Render partition 1 2 Read partition 2 Isosurface partition 2 Render partition 2 3 Read partition 3 Isosurface partition 2 Render partition 3 1/20/2012 13

Parallel Algorithms: What doesn t work Streamlines! Not data-parallel Partial streamlines must be passed from processor to processor as the streamline moves from partition to partition No more parallelism available than the number of streamlines! If >1 streamlines pass through the same partition, you may not even get that 1/20/2012 14

Parallel Data Management Data must be distributed across parallel processes to take advantage of resources Explicit Parallel formats use separate files for partitions Implicit parallel formats have a structure where data partitions can be deduced from file structure.vtk legacy, silo, raw Non-parallel formats need to be read serially and distributed in order to be used in parallel Overhead! Vtk xml formats (.vtu,.vti, etc) 1/20/2012 15

Parallel Data Management Read the manual! Vis software has varying support for file formats True parallel I/O may not be implemented for some formats Vis software will try to hide it s failings Example: ParaView (from FAQ) Currently there are only a few readers that truly work in parallel: VTK files (not legacy), partitioned legacy VTK files, ParaView data files, HDF5 files, EnSight master server files, and raw (binary) files can be read in parallel. For demonstration purposes, ParaView will distribute pieces of a data set when the reader cannot. Unfortunatley, this is an inefficient process. 1/20/2012 16

Rendering Many graphics primitives spread out over nodes Rendering solutions 1. Gather triangles onto one node, render there Best when there s not a lot of data to render 2. Render triangles in place, gather and Z-composite the results Best when there is a lot of data to render Overhead is almost independent of data size VisIt and ParaView both do it both ways User controls threshold, but both apps aim for reasonable defaults 1/20/2012 17