Overview and Introduction to Scientific Visualization



Similar documents
Introduction to Scientific Data Visualization

Parallel Large-Scale Visualization

Remote & Collaborative Visualization. Texas Advanced Compu1ng Center

Data Mining with Hadoop at TACC

Visualization with ParaView. Greg Johnson

Large-Data Software Defined Visualization on CPUs

Visualisatie BMT. Introduction, visualization, visualization pipeline. Arjan Kok Huub van de Wetering

Why are we teaching you VisIt?

Visualization with ParaView

Interactive Data Visualization with Focus on Climate Research

Parallel Analysis and Visualization on Cray Compute Node Linux

Visualization of Adaptive Mesh Refinement Data with VisIt

VisIt Visualization Tool

Interactive simulation of an ash cloud of the volcano Grímsvötn

High Performance Computing in CST STUDIO SUITE

The Visualization Pipeline

Introduction to Visualization with VTK and ParaView

How is EnSight Uniquely Suited to FLOW-3D Data?

Data Visualization. Principles and Practice. Second Edition. Alexandru Telea

NVIDIA IndeX. Whitepaper. Document version June 2013

Facts about Visualization Pipelines, applicable to VisIt and ParaView

GPU Renderfarm with Integrated Asset Management & Production System (AMPS)

Simulation Platform Overview

NVIDIA IndeX Enabling Interactive and Scalable Visualization for Large Data Marc Nienhaus, NVIDIA IndeX Engineering Manager and Chief Architect

Scientific Visualization with ParaView

VisIt: A Tool for Visualizing and Analyzing Very Large Data. Hank Childs, Lawrence Berkeley National Laboratory December 13, 2010

Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel

Computer Applications in Textile Engineering. Computer Applications in Textile Engineering

CHAPTER FIVE RESULT ANALYSIS

Performance Characteristics of Large SMP Machines

COSC 6344 Visualization

Data Centric Interactive Visualization of Very Large Data

Data Visualization for Atomistic/Molecular Simulations. Douglas E. Spearot University of Arkansas

Hue Streams. Seismic Compression Technology. Years of my life were wasted waiting for data loading and copying

Jean-Pierre Panziera Teratec 2011

MEng, BSc Applied Computer Science

An Integrated Simulation and Visualization Framework for Tracking Cyclone Aila

Scalable and High Performance Computing for Big Data Analytics in Understanding the Human Dynamics in the Mobile Age

HETEROGENEOUS HPC, ARCHITECTURE OPTIMIZATION, AND NVLINK

How To Visualize At The Dlr

GPU Computing with CUDA Lecture 2 - CUDA Memories. Christopher Cooper Boston University August, 2011 UTFSM, Valparaíso, Chile

Volume visualization I Elvins

Visualization Plugin for ParaView

Overview Motivation and applications Challenges. Dynamic Volume Computation and Visualization on the GPU. GPU feature requests Conclusions

COMP/CS 605: Intro to Parallel Computing Lecture 01: Parallel Computing Overview (Part 1)

The Ultra-scale Visualization Climate Data Analysis Tools (UV-CDAT): A Vision for Large-Scale Climate Data

Comp 410/510. Computer Graphics Spring Introduction to Graphics Systems

MEng, BSc Computer Science with Artificial Intelligence

VisCMD: Visualizing Cloud Modeling Data

Interactive Level-Set Deformation On the GPU

A Short Introduction to Computer Graphics

Visualization Process. Alark Joshi

Parallel Computing with MATLAB

Petascale Visualization: Approaches and Initial Results

Curves and Surfaces. Goals. How do we draw surfaces? How do we specify a surface? How do we approximate a surface?

What is Visualization? Information Visualization An Overview. Information Visualization. Definitions

PARALLEL & CLUSTER COMPUTING CS 6260 PROFESSOR: ELISE DE DONCKER BY: LINA HUSSEIN

Writing Applications for the GPU Using the RapidMind Development Platform

Data Centric Systems (DCS)

Estonian Scientific Computing Infrastructure (ETAIS)

Visualization Infrastructure and Services at the MPCDF

Mixed Precision Iterative Refinement Methods Energy Efficiency on Hybrid Hardware Platforms

Interactive Level-Set Segmentation on the GPU

The Scientific Data Mining Process

Interactive Visualization of Magnetic Fields

Lecture Notes, CEng 477

Study of Large-Scale Data Visualization

How To Use Hadoop For Gis

Technical Report. An Interactive Iso-Surface Based 3D Weather Radar Data Visualization Package Using VisAD for WDSSII

Dell High-Performance Computing Clusters and Reservoir Simulation Research at UT Austin.

Outline. High Performance Computing (HPC) Big Data meets HPC. Case Studies: Some facts about Big Data Technologies HPC and Big Data converging

AN EFFECT OF GRID QUALITY ON THE RESULTS OF NUMERICAL SIMULATIONS OF THE FLUID FLOW FIELD IN AN AGITATED VESSEL

1. INTRODUCTION Graphics 2

Data-intensive HPC: opportunities and challenges. Patrick Valduriez

Introduction to Paraview. H.D.Rajesh

Efficient Storage, Compression and Transmission

Outline. Fundamentals. Rendering (of 3D data) Data mappings. Evaluation Interaction

Calculation of Eigenfields for the European XFEL Cavities

High-Performance Computing and Big Data Challenge

Visualization methods for patent data

Using Photorealistic RenderMan for High-Quality Direct Volume Rendering

FRIEDRICH-ALEXANDER-UNIVERSITÄT ERLANGEN-NÜRNBERG

HPC technology and future architecture

Introduction to Computer Graphics

Improving Data Mining of Multi-dimension Objects Using a Hybrid Database and Visualization System

IBM Deep Computing Visualization Offering

Stream Processing on GPUs Using Distributed Multimedia Middleware

Visualization. For Novices. ( Ted Hall ) University of Michigan 3D Lab Digital Media Commons, Library

The STC for Event Analysis: Scalability Issues

Visualizing Uncertainty: Computer Science Perspective

Advanced discretisation techniques (a collection of first and second order schemes); Innovative algorithms and robust solvers for fast convergence.

Course Overview. CSCI 480 Computer Graphics Lecture 1. Administrative Issues Modeling Animation Rendering OpenGL Programming [Angel Ch.

Transcription:

Overview and Introduction to Scientific Visualization Texas Advanced Computing Center The University of Texas at Austin http://portal.longhorn.tacc.utexas.edu/training

Before We Begin Please: Open a browser Go to //www.dropbox.com/sh/ g1yteg6fp878hqm/bz3nxq8km_ Download disk_out_ref.ex2 Download and unzip smallwrf.zip

Scientific Visualization The purpose of computing is insight not numbers. -- R. W. Hamming (1961)

Visualization Process Summary A picture is worth not just 1000 words, but potentially tera- or peta-bytes of data Larger datasets demand not just visualization, but advanced visualization resources and techniques Visualization system technology leverages lots of advancing technologies: GPUs, high-speed networks, web technology. Visualization software takes time to adapt

Today Introduction to Visualization Information Visualization Scientific Data Visualization With Paraview Parallel Visualization For Very Large Data

Visualization of Large Scale Turbulent Flow Kelly Gaither, Hank Childs, Greg Johnson, Karl Schulz, Cyrus Harrison, Diego Donzis, Texas A&M; P.K. Yeung, Georgia Tech Remote interactive visualization of 17 time-steps (34 TB) of the largest turbulent flow simulation computed to date (4096 3 ). First time this had been visualized interactively at this scale. Equal parts data mining and remote interactive visualization goal was to characterize flow behavior over time. Gaither, K., Childs, H., Schulz, K., Harrison, C., Barth, W., Donzis, D., and Yeung, P.K., Using Visualization and Data Analysis to Understand Critical Structures in Massive Time Varying Turbulent Flow Simulations, IEEE Computer Graphics and Applications, 32(4), Jul/Aug 2012.

Stellar Magnetism Greg Foss, TACC; Ben Brown, University of Wisconsin, Madison A Sun-like star undergoes magnetic cyclic reversal shown by field lines. Shifts in positive and negative polarity demonstrate largescale polarity changes in the star. Wreath-like areas in the magnetic field may be the source of Sun spots. Terabytes of data to mine through and visualize.

Coronary Artery Nano-particle Drug Delivery Visualization Ben Urick, Jo Wozniak, Karla Vega, TACC; Erik Zumalt, FIC; Shaolie Hossain, Tom Hughes, ICES. A computational tool-set was developed to support the design and analysis of a catheter-based local drug delivery system that uses nanoparticles as drug carriers to treat vulnerable plaques and diffuse atherosclerosis. The tool is now poised to be used in medical device industry to address important design questions such as, "given a particular desired drug-tissue concentration in a specific patient, what would be the optimum location, particle release mechanism, drug release rate, drug properties, and so forth, for maximum efficacy? The goal of this project is to create a visualization that explains the process of simulating local nanoparticulate drug delivery systems. The visualization makes use of 3DS Max, Maya, EnSight and ParaView.

Volume Visualization of Tera-Scale Global Seismic Wave Propagation Carsten Burstedde, Omar Ghattas, James Martin, Georg Stadler and Lucas Wilcox, ICES; Greg Abram, TACC Modeling propagation of seismic waves through the earth helps assess seismic hazard at regional scales and aids in interpretation of earth's interior structure at global scales. Discontinuous Galerkin method used to for numerical solution of the seismic wave propagation partial differential equations. Visualization corresponds to a simulation of global wave propagation from a simplified model of the 2011 Tohoku earthquake with a central source frequency of 1/85 Hz, using 93 million unknowns on TACC s Lonestar system.

H1N1 Flu Outbreak Simulation: Integrating Science and Art Greg Johnson, Brandt Westing, Karla Vega, Kelly Gaither, TACC; Ned Dimitrov, Lauren Meyers, UT Comp. Bio; Francesca Samsel, Austin, TX. Visualization of a swine flu epidemic spreading throughout North America. Epidemic begins in Mexico City. Visualization classifies individuals into three groups: susceptible (blue), infected (red), and recovered (green). Available antivirals are shown in purple. Cities and transportation links are highlighted in red to indicate large numbers of infected individuals and infectious travelers.

Texas Pandemic Flu Toolkit Greg Johnson, Adam Kubach, TACC; Lauren Meyers & group, UT Biology; David Morton & group, UT ORIE.

Visualizing Distributed Learning Weijia Xu, Karla Vega, Makoto Sadahiro Researchers are conducting studies to compare the effectiveness of virtual and real manipulatives in the support and success of learning This research explores the use of information and scientific visualization tools for data mining and feature detection

Visualization Allows Us to See the Science Raw Data Visualization Application

Getting from Data to Insight Data Source Data Representation Visualization Algorithms Rendering Display Computational Refinement Changing Changing Steering Techniques View And using insight to get more insight

I Think Of Two Kinds Of Data For Visualization Data for Scientific Visualization F(spatial dimensions[, time]) -> attributes E.g. Weather data: F(latitude, longitude, altitude) -> temperature, wind velocity, direction humidity Data for Information Visualization List of facts, which have multiple attributes E.g. A list of movies: Title, year, director, length, gate, male/female leads..

Scientific Data altitude longitude

Info Data Gene Expression Data

Any Number of Packages Do Viz R (GenomeGraphs) Python (matplotlib)

Probably The Most Common Viz Tool

Geosciences

SciVis Data Data mapped onto a computational domain Heat distribution in a block of material F(x,y,z) -> temperature for (x,y,z) a point in the block of material Multiple variables (or properties) Weather F(x,y,z) -> pressure, temperature, wind-velocity for (x,y,z) a point in the atmosphere

SciVis Data Dimensionality Domain is generally 1, 2, 3 or more dimensions Directly interpreted geometrically (see heat distribution) Indirectly interpreted geometrically F(lat, lon) -> temperature X = (earth radius)*cos(lat)*cos(lon) Y = (earth radius)*cos(lat)*sin(lon) Z = (earth radius)*sin(lat)

SciVis Data and Time Time may vary also Heat transfer in a block of material F(x,y,z,t) -> temperature for (x,y,z) a point in the block of material and t a point in time

Higher Dimensional SciVis Content(l state(t) = = 00,(l, l 01 l, 1, l 12 l, 2, l 23 l, 3, l 34 l, 4, l 45 l, 5, l 56 l, 6 )) l 6 = ) - A 7D l 0 space + l 1 + lcurve 2 + l 3 + representing l 4 + l 5 + l 6 - the A 7D amount function of representing oil stored at a the given total point amount in time of oil stored at the point (l 0, l 1, l 2, l 3, l 4, l 5, l 6 ) - A contour surface at C represents all the ways C barrels of oil can be contained in 7 tanks An Oil Tank Farm

SciVis Data: Discrete vs. Continuous Data Discrete data is known at a finite set of points in the domain F(x,y,z) -> temperature for (x,y,z) from a finite set of points in the domain, unknown otherwise Continuous data is known throughout the domain F(x,y,z) -> temperature for all points (x,y,z) in domain

The Grid Points in the domain can be regular - specified by origin, delta vectors and counts, or explicitly listed For interpolated grids: Topology: how the points connected (implicit or explicitly listed) Interpolation Model: How data values at an arbitrary point are derived from nearby points

Types of data at a point/cell Scalar Vector Tensor/matrix Labels, identifiers Other tuples

Example

Example Points

Example Connectivity

Example Data Visualized

Example Varying in Time

Types of Input Data Point scattered values with no defined structure

Types of Input Data Grid regular structure, all voxels (cells) are the same size and shape

Types of Input Data Curvilinear regularly grided mesh shaping function applied

Types of Input Data Unstructured grid irregular mesh typically composed of tetrahedra, prisms, pyramids, or hexahedra.

Types of Input Data Non-mesh connected point data (molecular)

Visualization Operations Surface Shading (Pseudocolor) Isosufacing (Contours) Volume Rendering Clipping Planes Streamlines

Surface Shading (Pseudocolor) Given a scalar value at a point on the surface and a color map, find the corresponding color (and opacity) and apply it to the surface point. Most common operation, often combined with other ops

Isosurfaces (Contours) Surface that represents points of constant value with a volume Plot the surface for a given scalar value. Good for showing known values of interest Good for sampling through a data range

Clipping / Slicing Planes Extract a plane from the data to show features Hide part of dataset to expose features

Particle Traces (Streamlines) Given a vector field, extract a trace that follows that trajectory defined by the vector. P new = P current + V P t Streamlines trace in space Pathlines trace in time

Visualization Techniques Surface Rendering is an indirect geometry based technique Direct Volume Rendering is a technique for the visualization of 3D scalar data sets without a conversion to surface representations

Volume Rendering Expresses how light travels through a volume Color and opacity controlled by transfer function Smoother transitions than isosurfaces

Example Volume Rendered

Visualization Resources Personal machines Most accessible, least powerful Projection systems Seamless image, high purchase and maintenance costs Tiled-LCD displays Lowest per-pixel costs, bezels divide image Remote visualization Access to high-performance system, latency can affect user experience

TACC Support for Remote, Interactive Visualization Maverick 2004 2008 Longhorn 2010 2013 I don t have a picture Wrangler 2014(?) I don t have a picture Maverick 2014 Spur 2008 2013 Stampede 2013 Lonestar 2011 Scientific Visualization Visual Analytics

XSEDE Visualization Resources Maverick (TACC) - HP 132 20-core Ivy Bridge nodes 256GB system memory per node Nvidia Tesla K40 GPU FDR InfiniBand interconnect Designed for interactive visualization Stampede (TACC) - DELL 6400 16-core Sandy Bridge nodes 256 GB system memory per node (16 @ 1TB) FDR Infiniband All with 62-core Xeon Phi 128 with NVIDIA K20 GPUs 205 TB Aggregate Memory

XSEDE Visualization Resources Maverick (TACC) - HP 132 20-core Ivy Bridge nodes 256GB system memory per node Nvidia Tesla K40 GPU FDR InfiniBand interconnect Designed for interactive visualization Stampede (TACC) - DELL 6400 16-core Sandy Bridge nodes 256 GB system memory per node (16 @ 1TB) FDR Infiniband All with 62-core Xeon Phi 128 with NVIDIA K20 GPUs 205 TB Aggregate Memory

Visualization Challenges

Visualization Allows Us to See the Science Raw Data Geometric Primitives Pixels Realization as Visual Entities Render

But what about large, distributed data? Gather the data? Realization as Visual Entities Render Gather

But what about large, distributed data? Gather Render Realization as Visual Entities Gather the geometry?

But what about large, distributed data? Depth-composite pixels Realization as Visual Entities Render Render full sized frames in place and depth-composite?

Or large, distributed displays?

Visualization Scaling Challenges Moving data to the visualization machine Most applications built for shared memory machines, not distributed clusters Image resolution limits in some software cannot capture feature details Displays cannot show entire highresolution images at their native resolution

Moving Data How long can you wait? File Size 10 Gbps 54 Mbps 1 GB 1 sec 2.5 min 1 TB ~17 min ~43 hours 1 PB ~12 days ~5 years

Old Model (No Remote Capability) HPC System Local Visualization Resource Data Archive Display Pixels Mouse Remote Site Wide-Area Network Local Site

New Model Remote Capability HPC System Large-Scale Visualization Resource Pixels Mouse Display Data Archive Remote Site Wide-Area Network Local Site

Analyzing Data Visualization programs only beginning to efficiently handle ultrascale data 650 GB dataset -> 3 TB memory footprint Allocate HPC nodes for RAM not cores N-1 idle processors per node! Stability across many distributed nodes Rendering clusters typically number N <= 64 Data must be dividable onto N cores Remember this when resampling!

Displaying Data Dell 30 flat-panel LCD 4 Megapixel display 2560 x 1600 resolution

Displaying Data Stallion world s 2 nd highest-resolution tiled display 328 Megapixels 40960 x 8000 pixel resolution Dell 30 LCD

What s the solution?

Summary Challenges at every stage of visualization when operating on large data Partial solutions exist, though not integrated Problem sizes continue to grow at every stage Vis software community must keep pace with hardware innovations