Data Centric Interactive Visualization of Very Large Data



Similar documents
Data Centric Systems (DCS)

NVIDIA IndeX Enabling Interactive and Scalable Visualization for Large Data Marc Nienhaus, NVIDIA IndeX Engineering Manager and Chief Architect

NVIDIA GRID OVERVIEW SERVER POWERED BY NVIDIA GRID. WHY GPUs FOR VIRTUAL DESKTOPS AND APPLICATIONS? WHAT IS A VIRTUAL DESKTOP?

Equalizer. Parallel OpenGL Application Framework. Stefan Eilemann, Eyescale Software GmbH

Big Data Visualization on the MIC

DeIC Watson Agreement - hvad betyder den for DeIC medlemmerne

OpenPOWER Outlook AXEL KOEHLER SR. SOLUTION ARCHITECT HPC

Packet-based Network Traffic Monitoring and Analysis with GPUs

IBM Deep Computing Visualization Offering

Jean-Pierre Panziera Teratec 2011

Crossing the Performance Chasm with OpenPOWER

IT of SPIM Data Storage and Compression. EMBO Course - August 27th! Jeff Oegema, Peter Steinbach, Oscar Gonzalez

STORAGE HIGH SPEED INTERCONNECTS HIGH PERFORMANCE COMPUTING VISUALISATION GPU COMPUTING

GPU System Architecture. Alan Gray EPCC The University of Edinburgh

Agenda. HPC Software Stack. HPC Post-Processing Visualization. Case Study National Scientific Center. European HPC Benchmark Center Montpellier PSSC

PNY Professional Solutions NVIDIA GRID - GPU Acceleration for the Cloud

Remote Graphical Visualization of Large Interactive Spatial Data

High Performance Applications over the Cloud: Gains and Losses

GTC Presentation March 19, Copyright 2012 Penguin Computing, Inc. All rights reserved

Network Traffic Monitoring and Analysis with GPUs

Network Traffic Monitoring & Analysis with GPUs

High Performance. CAEA elearning Series. Jonathan G. Dudley, Ph.D. 06/09/ CAE Associates

HPC Cluster Decisions and ANSYS Configuration Best Practices. Diana Collier Lead Systems Support Specialist Houston UGM May 2014

LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance

NVIDIA IndeX. Whitepaper. Document version June 2013

Amazon EC2 Product Details Page 1 of 5

Introducing PgOpenCL A New PostgreSQL Procedural Language Unlocking the Power of the GPU! By Tim Child

Overview of HPC Resources at Vanderbilt

HP ProLiant SL270s Gen8 Server. Evaluation Report

Bringing Big Data Modelling into the Hands of Domain Experts

ECLIPSE Performance Benchmarks and Profiling. January 2009

Scala Storage Scale-Out Clustered Storage White Paper

FLOW-3D Performance Benchmark and Profiling. September 2012

Large-Data Software Defined Visualization on CPUs

A Hybrid Visualization System for Molecular Models

David Rioja Redondo Telecommunication Engineer Englobe Technologies and Systems

The High Performance Internet of Things: using GVirtuS for gluing cloud computing and ubiquitous connected devices

Performance Monitoring of Parallel Scientific Applications

Unified Computing Systems

Accelerating Simulation & Analysis with Hybrid GPU Parallelization and Cloud Computing

Cluster Implementation and Management; Scheduling

SUN. Linda Fellingham, Ph. D Manager, Visualization and Graphics Sun Microsystems

Performance Beyond PCI Express: Moving Storage to The Memory Bus A Technical Whitepaper

Visualisation of Large Datasets with Houdini

The Construction of Seismic and Geological Studies' Cloud Platform Using Desktop Cloud Visualization Technology

Technical bulletin: Remote visualisation with VirtualGL

GPU-Based Network Traffic Monitoring & Analysis Tools

Computer Graphics Hardware An Overview

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage

Petascale Visualization: Approaches and Initial Results

#OpenPOWERSummit. Join the conversation at #OpenPOWERSummit 1

Scientific Computing Data Management Visions

GPU File System Encryption Kartik Kulkarni and Eugene Linkov

Appro Supercomputer Solutions Best Practices Appro 2012 Deployment Successes. Anthony Kenisky, VP of North America Sales

Big Data for Investment Research Management

PCIe Over Cable Provides Greater Performance for Less Cost for High Performance Computing (HPC) Clusters. from One Stop Systems (OSS)

Scalable Data Analysis in R. Lee E. Edlefsen Chief Scientist UserR! 2011

World s fastest database and big data analytics platform

SGI HPC Systems Help Fuel Manufacturing Rebirth

HETEROGENEOUS HPC, ARCHITECTURE OPTIMIZATION, AND NVLINK

Big Data Functionality for Oracle 11 / 12 Using High Density Computing and Memory Centric DataBase (MCDB) Frequently Asked Questions

The search engine you can see. Connects people to information and services

MOAB CON 2009 RMSC Case Study Providing Supercomputing Platforms as a Service (SPaaS)

Parallel Analysis and Visualization on Cray Compute Node Linux

Diablo and VMware TM powering SQL Server TM in Virtual SAN TM. A Diablo Technologies Whitepaper. May 2015

Statistical Analysis and Visualization for Cyber Security

In-Situ Bitmaps Generation and Efficient Data Analysis based on Bitmaps. Yu Su, Yi Wang, Gagan Agrawal The Ohio State University

IBM Platform Computing : infrastructure management for HPC solutions on OpenPOWER Jing Li, Software Development Manager IBM

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

Boas Betzler. Planet. Globally Distributed IaaS Platform Examples AWS and SoftLayer. November 9, IBM Corporation

Storage, Cloud, Web 2.0, Big Data Driving Growth

PRIMERGY server-based High Performance Computing solutions

Parallel Large-Scale Visualization

SR-IOV In High Performance Computing

High-Performance Visualization of Geographic Data

Optimizing GPU-based application performance for the HP for the HP ProLiant SL390s G7 server

Parallel Programming Survey

CS 698: Special Topics in Big Data. Chapter 2. Computing Trends for Big Data

NVIDIA GPUs in the Cloud

IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud

Introduction to Computer Graphics

HPC and Big Data. EPCC The University of Edinburgh. Adrian Jackson Technical Architect

Integrated Communication Systems

Cray DVS: Data Virtualization Service

Evoluzione dell Infrastruttura di Calcolo e Data Analytics per la ricerca

Estonian Scientific Computing Infrastructure (ETAIS)

SMB Direct for SQL Server and Private Cloud

Parallel Visualization for GIS Applications

Recent Advances in HPC for Structural Mechanics Simulations

Dr. Raju Namburu Computational Sciences Campaign U.S. Army Research Laboratory. The Nation s Premier Laboratory for Land Forces UNCLASSIFIED

Kriterien für ein PetaFlop System

REMOTE HIGH FIDELITY VISUALIZATION. May 2015 Jeremy Main, Sr. Solution Architect GRID

Data Center and Cloud Computing Market Landscape and Challenges

Aspirus Enterprise Backup Assessment and Implementation of Avamar and NetWorker

Software-defined Storage Architecture for Analytics Computing

Software. Enabling Technologies for the 3D Clouds. Paolo Maggi R&D Manager

White Paper. Recording Server Virtualization

Memory Channel Storage ( M C S ) Demystified. Jerome McFarland

Transcription:

Data Centric Interactive Visualization of Very Large Data Bruce D Amora, Senior Technical Staff Gordon Fossum, Advisory Engineer IBM T.J. Watson Research/Data Centric Systems #OpenPOWERSummit

Data Centric Domains Business Analytics Business Intelligence Social Analytics Financial Analytics Life Sciences System G Big Insights Complex analytics Watson DOD All-Source Intelligence Integrated Trading and VaR Health Care Analytics HPA Technical Computing Oil and Gas Climate & Environment Science Engineering Design, Prototyping, Analysis, Optimization Modeling, Simulation Integrated Wide Azimuth Imaging and Interpretation Production Weather DOE NNSA and Office of Science HPC Key Domain Characteristics: Big Data, Complex Analytics, Scale and Time to Solution Interactive, Real Time Visualization on Very Large Data in these domains is a key opportunity Join the conversation at #OpenPOWERSummit 2

Big Data Workflow Challenges Performance Challenges with Very Large Data Sets Storage and I/O impact data location and access Data movement bottlenecks workflow Data may be subject to legal/contractual constraints Export laws, e.g. Oil and Gas Exploration in foreign countries Privacy laws, e.g. Medical data Data security can be compromised by movement of data across networked systems Central data location easier to secure Remote users require access to visualization Join the conversation at #OpenPOWERSummit 3

Current Strategies Current systems hampered by end-to-end data movement Visualization solutions designed primarily for workstation plus GPU platforms Visualization loosely coupled with simulation and analytics Join the conversation at #OpenPOWERSummit 4

IB Network Data Centric System Architecture 256GB CPU0 256GB CPU 1 DCS D0 layer: Dense Compute 256GB CPU0 256GB CPU 1 DCS D1 layer: Compute + Flash Storage Flash System 840 DCS D2 layer: GPFS File Storage ESS Power 7 BE xcat Join the conversation at #OpenPOWERSummit 5

Big Data Capability Workflow Enablement (Potential) Join the conversation at #OpenPOWERSummit 6

Initial Prototype Socket connections HPC Applications IBM BlueGene/Q Server High Performance Compute Nodes (CN) CN0 CN1 CN2.. CNn Volume Rendering Application 1. Volume Rendering application can run on any number of compute nodes 2. Compute nodes run a light version of Linux to minimize OS jitter High Performance I/O Nodes (ION) Hybrid Scalable Solid-state Storage 3. I/O is vectored from CN to ION. ION have PCIe attached persistent storage Socket connections IBM Power login server Relay Server Application Socket connection 4. Socket connections between CN and relay server are managed by ION 5. Relay server is the conduit between BlueGene volume rendering server and client. 6. Client viewer sends user input to and receives displayable images from relay server via multiple socket connections Client Viewer Join the conversation at #OpenPOWERSummit 7

IBM Power 8 NVIDIA K40 System Power 8/GPU Cluster HPC Applications Volume Rendering Application High Performance Compute Nodes Ubuntu Ubuntu Ubuntu.. Ubuntu Ubuntu LE Linux on IBM Power 8 Fibre channel links to storage High Performance I/O IBM CAPI-accelerated FlashSystem 840 8GB/s per FlashSystem 840 drawer 16-48TB per drawer TCP socket connections IBM Power Login Node NodeJS relay & http server Relay server broadcasts client state and volume renderer sends new images Web socket connections Client viewer sends user input to and receives displayable images from relay server via multiple socket connections WebGL Client Browser Join the conversation at #OpenPOWERSummit 8

Prototype Features Scalable rendering of volumetric data Dozens of gigabytes per GPU, lots of GPUs Four sets of cut planes, perpendicular to three world axes, and Lookat vector adjustable alpha blending Multiplier and exponent sliders provided camera manipulation Latitude and longitude of eye Field of view angle Re-centering object after cutplane Support for remote client visualization Join the conversation at #OpenPOWERSummit 9

Big Data Visualizer Vision 3D arrays of data large enough that the number of data points exceeds the number of pixels available to display it by orders of magnitude. Implementation Data can be index, index+alpha, or RGBA Each MPI Rank has it s own subset of the data It extracts isoparametric surfaces, loading tiles with that data It uses raycasting to blend voxel colors, incorporating the previously computed surface data MPI Accumulate Ranks gather these tiles, sending them to the top Tiles are then shipped via socket to a (possibly remote) client Join the conversation at #OpenPOWERSummit 10

Isoparametric Surface Vision Huge data; rendered triangles would be far smaller than a pixel; why bother? Just find points of intersection along the x,y,z-aligned directions Voxels that intersect surface form a very sparse subset of all voxels Implementation Precompute all intersections and store location and normal info CUDA kernel call processes these intersections, adding colors to tiles Join the conversation at #OpenPOWERSummit 11

Example of Isoparametric points (Derived from the open-source female Visible Human dataset) Join the conversation at #OpenPOWERSummit 12

Raycasting Vision Previously computed opaque isoparametric points represent "starting color and depth" for a raycasted blend operation working back to front in each pixel for each tile that intersects the brick associated with this MPI Rank Implementation For each intersecting tile, make a CUDA kernel call Each thread is responsible for one pixel in that tile If the thread's ray pierces the brick { Read samples from back to front, blending color // note back depends on possible point data in that pixel } Join the conversation at #OpenPOWERSummit 13

Opaque point limits Raycasting 2D tile on screen Intersects an opaque isoparametric point Brick Eye point No opaque points: raycast through entire brick Join the conversation at #OpenPOWERSummit 14

Accumulation Vision Oct-tree hierarchy of MPI ranks Each leaf comprises a compute rank Each internal node comprises an accumulate rank with up to 8 children Children are guaranteed to be adjacent in the space of the large array Implementation Tiles from compute ranks are sent to their immediate parent Each accumulate rank knows where the eye point is Children are sorted based on eye location Tiles are blended in correct order Result is sent to parent Root accumulate ranks (eight of them, to facilitate socket efficiency) send tiles via socket to client for visualization within client graphics framework Join the conversation at #OpenPOWERSummit 15

Oct-tree structure (shown as Quad-tree) Root ranks First-level accumulate ranks Second-level accumulate ranks (some connections are not shown to keep chart simpler) Compute ranks Join the conversation at #OpenPOWERSummit 16

Example of blended image (Derived from the open-source female Visible Human dataset) Join the conversation at #OpenPOWERSummit 17

Future Work Utilize CAPI accelerated FLASH IO to enable workflow data transfers Graph visualization Merged Scientific and Info Visualization Latency mitigation Dynamic reallocation of aggregation ranks to more efficiently scale with viewing and window changes Improving IO performance by rearranging data into tiles matching page buffer size Join the conversation at #OpenPOWERSummit 18