Large-Data Software Defined Visualization on CPUs



Similar documents
Small Business Upgrades to Reliable, High-Performance Intel Xeon Processor-based Workstations to Satisfy Complex 3D Animation Needs

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms

Scaling up to Production

Vendor Update Intel 49 th IDC HPC User Forum. Mike Lafferty HPC Marketing Intel Americas Corp.

High Performance Computing and Big Data: The coming wave.

Petascale Visualization: Approaches and Initial Results

Parallel Large-Scale Visualization

Intel Platform and Big Data: Making big data work for you.

Three Paths to Faster Simulations Using ANSYS Mechanical 16.0 and Intel Architecture

NVIDIA IndeX Enabling Interactive and Scalable Visualization for Large Data Marc Nienhaus, NVIDIA IndeX Engineering Manager and Chief Architect

Remote & Collaborative Visualization. Texas Advanced Compu1ng Center

The Foundation for Better Business Intelligence

In-Situ Bitmaps Generation and Efficient Data Analysis based on Bitmaps. Yu Su, Yi Wang, Gagan Agrawal The Ohio State University

Virtualization of ArcGIS Pro. An Esri White Paper December 2015

Intel Many Integrated Core Architecture: An Overview and Programming Models

LA-UR- Title: Author(s): Intended for: Approved for public release; distribution is unlimited.

GPU System Architecture. Alan Gray EPCC The University of Edinburgh

Intel Media SDK Library Distribution and Dispatching Process

Towards OpenMP Support in LLVM

Keys to node-level performance analysis and threading in HPC applications

INTEL PARALLEL STUDIO XE EVALUATION GUIDE

Big Data for Big Science. Bernard Doering Business Development, EMEA Big Data Software

Recent Advances in HPC for Structural Mechanics Simulations

1 Bull, 2011 Bull Extreme Computing

Interactive Data Visualization with Focus on Climate Research

High Performance Computing in CST STUDIO SUITE

Overview of HPC Resources at Vanderbilt

Simulation Platform Overview

Business opportunities from IOT and Big Data. Joachim Aertebjerg Director Enterprise Solution Sales Intel EMEA

Advanced Rendering for Engineering & Styling

The Evolution of Computer Graphics. SVP, Content & Technology, NVIDIA

The Engine for Digital Transformation in the Data Center

Hur hanterar vi utmaningar inom området - Big Data. Jan Östling Enterprise Technologies Intel Corporation, NER

Big Data Performance Growth on the Rise

Contributed Article Program and Intel DPD Search Optimization Training. John McHugh and Steve Moore January 2012

Performance Evaluation of NAS Parallel Benchmarks on Intel Xeon Phi

Writing Applications for the GPU Using the RapidMind Development Platform

ArcGIS Pro: Virtualizing in Citrix XenApp and XenDesktop. Emily Apsey Performance Engineer

CHAPTER FIVE RESULT ANALYSIS

Computer Graphics Hardware An Overview

SUN. Linda Fellingham, Ph. D Manager, Visualization and Graphics Sun Microsystems

Built for Business. Ready for the Future.

Powered by Intel Cloud Technology

Overview

Overview on Modern Accelerators and Programming Paradigms Ivan Giro7o

Parallel Computing with MATLAB

PRIMERGY server-based High Performance Computing solutions

NVIDIA IndeX. Whitepaper. Document version June 2013

Modernizing Servers and Software

Intel Solid-State Drives Increase Productivity of Product Design and Simulation

wu.cloud: Insights Gained from Operating a Private Cloud System

Big Data Visualization on the MIC

Oracle Database Reliability, Performance and scalability on Intel Xeon platforms Mitch Shults, Intel Corporation October 2011

新 一 代 軟 體 定 義 的 網 路 架 構 Software Defined Networking (SDN) and Network Function Virtualization (NFV)

Enterprise HPC & Cloud Computing for Engineering Simulation. Barbara Hutchings Director, Strategic Partnerships ANSYS, Inc.

HPC Wales Skills Academy Course Catalogue 2015

The ROI from Optimizing Software Performance with Intel Parallel Studio XE

IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud

Grid Computing for Artificial Intelligence

HPC Cluster Decisions and ANSYS Configuration Best Practices. Diana Collier Lead Systems Support Specialist Houston UGM May 2014

Dell Reference Configuration for Hortonworks Data Platform

Intel Xeon Processor E5-2600

SGI HPC Systems Help Fuel Manufacturing Rebirth

PLANNING GUIDE Modernizing Servers and Software

Accomplish Optimal I/O Performance on SAS 9.3 with

Scalable and High Performance Computing for Big Data Analytics in Understanding the Human Dynamics in the Mobile Age

MAQAO Performance Analysis and Optimization Tool

Exascale Challenges and General Purpose Processors. Avinash Sodani, Ph.D. Chief Architect, Knights Landing Processor Intel Corporation

Intel Cloud Builders Guide to Cloud Design and Deployment on Intel Platforms

Cloud-based Analytics and Map Reduce

Accelerating CFD using OpenFOAM with GPUs

GeoImaging Accelerator Pansharp Test Results

High Performance. CAEA elearning Series. Jonathan G. Dudley, Ph.D. 06/09/ CAE Associates

Benchmarking Sahara-based Big-Data-as-a-Service Solutions. Zhidong Yu, Weiting Chen (Intel) Matthew Farrellee (Red Hat) May 2015

1. INTRODUCTION Graphics 2

Intel Media Server Studio - Metrics Monitor (v1.1.0) Reference Manual

Implementation and Performance of AES-NI in CyaSSL. Embedded SSL

Accelerate Discovery with Powerful New HPC Solutions

LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance

A Hybrid Visualization System for Molecular Models

Visualization Infrastructure and Services at the MPCDF

New Dimensions in Configurable Computing at runtime simultaneously allows Big Data and fine Grain HPC

Extended Attributes and Transparent Encryption in Apache Hadoop

The Design and Implement of Ultra-scale Data Parallel. In-situ Visualization System

Intel Desktop public roadmap

The Transition to PCI Express* for Client SSDs

Accelerating Business Intelligence with Large-Scale System Memory

Building a Top500-class Supercomputing Cluster at LNS-BUAP

Overview of Data Fitting Component in Intel Math Kernel Library (Intel MKL) Intel Corporation

HPC & Big Data THE TIME HAS COME FOR A SCALABLE FRAMEWORK

Intel Network Builders Solution Brief. Intel and ASTRI* Help Mobile Network Operators Support Small Cell Networks

Finding Performance and Power Issues on Android Systems. By Eric W Moore

Infrastructure Matters: POWER8 vs. Xeon x86

Data Centric Interactive Visualization of Very Large Data

A Powerful solution for next generation Pcs

The Future Of Animation Is Games

:Introducing Star-P. The Open Platform for Parallel Application Development. Yoel Jacobsen E&M Computing LTD

Programming models for heterogeneous computing. Manuel Ujaldón Nvidia CUDA Fellow and A/Prof. Computer Architecture Department University of Malaga

Transcription:

Large-Data Software Defined Visualization on CPUs Greg P. Johnson, Bruce Cherniak 2015 Rice Oil & Gas HPC Workshop

Trend: Increasing Data Size Measuring / modeling increasingly complex phenomena Rendering is typically performed on discrete accelerators Implications Interactivity harder to achieve Requires moving data è bandwidth, storage, security issues Limits usage models (e.g. for in-situ visualization, computational steering) Limits scalability for visualization Increased spatial / temporal resolution: more difficult to interpret visually 2

Corollary: Workflow Moving into Data Center Instruments Simulation File I/O Filtering Geometry Rendering Display Old: Data Center Workstation New: Data Center Workstation 3

Software Defined Visualization The concept of doing visualization / rendering entirely in software on CPUs Treats visualization as another form of parallel compute Enables vis on compute resources without requiring special purpose hardware and APIs Advantages Access to large memory No need to move data Enables different models of rendering Based on standard open source development tools / programming paradigms Homogenous system design: general purpose nodes 4

Software Defined Visualization: Our Approach Option 1: Support existing APIs (OpenGL*) Works with existing applications No code changes or recompilation required è OpenSWR software rasterizer Option 2: Enable new functionality and improved performance through a new API Good option for new applications Application OpenGL* Renderer OSPRay Renderer OpenGL* OpenSWR OSPRay Xeon 1 Xeon 1 + Xeon Phi 2 Some effort required to enable existing applications è OSPRay ray tracing based rendering engine 1 Intel Xeon processor, 2 Intel Xeon Phi coprocessor. 5

OpenSWR Software Rasterizer High performance open source software implementation of OpenGL* Fully multi-threaded and vectorized for Intel CPUs Leverages community development effort (Mesa*) Drop in replacement for OpenGL* library Implements an increasing subset of OpenGL* as required by vis applications 6

OpenSWR Performance (Single Node, vs Mesa) 40 30 20 Frames Per Second (Higher is Better) Intel Xeon E5-2687W v3 Intel Software Rasterizer 1.4 2 x 10 cores, 3.1 GHz Intel Xeon E5-2687W v3 Mesa 10.2, Gallium-LLVMPipe 2 x 10 cores, 3.1 GHz 10 0 0.7 2.9 9.9 17.5 24.0 33.4 49.9 Millions of Triangles ParaView 4.1, Kitware build OpenGL 1.4, Display Lists Image Resolution: 1920 x 1080 Source: Intel Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance. 7

OpenSWR Performance (Multi-node) 8 7 6 SWR Performance Scaling (ParaView 4.2) FPS 5 4 3 2 1 0 FIU SWR, 4000 streamlines (47m triangles) RM SWR (316m triangles) 1 2 4 8 16 32 Node count Results measured on TACC Stampede cluster (dual-socket Intel Xeon E5-2680, 32GB per node, NVIDIA K20 discrete GPUs, 5GB) and based on internal Intel analysis. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance. 8

OpenSWR Software Rasterizer Open source: Apache 2.0 license Alpha released in December 2014 Currently supports OpenGL* 1.4 OpenGL* 2.x support coming soon Current users: TACC, Kitware, University of Tennessee Download at: http://www.openswr.org 9

OSPRay Ray Tracing Based Rendering Engine High performance, scalable open source rendering API Enables expanded functionality Additional visual cues (shadows, ambient occlusion, etc.) Traditional direct illumination only With ambient occlusion FIU Ground Water Flow data set courtesy of Florida International University and TACC. 10

OSPRay: Additional visual cues example Traditional direct illumination only FIU Ground Water Flow data set courtesy of Florida International University and TACC. 11

OSPRay: Additional visual cues example With ambient occlusion FIU Ground Water Flow data set courtesy of Florida International University and TACC. 12

OSPRay Ray Tracing Based Rendering Engine High performance, scalable open source rendering API Enables expanded functionality Volume rendering Heptane volume (256^3). Data courtesy SCI Institute, University of Utah. Richtmyer-Meshkov volume (2048^3 uint8, 8 GB). Data courtesy TACC. Isotropic Turbulence volume (2048^3 float, 32 GB). Data courtesy TACC. 13

OSPRay Ray Tracing Based Rendering Engine Provides improved performance Fully multi-threaded and vectorized for Intel CPUs and Intel Xeon Phi coprocessors Data parallel (multi-node) rendering Integrating into select vis applications VTK* ParaView* VisIt* 14

OSPRay Ray Tracing Based Rendering Engine Open source: Apache 2.0 license Alpha released in December 2014 Current users: TACC, Kitware, University of Tennessee, University of Utah, SURVICE Engineering Download at: http://www.ospray.org 15

Applications to Oil & Gas Visualization Large data is a key challenge Data sets often 100 GB to 1.5 TB in size (and beyond!) Large data can be more difficult to visually interpret Existing vis tools often not well optimized By rendering on CPUs We have access to larger memory Have access to additional rendering techniques Can use HPC systems for visualization 16

OSPRay Demo Volume Viewer (animation) 25 million triangle horizons (rendered into volume) Volume rendering Arbitrary slicing (can interactively move even in 100GB volumes) Data: University of Texas at Austin, Bureau of Economic Geology 17 17

Summary: Software Defined Visualization Compute Nodes Visualization Nodes General Purpose Nodes (Compute and Visualization) Cloud or Network Old Model Workstation New Model OpenSWR (http://www.openswr.org) OSPRay (http://www.ospray.org) 18

Legal Disclaimer Roadmap Notice: All products, computer systems, dates and figures specified are preliminary based on current expectations, and are subject to change without notice. Intel Advanced Vector Extensions (Intel AVX/ Intel AVX2): Intel AVX/AVX2 is designed to achieve higher throughput in certain integer and floating point operations. Depending on processor power and thermal characteristics, and system power and thermal conditions, AVX/AVX2 floating point instructions may run at lower frequency to maintain reliable operations at all times. For further details see product data sheet. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark* and MobileMark*, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance. Knights Landing and other code names featured are used internally within Intel to identify products that are in development and not yet publicly announced for release. Customers, licensees and other third parties are not authorized by Intel to use code names in advertising, promotion or marketing of any product or services and any such use of Intel's internal code names is at the sole risk of the user. Intel, the Intel logo, Xeon and Intel Xeon Phi are trademarks of Intel Corporation in the U.S. and/or other countries. *Other names and brands may be claimed as the property of others. 2015 Intel Corporation 19

Intel's compilers may or may not optimize to the same degree for non-intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice. Notice revision #20110804 20