The Scalable Data Management, Analysis, and Visualization (SDAV) Institute
|
|
|
- Emery Homer May
- 10 years ago
- Views:
Transcription
1 The Scalable Data Management, Analysis, and Visualization (SDAV) Institute Key Technical Accomplishments (Also, check out 8 posters) poster SciDAC PI meeting 2015
2 SDAV Institute Arie Shoshani (PI) Co-Principal Investigators from: Laboratories Argonne NL Lawrence Berkeley NL Lawrence Livermore NL Oak Ridge NL Los Alamos NL Sandia NL Kitware (Industry) Universities Georgia Inst of Technology North Carolina State U Northwestern U Ohio State U U of California, Davis Rutgers U U of Utah 24 people attending SciDAC PI meeting 2015
3 SDAV Approach to Productivity and Relevance Main Emphasis over the last year Parallel processing on multi/many cores In situ processing on leadership-class facilities (LCFs) Support ever growing volume of data (TBs PDs) Collaborate with application scientists to enhance their data understanding and insight Pro-active installation and support of software on LCFs Libraries, tools, and Frameworks Libraries software package that can be invoked (or embedded) by other programs through APIs e.g. in-situ data movement, code-coupling, indexing, rendering, compression Tools stand-alone software package usually support user interfaces e.g. topology-based analysis, feature tracking, flow analysis over space/time, I/O monitoring Frameworks software that can imbed multiple libraries SDAV has mainly I/O and visualization frameworks
4 SDAV Portfolio ( Data Management tools
5 SDAV Portfolio ( Analysis and Visualization tools
6 Deployment of SDAV Software
7 Visualization Frameworks and tools SciDAC PI meeting 2015
8 VTK-m: Accelerating the Visualization Toolkit for Multi-core and Many-core Architectures poster VTK-m goals A single place for the visualization community to collaborate, contribute, and leverage massively threaded algorithms. Reduce the challenges of writing highly concurrent algorithms by using data parallel algorithms VTK (Kitware) is a serial, single-threaded class library (data structures and algorithms) used as the basis for important applications (VisIt, PV) EAVL (ORNL) emphasizes the development of a new data model, Dax (Sandia) emphasizes the development of a new execution model, PISTON (LANL) emphasizes portability and parallel algorithm development. VisIt, ParaView VTK-m constituent technologies EAVL VTK-m Phi, Tesla, x86 PISTON VTK-M DAX
9 VTK-m Architecture Filters Worklets Data Model Post Processing In-Situ Data Parallel Algorithms Execution Arrays EAVL DAX PISTON VTK-m Data Set Model CellSet Data Set Explicit Structured QuadTree Subset Field Coords
10 VTK-m Progress and Results poster Features Device Interface (Serial, CUDA, TBB; OpenMP in progress) Architecture allows hardwareagnostic implementations Field and Topology Worklet and Dispatcher implemented Filters Isosurface for structured grids Statistical filters (histograms, moments, etc.) In development: stream lines, stream surfaces, tetrahedralization Example: ray casting with DAX and VTK-m Implementation of both ray-casting and cell projection volume rendering algorithms using Dax, one of VTK-m s constituent projects Complied for CUDA, OpenMP, and Intel s Thread Building Blocks Performance study on NVIDIA Titan X GPU, Intel Xeon, and Intel Xeon Phi Cell projection implementation using data parallel primitives renders image in near sub-second times. VTK-m Isosurface Performance (10-30 factor improvement) Volume rendering of type Ia supernova simulation data set using ray-casting
11 Continued collaboration and use of visualization tools with specific science code Example: in situ Visualization and Analysis of Particle Accelerator components Warp framework: PIC modeling of beams, accelerators, plasmas, VisIt: parallel visualization tool Problem Warp worked with the popular OpenDX (data explorer) 10 7 to10 9 particles are required for simulation but often only a small fraction form features of interest Solution Couple state-of-the-art in situ visualization tool VisIt with Warp Integrate in situ query capabilities with Warp Integrate high-performance I/O with Warp Provides dynamic feature exploration while simulation is in progress Simulations using WarpVisIt Viewer Warp Warp Warp Warp Warp Warp Warp Warp Warp Warp Warp Warp Warp Warp Warp OpenDX Warp Warp Warp Warp Warp libsim Warp Warp Warp Warp Warp libsim Warp Warp Warp Warp Warp libsim libsim libsim libsim Gather simulation data libsim libsim libsim Send metadata and images or geometry libsim libsim libsim poster libsim libsim libsim Send commands
12 in situ Visualization and Analysis of Particle Accelerator Simulations using WarpVisIt poster Impact Enable flexible in situ analysis of particle features Reduce cost for visualization and I/O Enable analysis and collection of particle subsets of interest at higher temporal frequency a) Without halo filter Enable Filtered Species Easy definition and implementation of derived particle species while exposing to the analysis the same species interface as Warp b) With halo filter
13 Data Analysis Tools SciDAC PI meeting 2015
14 Analysis Tools Applied to Various Applications Feature-driven analysis Dynamic tracking graphs and multi-core exploration, e.g., Localized thresholds for vortex detection Importance driven analysis In-situ data triage/reduction, e.g., Explorable images Statistical analysis and data mining Feature-based algorithms in MPI, e.g., Memory efficient graph analytics Topological techniques Integrate higher order feature descriptors, e.g., Abstract representation of material structure to investigate Lithium diffusion Vector field analysis Incorporate advanced statistical methods, e.g., Flowline indexing via geometric signatures
15 Quantifying Lithium Ion Diffusion for Improved Battery Design (using MSCEER) poster Problem: Lithium Diffusion in Carbon Nanospheres understanding their cycling and capacity to store lithium is not possible with classical MD simulations Instead, models for lithium diffusion are used where the valence of a carbon ring determines whether or not lithium is able to pass through Tool used for modeling structure MSCEER (Morse-Smale Complex Exploration, Evaluation, and Reasoning) is a set of tools for delivering state-of-the art topologybased analytics to the scientific community. New algorithm developed A new algorithm in MSCEER based on discrete Morse theory operates on a Delaunay triangulation, explicitly identifying ring structures and their valence Result Allow interactive visualization of blocking and nonblocking structures, for the first time showing the planes along which lithium can diffuse. Penetration into the nanosphere is dominated by large-scale dislocations.
16 Analyzing and Tracking Features in Large-Scale Turbulent Combustion at Scale (using TALASS) poster Understanding Flame Stability in Turbulent Non-Premixed Combustion Find when and where a flame becomes unstable and undergo significant extinction and re-ignition decreasing efficiency and increasing emissions. Challenges Complex fast moving geometry Extinction threshold is uncertain and unstable Tool used to enhance understanding Topological Analysis of Large Scale Simulations (TALASS) Generate graphs that track flame progression over time Results Modified graph using adaptive threshold to produce cleaner tracks, easier to understand and analyze.
17 Data and Storage Management SciDAC PI meeting 2015
18 I/O Monitoring The Traditional Post-Processing I/O Stack high-level I/O interfaces: HDF5, Parallel netcdf MPI-IO implementation: ROMIO is the basis for all platforms today. production software: GPFS (IBM), Lustre (Intel/Whamcloud) Application High-Level I/O Libraries I/O Middleware I/O Forwarding / Routing Parallel File System I/O Hardware Monitoring application I/O behavior at scale: Darshan running on production plaforms, with negligible overhead.
19 Beyond Traditional I/O Capabilities SDAV Data management frameworks provide a vehicle for deployment of in situ data analysis/transformations Application Data Management Framework Data Model Interface(s) (e.g., adaptive mesh) Data Management Services (e.g., buffering, scheduling, code coupling) Data Transformations (e.g., indexing, compression, in situ analysis) Parallel File System Analysis/Vis. Code Other Apps
20 ADIOS (Adaptable I/O System) Abstracts Data-at-Rest to Data-in-Motion (in situ) for HPC Provides portable, fast, scalable, easy-to-use, metadata rich output Dynamically allows users to change the method during an experiment/simulation Provides solutions for a large number of the applications Applied to extremely large data volumes Interface to apps for descrip on of data (ADIOS, etc.) Feedback Mul -resolu on methods Provenance Data Management Services Buffering Data Compression methods Plugins to the hybrid staging area Schedule Data Indexing (FastBit) methods Workflow Engine Run me engine Data movement Analysis Plugins Parallel and Distributed File System Visualiza on Plugins Adios-bp IDX HDF5 pnetcdf raw data Image data Viz. Client Astrophysics Climate Combustion CFD Environmental Science Fusion Geoscience Materials Science Medical: Pathology Neutron Science Nuclear Science Quantum Turbulence Relativity Seismology Sub-surface modeling Weather
21 I/O Performance Improvement for Multiple Applications (mentioned in last PI meeting) Optimizations use: Synchronous I/O on leading HPC systems (Titan, Mira, Edison, Tiahne-1A, Garnet, ) Techniques are to optimize in-memory data copy, movement, and file system access Optimizations for GPFS, Lustre
22 I/O Frameworks Facilitate Embedding in situ Data Services Example: In-Memory Data Management for Coupled Simulation Workflows using Staging with DataSpaces and ADIOS The need enable in situ processing and analysis Staging as a persistent service applications can dynamically connect to use staging / staging hosted services Enable more complex and dynamic workflows Many use cases: I/O acceleration, data staging, coupling, data reorganization/manipulation, indexing/querying, data persistence, etc. poster Code Coupling: get, put, publish, subscribe service In transit data movement and workflow management
23 In-Memory Data Management for Coupled Simulation Workflows using Staging with DataSpaces and ADIOS Fusion Application (C.S.Chang) XGC1 and XGCa executed sequentially and share data In each iteration, the workflow first executes XGC1 for n time steps to compute turbulence data and particle state, and then executes XGCa for m time steps to evolve the state of plasma. XGC1 and XGCa use node-local shared memory segments as part of an inmemory data staging service Impact of using DataSpaces Enables tightly coupled simulations at very large scales use node-local shared memory segments as part of an in-memory data staging service Results in huge performance improvement over traditional file-based approaches Particle data read: reduced by ~98% compared to file-based
24 GLEAN: Topology-Aware Parallel I/O I/O performance is highly dependent on configuration settings Need to perform the right computation at the right place and time taking into account the characteristics of the simulation, resources and analysis e.g. HACC code often uses traditional I/O stack for checkpointing Using GLEAN Small code tweaks plus configuration adjustments led to 15x performance improvement for checkpoint operations. Achieved 160 GB/s for HACC production simulations on Mira BG/Q system
25 Large-scale Parallel Analysis of Applications with the DIY Library DIY Do It Yourself Large-scale parallel analysis (visual and numerical) on HPC machines For scientists, visualization researchers, tool builders In situ, co-processing, post-processing Data-parallel problem decomposition MPI + threads hybrid parallelism Scalable data movement algorithms Runs on Unix-like platforms, from laptop to supercomputer (including all IBM and Cray HPC leadership machines
26 Time (s) Examples of using DIY poster IMAGE SEGMENTATION IN POROUS MEDIA TESSELLATION-BASED DENSITY ESTIMATION COSMOLOGY AND ASTROPHYSICS Left: 3D image of a granular material (flexible sandstone) acquired at ALS (by Michael Manga and Dula Parkinson). Data: 8.4 billion points: LBNL (Dmitriy Morozov and Patrick O Neil) developed tools for segmentation and connectivity analysis of granular and porous media using DIY Right: segmentation of the material identifies individual grains Three representations of the same halo. From left to right: original raw particle data, Voronoi tessellation, and regular grid density sampling. Strong Scaling 8192^3 grid 4096^3 grid 2048^3 grid 1024^3 grid Perfect strong scaling Number of Processes Left and center: Cross sections of CIC (cloud-incell) and much more accurate TESS-based density estimators. Right: Strong scaling for grid sizes ranging from to (0.5 trillion).
27 Science Problem often Require Specialized Data Management Techniques poster Problem: detection of ice-calving events Large-scale calving events (free-floating icebergs and ice fracture) are of scientific interest to study global climate change Typical iceberg is 6x the size of Manhattan Scientific simulations are developed as part of the BISICLES partnership (Dan Martin) Simulation is AMR-based ice sheet modeling using Chombo Goal Identify events during the simulation runs -> in situ / real-time for every time step Technical approach Develop Parallel Connected Component Labeling (PCCL) for in situ AMR data Approach Result Great speedup (up to 6x) at larger per-process data sizes relative to previously used multiphase CCL (repeated scans) Schematic showing computed ice velocity for Antarctic continent (right), and meshing for the Pine Island Glacier (left). The grounding line location is shown as red line.
28 SDAV Future Outlook Experience with in situ processing has been very successful Typically less than 1% of simulation time used for in situ feature extraction In-memory code-coupling, in-memory workflow management, and In situ indexing and search capabilities help minimize data movement In situ visualization and analysis methods allow dynamic exploration of data and avoid writing full-resolution data We expect to continue and benefit from this approach Scalability has been successful Many SDAV tools scaled to hundred of thousands of cores Many SDAV tools took advantage of GPUs when available, especially Vis tools Many tools managed to reduce their memory use We see a clear path for taking advantage of future architectural characteristics Burst buffers may be useful for Checkpoint restart Fast read - staging of data Extension of memory Extension of storage We expect to benefit greatly from burst buffers in the SciDAC3 time frame
29 SDAV Future Outlook Hardware Gap: generally high concurrency platforms are mostly designed with other target applications (e.g. simulation-> high flops, low memory) Algorithmic Gap: generally data analysis algorithms have been traditionally executed in post processing and therefore less research has been focused on extreme parallelism New algorithmic research needed: New data structures that reduce memory usage New algorithmic structure that better exploits GPUs New algorithmic models that minimize communications How to exploit opportunities given by deep memory hierarchies? How can we better coordinate with I/O infrastructures?
30 Extra slides
31 Data Management Tools I/O Frameworks ADIOS: provides a simple, flexible way for scientists to describe the data in their code, and based on that to provide efficient I/O, and in situ data processing Darshan: captures an accurate picture of application I/O behavior Parallel netcdf: a library providing high-performance I/O while still maintaining file-format compatibility with Unidata's NetCDF ROMIO: is a high-performance, portable implementation of MPI-IO ViSUS/IDX: Provides data streaming techniques for progressive processing and visualization of large surface and volume meshes In Situ Processing GLEAN: is an extensible framework that takes system characteristics into account in order to facilitate simulation-time data analysis and I/O acceleration DIY: Provides scalable building blocks for data movement tailored to the needs of large-scale parallel analysis workloads DataSpaces: facilitates in situ code coupling using a shared-space abstraction EvPath: is an event transport middleware layer providing processing over virtual data paths Indexing / Compression FastBit: A very fast indexing method based on compressed bitmap representation specially suitable for scientific data ISABELA: a tools for lossy but highly accurate (>.99 correlation) compression of spatiotemporal scientific data
32 Analysis and Visualization tools Analysis and Visualization Frameworks VisIt: is an Open Source, interactive, scalable, visualization, animation and analysis tool. ParaView: is an open-source, multi-platform data analysis and visualization application. Analysis and Visualization Libraries ExMage: provides in situ pathtube generation and visualization. TALASS: is a collection of routines for parallel and distributed processing of particle data. Ultravis-P: is a collection of routines for parallel and distributed processing of particle data. MSCEER: is a set of tools and libraries for feature extraction and exploration in scalar fields. IceT: is a high-performance sort-last parallel rendering library that provides the unique ability to generate images for tiled displays. NDDAV: is an interactive analysis framework for high-dimensional data. VTK: is an open-source system for 3D computer graphics, image processing and visualization.
33 Analysis and Visualization tools Multi-/Many-core Visualization Libraries Dax: The Dax Toolkit supports the fine-grained concurrency for data analysis and visualization algorithms required to drive exascale computing. EAVL: is the Extreme-scale Analysis and Visualization Library that expands traditional data models to support current and forthcoming scientific data sets. PISTON: is a cross-platform software library providing frequently used operations for scientific visualization and analysis. Statistics and Data Mining NU-Minebench: is a data mining benchmark suite containing a mix of several representative data mining applications from different application domains. Importance-Driven Analysis: is a tool that uses a newly-designed spatial data structure, named parallel distance tree, to enable highly scalable parallel distance field computing.
Roadmap for Many-core Visualization Software in DOE. Jeremy Meredith Oak Ridge National Laboratory
Roadmap for Many-core Visualization Software in DOE Jeremy Meredith Oak Ridge National Laboratory Supercomputers! Supercomputer Hardware Advances Everyday More and more parallelism High-Level Parallelism
HPC & Visualization. Visualization and High-Performance Computing
HPC & Visualization Visualization and High-Performance Computing Visualization is a critical step in gaining in-depth insight into research problems, empowering understanding that is not possible with
Towards Scalable Visualization Plugins for Data Staging Workflows
Towards Scalable Visualization Plugins for Data Staging Workflows David Pugmire Oak Ridge National Laboratory Oak Ridge, Tennessee James Kress University of Oregon Eugene, Oregon Jeremy Meredith, Norbert
The Ultra-scale Visualization Climate Data Analysis Tools (UV-CDAT): A Vision for Large-Scale Climate Data
The Ultra-scale Visualization Climate Data Analysis Tools (UV-CDAT): A Vision for Large-Scale Climate Data Lawrence Livermore National Laboratory? Hank Childs (LBNL) and Charles Doutriaux (LLNL) September
DIY Parallel Data Analysis
I have had my results for a long time, but I do not yet know how I am to arrive at them. Carl Friedrich Gauss, 1777-1855 DIY Parallel Data Analysis APTESC Talk 8/6/13 Image courtesy pigtimes.com Tom Peterka
Parallel Large-Scale Visualization
Parallel Large-Scale Visualization Aaron Birkland Cornell Center for Advanced Computing Data Analysis on Ranger January 2012 Parallel Visualization Why? Performance Processing may be too slow on one CPU
Data Centric Interactive Visualization of Very Large Data
Data Centric Interactive Visualization of Very Large Data Bruce D Amora, Senior Technical Staff Gordon Fossum, Advisory Engineer IBM T.J. Watson Research/Data Centric Systems #OpenPOWERSummit Data Centric
GPU System Architecture. Alan Gray EPCC The University of Edinburgh
GPU System Architecture EPCC The University of Edinburgh Outline Why do we want/need accelerators such as GPUs? GPU-CPU comparison Architectural reasons for GPU performance advantages GPU accelerated systems
Visualization and Data Analysis
Working Group Outbrief Visualization and Data Analysis James Ahrens, David Rogers, Becky Springmeyer Eric Brugger, Cyrus Harrison, Laura Monroe, Dino Pavlakos Scott Klasky, Kwan-Liu Ma, Hank Childs LLNL-PRES-481881
Data Centric Systems (DCS)
Data Centric Systems (DCS) Architecture and Solutions for High Performance Computing, Big Data and High Performance Analytics High Performance Computing with Data Centric Systems 1 Data Centric Systems
Parallel Analysis and Visualization on Cray Compute Node Linux
Parallel Analysis and Visualization on Cray Compute Node Linux David Pugmire, Oak Ridge National Laboratory and Hank Childs, Lawrence Livermore National Laboratory and Sean Ahern, Oak Ridge National Laboratory
Dr. Raju Namburu Computational Sciences Campaign U.S. Army Research Laboratory. The Nation s Premier Laboratory for Land Forces UNCLASSIFIED
Dr. Raju Namburu Computational Sciences Campaign U.S. Army Research Laboratory 21 st Century Research Continuum Theory Theory embodied in computation Hypotheses tested through experiment SCIENTIFIC METHODS
Large-Data Software Defined Visualization on CPUs
Large-Data Software Defined Visualization on CPUs Greg P. Johnson, Bruce Cherniak 2015 Rice Oil & Gas HPC Workshop Trend: Increasing Data Size Measuring / modeling increasingly complex phenomena Rendering
HPC Wales Skills Academy Course Catalogue 2015
HPC Wales Skills Academy Course Catalogue 2015 Overview The HPC Wales Skills Academy provides a variety of courses and workshops aimed at building skills in High Performance Computing (HPC). Our courses
Trends in High-Performance Computing for Power Grid Applications
Trends in High-Performance Computing for Power Grid Applications Franz Franchetti ECE, Carnegie Mellon University www.spiral.net Co-Founder, SpiralGen www.spiralgen.com This talk presents my personal views
Post-processing and Visualization with Open-Source Tools. Journée Scientifique Centre Image April 9, 2015 - Julien Jomier
Post-processing and Visualization with Open-Source Tools Journée Scientifique Centre Image April 9, 2015 - Julien Jomier Kitware - Leader in Open Source Software for Scientific Computing Software Development
Petascale Visualization: Approaches and Initial Results
Petascale Visualization: Approaches and Initial Results James Ahrens Li-Ta Lo, Boonthanome Nouanesengsy, John Patchett, Allen McPherson Los Alamos National Laboratory LA-UR- 08-07337 Operated by Los Alamos
Overview of HPC Resources at Vanderbilt
Overview of HPC Resources at Vanderbilt Will French Senior Application Developer and Research Computing Liaison Advanced Computing Center for Research and Education June 10, 2015 2 Computing Resources
Cluster Scalability of ANSYS FLUENT 12 for a Large Aerodynamics Case on the Darwin Supercomputer
Cluster Scalability of ANSYS FLUENT 12 for a Large Aerodynamics Case on the Darwin Supercomputer Stan Posey, MSc and Bill Loewe, PhD Panasas Inc., Fremont, CA, USA Paul Calleja, PhD University of Cambridge,
An Analytical Framework for Particle and Volume Data of Large-Scale Combustion Simulations. Franz Sauer 1, Hongfeng Yu 2, Kwan-Liu Ma 1
An Analytical Framework for Particle and Volume Data of Large-Scale Combustion Simulations Franz Sauer 1, Hongfeng Yu 2, Kwan-Liu Ma 1 1 University of California, Davis 2 University of Nebraska, Lincoln
So#ware Tools and Techniques for HPC, Clouds, and Server- Class SoCs Ron Brightwell
So#ware Tools and Techniques for HPC, Clouds, and Server- Class SoCs Ron Brightwell R&D Manager, Scalable System So#ware Department Sandia National Laboratories is a multi-program laboratory managed and
Scientific Visualization with Open Source Tools. HM 2014 Julien Jomier [email protected]
Scientific Visualization with Open Source Tools HM 2014 Julien Jomier [email protected] Visualization is Communication Challenges of Visualization Challenges of Visualization Heterogeneous data
Visualization of Adaptive Mesh Refinement Data with VisIt
Visualization of Adaptive Mesh Refinement Data with VisIt Gunther H. Weber Lawrence Berkeley National Laboratory VisIt Richly featured visualization and analysis tool for large data sets Built for five
VisIt: A Tool for Visualizing and Analyzing Very Large Data. Hank Childs, Lawrence Berkeley National Laboratory December 13, 2010
VisIt: A Tool for Visualizing and Analyzing Very Large Data Hank Childs, Lawrence Berkeley National Laboratory December 13, 2010 VisIt is an open source, richly featured, turn-key application for large
Parallel Programming at the Exascale Era: A Case Study on Parallelizing Matrix Assembly For Unstructured Meshes
Parallel Programming at the Exascale Era: A Case Study on Parallelizing Matrix Assembly For Unstructured Meshes Eric Petit, Loïc Thebault, Quang V. Dinh May 2014 EXA2CT Consortium 2 WPs Organization Proto-Applications
Large Vector-Field Visualization, Theory and Practice: Large Data and Parallel Visualization Hank Childs + D. Pugmire, D. Camp, C. Garth, G.
Large Vector-Field Visualization, Theory and Practice: Large Data and Parallel Visualization Hank Childs + D. Pugmire, D. Camp, C. Garth, G. Weber, S. Ahern, & K. Joy Lawrence Berkeley National Laboratory
HPC Deployment of OpenFOAM in an Industrial Setting
HPC Deployment of OpenFOAM in an Industrial Setting Hrvoje Jasak [email protected] Wikki Ltd, United Kingdom PRACE Seminar: Industrial Usage of HPC Stockholm, Sweden, 28-29 March 2011 HPC Deployment
HOW TO VISUALIZE YOUR GPU-ACCELERATED SIMULATION RESULTS. Peter Messmer, NVIDIA
HOW TO VISUALIZE YOUR GPU-ACCELERATED SIMULATION RESULTS Peter Messmer, NVIDIA RANGE OF ANALYSIS AND VIZ TASKS Analysis: Focus quantitative Visualization: Focus qualitative Monitoring, Steering TRADITIONAL
Designing and Building Applications for Extreme Scale Systems CS598 William Gropp www.cs.illinois.edu/~wgropp
Designing and Building Applications for Extreme Scale Systems CS598 William Gropp www.cs.illinois.edu/~wgropp Welcome! Who am I? William (Bill) Gropp Professor of Computer Science One of the Creators of
Scalable and High Performance Computing for Big Data Analytics in Understanding the Human Dynamics in the Mobile Age
Scalable and High Performance Computing for Big Data Analytics in Understanding the Human Dynamics in the Mobile Age Xuan Shi GRA: Bowei Xue University of Arkansas Spatiotemporal Modeling of Human Dynamics
Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage
White Paper Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage A Benchmark Report August 211 Background Objectivity/DB uses a powerful distributed processing architecture to manage
Hank Childs, University of Oregon
Exascale Analysis & Visualization: Get Ready For a Whole New World Sept. 16, 2015 Hank Childs, University of Oregon Before I forget VisIt: visualization and analysis for very big data DOE Workshop for
Scientific Computing Programming with Parallel Objects
Scientific Computing Programming with Parallel Objects Esteban Meneses, PhD School of Computing, Costa Rica Institute of Technology Parallel Architectures Galore Personal Computing Embedded Computing Moore
Data Analytics at NERSC. Joaquin Correa [email protected] NERSC Data and Analytics Services
Data Analytics at NERSC Joaquin Correa [email protected] NERSC Data and Analytics Services NERSC User Meeting August, 2015 Data analytics at NERSC Science Applications Climate, Cosmology, Kbase, Materials,
Visualization Infrastructure and Services at the MPCDF
Visualization Infrastructure and Services at the MPCDF Markus Rampp & Klaus Reuter Max Planck Computing and Data Facility (MPCDF) ([email protected]) Interdisciplinary Cluster Workshop on Visualization
Data Deduplication in a Hybrid Architecture for Improving Write Performance
Data Deduplication in a Hybrid Architecture for Improving Write Performance Data-intensive Salable Computing Laboratory Department of Computer Science Texas Tech University Lubbock, Texas June 10th, 2013
Addressing Big Data Challenges in Simulation-based Science
Addressing Big Data Challenges in Simulation-based Science Manish Parashar* Rutgers Discovery Informatics Institute (RDI2) NSF Center for Cloud and Autonomic Computing (CAC) Department of Electrical and
The Uintah Framework: A Unified Heterogeneous Task Scheduling and Runtime System
The Uintah Framework: A Unified Heterogeneous Task Scheduling and Runtime System Qingyu Meng, Alan Humphrey, Martin Berzins Thanks to: John Schmidt and J. Davison de St. Germain, SCI Institute Justin Luitjens
HETEROGENEOUS HPC, ARCHITECTURE OPTIMIZATION, AND NVLINK
HETEROGENEOUS HPC, ARCHITECTURE OPTIMIZATION, AND NVLINK Steve Oberlin CTO, Accelerated Computing US to Build Two Flagship Supercomputers SUMMIT SIERRA Partnership for Science 100-300 PFLOPS Peak Performance
Mission Need Statement for the Next Generation High Performance Production Computing System Project (NERSC-8)
Mission Need Statement for the Next Generation High Performance Production Computing System Project () (Non-major acquisition project) Office of Advanced Scientific Computing Research Office of Science
Data-intensive HPC: opportunities and challenges. Patrick Valduriez
Data-intensive HPC: opportunities and challenges Patrick Valduriez Big Data Landscape Multi-$billion market! Big data = Hadoop = MapReduce? No one-size-fits-all solution: SQL, NoSQL, MapReduce, No standard,
Zhenping Liu *, Yao Liang * Virginia Polytechnic Institute and State University. Xu Liang ** University of California, Berkeley
P1.1 AN INTEGRATED DATA MANAGEMENT, RETRIEVAL AND VISUALIZATION SYSTEM FOR EARTH SCIENCE DATASETS Zhenping Liu *, Yao Liang * Virginia Polytechnic Institute and State University Xu Liang ** University
NA-ASC-128R-14-VOL.2-PN
L L N L - X X X X - X X X X X Advanced Simulation and Computing FY15 19 Program Notebook for Advanced Technology Development & Mitigation, Computational Systems and Software Environment and Facility Operations
Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel
Big Data and Analytics: Getting Started with ArcGIS Mike Park Erik Hoel Agenda Overview of big data Distributed computation User experience Data management Big data What is it? Big Data is a loosely defined
Cluster, Grid, Cloud Concepts
Cluster, Grid, Cloud Concepts Kalaiselvan.K Contents Section 1: Cluster Section 2: Grid Section 3: Cloud Cluster An Overview Need for a Cluster Cluster categorizations A computer cluster is a group of
In-situ Visualization
In-situ Visualization Dr. Jean M. Favre Scientific Computing Research Group 13-01-2011 Outline Motivations How is parallel visualization done today Visualization pipelines Execution paradigms Many grids
IDL. Get the answers you need from your data. IDL
Get the answers you need from your data. IDL is the preferred computing environment for understanding complex data through interactive visualization and analysis. IDL Powerful visualization. Interactive
Programming models for heterogeneous computing. Manuel Ujaldón Nvidia CUDA Fellow and A/Prof. Computer Architecture Department University of Malaga
Programming models for heterogeneous computing Manuel Ujaldón Nvidia CUDA Fellow and A/Prof. Computer Architecture Department University of Malaga Talk outline [30 slides] 1. Introduction [5 slides] 2.
A Case Study - Scaling Legacy Code on Next Generation Platforms
Available online at www.sciencedirect.com ScienceDirect Procedia Engineering 00 (2015) 000 000 www.elsevier.com/locate/procedia 24th International Meshing Roundtable (IMR24) A Case Study - Scaling Legacy
Visualization with ParaView. Greg Johnson
Visualization with Greg Johnson Before we begin Make sure you have 3.8.0 installed so you can follow along in the lab section http://paraview.org/paraview/resources/software.html http://www.paraview.org/
ACCELERATING SELECT WHERE AND SELECT JOIN QUERIES ON A GPU
Computer Science 14 (2) 2013 http://dx.doi.org/10.7494/csci.2013.14.2.243 Marcin Pietroń Pawe l Russek Kazimierz Wiatr ACCELERATING SELECT WHERE AND SELECT JOIN QUERIES ON A GPU Abstract This paper presents
MEng, BSc Applied Computer Science
School of Computing FACULTY OF ENGINEERING MEng, BSc Applied Computer Science Year 1 COMP1212 Computer Processor Effective programming depends on understanding not only how to give a machine instructions
Summit and Sierra Supercomputers:
Whitepaper Summit and Sierra Supercomputers: An Inside Look at the U.S. Department of Energy s New Pre-Exascale Systems November 2014 1 Contents New Flagship Supercomputers in U.S. to Pave Path to Exascale
NVIDIA CUDA Software and GPU Parallel Computing Architecture. David B. Kirk, Chief Scientist
NVIDIA CUDA Software and GPU Parallel Computing Architecture David B. Kirk, Chief Scientist Outline Applications of GPU Computing CUDA Programming Model Overview Programming in CUDA The Basics How to Get
Parallel Computing for Data Science
Parallel Computing for Data Science With Examples in R, C++ and CUDA Norman Matloff University of California, Davis USA (g) CRC Press Taylor & Francis Group Boca Raton London New York CRC Press is an imprint
Building Platform as a Service for Scientific Applications
Building Platform as a Service for Scientific Applications Moustafa AbdelBaky [email protected] Rutgers Discovery Informa=cs Ins=tute (RDI 2 ) The NSF Cloud and Autonomic Compu=ng Center Department
Big Data Challenges in Simulation-based Science
Big Data Challenges in Simulation-based Science Manish Parashar* Rutgers Discovery Informatics Institute (RDI 2 ) Department of Computer Science *Hoang Bui, Tong Jin, Qian Sun, Fan Zhang, ADIOS Team, and
Recent and Future Activities in HPC and Scientific Data Management Siegfried Benkner
Recent and Future Activities in HPC and Scientific Data Management Siegfried Benkner Research Group Scientific Computing Faculty of Computer Science University of Vienna AUSTRIA http://www.par.univie.ac.at
NEXT-GENERATION. EXTREME Scale. Visualization Technologies: Enabling Discoveries at ULTRA-SCALE VISUALIZATION
ULTRA-SCALE VISUALIZATION NEXT-GENERATION Visualization Technologies: Enabling Discoveries at EXTREME Scale The SciDAC Institute for Ultra-Scale Visualization aims to enable extreme-scale knowledge discovery
E6895 Advanced Big Data Analytics Lecture 14:! NVIDIA GPU Examples and GPU on ios devices
E6895 Advanced Big Data Analytics Lecture 14: NVIDIA GPU Examples and GPU on ios devices Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science IBM Chief Scientist,
Large-Scale Reservoir Simulation and Big Data Visualization
Large-Scale Reservoir Simulation and Big Data Visualization Dr. Zhangxing John Chen NSERC/Alberta Innovates Energy Environment Solutions/Foundation CMG Chair Alberta Innovates Technology Future (icore)
HPC with Multicore and GPUs
HPC with Multicore and GPUs Stan Tomov Electrical Engineering and Computer Science Department University of Tennessee, Knoxville CS 594 Lecture Notes March 4, 2015 1/18 Outline! Introduction - Hardware
The GPU Accelerated Data Center. Marc Hamilton, August 27, 2015
The GPU Accelerated Data Center Marc Hamilton, August 27, 2015 THE GPU-ACCELERATED DATA CENTER HPC DEEP LEARNING PC VIRTUALIZATION CLOUD GAMING RENDERING 2 Product design FROM ADVANCED RENDERING TO VIRTUAL
Machine Learning for Data-Driven Discovery
Machine Learning for Data-Driven Discovery Thoughts on the Past, Present and Future Sreenivas Rangan Sukumar, PhD Data Scientist and R&D Staff Computaional Data Analytics Group Computational Sciences and
Part I Courses Syllabus
Part I Courses Syllabus This document provides detailed information about the basic courses of the MHPC first part activities. The list of courses is the following 1.1 Scientific Programming Environment
NVIDIA IndeX. Whitepaper. Document version 1.0 3 June 2013
NVIDIA IndeX Whitepaper Document version 1.0 3 June 2013 NVIDIA Advanced Rendering Center Fasanenstraße 81 10623 Berlin phone +49.30.315.99.70 fax +49.30.315.99.733 [email protected] Copyright Information
Performance Monitoring of Parallel Scientific Applications
Performance Monitoring of Parallel Scientific Applications Abstract. David Skinner National Energy Research Scientific Computing Center Lawrence Berkeley National Laboratory This paper introduces an infrastructure
Quick Reference Selling Guide for Intel Lustre Solutions Overview
Overview The 30 Second Pitch Intel Solutions for Lustre* solutions Deliver sustained storage performance needed that accelerate breakthrough innovations and deliver smarter, data-driven decisions for enterprise
SQL Server 2012 Performance White Paper
Published: April 2012 Applies to: SQL Server 2012 Copyright The information contained in this document represents the current view of Microsoft Corporation on the issues discussed as of the date of publication.
Big Data Management in the Clouds and HPC Systems
Big Data Management in the Clouds and HPC Systems Hemera Final Evaluation Paris 17 th December 2014 Shadi Ibrahim [email protected] Era of Big Data! Source: CNRS Magazine 2013 2 Era of Big Data! Source:
Visualizing Large, Complex Data
Visualizing Large, Complex Data Outline Visualizing Large Scientific Simulation Data Importance-driven visualization Multidimensional filtering Visualizing Large Networks A layout method Filtering methods
MEng, BSc Computer Science with Artificial Intelligence
School of Computing FACULTY OF ENGINEERING MEng, BSc Computer Science with Artificial Intelligence Year 1 COMP1212 Computer Processor Effective programming depends on understanding not only how to give
Graphical Processing Units to Accelerate Orthorectification, Atmospheric Correction and Transformations for Big Data
Graphical Processing Units to Accelerate Orthorectification, Atmospheric Correction and Transformations for Big Data Amanda O Connor, Bryan Justice, and A. Thomas Harris IN52A. Big Data in the Geosciences:
www.xenon.com.au STORAGE HIGH SPEED INTERCONNECTS HIGH PERFORMANCE COMPUTING VISUALISATION GPU COMPUTING
www.xenon.com.au STORAGE HIGH SPEED INTERCONNECTS HIGH PERFORMANCE COMPUTING GPU COMPUTING VISUALISATION XENON Accelerating Exploration Mineral, oil and gas exploration is an expensive and challenging
Parallel Visualization for GIS Applications
Parallel Visualization for GIS Applications Alexandre Sorokine, Jamison Daniel, Cheng Liu Oak Ridge National Laboratory, Geographic Information Science & Technology, PO Box 2008 MS 6017, Oak Ridge National
NVIDIA IndeX Enabling Interactive and Scalable Visualization for Large Data Marc Nienhaus, NVIDIA IndeX Engineering Manager and Chief Architect
SIGGRAPH 2013 Shaping the Future of Visual Computing NVIDIA IndeX Enabling Interactive and Scalable Visualization for Large Data Marc Nienhaus, NVIDIA IndeX Engineering Manager and Chief Architect NVIDIA
Xeon+FPGA Platform for the Data Center
Xeon+FPGA Platform for the Data Center ISCA/CARL 2015 PK Gupta, Director of Cloud Platform Technology, DCG/CPG Overview Data Center and Workloads Xeon+FPGA Accelerator Platform Applications and Eco-system
Parallel Computing. Benson Muite. [email protected] http://math.ut.ee/ benson. https://courses.cs.ut.ee/2014/paralleel/fall/main/homepage
Parallel Computing Benson Muite [email protected] http://math.ut.ee/ benson https://courses.cs.ut.ee/2014/paralleel/fall/main/homepage 3 November 2014 Hadoop, Review Hadoop Hadoop History Hadoop Framework
SciDAC Visualization and Analytics Center for Enabling Technology
SciDAC Visualization and Analytics Center for Enabling Technology E. Wes Bethel 1, Chris Johnson 2, Ken Joy 3, Sean Ahern 4, Valerio Pascucci 5, Hank Childs 5, Jonathan Cohen 5, Mark Duchaineau 5, Bernd
Parallel I/O on JUQUEEN
Parallel I/O on JUQUEEN 3. February 2015 3rd JUQUEEN Porting and Tuning Workshop Sebastian Lührs, Kay Thust [email protected], [email protected] Jülich Supercomputing Centre Overview Blue Gene/Q
The Visualization Pipeline
The Visualization Pipeline Conceptual perspective Implementation considerations Algorithms used in the visualization Structure of the visualization applications Contents The focus is on presenting the
Using an In-Memory Data Grid for Near Real-Time Data Analysis
SCALEOUT SOFTWARE Using an In-Memory Data Grid for Near Real-Time Data Analysis by Dr. William Bain, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 IN today s competitive world, businesses
HPC Software Requirements to Support an HPC Cluster Supercomputer
HPC Software Requirements to Support an HPC Cluster Supercomputer Susan Kraus, Cray Cluster Solutions Software Product Manager Maria McLaughlin, Cray Cluster Solutions Product Marketing Cray Inc. WP-CCS-Software01-0417
