Accelerating Shallow Water Flow and Mass Transport Using Lattice Boltzmann Methods on GPUs. Kevin Tubbs Dell Global HPC Solutions Engineering

Size: px
Start display at page:

Download "Accelerating Shallow Water Flow and Mass Transport Using Lattice Boltzmann Methods on GPUs. Kevin Tubbs Dell Global HPC Solutions Engineering"

Transcription

1 Accelerating Shallow Water Flow and Mass Transport Using Lattice Boltzmann Methods on GPUs Kevin Tubbs Dell Global HPC Solutions Engineering

2 Introduction

3 Background and Overview Role of CFD Improve Environmental Predictions Used to Improve Decision and Policy Making Test and Evaluate Preventative Methods 3

4 Dell Global HPC Solutions Engineering Performance Analysis & Characterization - Node level - Cluster level Power Consumption and Energy Analysis Benchmark and Application Analysis - Industry Standard Benchmarks & Apps: HPL, NPB, NAMD - Focus Area Specific: CFD, Climate & Weather, Molecular Dynamics, etc. 4 Confidential

5 Why Accelereyes? Programmability Faster Writing Kernels Using Libraries Slower Instruction Set Optimization Compiler Directives Time - Consuming Easy-to-use

6 Lattice Boltzmann Methods

7 Lattice Boltzmann Method Overview: Modeling Shallow Water and Mass Transport Shallow Water Equation h ( hui ) + = 0 t x i 2 2 ( hu ) ( hu u ) h ( hu ) i i + = g + ν + Fi t xi xi 2 xj xj Advection Dispersion Equation for Solute Concentration ( hc) ( hu jc) i j C + = Dh ij + F t xj x x j C 1 ( ( eq) ) f + c f = f f t τ LB Numerical Formulation

8 Background: LBM f + f = f f τ Boltzmann equation e t Particle distribution function f(, xe,) t 1 ( ( eq) ) g(, xe,) t EDF ( ue ) 2 f h f h f 2 eq c s ue α 3 α 1 uu eq eq α = ωα + +, α > = α e e 2 e 2 e α > 0 LBM Grid e α ( hue ) 2 g C c hue 3 1 hu hu g hc g 2 eq sα α α eq eq α = ωα + +, α > α = α e e 2 he 2 he α > 0 ( 0,0) α = 0 ( α 1) π ( α 1) π = e cos,sin, α = 1,2,3,4 4 4 ( α 1) π ( α 1) π 2e cos,sin, α = 5, 6, 7,8 4 4 D2Q9 LBM

9 Background: LBM (cont.) Zeroth Moment First Moment LBM evolves in a two-step process Step 1. Collision Step 2. Streaming h= α f α hui = α eαi f 1 fα t fα t fα t fα t τ t + 2 e αi F i( x, t ) 6e eq ( x, ˆ) = ( x, ) ( ( x, ) ( x, )) α hc = α g α Chui = α eαig α ( x+ e, + ) = ( x, ˆ) f tt t f t α α α

10 Results: 1) Model Validation 2. Simulation Results

11 2D Dam Break: Backward Facing Step Domain: 12 m x 12 m Dam Break Initial Conditions Upstream: water depth 5 m Downstream: water depth 1m BC: Free Slip at walls for flow t = 0.5 s

12 2D Partial Dam Break Water Depth Surface t = 7.2 s Velocity Vector Field t = 7.2 s Domain: 200 m x 200 m Dam Break Initial Conditions Upstream: water depth 10m Downstream: water depth 5m BC: Free Slip at walls for flow, Impermeable for concentration

13 2D Continuous Release Mass Transport Problem Setup Domain: Infinite Shallow Water Pool Constant Water Depth 1 m τ s ( ( τ )) 1 A = sij hdij = t τ ( 12) Velocity Anisotropic Dispersion D h κ xx = u L Dyy h κt c = u κ / κ = 10 L T A

14 Domain: 800m x 200m Dam Break with Transport Initial Conditions o 2D Dam Break with Mass Transport Upstream: water depth 10m Downstream: water depth 5m BC: Free Slip at walls for flow Time Snap shots t = 10 s t = 30 s t = 60 s t = 90 s

15 2D Dam Break with Mass Transport Domain: 800 m x 200 m Dam Break with Transport Initial Conditions o Upstream: water depth 10m, Concentration 0.7 Downstream: water depth 5m, Concentration 0.2 BC: Free Slip at walls for flow, Impermeable for concentration Time Snap shots t = 10 s t = 30 s t = 60 s t = 90 s D = k u δ + k k ( ) ij L ij L T uu i j h g u Cz k L = 5.93 k = 0.23 T

16 Performance

17 2D Dam Break: Backward Facing Step

18 2D Continuous Release Mass Transport

19 2D Dam Break with Mass Transport

20 Summary LBM is a flexible algorithm for various CFD simulations Tools from Accelereyes allow rapid development and validation of LBM based CFD algorithms The GPU accelerated LBM algorithm achieved maximum of 24X speedup on GPU based architectures. The GPU parallel performance depends on size of simulation.

21 Resources Blogs Whitepapers HPC Advisor Online

22 Related Work Publications, Proceedings & Presentations: Publications Tubbs, K. R., and F. T.-C. Tsai, GPU Accelerated Lattice Boltzmann Model for Shallow Water Flow and Mass Transport. Int. J. Numer. Meth. Eng (2010) DOI: /nme Tubbs, K. R., and F. T.-C. Tsai, Multilayer Shallow Water Flow using Lattice Boltzmann Model with High Performance Computing. Advances in Water Resources, 32(12), , Presentations Tubbs, K. R., and F. T.-C. Tsai, A GPU Accelerated Lattice Boltzmann Model for Shallow Water Flow and Mass Transport, Sixth International Conference for Mesoscopic Methods in Engineering and Science (ICMMES), Guangzhou City, Guangdong Province, China, July 13-17, Proceedings Tubbs, K. R., and F. T.-C.. Tsai. Simulation of Multilayer Shallow Water Fluid Flow Using Lattice Boltzmann Modeling and High Performance Computing. ASCE/EWRI World Water & Environmental Resources Conference, Kansas City, Missouri, May 17-21, Tubbs, K. R., and F. T.-C.. Tsai, C. White, G. Allen. Lattice Boltzmann Modeling Using High Performance Computing for Shallow Water Fluid Flow and Mass Transport, American Geophysical Union 2008 Fall Meeting, San Francisco, CA, Dec 15-19,

23 Questions

LBM BASED FLOW SIMULATION USING GPU COMPUTING PROCESSOR

LBM BASED FLOW SIMULATION USING GPU COMPUTING PROCESSOR LBM BASED FLOW SIMULATION USING GPU COMPUTING PROCESSOR Frédéric Kuznik, frederic.kuznik@insa lyon.fr 1 Framework Introduction Hardware architecture CUDA overview Implementation details A simple case:

More information

Boundary Conditions in lattice Boltzmann method

Boundary Conditions in lattice Boltzmann method Boundar Conditions in lattice Boltzmann method Goncalo Silva Department of Mechanical Engineering Instituto Superior Técnico (IST) Lisbon, Portugal Outline Introduction 1 Introduction Boundar Value Problems

More information

walberla: A software framework for CFD applications on 300.000 Compute Cores

walberla: A software framework for CFD applications on 300.000 Compute Cores walberla: A software framework for CFD applications on 300.000 Compute Cores J. Götz (LSS Erlangen, jan.goetz@cs.fau.de), K. Iglberger, S. Donath, C. Feichtinger, U. Rüde Lehrstuhl für Informatik 10 (Systemsimulation)

More information

Design and Optimization of a Portable Lattice Boltzmann Code for Heterogeneous Architectures

Design and Optimization of a Portable Lattice Boltzmann Code for Heterogeneous Architectures Design and Optimization of a Portable Lattice Boltzmann Code for Heterogeneous Architectures E Calore, S F Schifano, R Tripiccione Enrico Calore INFN Ferrara, Italy Perspectives of GPU Computing in Physics

More information

CFD Application on Food Industry; Energy Saving on the Bread Oven

CFD Application on Food Industry; Energy Saving on the Bread Oven Middle-East Journal of Scientific Research 13 (8): 1095-1100, 2013 ISSN 1990-9233 IDOSI Publications, 2013 DOI: 10.5829/idosi.mejsr.2013.13.8.548 CFD Application on Food Industry; Energy Saving on the

More information

walberla: A software framework for CFD applications

walberla: A software framework for CFD applications walberla: A software framework for CFD applications U. Rüde, S. Donath, C. Feichtinger, K. Iglberger, F. Deserno, M. Stürmer, C. Mihoubi, T. Preclic, D. Haspel (all LSS Erlangen), N. Thürey (LSS Erlangen/

More information

Parallel 3D Image Segmentation of Large Data Sets on a GPU Cluster

Parallel 3D Image Segmentation of Large Data Sets on a GPU Cluster Parallel 3D Image Segmentation of Large Data Sets on a GPU Cluster Aaron Hagan and Ye Zhao Kent State University Abstract. In this paper, we propose an inherent parallel scheme for 3D image segmentation

More information

Accelerating CFD using OpenFOAM with GPUs

Accelerating CFD using OpenFOAM with GPUs Accelerating CFD using OpenFOAM with GPUs Authors: Saeed Iqbal and Kevin Tubbs The OpenFOAM CFD Toolbox is a free, open source CFD software package produced by OpenCFD Ltd. Its user base represents a wide

More information

Mixed Precision Iterative Refinement Methods Energy Efficiency on Hybrid Hardware Platforms

Mixed Precision Iterative Refinement Methods Energy Efficiency on Hybrid Hardware Platforms Mixed Precision Iterative Refinement Methods Energy Efficiency on Hybrid Hardware Platforms Björn Rocker Hamburg, June 17th 2010 Engineering Mathematics and Computing Lab (EMCL) KIT University of the State

More information

HPC enabling of OpenFOAM R for CFD applications

HPC enabling of OpenFOAM R for CFD applications HPC enabling of OpenFOAM R for CFD applications Towards the exascale: OpenFOAM perspective Ivan Spisso 25-27 March 2015, Casalecchio di Reno, BOLOGNA. SuperComputing Applications and Innovation Department,

More information

Using Computational Fluid Dynamics (CFD) for improving cooling system efficiency for Data centers

Using Computational Fluid Dynamics (CFD) for improving cooling system efficiency for Data centers Data Centre Best Practises Workshop Using Computational Fluid Dynamics (CFD) for improving cooling system efficiency for Data centers Shishir Gupta 17 th March 2009 You are Here Introduction to CFD Data

More information

Unleashing the Performance Potential of GPUs for Atmospheric Dynamic Solvers

Unleashing the Performance Potential of GPUs for Atmospheric Dynamic Solvers Unleashing the Performance Potential of GPUs for Atmospheric Dynamic Solvers Haohuan Fu haohuan@tsinghua.edu.cn High Performance Geo-Computing (HPGC) Group Center for Earth System Science Tsinghua University

More information

Design and Optimization of OpenFOAM-based CFD Applications for Hybrid and Heterogeneous HPC Platforms

Design and Optimization of OpenFOAM-based CFD Applications for Hybrid and Heterogeneous HPC Platforms Design and Optimization of OpenFOAM-based CFD Applications for Hybrid and Heterogeneous HPC Platforms Amani AlOnazi, David E. Keyes, Alexey Lastovetsky, Vladimir Rychkov Extreme Computing Research Center,

More information

Parallel programming with Session Java

Parallel programming with Session Java 1/17 Parallel programming with Session Java Nicholas Ng (nickng@doc.ic.ac.uk) Imperial College London 2/17 Motivation Parallel designs are difficult, error prone (eg. MPI) Session types ensure communication

More information

Remote Graphical Visualization of Large Interactive Spatial Data

Remote Graphical Visualization of Large Interactive Spatial Data Remote Graphical Visualization of Large Interactive Spatial Data ComplexHPC Spring School 2011 International ComplexHPC Challenge Cristinel Mihai Mocan Computer Science Department Technical University

More information

walberla: Towards an Adaptive, Dynamically Load-Balanced, Massively Parallel Lattice Boltzmann Fluid Simulation

walberla: Towards an Adaptive, Dynamically Load-Balanced, Massively Parallel Lattice Boltzmann Fluid Simulation walberla: Towards an Adaptive, Dynamically Load-Balanced, Massively Parallel Lattice Boltzmann Fluid Simulation SIAM Parallel Processing for Scientific Computing 2012 February 16, 2012 Florian Schornbaum,

More information

NVIDIA Tesla K20-K20X GPU Accelerators Benchmarks Application Performance Technical Brief

NVIDIA Tesla K20-K20X GPU Accelerators Benchmarks Application Performance Technical Brief NVIDIA Tesla K20-K20X GPU Accelerators Benchmarks Application Performance Technical Brief NVIDIA changed the high performance computing (HPC) landscape by introducing its Fermibased GPUs that delivered

More information

Optimized Hybrid Parallel Lattice Boltzmann Fluid Flow Simulations on Complex Geometries

Optimized Hybrid Parallel Lattice Boltzmann Fluid Flow Simulations on Complex Geometries Optimized Hybrid Parallel Lattice Boltzmann Fluid Flow Simulations on Complex Geometries Jonas Fietz 2, Mathias J. Krause 2, Christian Schulz 1, Peter Sanders 1, and Vincent Heuveline 2 1 Karlsruhe Institute

More information

Fast Parallel Algorithms for Computational Bio-Medicine

Fast Parallel Algorithms for Computational Bio-Medicine Fast Parallel Algorithms for Computational Bio-Medicine H. Köstler, J. Habich, J. Götz, M. Stürmer, S. Donath, T. Gradl, D. Ritter, D. Bartuschat, C. Feichtinger, C. Mihoubi, K. Iglberger (LSS Erlangen)

More information

High Performance Computing

High Performance Computing High Parallel Computing Hybrid Program Coding Heterogeneous Program Coding Heterogeneous Parallel Coding Hybrid Parallel Coding High Performance Computing Highly Proficient Coding Highly Parallelized Code

More information

HPC-related R&D in 863 Program

HPC-related R&D in 863 Program HPC-related R&D in 863 Program Depei Qian Sino-German Joint Software Institute (JSI) Beihang University Aug. 27, 2010 Outline The 863 key project on HPC and Grid Status and Next 5 years 863 efforts on

More information

LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance

LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance 11 th International LS-DYNA Users Conference Session # LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance Gilad Shainer 1, Tong Liu 2, Jeff Layton 3, Onur Celebioglu

More information

Collaborative and Interactive CFD Simulation using High Performance Computers

Collaborative and Interactive CFD Simulation using High Performance Computers Collaborative and Interactive CFD Simulation using High Performance Computers Petra Wenisch, Andre Borrmann, Ernst Rank, Christoph van Treeck Technische Universität München {wenisch, borrmann, rank, treeck}@bv.tum.de

More information

The Green Index: A Metric for Evaluating System-Wide Energy Efficiency in HPC Systems

The Green Index: A Metric for Evaluating System-Wide Energy Efficiency in HPC Systems 202 IEEE 202 26th IEEE International 26th International Parallel Parallel and Distributed and Distributed Processing Processing Symposium Symposium Workshops Workshops & PhD Forum The Green Index: A Metric

More information

MODULE VII LARGE BODY WAVE DIFFRACTION

MODULE VII LARGE BODY WAVE DIFFRACTION MODULE VII LARGE BODY WAVE DIFFRACTION 1.0 INTRODUCTION In the wave-structure interaction problems, it is classical to divide into two major classification: slender body interaction and large body interaction.

More information

Workshop on Parallel and Distributed Scientific and Engineering Computing, Shanghai, 25 May 2012

Workshop on Parallel and Distributed Scientific and Engineering Computing, Shanghai, 25 May 2012 Scientific Application Performance on HPC, Private and Public Cloud Resources: A Case Study Using Climate, Cardiac Model Codes and the NPB Benchmark Suite Peter Strazdins (Research School of Computer Science),

More information

FLOODING AND DRYING IN DISCONTINUOUS GALERKIN DISCRETIZATIONS OF SHALLOW WATER EQUATIONS

FLOODING AND DRYING IN DISCONTINUOUS GALERKIN DISCRETIZATIONS OF SHALLOW WATER EQUATIONS European Conference on Computational Fluid Dynamics ECCOMAS CFD 26 P. Wesseling, E. Oñate and J. Périaux (Eds) c TU Delft, The Netherlands, 26 FLOODING AND DRING IN DISCONTINUOUS GALERKIN DISCRETIZATIONS

More information

Flow Physics Analysis of Three-Bucket Helical Savonius Rotor at Twist Angle Using CFD

Flow Physics Analysis of Three-Bucket Helical Savonius Rotor at Twist Angle Using CFD Vol.3, Issue.2, March-April. 2013 pp-739-746 ISSN: 2249-6645 Flow Physics Analysis of Three-Bucket Helical Savonius Rotor at Twist Angle Using CFD Pinku Debnath, 1 Rajat Gupta 2 12 Mechanical Engineering,

More information

OpenACC Parallelization and Optimization of NAS Parallel Benchmarks

OpenACC Parallelization and Optimization of NAS Parallel Benchmarks OpenACC Parallelization and Optimization of NAS Parallel Benchmarks Presented by Rengan Xu GTC 2014, S4340 03/26/2014 Rengan Xu, Xiaonan Tian, Sunita Chandrasekaran, Yonghong Yan, Barbara Chapman HPC Tools

More information

Performance Evaluation of NAS Parallel Benchmarks on Intel Xeon Phi

Performance Evaluation of NAS Parallel Benchmarks on Intel Xeon Phi Performance Evaluation of NAS Parallel Benchmarks on Intel Xeon Phi ICPP 6 th International Workshop on Parallel Programming Models and Systems Software for High-End Computing October 1, 2013 Lyon, France

More information

Performance of HPC Applications on the Amazon Web Services Cloud

Performance of HPC Applications on the Amazon Web Services Cloud Cloudcom 2010 November 1, 2010 Indianapolis, IN Performance of HPC Applications on the Amazon Web Services Cloud Keith R. Jackson, Lavanya Ramakrishnan, Krishna Muriki, Shane Canon, Shreyas Cholia, Harvey

More information

ACCELERATING COMMERCIAL LINEAR DYNAMIC AND NONLINEAR IMPLICIT FEA SOFTWARE THROUGH HIGH- PERFORMANCE COMPUTING

ACCELERATING COMMERCIAL LINEAR DYNAMIC AND NONLINEAR IMPLICIT FEA SOFTWARE THROUGH HIGH- PERFORMANCE COMPUTING ACCELERATING COMMERCIAL LINEAR DYNAMIC AND Vladimir Belsky Director of Solver Development* Luis Crivelli Director of Solver Development* Matt Dunbar Chief Architect* Mikhail Belyi Development Group Manager*

More information

Agenda. HPC Software Stack. HPC Post-Processing Visualization. Case Study National Scientific Center. European HPC Benchmark Center Montpellier PSSC

Agenda. HPC Software Stack. HPC Post-Processing Visualization. Case Study National Scientific Center. European HPC Benchmark Center Montpellier PSSC HPC Architecture End to End Alexandre Chauvin Agenda HPC Software Stack Visualization National Scientific Center 2 Agenda HPC Software Stack Alexandre Chauvin Typical HPC Software Stack Externes LAN Typical

More information

Simulation Platform Overview

Simulation Platform Overview Simulation Platform Overview Build, compute, and analyze simulations on demand www.rescale.com CASE STUDIES Companies in the aerospace and automotive industries use Rescale to run faster simulations Aerospace

More information

Jean-Pierre Panziera Teratec 2011

Jean-Pierre Panziera Teratec 2011 Technologies for the future HPC systems Jean-Pierre Panziera Teratec 2011 3 petaflop systems : TERA 100, CURIE & IFERC Tera100 Curie IFERC 1.25 PetaFlops 256 TB ory 30 PB disk storage 140 000+ Xeon cores

More information

Application of a Development Time Productivity Metric to Parallel Software Development

Application of a Development Time Productivity Metric to Parallel Software Development Application of a Development Time Metric to Parallel Software Development Andrew Funk afunk@ll.mit.edu Victor Basili 2 basili@cs.umd.edu Lorin Hochstein 2 lorin@cs.umd.edu Jeremy Kepner kepner@ll.mit.edu

More information

Combining Scalability and Efficiency for SPMD Applications on Multicore Clusters*

Combining Scalability and Efficiency for SPMD Applications on Multicore Clusters* Combining Scalability and Efficiency for SPMD Applications on Multicore Clusters* Ronal Muresano, Dolores Rexachs and Emilio Luque Computer Architecture and Operating System Department (CAOS) Universitat

More information

Multi-GPU Load Balancing for Simulation and Rendering

Multi-GPU Load Balancing for Simulation and Rendering Multi- Load Balancing for Simulation and Rendering Yong Cao Computer Science Department, Virginia Tech, USA In-situ ualization and ual Analytics Instant visualization and interaction of computing tasks

More information

Interactive simulation of an ash cloud of the volcano Grímsvötn

Interactive simulation of an ash cloud of the volcano Grímsvötn Interactive simulation of an ash cloud of the volcano Grímsvötn 1 MATHEMATICAL BACKGROUND Simulating flows in the atmosphere, being part of CFD, is on of the research areas considered in the working group

More information

Multicore Parallel Computing with OpenMP

Multicore Parallel Computing with OpenMP Multicore Parallel Computing with OpenMP Tan Chee Chiang (SVU/Academic Computing, Computer Centre) 1. OpenMP Programming The death of OpenMP was anticipated when cluster systems rapidly replaced large

More information

Advances in Smart Systems Research : ISSN 2050-8662 : http://nimbusvault.net/publications/koala/assr/ Vol. 3. No. 3 : pp.

Advances in Smart Systems Research : ISSN 2050-8662 : http://nimbusvault.net/publications/koala/assr/ Vol. 3. No. 3 : pp. Advances in Smart Systems Research : ISSN 2050-8662 : http://nimbusvault.net/publications/koala/assr/ Vol. 3. No. 3 : pp.49-54 : isrp13-005 Optimized Communications on Cloud Computer Processor by Using

More information

Program Optimization for Multi-core Architectures

Program Optimization for Multi-core Architectures Program Optimization for Multi-core Architectures Sanjeev K Aggarwal (ska@iitk.ac.in) M Chaudhuri (mainak@iitk.ac.in) R Moona (moona@iitk.ac.in) Department of Computer Science and Engineering, IIT Kanpur

More information

Retour d expérience : portage d une application haute-performance vers un langage de haut niveau

Retour d expérience : portage d une application haute-performance vers un langage de haut niveau Retour d expérience : portage d une application haute-performance vers un langage de haut niveau ComPAS/RenPar 2013 Mathias Bourgoin - Emmanuel Chailloux - Jean-Luc Lamotte 16 Janvier 2013 Our Goals Globally

More information

Using Cloud-Resolving Model Simulations of Deep Convection to Inform Cloud Parameterizations in Large-Scale Models

Using Cloud-Resolving Model Simulations of Deep Convection to Inform Cloud Parameterizations in Large-Scale Models Using Cloud-Resolving Model Simulations of Deep Convection to Inform Cloud Parameterizations in Large-Scale Models S. A. Klein National Oceanic and Atmospheric Administration Geophysical Fluid Dynamics

More information

HPC technology and future architecture

HPC technology and future architecture HPC technology and future architecture Visual Analysis for Extremely Large-Scale Scientific Computing KGT2 Internal Meeting INRIA France Benoit Lange benoit.lange@inria.fr Toàn Nguyên toan.nguyen@inria.fr

More information

CLASSICAL CONCEPT REVIEW 8

CLASSICAL CONCEPT REVIEW 8 CLASSICAL CONCEPT REVIEW 8 Kinetic Theory Information concerning the initial motions of each of the atoms of macroscopic systems is not accessible, nor do we have the computational capability even with

More information

CFD analysis for road vehicles - case study

CFD analysis for road vehicles - case study CFD analysis for road vehicles - case study Dan BARBUT*,1, Eugen Mihai NEGRUS 1 *Corresponding author *,1 POLITEHNICA University of Bucharest, Faculty of Transport, Splaiul Independentei 313, 060042, Bucharest,

More information

Multi-Block Gridding Technique for FLOW-3D Flow Science, Inc. July 2004

Multi-Block Gridding Technique for FLOW-3D Flow Science, Inc. July 2004 FSI-02-TN59-R2 Multi-Block Gridding Technique for FLOW-3D Flow Science, Inc. July 2004 1. Introduction A major new extension of the capabilities of FLOW-3D -- the multi-block grid model -- has been incorporated

More information

Choices and Applications of 2D/3D models for supporting harbour & coastal management

Choices and Applications of 2D/3D models for supporting harbour & coastal management Choices and Applications of 2D/3D models for supporting harbour & coastal management Terug naar overzicht Rob Uittenbogaard (1,2) (1) WL Delft Hydraulics (2) Delft University of Technology ; J.M. Burgerscentre

More information

The Value of High-Performance Computing for Simulation

The Value of High-Performance Computing for Simulation White Paper The Value of High-Performance Computing for Simulation High-performance computing (HPC) is an enormous part of the present and future of engineering simulation. HPC allows best-in-class companies

More information

IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud

IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud February 25, 2014 1 Agenda v Mapping clients needs to cloud technologies v Addressing your pain

More information

AMLIGHT, Simulation Datasets, and Global Data Sharing

AMLIGHT, Simulation Datasets, and Global Data Sharing AMLIGHT, Simulation Datasets, and Global Data Sharing Jean-Bernard Minster (1,2,4,6), John J. Helly (1,2), Steven M. Day (3,4), Raul Castro Escamilla (5), Philip Maechling (4),Thomas H. Jordan (4), Amit

More information

The Methodology of Application Development for Hybrid Architectures

The Methodology of Application Development for Hybrid Architectures Computer Technology and Application 4 (2013) 543-547 D DAVID PUBLISHING The Methodology of Application Development for Hybrid Architectures Vladimir Orekhov, Alexander Bogdanov and Vladimir Gaiduchok Department

More information

HPC with Multicore and GPUs

HPC with Multicore and GPUs HPC with Multicore and GPUs Stan Tomov Electrical Engineering and Computer Science Department University of Tennessee, Knoxville CS 594 Lecture Notes March 4, 2015 1/18 Outline! Introduction - Hardware

More information

Modeling Rotor Wakes with a Hybrid OVERFLOW-Vortex Method on a GPU Cluster

Modeling Rotor Wakes with a Hybrid OVERFLOW-Vortex Method on a GPU Cluster Modeling Rotor Wakes with a Hybrid OVERFLOW-Vortex Method on a GPU Cluster Mark J. Stock, Ph.D., Adrin Gharakhani, Sc.D. Applied Scientific Research, Santa Ana, CA Christopher P. Stone, Ph.D. Computational

More information

Benchmark Hadoop and Mars: MapReduce on cluster versus on GPU

Benchmark Hadoop and Mars: MapReduce on cluster versus on GPU Benchmark Hadoop and Mars: MapReduce on cluster versus on GPU Heshan Li, Shaopeng Wang The Johns Hopkins University 3400 N. Charles Street Baltimore, Maryland 21218 {heshanli, shaopeng}@cs.jhu.edu 1 Overview

More information

EFFECT ON HEAT TRANSFER AND THERMAL DEVELOPMENT OF A RADIATIVELY PARTICIPATING FLUID IN A CHANNEL FLOW

EFFECT ON HEAT TRANSFER AND THERMAL DEVELOPMENT OF A RADIATIVELY PARTICIPATING FLUID IN A CHANNEL FLOW INTERNATIONAL JOURNAL OF MECHANICAL ENGINEERING AND TECHNOLOGY (IJMET) International Journal of Mechanical Engineering and Technology (IJMET), ISSN 0976 6340(Print), ISSN 0976 6340 (Print) ISSN 0976 6359

More information

Kashif Iqbal - PhD Kashif.iqbal@ichec.ie

Kashif Iqbal - PhD Kashif.iqbal@ichec.ie HPC/HTC vs. Cloud Benchmarking An empirical evalua.on of the performance and cost implica.ons Kashif Iqbal - PhD Kashif.iqbal@ichec.ie ICHEC, NUI Galway, Ireland With acknowledgment to Michele MicheloDo

More information

DYNAMIC LOAD BALANCING APPLICATIONS ON A HETEROGENEOUS UNIX/NT CLUSTER

DYNAMIC LOAD BALANCING APPLICATIONS ON A HETEROGENEOUS UNIX/NT CLUSTER European Congress on Computational Methods in Applied Sciences and Engineering ECCOMAS 2000 Barcelona, 11-14 September 2000 ECCOMAS DYNAMIC LOAD BALANCING APPLICATIONS ON A HETEROGENEOUS UNIX/NT CLUSTER

More information

The GPU Accelerated Data Center. Marc Hamilton, August 27, 2015

The GPU Accelerated Data Center. Marc Hamilton, August 27, 2015 The GPU Accelerated Data Center Marc Hamilton, August 27, 2015 THE GPU-ACCELERATED DATA CENTER HPC DEEP LEARNING PC VIRTUALIZATION CLOUD GAMING RENDERING 2 Product design FROM ADVANCED RENDERING TO VIRTUAL

More information

Pushing the limits. Turbine simulation for next-generation turbochargers

Pushing the limits. Turbine simulation for next-generation turbochargers Pushing the limits Turbine simulation for next-generation turbochargers KWOK-KAI SO, BENT PHILLIPSEN, MAGNUS FISCHER Computational fluid dynamics (CFD) has matured and is now an indispensable tool for

More information

Interconnect Analysis: 10GigE and InfiniBand in High Performance Computing

Interconnect Analysis: 10GigE and InfiniBand in High Performance Computing Interconnect Analysis: 10GigE and InfiniBand in High Performance Computing WHITE PAPER Highlights: There is a large number of HPC applications that need the lowest possible latency for best performance

More information

HPC Programming Framework Research Team

HPC Programming Framework Research Team HPC Programming Framework Research Team 1. Team Members Naoya Maruyama (Team Leader) Motohiko Matsuda (Research Scientist) Soichiro Suzuki (Technical Staff) Mohamed Wahib (Postdoctoral Researcher) Shinichiro

More information

Hardware-Aware Analysis and. Presentation Date: Sep 15 th 2009 Chrissie C. Cui

Hardware-Aware Analysis and. Presentation Date: Sep 15 th 2009 Chrissie C. Cui Hardware-Aware Analysis and Optimization of Stable Fluids Presentation Date: Sep 15 th 2009 Chrissie C. Cui Outline Introduction Highlights Flop and Bandwidth Analysis Mehrstellen Schemes Advection Caching

More information

Building Platform as a Service for Scientific Applications

Building Platform as a Service for Scientific Applications Building Platform as a Service for Scientific Applications Moustafa AbdelBaky moustafa@cac.rutgers.edu Rutgers Discovery Informa=cs Ins=tute (RDI 2 ) The NSF Cloud and Autonomic Compu=ng Center Department

More information

Model of a flow in intersecting microchannels. Denis Semyonov

Model of a flow in intersecting microchannels. Denis Semyonov Model of a flow in intersecting microchannels Denis Semyonov LUT 2012 Content Objectives Motivation Model implementation Simulation Results Conclusion Objectives A flow and a reaction model is required

More information

Data Centric Systems (DCS)

Data Centric Systems (DCS) Data Centric Systems (DCS) Architecture and Solutions for High Performance Computing, Big Data and High Performance Analytics High Performance Computing with Data Centric Systems 1 Data Centric Systems

More information

PARALLEL JAVASCRIPT. Norm Rubin (NVIDIA) Jin Wang (Georgia School of Technology)

PARALLEL JAVASCRIPT. Norm Rubin (NVIDIA) Jin Wang (Georgia School of Technology) PARALLEL JAVASCRIPT Norm Rubin (NVIDIA) Jin Wang (Georgia School of Technology) JAVASCRIPT Not connected with Java Scheme and self (dressed in c clothing) Lots of design errors (like automatic semicolon

More information

SUPERCOMPUTING FACILITY INAUGURATED AT BARC

SUPERCOMPUTING FACILITY INAUGURATED AT BARC SUPERCOMPUTING FACILITY INAUGURATED AT BARC The Honourable Prime Minister of India, Dr Manmohan Singh, inaugurated a new Supercomputing Facility at Bhabha Atomic Research Centre, Mumbai, on November 15,

More information

Performance Improvement of Application on the K computer

Performance Improvement of Application on the K computer Performance Improvement of Application on the K computer November 13, 2011 Kazuo Minami Team Leader, Application Development Team Research and Development Group Next-Generation Supercomputer R & D Center

More information

Towards Multiscale Simulations of Cloud Turbulence

Towards Multiscale Simulations of Cloud Turbulence Towards Multiscale Simulations of Cloud Turbulence Ryo Onishi *1,2 and Keiko Takahashi *1 *1 Earth Simulator Center/JAMSTEC *2 visiting scientist at Imperial College, London (Prof. Christos Vassilicos)

More information

High Performance Computing

High Performance Computing High Performance Computing Trey Breckenridge Computing Systems Manager Engineering Research Center Mississippi State University What is High Performance Computing? HPC is ill defined and context dependent.

More information

Visualization and Feature Extraction, FLOW Spring School 2016 Prof. Dr. Tino Weinkauf. Flow Visualization. Image-Based Methods (integration-based)

Visualization and Feature Extraction, FLOW Spring School 2016 Prof. Dr. Tino Weinkauf. Flow Visualization. Image-Based Methods (integration-based) Visualization and Feature Extraction, FLOW Spring School 2016 Prof. Dr. Tino Weinkauf Flow Visualization Image-Based Methods (integration-based) Spot Noise (Jarke van Wijk, Siggraph 1991) Flow Visualization:

More information

animation animation shape specification as a function of time

animation animation shape specification as a function of time animation animation shape specification as a function of time animation representation many ways to represent changes with time intent artistic motion physically-plausible motion efficiency control typically

More information

Abaqus/CFD Sample Problems. Abaqus 6.10

Abaqus/CFD Sample Problems. Abaqus 6.10 Abaqus/CFD Sample Problems Abaqus 6.10 Contents 1. Oscillatory Laminar Plane Poiseuille Flow 2. Flow in Shear Driven Cavities 3. Buoyancy Driven Flow in Cavities 4. Turbulent Flow in a Rectangular Channel

More information

22S:295 Seminar in Applied Statistics High Performance Computing in Statistics

22S:295 Seminar in Applied Statistics High Performance Computing in Statistics 22S:295 Seminar in Applied Statistics High Performance Computing in Statistics Luke Tierney Department of Statistics & Actuarial Science University of Iowa August 30, 2007 Luke Tierney (U. of Iowa) HPC

More information

TWO-DIMENSIONAL FINITE ELEMENT ANALYSIS OF FORCED CONVECTION FLOW AND HEAT TRANSFER IN A LAMINAR CHANNEL FLOW

TWO-DIMENSIONAL FINITE ELEMENT ANALYSIS OF FORCED CONVECTION FLOW AND HEAT TRANSFER IN A LAMINAR CHANNEL FLOW TWO-DIMENSIONAL FINITE ELEMENT ANALYSIS OF FORCED CONVECTION FLOW AND HEAT TRANSFER IN A LAMINAR CHANNEL FLOW Rajesh Khatri 1, 1 M.Tech Scholar, Department of Mechanical Engineering, S.A.T.I., vidisha

More information

Applications to Computational Financial and GPU Computing. May 16th. Dr. Daniel Egloff +41 44 520 01 17 +41 79 430 03 61

Applications to Computational Financial and GPU Computing. May 16th. Dr. Daniel Egloff +41 44 520 01 17 +41 79 430 03 61 F# Applications to Computational Financial and GPU Computing May 16th Dr. Daniel Egloff +41 44 520 01 17 +41 79 430 03 61 Today! Why care about F#? Just another fashion?! Three success stories! How Alea.cuBase

More information

ALL GROUND-WATER HYDROLOGY WORK IS MODELING. A Model is a representation of a system.

ALL GROUND-WATER HYDROLOGY WORK IS MODELING. A Model is a representation of a system. ALL GROUND-WATER HYDROLOGY WORK IS MODELING A Model is a representation of a system. Modeling begins when one formulates a concept of a hydrologic system, continues with application of, for example, Darcy's

More information

Parallel Computing. Benson Muite. benson.muite@ut.ee http://math.ut.ee/ benson. https://courses.cs.ut.ee/2014/paralleel/fall/main/homepage

Parallel Computing. Benson Muite. benson.muite@ut.ee http://math.ut.ee/ benson. https://courses.cs.ut.ee/2014/paralleel/fall/main/homepage Parallel Computing Benson Muite benson.muite@ut.ee http://math.ut.ee/ benson https://courses.cs.ut.ee/2014/paralleel/fall/main/homepage 3 November 2014 Hadoop, Review Hadoop Hadoop History Hadoop Framework

More information

An Introduction to Parallel Computing/ Programming

An Introduction to Parallel Computing/ Programming An Introduction to Parallel Computing/ Programming Vicky Papadopoulou Lesta Astrophysics and High Performance Computing Research Group (http://ahpc.euc.ac.cy) Dep. of Computer Science and Engineering European

More information

Pedraforca: ARM + GPU prototype

Pedraforca: ARM + GPU prototype www.bsc.es Pedraforca: ARM + GPU prototype Filippo Mantovani Workshop on exascale and PRACE prototypes Barcelona, 20 May 2014 Overview Goals: Test the performance, scalability, and energy efficiency of

More information

A PERFORMANCE COMPARISON USING HPC BENCHMARKS: WINDOWS HPC SERVER 2008 AND RED HAT ENTERPRISE LINUX 5

A PERFORMANCE COMPARISON USING HPC BENCHMARKS: WINDOWS HPC SERVER 2008 AND RED HAT ENTERPRISE LINUX 5 A PERFORMANCE COMPARISON USING HPC BENCHMARKS: WINDOWS HPC SERVER 2008 AND RED HAT ENTERPRISE LINUX 5 R. Henschel, S. Teige, H. Li, J. Doleschal, M. S. Mueller October 2010 Contents HPC at Indiana University

More information

A quasi three-dimensional variably saturated groundwater flow model for climate modeling

A quasi three-dimensional variably saturated groundwater flow model for climate modeling A quasi three-dimensional variabl saturated groundwater flow model for climate modeling Zhenghui Xie Zhenhua Di Zhendong LuoQian Ma LASG Institute of Atmospheric Phsics Chinese Academ of Sciences China

More information

Automatic Selection of Loop Scheduling Algorithms Using Reinforcement Learning

Automatic Selection of Loop Scheduling Algorithms Using Reinforcement Learning Automatic Selection of Loop Scheduling Algorithms Using Reinforcement Learning Sumithra Dhandayuthapani [1,2], Ioana Banicescu [1,2], Ricolindo L. Cariño [2], Eric Hansen [1], Jaderick P. Pabico [1,2],

More information

The Lattice Project: A Multi-Model Grid Computing System. Center for Bioinformatics and Computational Biology University of Maryland

The Lattice Project: A Multi-Model Grid Computing System. Center for Bioinformatics and Computational Biology University of Maryland The Lattice Project: A Multi-Model Grid Computing System Center for Bioinformatics and Computational Biology University of Maryland Parallel Computing PARALLEL COMPUTING a form of computation in which

More information

Graduate Certificate Program in Energy Conversion & Transport Offered by the Department of Mechanical and Aerospace Engineering

Graduate Certificate Program in Energy Conversion & Transport Offered by the Department of Mechanical and Aerospace Engineering Graduate Certificate Program in Energy Conversion & Transport Offered by the Department of Mechanical and Aerospace Engineering Intended Audience: Main Campus Students Distance (online students) Both Purpose:

More information

Robust Algorithms for Current Deposition and Dynamic Load-balancing in a GPU Particle-in-Cell Code

Robust Algorithms for Current Deposition and Dynamic Load-balancing in a GPU Particle-in-Cell Code Robust Algorithms for Current Deposition and Dynamic Load-balancing in a GPU Particle-in-Cell Code F. Rossi, S. Sinigardi, P. Londrillo & G. Turchetti University of Bologna & INFN GPU2014, Rome, Sept 17th

More information

Matrix Multiplication

Matrix Multiplication Matrix Multiplication CPS343 Parallel and High Performance Computing Spring 2016 CPS343 (Parallel and HPC) Matrix Multiplication Spring 2016 1 / 32 Outline 1 Matrix operations Importance Dense and sparse

More information

Waves. Wave Parameters. Krauss Chapter Nine

Waves. Wave Parameters. Krauss Chapter Nine Waves Krauss Chapter Nine Wave Parameters Wavelength = λ = Length between wave crests (or troughs) Wave Number = κ = 2π/λ (units of 1/length) Wave Period = T = Time it takes a wave crest to travel one

More information

Effect of Aspect Ratio on Laminar Natural Convection in Partially Heated Enclosure

Effect of Aspect Ratio on Laminar Natural Convection in Partially Heated Enclosure Universal Journal of Mechanical Engineering (1): 8-33, 014 DOI: 10.13189/ujme.014.00104 http://www.hrpub.org Effect of Aspect Ratio on Laminar Natural Convection in Partially Heated Enclosure Alireza Falahat

More information

HPC on AWS. Hiroshi Kobayashi, Dev./Lab. IT System HGST Japan, Ltd. Jun 3, 2015

HPC on AWS. Hiroshi Kobayashi, Dev./Lab. IT System HGST Japan, Ltd. Jun 3, 2015 HPC on AWS Hiroshi Kobayashi, Dev./Lab. IT System HGST Japan, Ltd. Jun 3, 2015 1 HPC on AWS HPC = High Performance Computing AWS = Amazon Web Service 2 Agenda HGST Why choose Cloud? Performance Flexibility

More information

Vista: A Multi-field Object Oriented CFD-package

Vista: A Multi-field Object Oriented CFD-package Vista: A Multi-field Object Oriented CFD-package T. Kvamsdal 1, R. Holdahl 1 and P. Böhm 2 1 SINTEF ICT, Applied Mathematics, Norway 2 inutech GmbH, Germany Outline inutech & SINTEF VISTA a CFD Solver

More information

Mapping the Universe; Challenge for the Big Data science

Mapping the Universe; Challenge for the Big Data science Mapping the Universe; Challenge for the Big Data science Dr.Utain Sawangwit National Astronomical Research Institute of Thailand utane@narit.or.th Observational Cosmology relies on big surveys and very

More information

Introduction to Numerical General Purpose GPU Computing with NVIDIA CUDA. Part 1: Hardware design and programming model

Introduction to Numerical General Purpose GPU Computing with NVIDIA CUDA. Part 1: Hardware design and programming model Introduction to Numerical General Purpose GPU Computing with NVIDIA CUDA Part 1: Hardware design and programming model Amin Safi Faculty of Mathematics, TU dortmund January 22, 2016 Table of Contents Set

More information

Fast Multipole Method for particle interactions: an open source parallel library component

Fast Multipole Method for particle interactions: an open source parallel library component Fast Multipole Method for particle interactions: an open source parallel library component F. A. Cruz 1,M.G.Knepley 2,andL.A.Barba 1 1 Department of Mathematics, University of Bristol, University Walk,

More information

1 The basic equations of fluid dynamics

1 The basic equations of fluid dynamics 1 The basic equations of fluid dynamics The main task in fluid dynamics is to find the velocity field describing the flow in a given domain. To do this, one uses the basic equations of fluid flow, which

More information

Integration of OpenFOAM Flow Simulation and Filament-Based Gas Propagation Models for Gas Dispersion Simulation

Integration of OpenFOAM Flow Simulation and Filament-Based Gas Propagation Models for Gas Dispersion Simulation Integration of OpenFOAM Flow Simulation and Filament-Based Gas Propagation Models for Gas Dispersion Simulation Sepideh Pashami, Sahar Asadi, Achim Lilienthal Centre of Applied Autonomous Sensor Systems,

More information

Fluid Dynamics and the Navier-Stokes Equation

Fluid Dynamics and the Navier-Stokes Equation Fluid Dynamics and the Navier-Stokes Equation CMSC498A: Spring 12 Semester By: Steven Dobek 5/17/2012 Introduction I began this project through a desire to simulate smoke and fire through the use of programming

More information