Generating Virtual Worlds with Supercomputer Simulations

Size: px
Start display at page:

Download "Generating Virtual Worlds with Supercomputer Simulations"

Transcription

1 Generating Virtual Worlds with Supercomputer Simulations U. Rüde (LSS Erlangen, joint work with many collaborators and students Lehrstuhl für Informatik 10 (Systemsimulation) Universität Erlangen-Nürnberg www10.informatik.uni-erlangen.de December 19,

2 Overview Computers as tools for scientists: What is Computational Science Examples of Simulation for Science and Engineering Simulating Flow Simulations Biomedical Applications Conclusions 2

3 Motivation 3

4 How much is a PetaFlops? 10 6 = 1 MegaFlops: Intel 486 Zur Anzeige wird der QuickTime Dekompressor TIFF (Unkomprimiert) benötigt. 33MHz PC (~1989) 10 9 = 1 GigaFlops: Intel Pentium III 1GHz (~2000) If every person on earth does one operation every 6 seconds, all humans together have 1 GigaFlops performance (less than a current laptop from Aldi) = 1 TeraFlops: HLRB-I 1344 Proc., ~ = 1 PetaFlops > Proc. Cores?, ~2008? If every person on earth runs a 486 PC, we all together have an aggregate Performance of 6 PetaFlops. Zur Anzeige wird der QuickTime Dekompressor TIFF (Unkomprimiert) benötigt. HLRB-I: 2 TFlops Zur Anzeige wird der QuickTime Dekompressor TIFF (Unkomprimiert) benötigt. HLRB-II: 63 TFlops 10

5 The Two Principles of Science Three Theory Mathematical Models, Differential Equations, Newton Experiments Observation and prototypes empirical Sciences Computational Science Simulation, Optimization (quantitative) virtual Reality 11

6 SIAM s Definition of CSE siam.org/cse/report.htm CSE is a broad multidisciplinary area that encompasses applications in science/engineering, applied mathematics, numerical analysis, and computer science. Computer models and computer simulations have become an important part of the research repertoire, supplementing (and in some cases replacing) experimentation. Going from application area to computational results requires domain expertise, mathematical modeling,, numerical analysis, algorithm development, software implementation, program execution, analysis, validation and visualization of results.. CSE involves all of this. 13

7 SIAM's Definition of CSE (2) Especially: What is it NOT! CSE makes use of the techniques of applied mathematics and computer science for the development of problem-solving methodologies and robust tools which will be the building blocks for solutions to scientific and engineering problems of ever-increasing complexity. It differs from mathematics or computer science in that analysis and methodologies are directed specifically at the solution of problem classes from science and engineering,, and will generally require a detailed knowledge or substantial collaboration from those disciplines. The computing and mathematical techniques used may be more domain specific, and the computer science and mathematics skills needed will be broader. CSE is more than a scientist or engineer using a canned code to generate and visualize results (skipping all of the intermediate steps). 14

8 Fluid Flow Simulation Metal Foams Nano Technology Fancy Physics In Collaboration with: Lehrstuhl Werkstoffkunde und Technologie der Metalle, Erlangen (R.F. Singer, C. Körner) Lehrstuhl für Bauinformatik, TU München (E. Rank) Institut für Computeranwendungen im Bauingenieurwesen, TU Braunschweig (M. Krafczyk) Lehrstuhl für Feststoff- und Grenzflächenverfahrenstechnik, Erlangen (W. Peukert, H.-J. Schmid) Computer Graphics Lab, ETH Zürich (M. Pauly) 15

9 First Test Breaking Dam Zur Anzeige wird der QuickTime Dekompressor YUV420 codec benötigt. 16

10 Falling Drop Zur Anzeige wird der QuickTime Dekompressor YUV420 codec benötigt. 17

11 Falling Meteor Zur Anzeige wird der QuickTime Dekompressor YUV420 codec benötigt. 18

12 The interface between Compute only fluid Liquid and Gas Special free surface conditions on interface 22

13 Why so compute intensive? Millions to billions of cells (1000x1000x1000) Thousands to millions of time steps hundreds of operations in each cell and time step The curse of dimensionality! 23

14 Visualization Ray-tracing Refraction Reflection Caustics About 15 Min per frame = 1 day for 4 secs About same compute time as flow simulation 24

15 Process Simulation of Foam Production poorly understood: coalescence collapse, drying, solidification etc. Simulation as tool to understand and control the process 26

16 Rising Bubbles Zur Anzeige wird der QuickTime Dekompressor YUV420 codec benötigt. 27

17 Simultaneously Rising Bubbles Zur Anzeige wird der QuickTime Dekompressor YUV420 codec benötigt. 28

18 Experimental Verification Zur Anzeige wird der QuickTime Dekompressor YUV420 codec benötigt. Simulation and Experiment: Diplomarbeit N. Thürey 29

19 Foaming Simulation Zur Anzeige wird der QuickTime Dekompressor Cinepak benötigt. Zur Anzeige wird der QuickTime Dekompressor benötigt. 30

20 Fancy Physics Zur Anzeige wird der QuickTime Dekompressor YUV420 codec benötigt. 31

21 Moving Nano Particles in a Liquid Zur Anzeige wird der QuickTime Dekompressor YUV420 codec benötigt. K. Iglberger - Master Thesis C. Feichtinger - Diplomarbeit 32

22 Bio-medical and Bio-chemical Simulation Bood Flow in an Aneurysma HIV-Protease Bio-Electrical Fields 33

23 Datensatz Pulsating Blood Flow in an Aneurysma Master Thesis Jan GötzG Zur Anzeige wird der QuickTime Dekompressor YUV420 codec benötigt. Collaboration with Neuroradiologie (Prof. Dörfler, D Dr. Richter) Image Processing Simulation Fluid Mechanics (Prof. Durst) 34

24 Datensatz Pulsating Blood Flow in an Aneurysma Master Thesis Jan GötzG Collaboration with Neuro-Radiology (Prof. Dörfler, D Dr. Richter) Image Processing Simulation Fluid Mechanics (Prof. Durst) Zur Anzeige wird der QuickTime Dekompressor YUV420 codec benötigt. 35

25 Bio-Electromagnetic Fields Source Localisation Collaboration with: Chr. Johnson (Univ. of Utah), C. Popa (Ovidius Univ. Constanta), Bart Vanrumste, (Univ. of Canterbury, New Zealand), G. Greiner, F. Fahlbusch (Erlangen), C. Wolters (Münster) Erlangen Neuro Surgeons at work View through operation microscope 36

26 Simulation or better do experiments? Source localisation by open brain measurements Operation planning with a virtual head model (Chr. Johnson, Utah) 37

27 Molekular Dynamics Simulation of HIV-Protease Zur Anzeige wird der QuickTime Dekompressor YUV420 codec benötigt. H. Sticht (Inst. für f r Biochemie) 38

28 International Master (and PhD) Programme Computational Engineering What is this about? it is not Computer Science it is not Mathematics it is not a conventional engineering field it is an interdisciplinary combination of all three - the foundation of future science Master Program in Erlangen Hours Option (Elite Program) jointly with TU Munich Information: 39

29 Acknowledgements Collaborators In Erlangen: WTM, LSE, LSTM, LGDV, RRZE, Neurozentrum, Radiologie, etc. Especially for foams: C. Körner (WTM) International: Utah, Technion, Constanta, Ghent, Boulder, München, Zürich,... Dissertationen Projects U. Fabricius (AMG-Methods and SW-Engineering for parallelization) C. Freundl (Parelle Expression Templates for PDE-solver) K. Iglberger (Rigid Body Dynamics) J. Götz (LBM, blood flow) T. Gradl (Parallel multigrid)... and 8 more 25 Diplom- /Master- Thesis Studien- /Bachelor- Thesis Especially for Performance-Analysis/ Optimization for LBM J. Wilke, K. Iglberger, S. Donath, B. Gmeiner... and 23 more KONWIHR, DFG, NATO, BMBF Elitenetzwerk Bayern Bavarian Graduate School in Computational Engineering (with TUM, since 2004) Special International PhD program: Identifikation, Optimierung und Steuerung für f r technische Anwendungen (with Bayreuth and Würzburg) W since Jan

30 Thank you for your interest! Questions? 41

31 Part II-a Towards Scalable FE Software Scalable Algorithms: Multigrid 42

32 What is Multigrid? Has nothing to do with grid computing A general methodology multi - scale (actually it is the original ) many different applications developped in the 1970s -... Useful e.g. for solving elliptic PDEs large sparse systems of equations iterative convergence rate independent of problem size asymptotically optimal complexity -> algorithmic scalability! can solve e.g. 2D Poisson Problem in ~ 30 operations per gridpoint efficient parallelization - if one knows how to do it best (maybe the only?) basis for fully scalable FE solvers 43

33 Multigrid: V-Cycle Goal: solve A h u h = f h using a hierarchy of grids Relax on Correct Residual Restrict Interpolate Solve by recursion 44

34 Part II - b Towards Scalable FE Software Scalable Architecture Hierarchical Hybrid Grids 46

35 Hierarchical Hybrid Grids (HHG) Unstructured input grid Resolves geometry of problem domain Patch-wise regular refinement generates nested grid hierarchies naturally suitable for geometric multigrid algorithms New: Modify storage formats and operations on the grid to exploit the regular substructures Does an unstructured grid with elements make sense? HHG - Ultimate Parallel FE Performance! 47

36 HHG refinement example Input Grid 48

37 HHG Refinement example Refinement Level one 49

38 HHG Refinement example Refinement Level Two 50

39 HHG Refinement example Structured Interior 51

40 HHG Refinement example Structured Interior 52

41 HHG Refinement example Edge Interior 53

42 HHG Refinement example Edge Interior 54

43 Common HHG Misconceptions Hierarchical hybrid grids (HHG) are not only another block structured grid HHG are more flexible (unstructured, hybrid input grids) are not only another unstructured geometric multigrid package HHG achieve better performance unstructured treatment of regular regions does not improve performance 55

44 Parallel HHG - Framework Design Goals To realize good parallel scalability: Minimize latency by reducing the number of messages that must be sent Optimize for high bandwidth interconnects large messages Avoid local copying into MPI buffers 56

45 HHG for Parallelization Use regular HHG patches for partitioning the domain 57

46 HHG Parallel Update Algorithm for each vertex do apply operation to vertex end for update vertex primary dependencies for each edge do copy from vertex interior apply operation to edge copy to vertex halo end for update edge primary dependencies for each element do copy from edge/vertex interiors apply operation to element copy to edge/vertex halos end for update secondary dependencies 58

47 Part II - c Towards Scalable FE Software Performance Results 59

48 Single Processor HHG Performance on Itanium for Relaxation of a Tetrahedral Finite Element Mesh 60

49 HHG: Parallel Scalability #Procs #DOFS x 10 6 #Els x 10 6 #Input Els GFLOP/s Time [s] 64 2,144 12, / ,288 25, / ,577 51, / , , / , , ,456/ Parallel scalability of Poisson problem discretized by tetrahedral finite elements - SGI Altix (Itanium GHz) B. Bergen, F. Hülsemann, U. Ruede: Is unknowns the largest finite element system that can be solved today? SuperComputing, Nov See also: ISC Award 2006 for Application Scalability. 61

50 Part III - a Free Surface Flow Simulation The Lattice Boltzmann Method 62

51 Free surface flow: Breaking Dam Zur Anzeige wird der QuickTime Dekompressor YUV420 codec benötigt. 63

52 The Lattice-Boltzmann Method (2) Weakly compressible approximation of the Navier-Stokes equations Easy implementation Applicable for small Mach numbers (< 0.1) Easy to adapt, e.g. for Complicated or time-varying geometries Free surfaces Additional physical and chemical effects 65

53 The Lattice-Boltzmann Method (3) Real valued representation of particles Discrete velocities and positions Algorithm proceeds in two steps: Stream Collide 66

54 Fluid Cell Treatment Algorithm proceeds in two steps: Stream: advect fluid elements (copy DFs to neighbors) Collide: compute collisions of fluid molecules 67

55 Fluid Cell Treatment Algorithm proceeds in two steps: Stream: advect fluid elements (copy DFs to neighbors) Collide: compute collisions of fluid molecules 68

56 Fluid Cell Treatment Algorithm proceeds in two steps: Stream: advect fluid elements (copy DFs to neighbors) Collide: compute collisions of fluid molecules 69

57 Fluid Cell Treatment Algorithm proceeds in two steps: Stream: advect fluid elements (copy DFs to neighbors) Collide: compute collisions of fluid molecules 70

58 Fluid Cell Treatment Algorithm proceeds in two steps: Stream: advect fluid elements (copy DFs to neighbors) Collide: compute collisions of fluid molecules 71

59 The Collide Step Amounts for collisions of particles during movement Weigh equilibrium velocities and velocities from streaming depending on fluid viscosity 72

60 Stream/Collide: LBM in Equations Zur Anzeige wird der QuickTime Dekompressor TIFF (LZW) benötigt. Equilibrium DF: Zur Anzeige wird der QuickTime Dekompressor TIFF (LZW) benötigt. Zur Anzeige wird der QuickTime Dekompressor TIFF (LZW) benötigt. Zur Anzeige wird der QuickTime Dekompressor TIFF (LZW) benötigt. Zur Anzeige wird der QuickTime Dekompressor TIFF (LZW) benötigt. Zur Anzeige wird der QuickTime Dekompressor TIFF (LZW) benötigt. Zur Anzeige wird der QuickTime Dekompressor TIFF (LZW) benötigt. 73

61 Stability & Turbulence Modelling Smagorinsky Subgrid Model: Similar to approach for NS-Solvers Model subgrid-scale vortices by locally changing the viscosity Implementation for LBM Reynolds stress tensor computed for each cell Changes only in collision operator Ca. 20% slowdown, significant gain due to decreased resolution requirements 74

62 Falling Drop with Turbulence Model Zur Anzeige wird der QuickTime Dekompressor benötigt. 75

63 Falling Drop with Turbulence Model (slower( slower) Zur Anzeige wird der QuickTime Dekompressor benötigt. 76

64 Part III - b Free Surface Flow Simulation Volume of Fluids 77

65 Free surfaces with LBM Metal Foams huge gas volumes Only simulate and track fluid motion Compute boundary conditions at free surface Three cell types: Empty/Gas, Fluid, Interface 78

66 Boundary Conditions Problem: Missing distribution functions at interface cells after streaming! Gas Liquid Reconstruction such that macroscopic boundary conditions are satisfied. Körner et al. Lattice Boltzmann Model for Free Surface Flow, Journal of Computational Physics 79

67 Free surface simulations Algorithmic Overview: Before stream step, compute mass exchange across cell boundaries for interface cells Calculate bubble volumes and pressure Surface curvature for surface tension Change topology if interface cells become full or empty keep layer of interface cells closed 88

68 Free Surface Cell Conversions Emptied interface cell > gas Filled interface cell > fluid Guarantee closed layer of interface cells Redistribute mass in the neighborhood 89

69 Curvature calculation (version I) Alternative approaches: Integrate normals over surface (weighted triangles) Level set methods (track surface as implicit function) 90

70 Surface Tension (Vers. 2) Marching-cube surface triangulation Compute a curvature for each triangle κ = 1 2 δa δv δa = A A n r n r 3 1 A n r 2 δv A Associate with each LBM cell the average curvature of its triangles Complicated Beats level sets for our applications (mass conservation) 91

71 Part III - c Free Surface Flow Simulation Application: Metal Foam 92

72 Towards Simulating Metal Foams Bubble growth, coalescence, collapse, drainage, rheology,, etc. are still poorly understood Simulation as a tool to better understand, control and optimize the process 94

73 Rising Bubbles Zur Anzeige wird der QuickTime Dekompressor YUV420 codec benötigt. 95

74 More Rising Bubbles Zur Anzeige wird der QuickTime Dekompressor YUV420 codec benötigt. 96

75 Simulation Verification by Experiment Zur Anzeige wird der QuickTime Dekompressor YUV420 codec benötigt. Simulation and Experiment: Diplomarbeit N. Thürey 97

76 Foaming Simulation 1 Zur Anzeige wird der QuickTime Dekompressor Cinepak benötigt. 98

77 Numerical Experiment: Single Rising Bubble 101

78 Part III - d Free Surface Flow Simulation Parallel Performance 102

79 Parallelization Standard LBM-Code: Scalability on SR 8000-F1 Largest Simulation: 1,08*10 9 cells 370 GByte memory Communication Cost because of large data volume (64 MByte) Efficiency ~ 75% Dissertation T. Pohl (2006) 103

80 Free surface LBM-Code Parallelization Standard LBM Free surface LBM 1 sweep through grid 5 sweeps through grid Cell type changes, Closed boundary for bubbles, Initialization of modified cells, Mass balance correction 104

81 Free surface LBM-Code: Parallelization Standard LBM Free surface LBM 1 sweep through grid 5 sweeps through grid 1 row of ghost nodes 4 rows of ghost nodes 105

82 Performance on SR 8000 Standard LBM-Code Free surface LBM-Code Performance lousy on a single node! Conditionals: 2,9 SLBM 51 free surface LBM Pentium 4: almost no degradation ~ 10% SR 8000: enormous degradation (pseudo-vector, predictable jumps) 106

83 Parallel Performance LSS- Cluster Fujitsu- Siemens 107

84 Part III - c Free Surface Flow Simulation Visualization and Animation 108

85 Adaptive Grids Performance Zur Anzeige wird der QuickTime Dekompressor benötigt. Speed up: factor 2-4 for larger resolutions Insignifcant overhead for small resolutions 109

86 Example Coupled Simulations Zur Anzeige wird der QuickTime Dekompressor benötigt. 113

87 Physically Based Animation Special Effects e.g. for Computer generated movies Realistic appearance necessary, but only where it s absolutely necessary > Control Fluid or other simulations Examples of Fluid Simulations in Movies: Harry Potter 4 (ship-scene), Ice Age 2 (throughout), Poseidon 114

88 Simulations with Fluid Control Zur Anzeige wird der QuickTime Dekompressor mpeg4 benötigt. 115

89 Part IV Outlook 116

90 Acknowledgements Collaborators In Erlangen: WTM, LSE, LSTM, LGDV, RRZE, Neurozentrum, Radiologie, etc. Especially for foams: C. Körner (WTM) International: Utah, Technion, Constanta, Ghent, Boulder, München, Zürich,... Dissertationen Projects U. Fabricius (AMG-Verfahren and SW-Engineering for parallelization) C. Freundl (Parelle Expression Templates for PDE-solver) J. Härtlein (Expression Templates for FE-Applications) N. Thürey (LBM, free surfaces) T. Pohl (Parallel LBM)... and 6 more 19 Diplom- /Master- Thesis Studien- /Bachelor- Thesis Especially for Performance-Analysis/ Optimization for LBM J. Wilke, K. Iglberger, S. Donath... and 23 more KONWIHR, DFG, NATO, BMBF Elitenetzwerk Bayern Bavarian Graduate School in Computational Engineering (with TUM, since 2004) Special International PhD program: Identifikation, Optimierung und Steuerung für f r technische Anwendungen (with Bayreuth and Würzburg) W since Jan

91 Talk is Over Please wake up! 118

walberla: A software framework for CFD applications

walberla: A software framework for CFD applications walberla: A software framework for CFD applications U. Rüde, S. Donath, C. Feichtinger, K. Iglberger, F. Deserno, M. Stürmer, C. Mihoubi, T. Preclic, D. Haspel (all LSS Erlangen), N. Thürey (LSS Erlangen/

More information

Fast Parallel Algorithms for Computational Bio-Medicine

Fast Parallel Algorithms for Computational Bio-Medicine Fast Parallel Algorithms for Computational Bio-Medicine H. Köstler, J. Habich, J. Götz, M. Stürmer, S. Donath, T. Gradl, D. Ritter, D. Bartuschat, C. Feichtinger, C. Mihoubi, K. Iglberger (LSS Erlangen)

More information

FRIEDRICH-ALEXANDER-UNIVERSITÄT ERLANGEN-NÜRNBERG

FRIEDRICH-ALEXANDER-UNIVERSITÄT ERLANGEN-NÜRNBERG FRIEDRICH-ALEXANDER-UNIVERSITÄT ERLANGEN-NÜRNBERG INSTITUT FÜR INFORMATIK (MATHEMATISCHE MASCHINEN UND DATENVERARBEITUNG) Lehrstuhl für Informatik 10 (Systemsimulation) Massively Parallel Multilevel Finite

More information

walberla: A software framework for CFD applications on 300.000 Compute Cores

walberla: A software framework for CFD applications on 300.000 Compute Cores walberla: A software framework for CFD applications on 300.000 Compute Cores J. Götz (LSS Erlangen, jan.goetz@cs.fau.de), K. Iglberger, S. Donath, C. Feichtinger, U. Rüde Lehrstuhl für Informatik 10 (Systemsimulation)

More information

Towards real-time image processing with Hierarchical Hybrid Grids

Towards real-time image processing with Hierarchical Hybrid Grids Towards real-time image processing with Hierarchical Hybrid Grids International Doctorate Program - Summer School Björn Gmeiner Joint work with: Harald Köstler, Ulrich Rüde August, 2011 Contents The HHG

More information

Collaborative and Interactive CFD Simulation using High Performance Computers

Collaborative and Interactive CFD Simulation using High Performance Computers Collaborative and Interactive CFD Simulation using High Performance Computers Petra Wenisch, Andre Borrmann, Ernst Rank, Christoph van Treeck Technische Universität München {wenisch, borrmann, rank, treeck}@bv.tum.de

More information

Computational Engineering Programs at the University of Erlangen-Nuremberg

Computational Engineering Programs at the University of Erlangen-Nuremberg Computational Engineering Programs at the University of Erlangen-Nuremberg Ulrich Ruede Lehrstuhl für Simulation, Institut für Informatik Universität Erlangen http://www10.informatik.uni-erlangen.de/ ruede

More information

walberla: Towards an Adaptive, Dynamically Load-Balanced, Massively Parallel Lattice Boltzmann Fluid Simulation

walberla: Towards an Adaptive, Dynamically Load-Balanced, Massively Parallel Lattice Boltzmann Fluid Simulation walberla: Towards an Adaptive, Dynamically Load-Balanced, Massively Parallel Lattice Boltzmann Fluid Simulation SIAM Parallel Processing for Scientific Computing 2012 February 16, 2012 Florian Schornbaum,

More information

ME6130 An introduction to CFD 1-1

ME6130 An introduction to CFD 1-1 ME6130 An introduction to CFD 1-1 What is CFD? Computational fluid dynamics (CFD) is the science of predicting fluid flow, heat and mass transfer, chemical reactions, and related phenomena by solving numerically

More information

Efficient Storage, Compression and Transmission

Efficient Storage, Compression and Transmission Efficient Storage, Compression and Transmission of Complex 3D Models context & problem definition general framework & classification our new algorithm applications for digital documents Mesh Decimation

More information

P013 INTRODUCING A NEW GENERATION OF RESERVOIR SIMULATION SOFTWARE

P013 INTRODUCING A NEW GENERATION OF RESERVOIR SIMULATION SOFTWARE 1 P013 INTRODUCING A NEW GENERATION OF RESERVOIR SIMULATION SOFTWARE JEAN-MARC GRATIEN, JEAN-FRANÇOIS MAGRAS, PHILIPPE QUANDALLE, OLIVIER RICOIS 1&4, av. Bois-Préau. 92852 Rueil Malmaison Cedex. France

More information

and RISC Optimization Techniques for the Hitachi SR8000 Architecture

and RISC Optimization Techniques for the Hitachi SR8000 Architecture 1 KONWIHR Project: Centre of Excellence for High Performance Computing Pseudo-Vectorization and RISC Optimization Techniques for the Hitachi SR8000 Architecture F. Deserno, G. Hager, F. Brechtefeld, G.

More information

Express Introductory Training in ANSYS Fluent Lecture 1 Introduction to the CFD Methodology

Express Introductory Training in ANSYS Fluent Lecture 1 Introduction to the CFD Methodology Express Introductory Training in ANSYS Fluent Lecture 1 Introduction to the CFD Methodology Dimitrios Sofialidis Technical Manager, SimTec Ltd. Mechanical Engineer, PhD PRACE Autumn School 2013 - Industry

More information

Model of a flow in intersecting microchannels. Denis Semyonov

Model of a flow in intersecting microchannels. Denis Semyonov Model of a flow in intersecting microchannels Denis Semyonov LUT 2012 Content Objectives Motivation Model implementation Simulation Results Conclusion Objectives A flow and a reaction model is required

More information

Optimizing Performance of the Lattice Boltzmann Method for Complex Structures on Cache-based Architectures

Optimizing Performance of the Lattice Boltzmann Method for Complex Structures on Cache-based Architectures Optimizing Performance of the Lattice Boltzmann Method for Complex Structures on Cache-based Architectures Stefan Donath 1, Thomas Zeiser, Georg Hager, Johannes Habich, Gerhard Wellein Regionales Rechenzentrum

More information

HPC enabling of OpenFOAM R for CFD applications

HPC enabling of OpenFOAM R for CFD applications HPC enabling of OpenFOAM R for CFD applications Towards the exascale: OpenFOAM perspective Ivan Spisso 25-27 March 2015, Casalecchio di Reno, BOLOGNA. SuperComputing Applications and Innovation Department,

More information

Hash-Storage Techniques for Adaptive Multilevel Solvers and Their Domain Decomposition Parallelization

Hash-Storage Techniques for Adaptive Multilevel Solvers and Their Domain Decomposition Parallelization Contemporary Mathematics Volume 218, 1998 B 0-8218-0988-1-03018-1 Hash-Storage Techniques for Adaptive Multilevel Solvers and Their Domain Decomposition Parallelization Michael Griebel and Gerhard Zumbusch

More information

LBM BASED FLOW SIMULATION USING GPU COMPUTING PROCESSOR

LBM BASED FLOW SIMULATION USING GPU COMPUTING PROCESSOR LBM BASED FLOW SIMULATION USING GPU COMPUTING PROCESSOR Frédéric Kuznik, frederic.kuznik@insa lyon.fr 1 Framework Introduction Hardware architecture CUDA overview Implementation details A simple case:

More information

The Application of a Black-Box Solver with Error Estimate to Different Systems of PDEs

The Application of a Black-Box Solver with Error Estimate to Different Systems of PDEs The Application of a Black-Bo Solver with Error Estimate to Different Systems of PDEs Torsten Adolph and Willi Schönauer Forschungszentrum Karlsruhe Institute for Scientific Computing Karlsruhe, Germany

More information

Basin simulation for complex geological settings

Basin simulation for complex geological settings Énergies renouvelables Production éco-responsable Transports innovants Procédés éco-efficients Ressources durables Basin simulation for complex geological settings Towards a realistic modeling P. Havé*,

More information

Iterative Solvers for Linear Systems

Iterative Solvers for Linear Systems 9th SimLab Course on Parallel Numerical Simulation, 4.10 8.10.2010 Iterative Solvers for Linear Systems Bernhard Gatzhammer Chair of Scientific Computing in Computer Science Technische Universität München

More information

HPC Deployment of OpenFOAM in an Industrial Setting

HPC Deployment of OpenFOAM in an Industrial Setting HPC Deployment of OpenFOAM in an Industrial Setting Hrvoje Jasak h.jasak@wikki.co.uk Wikki Ltd, United Kingdom PRACE Seminar: Industrial Usage of HPC Stockholm, Sweden, 28-29 March 2011 HPC Deployment

More information

High Performance Computing in the Multi-core Area

High Performance Computing in the Multi-core Area High Performance Computing in the Multi-core Area Arndt Bode Technische Universität München Technology Trends for Petascale Computing Architectures: Multicore Accelerators Special Purpose Reconfigurable

More information

CFD Applications using CFD++ Paul Batten & Vedat Akdag

CFD Applications using CFD++ Paul Batten & Vedat Akdag CFD Applications using CFD++ Paul Batten & Vedat Akdag Metacomp Products available under Altair Partner Program CFD++ Introduction Accurate multi dimensional polynomial framework Robust on wide variety

More information

Introduction to CFD Analysis

Introduction to CFD Analysis Introduction to CFD Analysis 2-1 What is CFD? Computational Fluid Dynamics (CFD) is the science of predicting fluid flow, heat and mass transfer, chemical reactions, and related phenomena by solving numerically

More information

Customer Training Material. Lecture 2. Introduction to. Methodology ANSYS FLUENT. ANSYS, Inc. Proprietary 2010 ANSYS, Inc. All rights reserved.

Customer Training Material. Lecture 2. Introduction to. Methodology ANSYS FLUENT. ANSYS, Inc. Proprietary 2010 ANSYS, Inc. All rights reserved. Lecture 2 Introduction to CFD Methodology Introduction to ANSYS FLUENT L2-1 What is CFD? Computational Fluid Dynamics (CFD) is the science of predicting fluid flow, heat and mass transfer, chemical reactions,

More information

Lecture 16 - Free Surface Flows. Applied Computational Fluid Dynamics

Lecture 16 - Free Surface Flows. Applied Computational Fluid Dynamics Lecture 16 - Free Surface Flows Applied Computational Fluid Dynamics Instructor: André Bakker http://www.bakker.org André Bakker (2002-2006) Fluent Inc. (2002) 1 Example: spinning bowl Example: flow in

More information

Introduction to CFD Analysis

Introduction to CFD Analysis Introduction to CFD Analysis Introductory FLUENT Training 2006 ANSYS, Inc. All rights reserved. 2006 ANSYS, Inc. All rights reserved. 2-2 What is CFD? Computational fluid dynamics (CFD) is the science

More information

Optimized Hybrid Parallel Lattice Boltzmann Fluid Flow Simulations on Complex Geometries

Optimized Hybrid Parallel Lattice Boltzmann Fluid Flow Simulations on Complex Geometries Optimized Hybrid Parallel Lattice Boltzmann Fluid Flow Simulations on Complex Geometries Jonas Fietz 2, Mathias J. Krause 2, Christian Schulz 1, Peter Sanders 1, and Vincent Heuveline 2 1 Karlsruhe Institute

More information

Applied mathematics and mathematical statistics

Applied mathematics and mathematical statistics Applied mathematics and mathematical statistics The graduate school is organised within the Department of Mathematical Sciences.. Deputy head of department: Aila Särkkä Director of Graduate Studies: Marija

More information

Simulation of Fluid-Structure Interactions in Aeronautical Applications

Simulation of Fluid-Structure Interactions in Aeronautical Applications Simulation of Fluid-Structure Interactions in Aeronautical Applications Martin Kuntz Jorge Carregal Ferreira ANSYS Germany D-83624 Otterfing Martin.Kuntz@ansys.com December 2003 3 rd FENET Annual Industry

More information

Aeroacoustic Analogy for the Computation of Aeroacoustic Fields in Partially Closed Domains

Aeroacoustic Analogy for the Computation of Aeroacoustic Fields in Partially Closed Domains INSTITUT FÜR MECHANIK UND MECHATRONIK Messtechnik und Aktorik Aeroacoustic Analogy for the Computation of Aeroacoustic Fields in Partially Closed Domains A. Hüppe 1, M. Kaltenbacher 1, A. Reppenhagen 2,

More information

Mesh Generation and Load Balancing

Mesh Generation and Load Balancing Mesh Generation and Load Balancing Stan Tomov Innovative Computing Laboratory Computer Science Department The University of Tennessee April 04, 2012 CS 594 04/04/2012 Slide 1 / 19 Outline Motivation Reliable

More information

AN EFFECT OF GRID QUALITY ON THE RESULTS OF NUMERICAL SIMULATIONS OF THE FLUID FLOW FIELD IN AN AGITATED VESSEL

AN EFFECT OF GRID QUALITY ON THE RESULTS OF NUMERICAL SIMULATIONS OF THE FLUID FLOW FIELD IN AN AGITATED VESSEL 14 th European Conference on Mixing Warszawa, 10-13 September 2012 AN EFFECT OF GRID QUALITY ON THE RESULTS OF NUMERICAL SIMULATIONS OF THE FLUID FLOW FIELD IN AN AGITATED VESSEL Joanna Karcz, Lukasz Kacperski

More information

Load Balancing Strategies for Parallel SAMR Algorithms

Load Balancing Strategies for Parallel SAMR Algorithms Proposal for a Summer Undergraduate Research Fellowship 2005 Computer science / Applied and Computational Mathematics Load Balancing Strategies for Parallel SAMR Algorithms Randolf Rotta Institut für Informatik,

More information

TESLA Report 2003-03

TESLA Report 2003-03 TESLA Report 23-3 A multigrid based 3D space-charge routine in the tracking code GPT Gisela Pöplau, Ursula van Rienen, Marieke de Loos and Bas van der Geer Institute of General Electrical Engineering,

More information

Design and Optimization of OpenFOAM-based CFD Applications for Hybrid and Heterogeneous HPC Platforms

Design and Optimization of OpenFOAM-based CFD Applications for Hybrid and Heterogeneous HPC Platforms Design and Optimization of OpenFOAM-based CFD Applications for Hybrid and Heterogeneous HPC Platforms Amani AlOnazi, David E. Keyes, Alexey Lastovetsky, Vladimir Rychkov Extreme Computing Research Center,

More information

Multi-GPU Load Balancing for Simulation and Rendering

Multi-GPU Load Balancing for Simulation and Rendering Multi- Load Balancing for Simulation and Rendering Yong Cao Computer Science Department, Virginia Tech, USA In-situ ualization and ual Analytics Instant visualization and interaction of computing tasks

More information

THE CFD SIMULATION OF THE FLOW AROUND THE AIRCRAFT USING OPENFOAM AND ANSA

THE CFD SIMULATION OF THE FLOW AROUND THE AIRCRAFT USING OPENFOAM AND ANSA THE CFD SIMULATION OF THE FLOW AROUND THE AIRCRAFT USING OPENFOAM AND ANSA Adam Kosík Evektor s.r.o., Czech Republic KEYWORDS CFD simulation, mesh generation, OpenFOAM, ANSA ABSTRACT In this paper we describe

More information

Dr. Raju Namburu Computational Sciences Campaign U.S. Army Research Laboratory. The Nation s Premier Laboratory for Land Forces UNCLASSIFIED

Dr. Raju Namburu Computational Sciences Campaign U.S. Army Research Laboratory. The Nation s Premier Laboratory for Land Forces UNCLASSIFIED Dr. Raju Namburu Computational Sciences Campaign U.S. Army Research Laboratory 21 st Century Research Continuum Theory Theory embodied in computation Hypotheses tested through experiment SCIENTIFIC METHODS

More information

Multiphase Flow - Appendices

Multiphase Flow - Appendices Discovery Laboratory Multiphase Flow - Appendices 1. Creating a Mesh 1.1. What is a geometry? The geometry used in a CFD simulation defines the problem domain and boundaries; it is the area (2D) or volume

More information

Calculation of Eigenmodes in Superconducting Cavities

Calculation of Eigenmodes in Superconducting Cavities Calculation of Eigenmodes in Superconducting Cavities W. Ackermann, C. Liu, W.F.O. Müller, T. Weiland Institut für Theorie Elektromagnetischer Felder, Technische Universität Darmstadt Status Meeting December

More information

Lecture 7 - Meshing. Applied Computational Fluid Dynamics

Lecture 7 - Meshing. Applied Computational Fluid Dynamics Lecture 7 - Meshing Applied Computational Fluid Dynamics Instructor: André Bakker http://www.bakker.org André Bakker (2002-2006) Fluent Inc. (2002) 1 Outline Why is a grid needed? Element types. Grid types.

More information

CCTech TM. ICEM-CFD & FLUENT Software Training. Course Brochure. Simulation is The Future

CCTech TM. ICEM-CFD & FLUENT Software Training. Course Brochure. Simulation is The Future . CCTech TM Simulation is The Future ICEM-CFD & FLUENT Software Training Course Brochure About. CCTech Established in 2006 by alumni of IIT Bombay. Our motive is to establish a knowledge centric organization

More information

Large-Scale Reservoir Simulation and Big Data Visualization

Large-Scale Reservoir Simulation and Big Data Visualization Large-Scale Reservoir Simulation and Big Data Visualization Dr. Zhangxing John Chen NSERC/Alberta Innovates Energy Environment Solutions/Foundation CMG Chair Alberta Innovates Technology Future (icore)

More information

Interactive simulation of an ash cloud of the volcano Grímsvötn

Interactive simulation of an ash cloud of the volcano Grímsvötn Interactive simulation of an ash cloud of the volcano Grímsvötn 1 MATHEMATICAL BACKGROUND Simulating flows in the atmosphere, being part of CFD, is on of the research areas considered in the working group

More information

How To Write A Program For The Pd Framework

How To Write A Program For The Pd Framework Enhanced divergence-free elements for efficient incompressible flow simulations in the PDE framework Peano, Miriam Mehl, Christoph Zenger, Fakultät für Informatik TU München Germany Outline Derivation

More information

CFD Based Air Flow and Contamination Modeling of Subway Stations

CFD Based Air Flow and Contamination Modeling of Subway Stations CFD Based Air Flow and Contamination Modeling of Subway Stations Greg Byrne Center for Nonlinear Science, Georgia Institute of Technology Fernando Camelli Center for Computational Fluid Dynamics, George

More information

Authors: Masahiro Watanabe*, Motoi Okuda**, Teruo Matsuzawa*** Speaker: Masahiro Watanabe**

Authors: Masahiro Watanabe*, Motoi Okuda**, Teruo Matsuzawa*** Speaker: Masahiro Watanabe** Visualization of the Blood flow and the Stress distribution with the Diagnostic Support System for Circulatory Disease in the Volume Communications Environment Authors: Masahiro Watanabe*, Motoi Okuda**,

More information

Computer Graphics AACHEN AACHEN AACHEN AACHEN. Public Perception of CG. Computer Graphics Research. Methodological Approaches - - - - - - - - - -

Computer Graphics AACHEN AACHEN AACHEN AACHEN. Public Perception of CG. Computer Graphics Research. Methodological Approaches - - - - - - - - - - Public Perception of CG Games Computer Graphics Movies Computer Graphics Research algorithms & data structures fundamental continuous & discrete mathematics optimization schemes 3D reconstruction global

More information

A Load Balancing Tool for Structured Multi-Block Grid CFD Applications

A Load Balancing Tool for Structured Multi-Block Grid CFD Applications A Load Balancing Tool for Structured Multi-Block Grid CFD Applications K. P. Apponsah and D. W. Zingg University of Toronto Institute for Aerospace Studies (UTIAS), Toronto, ON, M3H 5T6, Canada Email:

More information

Oliver Röhrle Ulrich Rüde Barbara Wohlmuth. Garching, September 17/18, 2012

Oliver Röhrle Ulrich Rüde Barbara Wohlmuth. Garching, September 17/18, 2012 Oliver Röhrle Ulrich Rüde Barbara Wohlmuth Garching, September 17/18, 2012 COMPUTATIONAL SCIENCE & ENGINEERING Leading-edge science and engineering depend on advanced computing for understanding, prediction,

More information

Introduction to DISC and Hadoop

Introduction to DISC and Hadoop Introduction to DISC and Hadoop Alice E. Fischer April 24, 2009 Alice E. Fischer DISC... 1/20 1 2 History Hadoop provides a three-layer paradigm Alice E. Fischer DISC... 2/20 Parallel Computing Past and

More information

Performance of the JMA NWP models on the PC cluster TSUBAME.

Performance of the JMA NWP models on the PC cluster TSUBAME. Performance of the JMA NWP models on the PC cluster TSUBAME. K.Takenouchi 1), S.Yokoi 1), T.Hara 1) *, T.Aoki 2), C.Muroi 1), K.Aranami 1), K.Iwamura 1), Y.Aikawa 1) 1) Japan Meteorological Agency (JMA)

More information

BOĞAZİÇİ UNIVERSITY COMPUTER ENGINEERING

BOĞAZİÇİ UNIVERSITY COMPUTER ENGINEERING Parallel l Tetrahedral Mesh Refinement Mehmet Balman Computer Engineering, Boğaziçi University Adaptive Mesh Refinement (AMR) A computation ti technique used to improve the efficiency i of numerical systems

More information

OpenFOAM Optimization Tools

OpenFOAM Optimization Tools OpenFOAM Optimization Tools Henrik Rusche and Aleks Jemcov h.rusche@wikki-gmbh.de and a.jemcov@wikki.co.uk Wikki, Germany and United Kingdom OpenFOAM Optimization Tools p. 1 Agenda Objective Review optimisation

More information

Unleashing the Performance Potential of GPUs for Atmospheric Dynamic Solvers

Unleashing the Performance Potential of GPUs for Atmospheric Dynamic Solvers Unleashing the Performance Potential of GPUs for Atmospheric Dynamic Solvers Haohuan Fu haohuan@tsinghua.edu.cn High Performance Geo-Computing (HPGC) Group Center for Earth System Science Tsinghua University

More information

Parallel Simplification of Large Meshes on PC Clusters

Parallel Simplification of Large Meshes on PC Clusters Parallel Simplification of Large Meshes on PC Clusters Hua Xiong, Xiaohong Jiang, Yaping Zhang, Jiaoying Shi State Key Lab of CAD&CG, College of Computer Science Zhejiang University Hangzhou, China April

More information

Introduction to Visualization with VTK and ParaView

Introduction to Visualization with VTK and ParaView Introduction to Visualization with VTK and ParaView R. Sungkorn and J. Derksen Department of Chemical and Materials Engineering University of Alberta Canada August 24, 2011 / LBM Workshop 1 Introduction

More information

TwinMesh for Positive Displacement Machines: Structured Meshes and reliable CFD Simulations

TwinMesh for Positive Displacement Machines: Structured Meshes and reliable CFD Simulations TwinMesh for Positive Displacement Machines: Structured Meshes and reliable CFD Simulations 05.06.2014 Dipl.-Ing. Jan Hesse, Dr. Andreas Spille-Kohoff CFX Berlin Software GmbH Karl-Marx-Allee 90 A 10243

More information

O.F.Wind Wind Site Assessment Simulation in complex terrain based on OpenFOAM. Darmstadt, 27.06.2012

O.F.Wind Wind Site Assessment Simulation in complex terrain based on OpenFOAM. Darmstadt, 27.06.2012 O.F.Wind Wind Site Assessment Simulation in complex terrain based on OpenFOAM Darmstadt, 27.06.2012 Michael Ehlen IB Fischer CFD+engineering GmbH Lipowskystr. 12 81373 München Tel. 089/74118743 Fax 089/74118749

More information

Parallel 3D Image Segmentation of Large Data Sets on a GPU Cluster

Parallel 3D Image Segmentation of Large Data Sets on a GPU Cluster Parallel 3D Image Segmentation of Large Data Sets on a GPU Cluster Aaron Hagan and Ye Zhao Kent State University Abstract. In this paper, we propose an inherent parallel scheme for 3D image segmentation

More information

Performance Improvement of Application on the K computer

Performance Improvement of Application on the K computer Performance Improvement of Application on the K computer November 13, 2011 Kazuo Minami Team Leader, Application Development Team Research and Development Group Next-Generation Supercomputer R & D Center

More information

Very special thanks to Wolfgang Gentzsch and Burak Yenier for making the UberCloud HPC Experiment possible.

Very special thanks to Wolfgang Gentzsch and Burak Yenier for making the UberCloud HPC Experiment possible. Digital manufacturing technology and convenient access to High Performance Computing (HPC) in industry R&D are essential to increase the quality of our products and the competitiveness of our companies.

More information

Scalable coupling of Molecular Dynamics (MD) and Direct Numerical Simulation (DNS) of multi-scale flows

Scalable coupling of Molecular Dynamics (MD) and Direct Numerical Simulation (DNS) of multi-scale flows Scalable coupling of Molecular Dynamics (MD) and Direct Numerical Simulation (DNS) of multi-scale flows Lucian Anton 1, Edward Smith 2 1 Numerical Algorithms Group Ltd, Wilkinson House, Jordan Hill Road,

More information

OpenFOAM Workshop. Yağmur Gülkanat Res.Assist.

OpenFOAM Workshop. Yağmur Gülkanat Res.Assist. OpenFOAM Workshop Yağmur Gülkanat Res.Assist. Introduction to OpenFOAM What is OpenFOAM? FOAM = Field Operation And Manipulation OpenFOAM is a free-to-use open-source numerical simulation software with

More information

Advanced Volume Rendering Techniques for Medical Applications

Advanced Volume Rendering Techniques for Medical Applications Advanced Volume Rendering Techniques for Medical Applications Verbesserte Darstellungsmethoden für Volumendaten in medizinischen Anwendungen J. Georgii 1, J. Schneider 1, J. Krüger 1, R. Westermann 1,

More information

A GPU COMPUTING PLATFORM (SAGA) AND A CFD CODE ON GPU FOR AEROSPACE APPLICATIONS

A GPU COMPUTING PLATFORM (SAGA) AND A CFD CODE ON GPU FOR AEROSPACE APPLICATIONS A GPU COMPUTING PLATFORM (SAGA) AND A CFD CODE ON GPU FOR AEROSPACE APPLICATIONS SUDHAKARAN.G APCF, AERO, VSSC, ISRO 914712564742 g_suhakaran@vssc.gov.in THOMAS.C.BABU APCF, AERO, VSSC, ISRO 914712565833

More information

COMPUTATIONAL FLUID DYNAMICS USING COMMERCIAL CFD CODES

COMPUTATIONAL FLUID DYNAMICS USING COMMERCIAL CFD CODES ME469B - Spring 2007 COMPUTATIONAL FLUID DYNAMICS USING COMMERCIAL CFD CODES Gianluca Iaccarino Dept. Mechanical Engineering Bldg. 500 RM 204 (RM500-I) Ph. 650-723-9599 Email: jops@stanford.edu ME469B/1/GI

More information

NUMERICAL ANALYSIS OF THE EFFECTS OF WIND ON BUILDING STRUCTURES

NUMERICAL ANALYSIS OF THE EFFECTS OF WIND ON BUILDING STRUCTURES Vol. XX 2012 No. 4 28 34 J. ŠIMIČEK O. HUBOVÁ NUMERICAL ANALYSIS OF THE EFFECTS OF WIND ON BUILDING STRUCTURES Jozef ŠIMIČEK email: jozef.simicek@stuba.sk Research field: Statics and Dynamics Fluids mechanics

More information

Chemical Engineering - CHEN

Chemical Engineering - CHEN Auburn University 1 Chemical Engineering - CHEN Courses CHEN 2100 PRINCIPLES OF CHEMICAL ENGINEERING (4) LEC. 3. LAB. 3. Pr. (CHEM 1110 or CHEM 1117 or CHEM 1030) and (MATH 1610 or MATH 1613 or MATH 1617

More information

Yousef Saad University of Minnesota Computer Science and Engineering. CRM Montreal - April 30, 2008

Yousef Saad University of Minnesota Computer Science and Engineering. CRM Montreal - April 30, 2008 A tutorial on: Iterative methods for Sparse Matrix Problems Yousef Saad University of Minnesota Computer Science and Engineering CRM Montreal - April 30, 2008 Outline Part 1 Sparse matrices and sparsity

More information

Mixed Precision Iterative Refinement Methods Energy Efficiency on Hybrid Hardware Platforms

Mixed Precision Iterative Refinement Methods Energy Efficiency on Hybrid Hardware Platforms Mixed Precision Iterative Refinement Methods Energy Efficiency on Hybrid Hardware Platforms Björn Rocker Hamburg, June 17th 2010 Engineering Mathematics and Computing Lab (EMCL) KIT University of the State

More information

Real Time Simulation of Power Plants

Real Time Simulation of Power Plants Real Time Simulation of Power Plants Torsten Dreher 1 System Simulation Group Friedrich-Alexander-University Erlangen-Nuremberg Siemens Simulation Center, Erlangen December 14, 2008 1 torsten.dreher@informatik.uni-erlangen.de

More information

FLUID MECHANICS IM0235 DIFFERENTIAL EQUATIONS - CB0235 2014_1

FLUID MECHANICS IM0235 DIFFERENTIAL EQUATIONS - CB0235 2014_1 COURSE CODE INTENSITY PRE-REQUISITE CO-REQUISITE CREDITS ACTUALIZATION DATE FLUID MECHANICS IM0235 3 LECTURE HOURS PER WEEK 48 HOURS CLASSROOM ON 16 WEEKS, 32 HOURS LABORATORY, 112 HOURS OF INDEPENDENT

More information

How To Run A Cdef Simulation

How To Run A Cdef Simulation Simple CFD Simulations and Visualisation using OpenFOAM and ParaView Sachiko Arvelius, PhD Purpose of this presentation To show my competence in CFD (Computational Fluid Dynamics) simulation and visualisation

More information

High Performance Multi-Layer Ocean Modeling. University of Kentucky, Computer Science Department, 325 McVey Hall, Lexington, KY 40506-0045, USA.

High Performance Multi-Layer Ocean Modeling. University of Kentucky, Computer Science Department, 325 McVey Hall, Lexington, KY 40506-0045, USA. High Performance Multi-Layer Ocean Modeling Craig C. Douglas A, Gundolf Haase B, and Mohamed Iskandarani C A University of Kentucky, Computer Science Department, 325 McVey Hall, Lexington, KY 40506-0045,

More information

Curriculum Vitae. George Breyiannis. Mechanical Engineer, PhD

Curriculum Vitae. George Breyiannis. Mechanical Engineer, PhD Curriculum Vitae George Breyiannis Mechanical Engineer, PhD Sep 2006 Personal Nationality: Greek/Australian Place/Date of birth: Melbourne Australia, March 13, 1968. Marital status: Married, two children.

More information

Hardware-Aware Analysis and. Presentation Date: Sep 15 th 2009 Chrissie C. Cui

Hardware-Aware Analysis and. Presentation Date: Sep 15 th 2009 Chrissie C. Cui Hardware-Aware Analysis and Optimization of Stable Fluids Presentation Date: Sep 15 th 2009 Chrissie C. Cui Outline Introduction Highlights Flop and Bandwidth Analysis Mehrstellen Schemes Advection Caching

More information

Towards Optimal Performance for Lattice Boltzmann Applications on Terascale Computers

Towards Optimal Performance for Lattice Boltzmann Applications on Terascale Computers Towards Optimal Performance for Lattice Boltzmann Applications on Terascale Computers G. Wellein a, P. Lammers b, G. Hager a, S. Donath a, and T. Zeiser a a Regionales Rechenzentrum Erlangen (RRZE) Martensstraße

More information

The Scientific Data Mining Process

The Scientific Data Mining Process Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In

More information

C3.8 CRM wing/body Case

C3.8 CRM wing/body Case C3.8 CRM wing/body Case 1. Code description XFlow is a high-order discontinuous Galerkin (DG) finite element solver written in ANSI C, intended to be run on Linux-type platforms. Relevant supported equation

More information

Highly Scalable Dynamic Load Balancing in the Atmospheric Modeling System COSMO-SPECS+FD4

Highly Scalable Dynamic Load Balancing in the Atmospheric Modeling System COSMO-SPECS+FD4 Center for Information Services and High Performance Computing (ZIH) Highly Scalable Dynamic Load Balancing in the Atmospheric Modeling System COSMO-SPECS+FD4 PARA 2010, June 9, Reykjavík, Iceland Matthias

More information

Design and Optimization of a Portable Lattice Boltzmann Code for Heterogeneous Architectures

Design and Optimization of a Portable Lattice Boltzmann Code for Heterogeneous Architectures Design and Optimization of a Portable Lattice Boltzmann Code for Heterogeneous Architectures E Calore, S F Schifano, R Tripiccione Enrico Calore INFN Ferrara, Italy Perspectives of GPU Computing in Physics

More information

Fast Multipole Method for particle interactions: an open source parallel library component

Fast Multipole Method for particle interactions: an open source parallel library component Fast Multipole Method for particle interactions: an open source parallel library component F. A. Cruz 1,M.G.Knepley 2,andL.A.Barba 1 1 Department of Mathematics, University of Bristol, University Walk,

More information

CFD Application on Food Industry; Energy Saving on the Bread Oven

CFD Application on Food Industry; Energy Saving on the Bread Oven Middle-East Journal of Scientific Research 13 (8): 1095-1100, 2013 ISSN 1990-9233 IDOSI Publications, 2013 DOI: 10.5829/idosi.mejsr.2013.13.8.548 CFD Application on Food Industry; Energy Saving on the

More information

Computers in Film Making

Computers in Film Making Computers in Film Making Snow White (1937) Computers in Film Making Slide 1 Snow White - Disney s Folly Moral: Original Budget $250,000 Production Cost $1,488,422 Frames 127,000 Production time 3.5 years

More information

Master s in Computational Mathematics. Faculty of Mathematics, University of Waterloo

Master s in Computational Mathematics. Faculty of Mathematics, University of Waterloo Master s in Computational Mathematics Faculty of Mathematics, University of Waterloo Master s in Computational Mathematics One-year, interdisciplinary Master s (duration of study 12 months, starts September

More information

Steady Flow: Laminar and Turbulent in an S-Bend

Steady Flow: Laminar and Turbulent in an S-Bend STAR-CCM+ User Guide 6663 Steady Flow: Laminar and Turbulent in an S-Bend This tutorial demonstrates the flow of an incompressible gas through an s-bend of constant diameter (2 cm), for both laminar and

More information

Modeling of Earth Surface Dynamics and Related Problems Using OpenFOAM

Modeling of Earth Surface Dynamics and Related Problems Using OpenFOAM CSDMS 2013 Meeting Modeling of Earth Surface Dynamics and Related Problems Using OpenFOAM Xiaofeng Liu, Ph.D., P.E. Assistant Professor Department of Civil and Environmental Engineering University of Texas

More information

Spatial Discretisation Schemes in the PDE framework Peano for Fluid-Structure Interactions

Spatial Discretisation Schemes in the PDE framework Peano for Fluid-Structure Interactions Spatial Discretisation Schemes in the PDE framework Peano for Fluid-Structure Interactions T. Neckel, H.-J. Bungartz, B. Gatzhammer, M. Mehl, C. Zenger TUM Department of Informatics Chair of Scientific

More information

ParFUM: A Parallel Framework for Unstructured Meshes. Aaron Becker, Isaac Dooley, Terry Wilmarth, Sayantan Chakravorty Charm++ Workshop 2008

ParFUM: A Parallel Framework for Unstructured Meshes. Aaron Becker, Isaac Dooley, Terry Wilmarth, Sayantan Chakravorty Charm++ Workshop 2008 ParFUM: A Parallel Framework for Unstructured Meshes Aaron Becker, Isaac Dooley, Terry Wilmarth, Sayantan Chakravorty Charm++ Workshop 2008 What is ParFUM? A framework for writing parallel finite element

More information

Scientific Computing Programming with Parallel Objects

Scientific Computing Programming with Parallel Objects Scientific Computing Programming with Parallel Objects Esteban Meneses, PhD School of Computing, Costa Rica Institute of Technology Parallel Architectures Galore Personal Computing Embedded Computing Moore

More information

Parallel Programming at the Exascale Era: A Case Study on Parallelizing Matrix Assembly For Unstructured Meshes

Parallel Programming at the Exascale Era: A Case Study on Parallelizing Matrix Assembly For Unstructured Meshes Parallel Programming at the Exascale Era: A Case Study on Parallelizing Matrix Assembly For Unstructured Meshes Eric Petit, Loïc Thebault, Quang V. Dinh May 2014 EXA2CT Consortium 2 WPs Organization Proto-Applications

More information

Fire Simulations in Civil Engineering

Fire Simulations in Civil Engineering Contact: Lukas Arnold l.arnold@fz- juelich.de Nb: I had to remove slides containing input from our industrial partners. Sorry. Fire Simulations in Civil Engineering 07.05.2013 Lukas Arnold Fire Simulations

More information

Performance Evaluation of Amazon EC2 for NASA HPC Applications!

Performance Evaluation of Amazon EC2 for NASA HPC Applications! National Aeronautics and Space Administration Performance Evaluation of Amazon EC2 for NASA HPC Applications! Piyush Mehrotra!! J. Djomehri, S. Heistand, R. Hood, H. Jin, A. Lazanoff,! S. Saini, R. Biswas!

More information

Cluster Scalability of ANSYS FLUENT 12 for a Large Aerodynamics Case on the Darwin Supercomputer

Cluster Scalability of ANSYS FLUENT 12 for a Large Aerodynamics Case on the Darwin Supercomputer Cluster Scalability of ANSYS FLUENT 12 for a Large Aerodynamics Case on the Darwin Supercomputer Stan Posey, MSc and Bill Loewe, PhD Panasas Inc., Fremont, CA, USA Paul Calleja, PhD University of Cambridge,

More information

A Multi-layered Domain-specific Language for Stencil Computations

A Multi-layered Domain-specific Language for Stencil Computations A Multi-layered Domain-specific Language for Stencil Computations Christian Schmitt, Frank Hannig, Jürgen Teich Hardware/Software Co-Design, University of Erlangen-Nuremberg Workshop ExaStencils 2014,

More information

Multi-Block Gridding Technique for FLOW-3D Flow Science, Inc. July 2004

Multi-Block Gridding Technique for FLOW-3D Flow Science, Inc. July 2004 FSI-02-TN59-R2 Multi-Block Gridding Technique for FLOW-3D Flow Science, Inc. July 2004 1. Introduction A major new extension of the capabilities of FLOW-3D -- the multi-block grid model -- has been incorporated

More information