CSE Case Study: Optimising the CFD code DG-DES
|
|
|
- Tracy Brooks
- 9 years ago
- Views:
Transcription
1 CSE Case Study: Optimising the CFD code DG-DES CSE Team NAG Ltd., Naveed Durrani University of Sheffield June 2008 Introduction One of the activities of the NAG CSE (Computational Science and Engineering) team is to provide computational support to scientists using the HECToR service, which includes helping users get the most out of the machine by providing short periods of support to optimise code performance. This case study describes one such effort to improve the performance of a Computational Fluid Dynamics (CFD) code, DG-DES (Dynamic Grid - Detached Eddy Simulation), developed by a group from the Department of Mechanical Engineering at the University of Sheffield 1. Some members of the group attended NAG HECToR training (which is freely available to all HECToR users) and the CSE team were able to let them know how to get access to HECToR and help port DG- DES on to the machine. After porting, the group found that the code was crashing and submitted a query requesting help with debugging. The CSE team spotted and fixed two minor errors in the code (a mis-declared array resulting in a segmentation fault and a divide by zero). With the code working properly the next step was to improve its performance with specific production runs in mind (the users wanted to perform a series of 12 hour runs using around 256 PEs), so a request for a short period of help with optimisation was sent to SAFE. DG-DES DG-DES is a Fortran 90 MPI code which models turbulent fluid flows over time in a 3-dimensional spatial domain. The finite element spatial domain is decomposed using the Metis graph partitioning software 2 and time-stepping follows a multi-stage Runge-Kutta scheme. DES is a hybrid numerical approach that combines the strengths of two other numerical schemes called Reynolds Averaged Navier Stokes (RANS) and Large Eddy Simulation (LES). DG-DES is also capable of simulations that require a dynamic grid, where the locations of mesh points are altered at regular intervals according to the behaviour of the fluid. The test case supplied with the source code was for a comparatively small fixed mesh of around 35,000 nodes; production run meshes consist of more than 1.5 million nodes
2 Figure 1: Plot of vorticity magnitude at different spanwise locations around cylinder for a Reynolds Number of Initial Performance Tests In porting their code to HECToR the group employed the default options for compiling - simply replacing in their Makefile the old compiler command with the ftn command and using the PGI compiler without additional options. Therefore, the first experiment was to find out which compiler and combination of optimisation options to use. Figure 2 illustrates the results of tests performed on the Test and Development System (TDS) 3 for the PGI and Pathscale compilers with and without the vendor-suggested optimisation options. Figure 2 shows that the Pathscale (3.0) compiler produces the best-performing executable: compared with the case where the PGI (7.0.4) compiler is used without specifying optimisation options (top line, i.e. how the code was originally being compiled), simply switching to the Pathscale compiler and using -Ofast produces a significant speedup. The tests in Figure 2 were in dual core mode; the results in Figure 3 show clearly that there is no benefit to running DG-DES in single core mode. From browsing the source code it is apparent that DG-DES performs many subroutine and function calls within the main computational loops, which suggests the importance of inlining. Pathscale s -Ofast option expands to -O3 -ipa -OPT:ro=2:Olimit=0:div split=on:alias=typed -fno-math-errno -ffast-math, and although inlining is included as part of the Inter-Procedural Analysis option, -ipa, there are further, more aggressive options that control inlining. The following were tried in addition to-ofast: -IPA:plimit=20000:callee limit= INLINE:aggressive=on. Using -INLINE:list=on shows that more of the larger subroutines called within loops are inlined, but Figure 4 shows this has no effect on performance; the routines inlilned by -ipa are sufficient. 3 A test machine available to the CSE team for supporting users. 2
3 Figure 2: Performance of DG-DES for 4, 8, 16, 32, 64 and 128 Processing Elements (PEs) for the PGI and Pathscale compilers with and without optimisation flags. Figure 3: Performance of DG-DES running in single and dual core mode. 3
4 Figure 4: Performance of DG-DES with extra, aggressive inlining options. Profiling Profiling with CrayPAT reports that the subroutine residual mpi and its children account for around 75% of execution time. However, the incompatibility of the -ipa option and the -g option added by CrayPAT meant that CrayPAT was of limited further use in this exercise to, for example, compare utilisation rates between optimised and unoptimised code. CrayPAT was also found to produce very slow instrumented executables for tracing experiments, even when the Automatic Profiling Analysis (APA) option was used (tracing still fewer routines than APA also proved sluggish). A custom timing module was therefore incorporated into the code, using the MPI WTIME function. Simple Optimisations Subroutine residual mpi is called within the main computational loop and calls other routines many times per iteration, resulting in millions of calls even for short test runs. Two of the top time-consuming routines, mu and sonic, are one-line functions which can be inlined manually very easily. Two operands for the calculations performed by these routines are constants, but were not declared as such; doing so gives the compiler the opportunity to remove two loads per calculation. Inlining and changing these variables to constants produced around a 10% speedup 4 overall compared to -Ofast alone. It was observed that the statement if(teqs!=0) is executed repeatedly because the constant teqs (a flag for the turbulence model) declared as a variable. Re-declaring as a parameter allows the compiler to remove these conditionals, producing a further 5% speedup overall. Repeated execution of the following case statement was also redundant since the users are only interested in DES experiments: 4 Percentage speedup is measured as follows throughout: 100 (originaltime newtime) originaltime, i.e. percentage of the original execution time that has been saved. 4
5 Figure 5: Performance of DG-DES with: -Ofast alone; after inlining sonic and mu; after declaring teqs as a parameter; and after eliminating the case statements. The combined performance improvement is around 40% for 16 PEs. select case(trim(tmodel)) case( S-A, DES ) tfv = Vn*tFV tfv = tfv*face(id)%vol case( k-w )... end select Replacing this block with unconditional execution of the DES case produced a further 25% speedup overall. This change is controlled by a pre-processor macro, enabling users to switch back to the original code easily. Figure 5 shows the performance improvements made by each of these three changes. Scaling The performance curves in Figure 5 suggest that DG-DES may not scale well for the test case. This was confirmed by performing tests up to 512 PEs on HECToR: see Figure 6. Timing key sections of code, shown in Figure 7, reveals the source of the poor performance beyond 64 PEs: a broadcast in the timestepping loop, which indicates a load balancing problem. It was thought this was likely due to partitioning a relatively small, irregular test mesh among many processes; running the same code using a larger production mesh indicates that this seems to be the case (see Figure 8) and that there should be no problems scaling up to 256 PEs, as the users intend. 5
6 Figure 6: Performance of DG-DES in the test case up to 512 PEs: poor scaling. Figure 7: Timing key sections of code show that poor scaling is mainly due to the cost of a broadcast in the timestepping loop. 6
7 Figure 8: Timing the same sections of code as in Figure 7 for a production mesh of 1.8 million nodes shows that DG-DES does scale well up to 256 PEs, as required. Figure 9: Overall performance improvement of DG-DES: before CSE help the code was being compiled using PGI with no optimisation options; with CSE help, the code runs 70% faster due to switching to the Pathscale compiler with -Ofast and making simple code modifications. 7
8 Summary Figure 9 shows the performance of the code before and after CSE assistance around a 70% performance improvement gained by making simple, changes such as switching compiler, enabling optimisation options and removing redundant code. The NAG CSE team were able to support the DG-DES users by advising them how to gain access to HECToR, helping to port and debug the code, profiling and optimising for specific production runs, and providing advice in planning their experiments. The users have also been supplied with timing routines to help monitor the performance of their code in future. For example, the output below gives the cumulative time spent in key sections of code. In particular the problematic broadcast of Figure 7 is timed; if the difference between the maximum and minimum values is large then the problem is poorly load-balanced Overall run time max time = min time = Cumulative time for global loop -max time = min time = bcast with slamon ---max time = min time = This case study shows that it is expedient to allow the compiler to optimise your code, giving it as much help as possible. It also demonstrates how eliminating small sections of redundant code can have a large performance impact it is not always necessary to make drastic changes. These changes may reduce the flexibility of your code, but this may be worthwhile for production runs. Also make sure you take into account the test case you are using for benchmarking; it is important to check that changes you make scale up to the large cases you intend to use for your experiments. 8
Scalable coupling of Molecular Dynamics (MD) and Direct Numerical Simulation (DNS) of multi-scale flows
Scalable coupling of Molecular Dynamics (MD) and Direct Numerical Simulation (DNS) of multi-scale flows Lucian Anton 1, Edward Smith 2 1 Numerical Algorithms Group Ltd, Wilkinson House, Jordan Hill Road,
Introductory FLUENT Training
Chapter 10 Transient Flow Modeling Introductory FLUENT Training www.ptecgroup.ir 10-1 Motivation Nearly all flows in nature are transient! Steady-state assumption is possible if we: Ignore transient fluctuations
CONVERGE Features, Capabilities and Applications
CONVERGE Features, Capabilities and Applications CONVERGE CONVERGE The industry leading CFD code for complex geometries with moving boundaries. Start using CONVERGE and never make a CFD mesh again. CONVERGE
Multiphase Flow - Appendices
Discovery Laboratory Multiphase Flow - Appendices 1. Creating a Mesh 1.1. What is a geometry? The geometry used in a CFD simulation defines the problem domain and boundaries; it is the area (2D) or volume
HPC Wales Skills Academy Course Catalogue 2015
HPC Wales Skills Academy Course Catalogue 2015 Overview The HPC Wales Skills Academy provides a variety of courses and workshops aimed at building skills in High Performance Computing (HPC). Our courses
Application of CFD modelling to the Design of Modern Data Centres
Application of CFD modelling to the Design of Modern Data Centres White Paper March 2012 By Sam Wicks BEng CFD Applications Engineer Sudlows March 14, 2012 Application of CFD modelling to the Design of
Parallel and Distributed Computing Programming Assignment 1
Parallel and Distributed Computing Programming Assignment 1 Due Monday, February 7 For programming assignment 1, you should write two C programs. One should provide an estimate of the performance of ping-pong
Streamline Computing Linux Cluster User Training. ( Nottingham University)
1 Streamline Computing Linux Cluster User Training ( Nottingham University) 3 User Training Agenda System Overview System Access Description of Cluster Environment Code Development Job Schedulers Running
GAMBIT Demo Tutorial
GAMBIT Demo Tutorial Wake of a Cylinder. 1.1 Problem Description The problem to be considered is schematically in fig. 1. We consider flow across a cylinder and look at the wake behind the cylinder. Air
Differentiating a Time-dependent CFD Solver
Differentiating a Time-dependent CFD Solver Presented to The AD Workshop, Nice, April 2005 Mohamed Tadjouddine & Shaun Forth Engineering Systems Department Cranfield University (Shrivenham Campus) Swindon
Guide to Partitioning Unstructured Meshes for Parallel Computing
Guide to Partitioning Unstructured Meshes for Parallel Computing Phil Ridley Numerical Algorithms Group Ltd, Wilkinson House, Jordan Hill Road, Oxford, OX2 8DR, UK, email: [email protected] April 17,
OPTIMISE TANK DESIGN USING CFD. Lisa Brown. Parsons Brinckerhoff
OPTIMISE TANK DESIGN USING CFD Paper Presented by: Lisa Brown Authors: Lisa Brown, General Manager, Franz Jacobsen, Senior Water Engineer, Parsons Brinckerhoff 72 nd Annual Water Industry Engineers and
Chapter 7: Additional Topics
Chapter 7: Additional Topics In this chapter we ll briefly cover selected advanced topics in fortran programming. All the topics come in handy to add extra functionality to programs, but the feature you
Module 6 Case Studies
Module 6 Case Studies 1 Lecture 6.1 A CFD Code for Turbomachinery Flows 2 Development of a CFD Code The lecture material in the previous Modules help the student to understand the domain knowledge required
Good FORTRAN Programs
Good FORTRAN Programs Nick West Postgraduate Computing Lectures Good Fortran 1 What is a Good FORTRAN Program? It Works May be ~ impossible to prove e.g. Operating system. Robust Can handle bad data e.g.
Computational Fluid Dynamics Research Projects at Cenaero (2011)
Computational Fluid Dynamics Research Projects at Cenaero (2011) Cenaero (www.cenaero.be) is an applied research center focused on the development of advanced simulation technologies for aeronautics. Located
Advanced Computer Architecture-CS501. Computer Systems Design and Architecture 2.1, 2.2, 3.2
Lecture Handout Computer Architecture Lecture No. 2 Reading Material Vincent P. Heuring&Harry F. Jordan Chapter 2,Chapter3 Computer Systems Design and Architecture 2.1, 2.2, 3.2 Summary 1) A taxonomy of
CFD Simulation of a 3-Bladed Horizontal Axis Tidal Stream Turbine using RANS and LES
CFD Simulation of a 3-Bladed Horizontal Axis Tidal Stream Turbine using RANS and LES J. McNaughton, I. Afgan, D.D. Apsley, S. Rolfo, T. Stallard and P.K. Stansby Modelling and Simulation Centre, School
Lecture 16 - Free Surface Flows. Applied Computational Fluid Dynamics
Lecture 16 - Free Surface Flows Applied Computational Fluid Dynamics Instructor: André Bakker http://www.bakker.org André Bakker (2002-2006) Fluent Inc. (2002) 1 Example: spinning bowl Example: flow in
Parallel Ray Tracing using MPI: A Dynamic Load-balancing Approach
Parallel Ray Tracing using MPI: A Dynamic Load-balancing Approach S. M. Ashraful Kadir 1 and Tazrian Khan 2 1 Scientific Computing, Royal Institute of Technology (KTH), Stockholm, Sweden [email protected],
Information Processing, Big Data, and the Cloud
Information Processing, Big Data, and the Cloud James Horey Computational Sciences & Engineering Oak Ridge National Laboratory Fall Creek Falls 2010 Information Processing Systems Model Parameters Data-intensive
OpenFOAM Opensource and CFD
OpenFOAM Opensource and CFD Andrew King Department of Mechanical Engineering Curtin University Outline What is Opensource Software OpenFOAM Overview Utilities, Libraries and Solvers Data Formats The CFD
Turbomachinery CFD on many-core platforms experiences and strategies
Turbomachinery CFD on many-core platforms experiences and strategies Graham Pullan Whittle Laboratory, Department of Engineering, University of Cambridge MUSAF Colloquium, CERFACS, Toulouse September 27-29
TWO-DIMENSIONAL FINITE ELEMENT ANALYSIS OF FORCED CONVECTION FLOW AND HEAT TRANSFER IN A LAMINAR CHANNEL FLOW
TWO-DIMENSIONAL FINITE ELEMENT ANALYSIS OF FORCED CONVECTION FLOW AND HEAT TRANSFER IN A LAMINAR CHANNEL FLOW Rajesh Khatri 1, 1 M.Tech Scholar, Department of Mechanical Engineering, S.A.T.I., vidisha
C3.8 CRM wing/body Case
C3.8 CRM wing/body Case 1. Code description XFlow is a high-order discontinuous Galerkin (DG) finite element solver written in ANSI C, intended to be run on Linux-type platforms. Relevant supported equation
How To Run A Steady Case On A Creeper
Crash Course Introduction to OpenFOAM Artur Lidtke University of Southampton [email protected] November 4, 2014 Artur Lidtke Crash Course Introduction to OpenFOAM 1 / 32 What is OpenFOAM? Using OpenFOAM
WESTMORELAND COUNTY PUBLIC SCHOOLS 2011 2012 Integrated Instructional Pacing Guide and Checklist Computer Math
Textbook Correlation WESTMORELAND COUNTY PUBLIC SCHOOLS 2011 2012 Integrated Instructional Pacing Guide and Checklist Computer Math Following Directions Unit FIRST QUARTER AND SECOND QUARTER Logic Unit
2) Write in detail the issues in the design of code generator.
COMPUTER SCIENCE AND ENGINEERING VI SEM CSE Principles of Compiler Design Unit-IV Question and answers UNIT IV CODE GENERATION 9 Issues in the design of code generator The target machine Runtime Storage
Optimization of a large-scale parallel finite element finite volume C++ code
Optimization of a large-scale parallel finite element finite volume C++ code Qi Huangfu August 22, 2008 MSc in High Performance Computing The University of Edinburgh Year of Presentation: 2008 Abstract
External bluff-body flow-cfd simulation using ANSYS Fluent
External bluff-body flow-cfd simulation using ANSYS Fluent External flow over a bluff body is complex, three-dimensional, and vortical. It is massively separated and it exhibits vortex shedding. Thus,
HPC Deployment of OpenFOAM in an Industrial Setting
HPC Deployment of OpenFOAM in an Industrial Setting Hrvoje Jasak [email protected] Wikki Ltd, United Kingdom PRACE Seminar: Industrial Usage of HPC Stockholm, Sweden, 28-29 March 2011 HPC Deployment
TDDD56 Multicore and GPU computing Lab 1: Load balancing
TDDD56 Multicore and GPU computing Lab 1: Load balancing Nicolas Melot [email protected] November 20, 2015 1 Introduction This laboratory aims to implement a parallel algorithm to generates a visual
Spark: Cluster Computing with Working Sets
Spark: Cluster Computing with Working Sets Outline Why? Mesos Resilient Distributed Dataset Spark & Scala Examples Uses Why? MapReduce deficiencies: Standard Dataflows are Acyclic Prevents Iterative Jobs
UT69R000 MicroController Software Tools Product Brief
Military Standard Products UT69R000 MicroController Software Tools Product Brief July 1996 Introduction The UT69R000 MicroController Software Tools consist of a C Compiler (GCC), a RISC assembler (), a
PyFR: Bringing Next Generation Computational Fluid Dynamics to GPU Platforms
PyFR: Bringing Next Generation Computational Fluid Dynamics to GPU Platforms P. E. Vincent! Department of Aeronautics Imperial College London! 25 th March 2014 Overview Motivation Flux Reconstruction Many-Core
Load Imbalance Analysis
With CrayPat Load Imbalance Analysis Imbalance time is a metric based on execution time and is dependent on the type of activity: User functions Imbalance time = Maximum time Average time Synchronization
DYNAMIC LOAD BALANCING APPLICATIONS ON A HETEROGENEOUS UNIX/NT CLUSTER
European Congress on Computational Methods in Applied Sciences and Engineering ECCOMAS 2000 Barcelona, 11-14 September 2000 ECCOMAS DYNAMIC LOAD BALANCING APPLICATIONS ON A HETEROGENEOUS UNIX/NT CLUSTER
New Tools Using the Hardware Pertbrmaiihe... Monitor to Help Users Tune Programs on the Cray X-MP*
ANL/CP--7 4428 _' DE92 001926 New Tools Using the Hardware Pertbrmaiihe... Monitor to Help Users Tune Programs on the Cray X-MP* D. E. Engert, L. Rudsinski J. Doak Argonne National Laboratory Cray Research
Fourth generation techniques (4GT)
Fourth generation techniques (4GT) The term fourth generation techniques (4GT) encompasses a broad array of software tools that have one thing in common. Each enables the software engineer to specify some
ASSEMBLY PROGRAMMING ON A VIRTUAL COMPUTER
ASSEMBLY PROGRAMMING ON A VIRTUAL COMPUTER Pierre A. von Kaenel Mathematics and Computer Science Department Skidmore College Saratoga Springs, NY 12866 (518) 580-5292 [email protected] ABSTRACT This paper
Andrew McRae Megadata Pty Ltd. [email protected]
A UNIX Task Broker Andrew McRae Megadata Pty Ltd. [email protected] This abstract describes a UNIX Task Broker, an application which provides redundant processing configurations using multiple
XFlow CFD results for the 1st AIAA High Lift Prediction Workshop
XFlow CFD results for the 1st AIAA High Lift Prediction Workshop David M. Holman, Dr. Monica Mier-Torrecilla, Ruddy Brionnaud Next Limit Technologies, Spain THEME Computational Fluid Dynamics KEYWORDS
Turbulence Modeling in CFD Simulation of Intake Manifold for a 4 Cylinder Engine
HEFAT2012 9 th International Conference on Heat Transfer, Fluid Mechanics and Thermodynamics 16 18 July 2012 Malta Turbulence Modeling in CFD Simulation of Intake Manifold for a 4 Cylinder Engine Dr MK
Aerodynamic Department Institute of Aviation. Adam Dziubiński CFD group FLUENT
Adam Dziubiński CFD group IoA FLUENT Content Fluent CFD software 1. Short description of main features of Fluent 2. Examples of usage in CESAR Analysis of flow around an airfoil with a flap: VZLU + ILL4xx
GPU Acceleration of the SENSEI CFD Code Suite
GPU Acceleration of the SENSEI CFD Code Suite Chris Roy, Brent Pickering, Chip Jackson, Joe Derlaga, Xiao Xu Aerospace and Ocean Engineering Primary Collaborators: Tom Scogland, Wu Feng (Computer Science)
Embedded Programming in C/C++: Lesson-1: Programming Elements and Programming in C
Embedded Programming in C/C++: Lesson-1: Programming Elements and Programming in C 1 An essential part of any embedded system design Programming 2 Programming in Assembly or HLL Processor and memory-sensitive
SOLIDWORKS SOFTWARE OPTIMIZATION
W H I T E P A P E R SOLIDWORKS SOFTWARE OPTIMIZATION Overview Optimization is the calculation of weight, stress, cost, deflection, natural frequencies, and temperature factors, which are dependent on variables
Using CFD to improve the design of a circulating water channel
2-7 December 27 Using CFD to improve the design of a circulating water channel M.G. Pullinger and J.E. Sargison School of Engineering University of Tasmania, Hobart, TAS, 71 AUSTRALIA Abstract Computational
Acceleration of a CFD Code with a GPU
Acceleration of a CFD Code with a GPU Dennis C. Jespersen ABSTRACT The Computational Fluid Dynamics code Overflow includes as one of its solver options an algorithm which is a fairly small piece of code
Monitoring Software using Sun Spots. Corey Andalora February 19, 2008
Monitoring Software using Sun Spots Corey Andalora February 19, 2008 Abstract Sun has developed small devices named Spots designed to provide developers familiar with the Java programming language a platform
A Load Balancing Tool for Structured Multi-Block Grid CFD Applications
A Load Balancing Tool for Structured Multi-Block Grid CFD Applications K. P. Apponsah and D. W. Zingg University of Toronto Institute for Aerospace Studies (UTIAS), Toronto, ON, M3H 5T6, Canada Email:
How To Optimise A Boat'S Hull
Hull form optimisation with Computational Fluid Dynamics Aurélien Drouet, Erwan Jacquin, Pierre-Michel Guilcher Overview Context Software Bulbous bow optimisation Conclusions/perspectives Page 2 HydrOcean
Guide to Performance and Tuning: Query Performance and Sampled Selectivity
Guide to Performance and Tuning: Query Performance and Sampled Selectivity A feature of Oracle Rdb By Claude Proteau Oracle Rdb Relational Technology Group Oracle Corporation 1 Oracle Rdb Journal Sampled
Instruction Set Architecture (ISA)
Instruction Set Architecture (ISA) * Instruction set architecture of a machine fills the semantic gap between the user and the machine. * ISA serves as the starting point for the design of a new machine
Dynamic Load Balancing of Parallel Monte Carlo Transport Calculations
Dynamic Load Balancing of Parallel Monte Carlo Transport Calculations Richard Procassini, Matthew O'Brien and Janine Taylor Lawrence Livermore National Laboratory Joint Russian-American Five-Laboratory
Simulation of Fluid-Structure Interactions in Aeronautical Applications
Simulation of Fluid-Structure Interactions in Aeronautical Applications Martin Kuntz Jorge Carregal Ferreira ANSYS Germany D-83624 Otterfing [email protected] December 2003 3 rd FENET Annual Industry
FAC 2.1: Data Center Cooling Simulation. Jacob Harris, Application Engineer Yue Ma, Senior Product Manager
FAC 2.1: Data Center Cooling Simulation Jacob Harris, Application Engineer Yue Ma, Senior Product Manager FAC 2.1: Data Center Cooling Simulation Computational Fluid Dynamics (CFD) can be used to numerically
TwinCAT NC Configuration
TwinCAT NC Configuration NC Tasks The NC-System (Numeric Control) has 2 tasks 1 is the SVB task and the SAF task. The SVB task is the setpoint generator and generates the velocity and position control
The Asterope compute cluster
The Asterope compute cluster ÅA has a small cluster named asterope.abo.fi with 8 compute nodes Each node has 2 Intel Xeon X5650 processors (6-core) with a total of 24 GB RAM 2 NVIDIA Tesla M2050 GPGPU
Overlapping Data Transfer With Application Execution on Clusters
Overlapping Data Transfer With Application Execution on Clusters Karen L. Reid and Michael Stumm [email protected] [email protected] Department of Computer Science Department of Electrical and Computer
Compilers I - Chapter 4: Generating Better Code
Compilers I - Chapter 4: Generating Better Code Lecturers: Paul Kelly ([email protected]) Office: room 304, William Penney Building Naranker Dulay ([email protected]) Materials: Office: room 562 Textbook
Fire Simulations in Civil Engineering
Contact: Lukas Arnold l.arnold@fz- juelich.de Nb: I had to remove slides containing input from our industrial partners. Sorry. Fire Simulations in Civil Engineering 07.05.2013 Lukas Arnold Fire Simulations
Introduction to COMSOL. The Navier-Stokes Equations
Flow Between Parallel Plates Modified from the COMSOL ChE Library module rev 10/13/08 Modified by Robert P. Hesketh, Chemical Engineering, Rowan University Fall 2008 Introduction to COMSOL The following
OpenFOAM: Computational Fluid Dynamics. Gauss Siedel iteration : (L + D) * x new = b - U * x old
OpenFOAM: Computational Fluid Dynamics Gauss Siedel iteration : (L + D) * x new = b - U * x old What s unique about my tuning work The OpenFOAM (Open Field Operation and Manipulation) CFD Toolbox is a
Introduction to CFD Analysis
Introduction to CFD Analysis 2-1 What is CFD? Computational Fluid Dynamics (CFD) is the science of predicting fluid flow, heat and mass transfer, chemical reactions, and related phenomena by solving numerically
Optimization tools. 1) Improving Overall I/O
Optimization tools After your code is compiled, debugged, and capable of running to completion or planned termination, you can begin looking for ways in which to improve execution speed. In general, the
Hands-on exercise (FX10 pi): NPB-MZ-MPI / BT
Hands-on exercise (FX10 pi): NPB-MZ-MPI / BT VI-HPS Team 14th VI-HPS Tuning Workshop, 25-27 March 2014, RIKEN AICS, Kobe, Japan 1 Tutorial exercise objectives Familiarise with usage of VI-HPS tools complementary
1 Description of The Simpletron
Simulating The Simpletron Computer 50 points 1 Description of The Simpletron In this assignment you will write a program to simulate a fictional computer that we will call the Simpletron. As its name implies
Analysis of Micromouse Maze Solving Algorithms
1 Analysis of Micromouse Maze Solving Algorithms David M. Willardson ECE 557: Learning from Data, Spring 2001 Abstract This project involves a simulation of a mouse that is to find its way through a maze.
CAD Import Module and LiveLink for CAD V4.3a
CAD Import Module and LiveLink for CAD V4.3a LiveLink for AutoCAD, LiveLink for Creo Parametric, LiveLink for Inventor, LiveLink for Pro/ENGINEER, LiveLink for Solid Edge, LiveLink for SolidWorks, LiveLink
Scientific Computing Programming with Parallel Objects
Scientific Computing Programming with Parallel Objects Esteban Meneses, PhD School of Computing, Costa Rica Institute of Technology Parallel Architectures Galore Personal Computing Embedded Computing Moore
GEOMETRIC, THERMODYNAMIC AND CFD ANALYSES OF A REAL SCROLL EXPANDER FOR MICRO ORC APPLICATIONS
2 nd International Seminar on ORC Power Systems October 7 th & 8 th, 213 De Doelen, Rotterdam, NL GEOMETRIC, THERMODYNAMIC AND CFD ANALYSES OF A REAL SCROLL EXPANDER FOR MICRO ORC APPLICATIONS M. Morini,
Distributed Dynamic Load Balancing for Iterative-Stencil Applications
Distributed Dynamic Load Balancing for Iterative-Stencil Applications G. Dethier 1, P. Marchot 2 and P.A. de Marneffe 1 1 EECS Department, University of Liege, Belgium 2 Chemical Engineering Department,
Aeroacoustic simulation based on linearized Euler equations and stochastic sound source modelling
Aeroacoustic simulation based on linearized Euler equations and stochastic sound source modelling H. Dechipre a, M. Hartmann a, J. W Delfs b and R. Ewert b a Volkswagen AG, Brieffach 1777, 38436 Wolfsburg,
C++ INTERVIEW QUESTIONS
C++ INTERVIEW QUESTIONS http://www.tutorialspoint.com/cplusplus/cpp_interview_questions.htm Copyright tutorialspoint.com Dear readers, these C++ Interview Questions have been designed specially to get
Embedded Systems. Review of ANSI C Topics. A Review of ANSI C and Considerations for Embedded C Programming. Basic features of C
Embedded Systems A Review of ANSI C and Considerations for Embedded C Programming Dr. Jeff Jackson Lecture 2-1 Review of ANSI C Topics Basic features of C C fundamentals Basic data types Expressions Selection
DNS (Domain Name System) is the system & protocol that translates domain names to IP addresses.
Lab Exercise DNS Objective DNS (Domain Name System) is the system & protocol that translates domain names to IP addresses. Step 1: Analyse the supplied DNS Trace Here we examine the supplied trace of a
MPI Hands-On List of the exercises
MPI Hands-On List of the exercises 1 MPI Hands-On Exercise 1: MPI Environment.... 2 2 MPI Hands-On Exercise 2: Ping-pong...3 3 MPI Hands-On Exercise 3: Collective communications and reductions... 5 4 MPI
Numerical Calculation of Laminar Flame Propagation with Parallelism Assignment ZERO, CS 267, UC Berkeley, Spring 2015
Numerical Calculation of Laminar Flame Propagation with Parallelism Assignment ZERO, CS 267, UC Berkeley, Spring 2015 Xian Shi 1 bio I am a second-year Ph.D. student from Combustion Analysis/Modeling Lab,
Multicore Parallel Computing with OpenMP
Multicore Parallel Computing with OpenMP Tan Chee Chiang (SVU/Academic Computing, Computer Centre) 1. OpenMP Programming The death of OpenMP was anticipated when cluster systems rapidly replaced large
ANSYS FLUENT. Using Moving Reference Frames and Sliding Meshes WS5-1. Customer Training Material
Workshop 5 Using Moving Reference Frames and Sliding Meshes Introduction to ANSYS FLUENT WS5-1 Introduction [1] Several solution strategies exist when there are moving parts in the domain. This workshop
CentOS Linux 5.2 and Apache 2.2 vs. Microsoft Windows Web Server 2008 and IIS 7.0 when Serving Static and PHP Content
Advances in Networks, Computing and Communications 6 92 CentOS Linux 5.2 and Apache 2.2 vs. Microsoft Windows Web Server 2008 and IIS 7.0 when Serving Static and PHP Content Abstract D.J.Moore and P.S.Dowland
Configuring Simulator 3 for IP
Configuring Simulator 3 for IP Application Note AN405 Revision v1.0 August 2012 AN405 Configuring Simulator 3 for IP 1 Overview This Application Note is designed to help to configure a Sim 3 to run in
High level code and machine code
High level code and machine code Teacher s Notes Lesson Plan x Length 60 mins Specification Link 2.1.7/cde Programming languages Learning objective Students should be able to (a) explain the difference
Get an Easy Performance Boost Even with Unthreaded Apps. with Intel Parallel Studio XE for Windows*
Get an Easy Performance Boost Even with Unthreaded Apps for Windows* Can recompiling just one file make a difference? Yes, in many cases it can! Often, you can achieve a major performance boost by recompiling
Performance Comparison of a Vertical Axis Wind Turbine using Commercial and Open Source Computational Fluid Dynamics based Codes
Performance Comparison of a Vertical Axis Wind Turbine using Commercial and Open Source Computational Fluid Dynamics based Codes Taimoor Asim 1, Rakesh Mishra 1, Sree Nirjhor Kaysthagir 1, Ghada Aboufares
Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering
Engineering Problem Solving and Excel EGN 1006 Introduction to Engineering Mathematical Solution Procedures Commonly Used in Engineering Analysis Data Analysis Techniques (Statistics) Curve Fitting techniques
Visualization of 2D Domains
Visualization of 2D Domains This part of the visualization package is intended to supply a simple graphical interface for 2- dimensional finite element data structures. Furthermore, it is used as the low
QA Analysis of the WRF Program
QA Analysis of the WRF Program WRF Workshop, Boulder Colorado, 26-28th June 2012 Mark Anderson 1 John Collins 1,2,3,4 Brian Farrimond 1,2,4 [email protected] [email protected] [email protected]
Eu-NORSEWInD - Assessment of Viability of Open Source CFD Code for the Wind Industry
Downloaded from orbit.dtu.dk on: Jun 28, 2016 Eu-NORSEWInD - Assessment of Viability of Open Source CFD Code for the Wind Industry Stickland, Matt; Scanlon, Tom; Fabre, Sylvie; Ahmad, Abdul; Oldroyd, Andrew;
Performance Analysis and Optimization Tool
Performance Analysis and Optimization Tool Andres S. CHARIF-RUBIAL [email protected] Performance Analysis Team, University of Versailles http://www.maqao.org Introduction Performance Analysis Develop
THE UNIVERSITY OF AUCKLAND
COMPSCI 742 THE UNIVERSITY OF AUCKLAND SECOND SEMESTER, 2008 Campus: City COMPUTER SCIENCE Data Communications and Networks (Time allowed: TWO hours) NOTE: Attempt all questions. Calculators are NOT permitted.
