Modeling Rotor Wakes with a Hybrid OVERFLOW-Vortex Method on a GPU Cluster

Similar documents
A New Solution Adaption Capability for the OVERFLOW CFD Code

CFD Lab Department of Engineering The University of Liverpool

Mixed Precision Iterative Refinement Methods Energy Efficiency on Hybrid Hardware Platforms

Aerodynamic Department Institute of Aviation. Adam Dziubiński CFD group FLUENT

Accelerating CFD using OpenFOAM with GPUs

Acceleration of a CFD Code with a GPU

Coupled CFD and Vortex Methods for Modelling Hydro- and Aerodynamics of Tidal Current Turbines and On- and Offshore Wind Turbines

Turbomachinery CFD on many-core platforms experiences and strategies

PyFR: Bringing Next Generation Computational Fluid Dynamics to GPU Platforms

Computational Modeling of Wind Turbines in OpenFOAM

TIME-ACCURATE SIMULATION OF THE FLOW AROUND THE COMPLETE BO105 WIND TUNNEL MODEL

Module 6 Case Studies

Simulation of Fluid-Structure Interactions in Aeronautical Applications

Aerodynamics of Rotating Discs

Unleashing the Performance Potential of GPUs for Atmospheric Dynamic Solvers

Design and Optimization of OpenFOAM-based CFD Applications for Hybrid and Heterogeneous HPC Platforms

GPU Acceleration of the SENSEI CFD Code Suite

OpenFOAM in Wind Energy: Wind Turbines as a source term. Paolo Schito, Luca Bernini, Alberto Zasso

AeroFluidX: A Next Generation GPU-Based CFD Solver for Engineering Applications

High Performance Computing in CST STUDIO SUITE

Purdue University - School of Mechanical Engineering. Objective: Study and predict fluid dynamics of a bluff body stabilized flame configuration.

Comparison between OpenFOAM CFD & BEM theory for variable speed variable pitch HAWT

OpenFOAM Optimization Tools

(Toward) Radiative transfer on AMR with GPUs. Dominique Aubert Université de Strasbourg Austin, TX,

ABSTRACT FOR THE 1ST INTERNATIONAL WORKSHOP ON HIGH ORDER CFD METHODS

Structured grid flow solvers such as NASA s OVERFLOW compressible Navier-Stokes flow solver 1, 2

Design and Optimization of a Portable Lattice Boltzmann Code for Heterogeneous Architectures

Load Balancing Strategies for Multi-Block Overset Grid Applications NAS

Parallel 3D Image Segmentation of Large Data Sets on a GPU Cluster

Overset composite grids for the simulation of complex moving geometries

A Fast Double Precision CFD Code using CUDA

PASSIVE CONTROL OF SHOCK WAVE APPLIED TO HELICOPTER ROTOR HIGH-SPEED IMPULSIVE NOISE REDUCTION

CFD Applications using CFD++ Paul Batten & Vedat Akdag

CFD Simulation of the NREL Phase VI Rotor

HPC enabling of OpenFOAM R for CFD applications

A GPU COMPUTING PLATFORM (SAGA) AND A CFD CODE ON GPU FOR AEROSPACE APPLICATIONS

Unsteady CFD of a Marine Current Turbine using OpenFOAM with Generalised Grid Interface

Scalable and High Performance Computing for Big Data Analytics in Understanding the Human Dynamics in the Mobile Age

C3.8 CRM wing/body Case

Lecture 11 Boundary Layers and Separation. Applied Computational Fluid Dynamics

walberla: A software framework for CFD applications on Compute Cores

Lecture 6 - Boundary Conditions. Applied Computational Fluid Dynamics

Lecture 16 - Free Surface Flows. Applied Computational Fluid Dynamics

AN INVESTIGATION ON THE AERODYNAMIC PERFORMANCE OF A VERTICAL AXIS WIND TURBINE ETESH VAISHNAV

LBM BASED FLOW SIMULATION USING GPU COMPUTING PROCESSOR

Hardware-Aware Analysis and. Presentation Date: Sep 15 th 2009 Chrissie C. Cui

Simulation at Aeronautics Test Facilities A University Perspective Helen L. Reed, Ph.D., P.E. ASEB meeting, Irvine CA 15 October

The Application of a Black-Box Solver with Error Estimate to Different Systems of PDEs

High Performance. CAEA elearning Series. Jonathan G. Dudley, Ph.D. 06/09/ CAE Associates

CFD Analysis of Civil Transport Aircraft

CFD Based Air Flow and Contamination Modeling of Subway Stations

Parallel Computing. Introduction

Fast Multipole Method for particle interactions: an open source parallel library component

High-Lift Systems. High Lift Systems -- Introduction. Flap Geometry. Outline of this Chapter

Application of CFD Simulation in the Design of a Parabolic Winglet on NACA 2412

High-Order CFD Methods: Current Status and Perspective

A CFD-BEM coupling technique for low Mach number flow induced noise

NUMERICAL ANALYSIS OF THE EFFECTS OF WIND ON BUILDING STRUCTURES

CFD simulations using an AMR-like approach in the PDE Framework Peano

GPU System Architecture. Alan Gray EPCC The University of Edinburgh

Spatial Discretisation Schemes in the PDE framework Peano for Fluid-Structure Interactions

CFD and wave and current induced loads on offshore wind turbines

Aeroacoustic Analogy for the Computation of Aeroacoustic Fields in Partially Closed Domains

Mesh Moving Techniques for Fluid-Structure Interactions With Large Displacements

CFD and EXPERIMENTAL ANALYSIS of VORTEX SHEDDING BEHIND D-SHAPED CYLINDER

Part II: Finite Difference/Volume Discretisation for CFD

Using Computational Fluid Dynamics for Aerodynamics Antony Jameson and Massimiliano Fatica Stanford University

HPC Deployment of OpenFOAM in an Industrial Setting

AN EFFECT OF GRID QUALITY ON THE RESULTS OF NUMERICAL SIMULATIONS OF THE FLUID FLOW FIELD IN AN AGITATED VESSEL

LS-DYNA Scalability on Cray Supercomputers. Tin-Ting Zhu, Cray Inc. Jason Wang, Livermore Software Technology Corp.

Development of Numerical Algorithm Based on a Modified Equation of Fluid Motion with Application to Turbomachinery Flow

Large-Scale Reservoir Simulation and Big Data Visualization

CS Software Engineering for Scientific Computing. Lecture 16: Particle Methods; Homework #4

Aeronautical Testing Service, Inc th DR NE Arlington, WA USA. CFD and Wind Tunnel Testing: Complimentary Methods for Aircraft Design

CFD ANALYSIS OF CONTROLLABLE PITCH PROPELLER USED IN MARINE VEHICLE

Overset Grids Technology in STAR-CCM+: Methodology and Applications

Multiphase Flow - Appendices

CFD MODELLING OF TOP SUBMERGED LANCE GAS INJECTION

Performance Comparison of a Vertical Axis Wind Turbine using Commercial and Open Source Computational Fluid Dynamics based Codes

DYNAMIC LOAD BALANCING APPLICATIONS ON A HETEROGENEOUS UNIX/NT CLUSTER

Contents. Microfluidics - Jens Ducrée Physics: Navier-Stokes Equation 1

Modelling and Computation of Compressible Liquid Flows with Phase Transition

Experiences Extending the CFD Solver of the PDE Framework Peano

Overset and Adaptive Meshes for Stabilized Finite-Element Scheme

Performance of the JMA NWP models on the PC cluster TSUBAME.

Aeroelastic Investigation of the Sandia 100m Blade Using Computational Fluid Dynamics

AERODYNAMIC ANALYSIS OF BLADE 1.5 KW OF DUAL ROTOR HORIZONTAL AXIS WIND TURBINE

Differential Relations for Fluid Flow. Acceleration field of a fluid. The differential equation of mass conservation

A Load Balancing Tool for Structured Multi-Block Grid CFD Applications

Performance Evaluation of NAS Parallel Benchmarks on Intel Xeon Phi

GPUs for Scientific Computing

DESIGN OF THE MODERN FAMILY OF HELICOPTER AIRFOILS 51

The Influence of Aerodynamics on the Design of High-Performance Road Vehicles

Dimensional Analysis

Current Status and Challenges in CFD at the DLR Institute of Aerodynamics and Flow Technology

Transcription:

Modeling Rotor Wakes with a Hybrid OVERFLOW-Vortex Method on a GPU Cluster Mark J. Stock, Ph.D., Adrin Gharakhani, Sc.D. Applied Scientific Research, Santa Ana, CA Christopher P. Stone, Ph.D. Computational Science & Engineering, Seattle, WA Funding: AMRDEC SBIR Phase I Grant, NSF grant via Teragrid, and NCRR award

Outline Motivation physical and numerical Method Lagrangian vortex particle methods Hybrid coupling scheme Test simulations

Motivation Rotor wakes are long-lived Blade interaction causes noise, loss of lift Obstructions make motion unpredictable Understanding vortex ring state http://flickr.com/photos/93995449@n00/1223845347 http://www.web.mek.dtu.dk/staff/jhw/projekter.htm

Eulerian CFD Advantages High-order methods Body-fitted grids with high aspect ratio Generality Disadvantages Numerical diffusion Gridding effort Start-up Far-field BCs

Lagrangian CFD Advantages Numerical diffusion can be explicit (not limited by CFL) Lower dimension grids Efficient use of computational elements/far-field BCs automatic Continuity preserved by construction No start-up problems Disadvantages Computationally expensive Largely incompressible Spatial adaptation difficult

Hybrid solution Euler solver near boundaries Capture boundary layer Proper vorticity creation at boundary Lagrangian solver in wake Maintain vorticity in wake No need for overset grids to far-field

OVERFLOW 2.1ae Developed over 20 years by P.G. Buning et al. Fully-compressible Navier-Stokes Eulerian solver Chimera overset structured grids Supports multi-threading via OpenMP Distributed parallelism via MPI

Ω-Flow v2 Developed over 10 years by A. Gharakhani et al. at ASR Multipole-accelerated treecode N-body solver Multipole-accelerated BEM solver Supports multi-threading via OpenMP Distributed parallelism via MPI Accelerates computations on capable GPUs See our other paper: A GPU-Accelerated Boundary Element Method and Vortex Particle Method, AIAA-2010-5099

Lagrangian vortex methods Navier-Stokes equations in vorticity Dω Dt = ω u + ν 2 ω Discretized onto Lagrangian particles N v ω( x, t) = i=1 Γ i = ω i V i Γ i (t) φ σ ( x x i )

Lagrangian vortex methods Helmholtz integral formula u( x) = U + V + S V ω ( x ) G( x, x ) dv ( x ) θ( x ) G( x, x ) dv ( x ) ( γ ( x ) + ˆn( x ) u( x ) ) G( x, x ) ds( x ) S (ˆn( x ) u( x ) ) G( x, x ) ds( x )

Lagrangian vortex methods Inviscid flow x i t = ū i Γ i t = Γ i ū i Viscous diffusion via VRM (Shankar and van Dommelen, 1996) Γ i t = ν 2 Γi Anisotropic SGS dissipation (Cottet et al. 1996, 2003)

Faster summations Barnes and Hut Treecode (1986), Greengard-Rokhlin FMM (1987) Distributed-memory parallelization (Salmon 1991) Data structures organized to prevent cache misses Real spherical multipoles cast to Cartesian for far-field

Faster summations Wall-clock time (s) 100000 10000 1000 100 10 1 GPU vs. CPU treecode performance 4-cores Phenom 8-cores Xeon 4-cores Opt. + 8800 GTX 8-cores Xeon + 2xTesla 2012 s 1495 s 34.51 s for 10M vortons 0.1 0.01 10000 100000 1e+06 1e+07 1e+08 Number of vortons, N v

Hybrid scheme Eulerian domain Requires only near-body grids, 10-20 particle separations wide Outer BC will be set by Lagrangian solution Lagrangian domain Particles exist throughout Eulerian domain Particle strengths are only modified when they are the middle of the Eulerian domain---away from either boundary BEM solution accounts for near-body vorticity

Hybrid method - procedure 1) Interpolate vorticity from Eulerian grid to particles a) Prepare a Cartesian grid covering the Euler domain b) Interpolate ω from all Euler grid cells not within Δx of body c) Mask out particles outside Euler region, too close to body, and too close to outer Euler boundary (2-3 Δx) d) Reset those particles strengths from this field 2) Advance Lagrangian solution, fill gaps 3) Compute velocities at outer Euler boundary nodes 4) Complete Euler BCs assuming constant ρ 5) Advance Euler solution

Hybrid method - interpolation

Hybrid method - interpolation

Hybrid method - compressibility Currently Lagrangian regime is incompressible Euler BCs are set using: ρ = const e = 1 γ(γ 1) + M2 2

Test cases Sphere, Re=100 Finite wing, 6.6:1 AR 4-bladed rotor (no hub), 0.23 advance ratio

Test systems Nodes cores per node GPUs GPU PEs/ node Local 1 4 @ 2.5GHz 2x GTX 275 480 @ 1.5 GHz Lincoln 8 8 @ 2.33 GHz 1/2x Tesla S1070 480 @ 1.44 GHz gfortran 4.3.2, OpenMP, CUDA 2.2, and OpenMPI

Sphere at ReD=100

Sphere at ReD=100 Primary Δx Δt Newton subiterations Euler grid nodes Full OVERFLOW 0.03D to x/d=3.5 0.025 5 4.12M Hybrid 0.0447D 0.05 / 0.025 5 1.41M plus 1M particles

Sphere at ReD=100

Sphere at ReD=100 0.8 OVERFLOW Hybrid Streamwise velocity, u/u! 0.6 0.4 0.2 0-0.2 1 2 3 4 5 6 Streamwise position, x/d

6.6:1 AR wing at 12 NACA 0015 section 6.6 AR regular planform at α =12 Flat tips M = 0.17, Re = 1,500,000 Similar to McAlister & Takahashi, NASA TP-3151, 1991.

6.6:1 AR wing at 12 Pure OVERFLOW

6.6:1 AR wing at 12 Primary Δx Δt Newton subiterations Euler grid nodes Full OVERFLOW 0.04c to x/c=8 0.05 12 28.5M Hybrid 0.04c 0.05 12* 4.3M plus 1M particles

6.6:1 AR wing at 12 Hybrid method

6.6:1 AR wing at 12 Hybrid method, ω magnitude 0.3, 0.5

6.6:1 AR wing at 12 Vertical velocity, w/u 0.5 0-0.5 x-x tip = 0.2c 0.5 0-0.5 x-x tip = 1c Experiments OVERFLOW, C-grid Hybrid, inviscid, O-grid 0 0.5 0 0.5 x-x tip = 2c x-x tip = 4c Vertical velocity, w/u! 0.5 0-0.5 0.5 0-0.5 0 0.5 Spanwise position, (y tip -y)/c 0 0.5 Spanwise position, (y tip -y)/c

4-Bladed isolated rotor NACA 0012 section 10.13 AR regular planform, root is 3.17c from hub Flat tips No cyclic pitch Twist is 0 at hub, 8 at tip M = 0.124, Re = 860,500 Advance ratio = 0.2265, α = -3.04 Similar to Elliot, Althoff, Saily, NASA TM-100542, 1988.

4-Bladed isolated rotor Near-body grid for both cases

4-Bladed isolated rotor Primary Δx Δt Newton subiterations Euler grid nodes Full OVERFLOW 0.06c to x/c=26 1/2 per step 9 46.2M Hybrid 0.06c 1 per step 10* 11.7M plus 55M particles

4-Bladed isolated rotor 720 Pure OVERFLOW Hybrid method

4-Bladed isolated rotor 720 Hybrid method

4-Bladed isolated rotor 720 Pure OVERFLOW

4-Bladed isolated rotor 900 Hybrid method

4-Bladed isolated rotor

4-Bladed isolated rotor

4-Bladed isolated rotor 970 Hybrid method

4-Bladed isolated rotor 970 Hybrid method

Conclusions Hybrid Lagrangian-Eulerian method created in short time Wake maintains all benefits of Vortex Particle Method Near-body retains all benefits of Eulerian overset grid method Coupling between regimes is fully two-way Connected GPU devices allow ~1 TFLOP/s per node

Thank you AIAA AMRDEC, NSF, NCRR You, for your attention Please see our other talk: A GPU-Accelerated Boundary Element Method and Vortex Particle Method AIAA-2010-5099, Thursday, 1100 hrs, room 23B http://www.applied-scientific.com/