Microscopic and mesoscopic simulations of amorphous systems using LAMMPS and GPU-based algorithms
|
|
|
- Jonah Cox
- 10 years ago
- Views:
Transcription
1 Microscopic and mesoscopic simulations of amorphous systems using LAMMPS and GPU-based algorithms E. Ferrero and F. Puosi Laboratoire Interdisciplinaire de Physique (LIPhy), UJF and CNRS 14/5/214 Journée des utilisateurs CIMENT
2 Amorphous materials Hard glasses Soft glasses Metallic/oxide glasses Polymers Colloids Foams Length scale 1 nm Length scale.1 μm Particles are arranged in a disordered way, like in liquids, but they are jammed like in solids Flow of amorphous media is an outstanding question in rheology and plasticity!2
3 Length scales in the deformation of amorphous systems Rodney et al, MSMSE (211) Microscopic scale: scale of individual objects (droplets, colloids, particles). Dynamics is extremely complex and numerically highly time consuming at this scale. Mesoscopic scale: scale where the elementary local plastic events that constitute the flow of amorphous materials do occur. Macroscopic scale: scale relevant to engineering problems. Phenomenological laws describe the flow behavior.!3
4 Length scales in the deformation of amorphous systems Rodney et al, MSMSE (211) 1st part by F. Puosi Microscopic scale: scale of individual objects (droplets, colloids, particles). Dynamics is extremely complex and numerically highly time consuming at this scale. Mesoscopic scale: scale where the elementary local plastic events that constitute the flow of amorphous materials do occur. 2nd part by E. Ferrero Macroscopic scale: scale relevant to engineering problems. Phenomenological laws describe the flow behavior.!4
5 Molecular Dynamics (MD) basics N particles (atoms, groups of atoms, meso-particles) in a box (rigid walls, periodic boundary conditions) All physics in energy potential (forces) - pair-wise interactions (LJ, Coulombic), many-body forces (EAM, Tersoff, REBO), molecular forces (springs, torsions) Integrate Newton s equation of motion Velocity-Verlet formulation: - update V by1/2 step (using F) - update X (using V) - build neighbor lists (occasionally) - compute F (using X) - apply constraints & boundary conditions - update V by1/2 step (using new F) - output and diagnostics CPU time break-down: - inter-particle forces = 8%! - neighbor lists = 15% - everything else = 5% Properties via time-averaging ensemble configurations!5
6 What is LAMMPS Large-scale Atomic/Molecular Massively Parallel Simulator! Classical MD code Open source (GPL), highly portable C++ Simulations at varying length and time scales: electrons atomistic corse-grained continuum Spatial-decomposition of simulation domain for parallelism GPU and OpenMP enhanced Can be coupled to other scales: QM, kmc, FE!6
7 LAMMPS on Froggy Benchmark: atomic fluid 32 atoms for 1 time-steps reduced density.8442 (liquid) force cutoff 2.5 sigma ( 2.5 diameters) neighbor skin.3 sigma (neighbor atom 55) NVE time integration 1 Parallel Efficiency (%) FROGGY Cray XT5 Cray XT3 Dual hex-core Xeon IBM BG/L IBM p69+ Xeon/Myrinet cluster Processors!7
8 GPU acceleration in LAMMPS USER-CUDA package! GPU version of pair styles, fixes and computes an entire LAMMPS calculation run entirely on the GPU for many time steps only one core per GPU better speed-up if the number of atom per GPU is large Milions of atom-timestep / second atomic fluid 1GPU (mixed) 2 GPUs (mixed) CPU (1 core) 1 k 1 k 1 k 1 M 1 M Atoms!8
9 GLASSDEF project Microscopic aspects: Parallel Replica Dynamics in glasses: a method to extend timescales accessible to Molecular Dynamics simulation (FP, D. Rodney and J.-L. Barrat) Plasticity in 2D sheared amorphous solids: investigation of the spatio temporal correlations of plastic activity in athermal amorphous solids (FP, A. Nicolas, J. Rottler and J.-L. Barrat) Microscopic test of a mean field model for the flow of amorphous systems (FP, J. Olivier, K. Martens and J.-L. Barrat) Avalanches in amorphous systems: scaling description of avalanches in the shear flow of athermal amorphous solids at finite shear rates (FP, E. Ferrero, C. Liu, L. Marradi, K. Martens, A. Nicolas and J.-L. Barrat)!9
10 Elementary plastic events At low temperature, the onset of plastic deformation in glasses is due to the accumulation of elementary plastic events, consisting of localized in space and time atomic rearrangements involving only a few tens of atoms, the so-called Shear Transformations (STs) Tanguy et al., EPJE (26) Open question: How do the elastic response to a ST build in time?!1
11 Fictitious STs in amorphous solids t=1 t= D binary mixture of athermal LennardJones particles. We apply an instantaneous local shear transformation and we observe the response of the system t=5 5 1 t= t= Disorder average of the response: average over different spatial realizations of the ST t=1 5 1 t= t= F. Puosi, J. Rottler and J.-L. Barrat, PRE (214)!11
12 Comparison with Continuum Elasticity Theory Equilibrium response: Eshelby inclusion problem Simulations Theory Transient regime: solution of a diffusion equation for t h e d i s p l a c e m e n t fi e l d i n a n h o m o g e n e o u s m e d i u m i n t h e presence of a perturbation due to a set of two force dipoles in the origin. <u r (r, = /4, t)> t = r F. Puosi, J. Rottler and J.-L. Barrat, PRE (214)!12
13 ... a mesoscopic approximation A 2D scalar field for the local stress propagator local in fourier space plastic strain change: Case of thermally activated events Plastic activation Elastic recovery + (eventually) independent tracers moving according to the resulting deformation fields.
14 Workflow (physical protocol) InitializeEverything(); for (i=; i<t_max; i++){ UpdateStateVariablesActivated(); ComputePlasticStrainChange(); TransformToFourierSpace(); Convolution(); #ifdef TRACERS CalculateDisplacementField(); #endif AntitransformFromFourierSpace(); EulerIntegrationStep(); #ifdef TRACERS UpdateTracersPositions(); #endif if (condition(i)){ DoSomeMeasurementsAndAccumulate(); } } PrintResults(); CUDA_Kernels, cudamemset CUDA_Kernel CUDA_Kernel or Thrust call cufft CUDA_Kernel CUDA_Kernel or Thrust call cufft CUDA_Kernel CUDA_Kernel Thrust calls + C host code C host code
15 CUDA basic kernel example Euler integration step Kernel Call pre-computed in Fourier space
16 cufft library call Plan declaration Plan usage
17 CUDA kernel RNG use Philox: A counter-based RNG from the Random 123 library (211) Fast, easy to parallelize, use minimal memory/cache resources (ideal for GPU), require very little code. Have passed all of the rigorous SmallCrush, Crush and BigCrush tests in the extensive TestU1 suite.
18 Thrust Library basic usage High-Level Parallel Algorithms Library Parallel Analog of the C++ Standard Template Library (STL) Portability and Interoperability (CUDA, TBB, OpenMP,...) (REAL is either float or double)
19 Thrust Library advanced usage
20 Profiling with NVVP
21 Performance Speedups ~ 5x ( CPU code not optimal ) A lot more to exploit, starting with CPU-GPU concurrency Unexpected similarity between Fermi and Kepler GPUs - cuda 6. vs cuda 5.? - ECC enabled/disabled? - very low occupancy in both cases?
22 Froggy use Nice nodes, nice GPUs, clear tutorials, everything runs smoothly. GPUs are evolving fast. Opportunity to upgrade the cluster compute power by replacing ONLY the accelerator. Oar GPU target can perhaps be improved. - Possible cross access socket-gpu when CUDA_SET_DEVICE() is used by the application. - Allow possibility of targeting the same GPU with many processes. timesharing=user,* - In general 1CPU+1GPU jobs --> we waste 15CPUs. A GPU process should block the full node if it wants to have exclusive access to the GPU. Annoying, but necessary thing: Max Wall-time = 96hs - Stop/Restart coding, is difficult when we compute many things on-the-fly. - Automatic checkpoints (BLCR like...)? Is it possible for a GPU context?
LIGGGHTS OPEN SOURCE DEM: COUPLING TO DNS OF TURBULENT CHANNEL FLOW
LIGGGHTS OPEN SOURCE DEM: COUPLING TO DNS OF TURBULENT CHANNEL FLOW 6th ERCOFTAC SIG43 Workshop, Udine, October 2013 Daniel Queteschiner 1, Christoph Kloss 1, Stefan Pirker 1 Cristian Marchioli 2, Alfredo
Data Visualization for Atomistic/Molecular Simulations. Douglas E. Spearot University of Arkansas
Data Visualization for Atomistic/Molecular Simulations Douglas E. Spearot University of Arkansas What is Atomistic Simulation? Molecular dynamics (MD) involves the explicit simulation of atomic scale particles
Overview of NAMD and Molecular Dynamics
Overview of NAMD and Molecular Dynamics Jim Phillips Low-cost Linux Clusters for Biomolecular Simulations Using NAMD Outline Overview of molecular dynamics Overview of NAMD NAMD parallel design NAMD config
walberla: A software framework for CFD applications on 300.000 Compute Cores
walberla: A software framework for CFD applications on 300.000 Compute Cores J. Götz (LSS Erlangen, [email protected]), K. Iglberger, S. Donath, C. Feichtinger, U. Rüde Lehrstuhl für Informatik 10 (Systemsimulation)
LAMMPS Developer Guide 23 Aug 2011
LAMMPS Developer Guide 23 Aug 2011 This document is a developer guide to the LAMMPS molecular dynamics package, whose WWW site is at lammps.sandia.gov. It describes the internal structure and algorithms
walberla: A software framework for CFD applications
walberla: A software framework for CFD applications U. Rüde, S. Donath, C. Feichtinger, K. Iglberger, F. Deserno, M. Stürmer, C. Mihoubi, T. Preclic, D. Haspel (all LSS Erlangen), N. Thürey (LSS Erlangen/
GPU System Architecture. Alan Gray EPCC The University of Edinburgh
GPU System Architecture EPCC The University of Edinburgh Outline Why do we want/need accelerators such as GPUs? GPU-CPU comparison Architectural reasons for GPU performance advantages GPU accelerated systems
Graphics Cards and Graphics Processing Units. Ben Johnstone Russ Martin November 15, 2011
Graphics Cards and Graphics Processing Units Ben Johnstone Russ Martin November 15, 2011 Contents Graphics Processing Units (GPUs) Graphics Pipeline Architectures 8800-GTX200 Fermi Cayman Performance Analysis
Robust Algorithms for Current Deposition and Dynamic Load-balancing in a GPU Particle-in-Cell Code
Robust Algorithms for Current Deposition and Dynamic Load-balancing in a GPU Particle-in-Cell Code F. Rossi, S. Sinigardi, P. Londrillo & G. Turchetti University of Bologna & INFN GPU2014, Rome, Sept 17th
4 Microscopic dynamics
4 Microscopic dynamics In this section we will look at the first model that people came up with when they started to model polymers from the microscopic level. It s called the Oldroyd B model. We will
HP ProLiant SL270s Gen8 Server. Evaluation Report
HP ProLiant SL270s Gen8 Server Evaluation Report Thomas Schoenemeyer, Hussein Harake and Daniel Peter Swiss National Supercomputing Centre (CSCS), Lugano Institute of Geophysics, ETH Zürich [email protected]
The Uintah Framework: A Unified Heterogeneous Task Scheduling and Runtime System
The Uintah Framework: A Unified Heterogeneous Task Scheduling and Runtime System Qingyu Meng, Alan Humphrey, Martin Berzins Thanks to: John Schmidt and J. Davison de St. Germain, SCI Institute Justin Luitjens
Simulation Platform Overview
Simulation Platform Overview Build, compute, and analyze simulations on demand www.rescale.com CASE STUDIES Companies in the aerospace and automotive industries use Rescale to run faster simulations Aerospace
Turbomachinery CFD on many-core platforms experiences and strategies
Turbomachinery CFD on many-core platforms experiences and strategies Graham Pullan Whittle Laboratory, Department of Engineering, University of Cambridge MUSAF Colloquium, CERFACS, Toulouse September 27-29
Overview of HPC Resources at Vanderbilt
Overview of HPC Resources at Vanderbilt Will French Senior Application Developer and Research Computing Liaison Advanced Computing Center for Research and Education June 10, 2015 2 Computing Resources
A Case Study - Scaling Legacy Code on Next Generation Platforms
Available online at www.sciencedirect.com ScienceDirect Procedia Engineering 00 (2015) 000 000 www.elsevier.com/locate/procedia 24th International Meshing Roundtable (IMR24) A Case Study - Scaling Legacy
CBE 6333, R. Levicky 1 Review of Fluid Mechanics Terminology
CBE 6333, R. Levicky 1 Review of Fluid Mechanics Terminology The Continuum Hypothesis: We will regard macroscopic behavior of fluids as if the fluids are perfectly continuous in structure. In reality,
Accelerating CFD using OpenFOAM with GPUs
Accelerating CFD using OpenFOAM with GPUs Authors: Saeed Iqbal and Kevin Tubbs The OpenFOAM CFD Toolbox is a free, open source CFD software package produced by OpenCFD Ltd. Its user base represents a wide
Parallel 3D Image Segmentation of Large Data Sets on a GPU Cluster
Parallel 3D Image Segmentation of Large Data Sets on a GPU Cluster Aaron Hagan and Ye Zhao Kent State University Abstract. In this paper, we propose an inherent parallel scheme for 3D image segmentation
Lecture 24 - Surface tension, viscous flow, thermodynamics
Lecture 24 - Surface tension, viscous flow, thermodynamics Surface tension, surface energy The atoms at the surface of a solid or liquid are not happy. Their bonding is less ideal than the bonding of atoms
NVIDIA Tesla K20-K20X GPU Accelerators Benchmarks Application Performance Technical Brief
NVIDIA Tesla K20-K20X GPU Accelerators Benchmarks Application Performance Technical Brief NVIDIA changed the high performance computing (HPC) landscape by introducing its Fermibased GPUs that delivered
Case Study on Productivity and Performance of GPGPUs
Case Study on Productivity and Performance of GPGPUs Sandra Wienke [email protected] ZKI Arbeitskreis Supercomputing April 2012 Rechen- und Kommunikationszentrum (RZ) RWTH GPU-Cluster 56 Nvidia
High Performance Computing in CST STUDIO SUITE
High Performance Computing in CST STUDIO SUITE Felix Wolfheimer GPU Computing Performance Speedup 18 16 14 12 10 8 6 4 2 0 Promo offer for EUC participants: 25% discount for K40 cards Speedup of Solver
Basic Principles in Microfluidics
Basic Principles in Microfluidics 1 Newton s Second Law for Fluidics Newton s 2 nd Law (F= ma) : Time rate of change of momentum of a system equal to net force acting on system!f = dp dt Sum of forces
PyFR: Bringing Next Generation Computational Fluid Dynamics to GPU Platforms
PyFR: Bringing Next Generation Computational Fluid Dynamics to GPU Platforms P. E. Vincent! Department of Aeronautics Imperial College London! 25 th March 2014 Overview Motivation Flux Reconstruction Many-Core
Next Generation GPU Architecture Code-named Fermi
Next Generation GPU Architecture Code-named Fermi The Soul of a Supercomputer in the Body of a GPU Why is NVIDIA at Super Computing? Graphics is a throughput problem paint every pixel within frame time
Molecular simulation of fluid dynamics on the nanoscale
VI. International Conference on Computational Fluid Dynamics Molecular simulation of fluid dynamics on the nanoscale St. Petersburg, July 16, 2010 M. Horsch, Y.-Tz. Hsiang, Z. Liu, S. K. Miroshnichenko,
Building Platform as a Service for Scientific Applications
Building Platform as a Service for Scientific Applications Moustafa AbdelBaky [email protected] Rutgers Discovery Informa=cs Ins=tute (RDI 2 ) The NSF Cloud and Autonomic Compu=ng Center Department
Adaptive Resolution Molecular Dynamics Simulation (AdResS): Changing the Degrees of Freedom on the Fly
Adaptive Resolution Molecular Dynamics Simulation (AdResS): Changing the Degrees of Freedom on the Fly Luigi Delle Site Max Planck Institute for Polymer Research Mainz Leipzig December 2007 Collaborators
Turbulence, Heat and Mass Transfer (THMT 09) Poiseuille flow of liquid methane in nanoscopic graphite channels by molecular dynamics simulation
Turbulence, Heat and Mass Transfer (THMT 09) Poiseuille flow of liquid methane in nanoscopic graphite channels by molecular dynamics simulation Sapienza Università di Roma, September 14, 2009 M. T. HORSCH,
Mixed Precision Iterative Refinement Methods Energy Efficiency on Hybrid Hardware Platforms
Mixed Precision Iterative Refinement Methods Energy Efficiency on Hybrid Hardware Platforms Björn Rocker Hamburg, June 17th 2010 Engineering Mathematics and Computing Lab (EMCL) KIT University of the State
Lecture 16 - Free Surface Flows. Applied Computational Fluid Dynamics
Lecture 16 - Free Surface Flows Applied Computational Fluid Dynamics Instructor: André Bakker http://www.bakker.org André Bakker (2002-2006) Fluent Inc. (2002) 1 Example: spinning bowl Example: flow in
Interpretation of density profiles and pair correlation functions
Interpretation of density profiles and pair correlation functions Martin Horsch and Cemal Engin Paderborn, 22 nd June 12 IV. Annual Meeting of the Boltzmann Zuse Society Surface tension of nanodroplets
Chapter Outline. Mechanical Properties of Metals How do metals respond to external loads?
Mechanical Properties of Metals How do metals respond to external loads? Stress and Strain Tension Compression Shear Torsion Elastic deformation Plastic Deformation Yield Strength Tensile Strength Ductility
Hardware design for ray tracing
Hardware design for ray tracing Jae-sung Yoon Introduction Realtime ray tracing performance has recently been achieved even on single CPU. [Wald et al. 2001, 2002, 2004] However, higher resolutions, complex
Medical Image Processing on the GPU. Past, Present and Future. Anders Eklund, PhD Virginia Tech Carilion Research Institute [email protected].
Medical Image Processing on the GPU Past, Present and Future Anders Eklund, PhD Virginia Tech Carilion Research Institute [email protected] Outline Motivation why do we need GPUs? Past - how was GPU programming
OpenPOWER Outlook AXEL KOEHLER SR. SOLUTION ARCHITECT HPC
OpenPOWER Outlook AXEL KOEHLER SR. SOLUTION ARCHITECT HPC Driving industry innovation The goal of the OpenPOWER Foundation is to create an open ecosystem, using the POWER Architecture to share expertise,
Overview on Modern Accelerators and Programming Paradigms Ivan Giro7o [email protected]
Overview on Modern Accelerators and Programming Paradigms Ivan Giro7o [email protected] Informa(on & Communica(on Technology Sec(on (ICTS) Interna(onal Centre for Theore(cal Physics (ICTP) Mul(ple Socket
Seeking Opportunities for Hardware Acceleration in Big Data Analytics
Seeking Opportunities for Hardware Acceleration in Big Data Analytics Paul Chow High-Performance Reconfigurable Computing Group Department of Electrical and Computer Engineering University of Toronto Who
Introduction to Solid Modeling Using SolidWorks 2012 SolidWorks Simulation Tutorial Page 1
Introduction to Solid Modeling Using SolidWorks 2012 SolidWorks Simulation Tutorial Page 1 In this tutorial, we will use the SolidWorks Simulation finite element analysis (FEA) program to analyze the response
Scalable coupling of Molecular Dynamics (MD) and Direct Numerical Simulation (DNS) of multi-scale flows
Scalable coupling of Molecular Dynamics (MD) and Direct Numerical Simulation (DNS) of multi-scale flows Lucian Anton 1, Edward Smith 2 1 Numerical Algorithms Group Ltd, Wilkinson House, Jordan Hill Road,
Design and Optimization of OpenFOAM-based CFD Applications for Hybrid and Heterogeneous HPC Platforms
Design and Optimization of OpenFOAM-based CFD Applications for Hybrid and Heterogeneous HPC Platforms Amani AlOnazi, David E. Keyes, Alexey Lastovetsky, Vladimir Rychkov Extreme Computing Research Center,
HETEROGENEOUS HPC, ARCHITECTURE OPTIMIZATION, AND NVLINK
HETEROGENEOUS HPC, ARCHITECTURE OPTIMIZATION, AND NVLINK Steve Oberlin CTO, Accelerated Computing US to Build Two Flagship Supercomputers SUMMIT SIERRA Partnership for Science 100-300 PFLOPS Peak Performance
A GPU COMPUTING PLATFORM (SAGA) AND A CFD CODE ON GPU FOR AEROSPACE APPLICATIONS
A GPU COMPUTING PLATFORM (SAGA) AND A CFD CODE ON GPU FOR AEROSPACE APPLICATIONS SUDHAKARAN.G APCF, AERO, VSSC, ISRO 914712564742 [email protected] THOMAS.C.BABU APCF, AERO, VSSC, ISRO 914712565833
Chapter Outline Dislocations and Strengthening Mechanisms
Chapter Outline Dislocations and Strengthening Mechanisms What is happening in material during plastic deformation? Dislocations and Plastic Deformation Motion of dislocations in response to stress Slip
ACCELERATING SELECT WHERE AND SELECT JOIN QUERIES ON A GPU
Computer Science 14 (2) 2013 http://dx.doi.org/10.7494/csci.2013.14.2.243 Marcin Pietroń Pawe l Russek Kazimierz Wiatr ACCELERATING SELECT WHERE AND SELECT JOIN QUERIES ON A GPU Abstract This paper presents
1. Fluids Mechanics and Fluid Properties. 1.1 Objectives of this section. 1.2 Fluids
1. Fluids Mechanics and Fluid Properties What is fluid mechanics? As its name suggests it is the branch of applied mechanics concerned with the statics and dynamics of fluids - both liquids and gases.
Accelerating Simulation & Analysis with Hybrid GPU Parallelization and Cloud Computing
Accelerating Simulation & Analysis with Hybrid GPU Parallelization and Cloud Computing Innovation Intelligence Devin Jensen August 2012 Altair Knows HPC Altair is the only company that: makes HPC tools
Fluid Mechanics: Static s Kinematics Dynamics Fluid
Fluid Mechanics: Fluid mechanics may be defined as that branch of engineering science that deals with the behavior of fluid under the condition of rest and motion Fluid mechanics may be divided into three
HPC with Multicore and GPUs
HPC with Multicore and GPUs Stan Tomov Electrical Engineering and Computer Science Department University of Tennessee, Knoxville CS 594 Lecture Notes March 4, 2015 1/18 Outline! Introduction - Hardware
Programming models for heterogeneous computing. Manuel Ujaldón Nvidia CUDA Fellow and A/Prof. Computer Architecture Department University of Malaga
Programming models for heterogeneous computing Manuel Ujaldón Nvidia CUDA Fellow and A/Prof. Computer Architecture Department University of Malaga Talk outline [30 slides] 1. Introduction [5 slides] 2.
Experiences on using GPU accelerators for data analysis in ROOT/RooFit
Experiences on using GPU accelerators for data analysis in ROOT/RooFit Sverre Jarp, Alfio Lazzaro, Julien Leduc, Yngve Sneen Lindal, Andrzej Nowak European Organization for Nuclear Research (CERN), Geneva,
Chapter Outline. Diffusion - how do atoms move through solids?
Chapter Outline iffusion - how do atoms move through solids? iffusion mechanisms Vacancy diffusion Interstitial diffusion Impurities The mathematics of diffusion Steady-state diffusion (Fick s first law)
Scalable and High Performance Computing for Big Data Analytics in Understanding the Human Dynamics in the Mobile Age
Scalable and High Performance Computing for Big Data Analytics in Understanding the Human Dynamics in the Mobile Age Xuan Shi GRA: Bowei Xue University of Arkansas Spatiotemporal Modeling of Human Dynamics
Physics 9e/Cutnell. correlated to the. College Board AP Physics 1 Course Objectives
Physics 9e/Cutnell correlated to the College Board AP Physics 1 Course Objectives Big Idea 1: Objects and systems have properties such as mass and charge. Systems may have internal structure. Enduring
NVIDIA CUDA Software and GPU Parallel Computing Architecture. David B. Kirk, Chief Scientist
NVIDIA CUDA Software and GPU Parallel Computing Architecture David B. Kirk, Chief Scientist Outline Applications of GPU Computing CUDA Programming Model Overview Programming in CUDA The Basics How to Get
Chapter Outline Dislocations and Strengthening Mechanisms
Chapter Outline Dislocations and Strengthening Mechanisms What is happening in material during plastic deformation? Dislocations and Plastic Deformation Motion of dislocations in response to stress Slip
DISTANCE DEGREE PROGRAM CURRICULUM NOTE:
Bachelor of Science in Electrical Engineering DISTANCE DEGREE PROGRAM CURRICULUM NOTE: Some Courses May Not Be Offered At A Distance Every Semester. Chem 121C General Chemistry I 3 Credits Online Fall
Software Development around a Millisecond
Introduction Software Development around a Millisecond Geoffrey Fox In this column we consider software development methodologies with some emphasis on those relevant for large scale scientific computing.
Size effects. Lecture 6 OUTLINE
Size effects 1 MTX9100 Nanomaterials Lecture 6 OUTLINE -Why does size influence the material s properties? -How does size influence the material s performance? -Why are properties of nanoscale objects
HIGH PERFORMANCE BIG DATA ANALYTICS
HIGH PERFORMANCE BIG DATA ANALYTICS Kunle Olukotun Electrical Engineering and Computer Science Stanford University June 2, 2014 Explosion of Data Sources Sensors DoD is swimming in sensors and drowning
5x in 5 hours Porting SEISMIC_CPML using the PGI Accelerator Model
5x in 5 hours Porting SEISMIC_CPML using the PGI Accelerator Model C99, C++, F2003 Compilers Optimizing Vectorizing Parallelizing Graphical parallel tools PGDBG debugger PGPROF profiler Intel, AMD, NVIDIA
Recent and Future Activities in HPC and Scientific Data Management Siegfried Benkner
Recent and Future Activities in HPC and Scientific Data Management Siegfried Benkner Research Group Scientific Computing Faculty of Computer Science University of Vienna AUSTRIA http://www.par.univie.ac.at
Optimizing a 3D-FWT code in a cluster of CPUs+GPUs
Optimizing a 3D-FWT code in a cluster of CPUs+GPUs Gregorio Bernabé Javier Cuenca Domingo Giménez Universidad de Murcia Scientific Computing and Parallel Programming Group XXIX Simposium Nacional de la
Molecular Dynamics Simulations
Molecular Dynamics Simulations Yaoquan Tu Division of Theoretical Chemistry and Biology, Royal Institute of Technology (KTH) 2011-06 1 Outline I. Introduction II. Molecular Mechanics Force Field III. Molecular
Graduate Courses in Mechanical Engineering
Graduate Courses in Mechanical Engineering MEEG 501 ADVANCED MECHANICAL ENGINEERING ANALYSIS An advanced, unified approach to the solution of mechanical engineering problems, with emphasis on the formulation
Parallel Algorithm Engineering
Parallel Algorithm Engineering Kenneth S. Bøgh PhD Fellow Based on slides by Darius Sidlauskas Outline Background Current multicore architectures UMA vs NUMA The openmp framework Examples Software crisis
Introduction to GP-GPUs. Advanced Computer Architectures, Cristina Silvano, Politecnico di Milano 1
Introduction to GP-GPUs Advanced Computer Architectures, Cristina Silvano, Politecnico di Milano 1 GPU Architectures: How do we reach here? NVIDIA Fermi, 512 Processing Elements (PEs) 2 What Can It Do?
Pore pressure. Ordinary space
Fault Mechanics Laboratory Pore pressure scale Lowers normal stress, moves stress circle to left Doesn Doesn t change shear Deviatoric stress not affected This example: failure will be by tensile cracks
Texture Cache Approximation on GPUs
Texture Cache Approximation on GPUs Mark Sutherland Joshua San Miguel Natalie Enright Jerger {suther68,enright}@ece.utoronto.ca, [email protected] 1 Our Contribution GPU Core Cache Cache
Applications to Computational Financial and GPU Computing. May 16th. Dr. Daniel Egloff +41 44 520 01 17 +41 79 430 03 61
F# Applications to Computational Financial and GPU Computing May 16th Dr. Daniel Egloff +41 44 520 01 17 +41 79 430 03 61 Today! Why care about F#? Just another fashion?! Three success stories! How Alea.cuBase
NUMERICAL ANALYSIS OF THE EFFECTS OF WIND ON BUILDING STRUCTURES
Vol. XX 2012 No. 4 28 34 J. ŠIMIČEK O. HUBOVÁ NUMERICAL ANALYSIS OF THE EFFECTS OF WIND ON BUILDING STRUCTURES Jozef ŠIMIČEK email: [email protected] Research field: Statics and Dynamics Fluids mechanics
Benchmark Tests on ANSYS Parallel Processing Technology
Benchmark Tests on ANSYS Parallel Processing Technology Kentaro Suzuki ANSYS JAPAN LTD. Abstract It is extremely important for manufacturing industries to reduce their design process period in order to
Heat Transfer and Thermal-Stress Analysis with Abaqus
Heat Transfer and Thermal-Stress Analysis with Abaqus 2016 About this Course Course objectives Upon completion of this course you will be able to: Perform steady-state and transient heat transfer simulations
Evaluation of CUDA Fortran for the CFD code Strukti
Evaluation of CUDA Fortran for the CFD code Strukti Practical term report from Stephan Soller High performance computing center Stuttgart 1 Stuttgart Media University 2 High performance computing center
Performance Evaluation of NAS Parallel Benchmarks on Intel Xeon Phi
Performance Evaluation of NAS Parallel Benchmarks on Intel Xeon Phi ICPP 6 th International Workshop on Parallel Programming Models and Systems Software for High-End Computing October 1, 2013 Lyon, France
ACCELERATING COMMERCIAL LINEAR DYNAMIC AND NONLINEAR IMPLICIT FEA SOFTWARE THROUGH HIGH- PERFORMANCE COMPUTING
ACCELERATING COMMERCIAL LINEAR DYNAMIC AND Vladimir Belsky Director of Solver Development* Luis Crivelli Director of Solver Development* Matt Dunbar Chief Architect* Mikhail Belyi Development Group Manager*
Part I Courses Syllabus
Part I Courses Syllabus This document provides detailed information about the basic courses of the MHPC first part activities. The list of courses is the following 1.1 Scientific Programming Environment
Optimizing GPU-based application performance for the HP for the HP ProLiant SL390s G7 server
Optimizing GPU-based application performance for the HP for the HP ProLiant SL390s G7 server Technology brief Introduction... 2 GPU-based computing... 2 ProLiant SL390s GPU-enabled architecture... 2 Optimizing
Lecture 3 Fluid Dynamics and Balance Equa6ons for Reac6ng Flows
Lecture 3 Fluid Dynamics and Balance Equa6ons for Reac6ng Flows 3.- 1 Basics: equations of continuum mechanics - balance equations for mass and momentum - balance equations for the energy and the chemical
CONVERGE Features, Capabilities and Applications
CONVERGE Features, Capabilities and Applications CONVERGE CONVERGE The industry leading CFD code for complex geometries with moving boundaries. Start using CONVERGE and never make a CFD mesh again. CONVERGE
Jean-Pierre Panziera Teratec 2011
Technologies for the future HPC systems Jean-Pierre Panziera Teratec 2011 3 petaflop systems : TERA 100, CURIE & IFERC Tera100 Curie IFERC 1.25 PetaFlops 256 TB ory 30 PB disk storage 140 000+ Xeon cores
Dynamic Process Modeling. Process Dynamics and Control
Dynamic Process Modeling Process Dynamics and Control 1 Description of process dynamics Classes of models What do we need for control? Modeling for control Mechanical Systems Modeling Electrical circuits
Towards Large-Scale Molecular Dynamics Simulations on Graphics Processors
Towards Large-Scale Molecular Dynamics Simulations on Graphics Processors Joe Davis, Sandeep Patel, and Michela Taufer University of Delaware Outline Introduction Introduction to GPU programming Why MD
CE 204 FLUID MECHANICS
CE 204 FLUID MECHANICS Onur AKAY Assistant Professor Okan University Department of Civil Engineering Akfırat Campus 34959 Tuzla-Istanbul/TURKEY Phone: +90-216-677-1630 ext.1974 Fax: +90-216-677-1486 E-mail:
Best practices for efficient HPC performance with large models
Best practices for efficient HPC performance with large models Dr. Hößl Bernhard, CADFEM (Austria) GmbH PRACE Autumn School 2013 - Industry Oriented HPC Simulations, September 21-27, University of Ljubljana,
Finite Element Formulation for Beams - Handout 2 -
Finite Element Formulation for Beams - Handout 2 - Dr Fehmi Cirak (fc286@) Completed Version Review of Euler-Bernoulli Beam Physical beam model midline Beam domain in three-dimensions Midline, also called
Large-Scale Reservoir Simulation and Big Data Visualization
Large-Scale Reservoir Simulation and Big Data Visualization Dr. Zhangxing John Chen NSERC/Alberta Innovates Energy Environment Solutions/Foundation CMG Chair Alberta Innovates Technology Future (icore)
Finite Element Analysis
Finite Element Analysis (MCEN 4173/5173) Instructor: Dr. H. Jerry Qi Fall, 2006 What is Finite Element Analysis (FEA)? -- A numerical method. -- Traditionally, a branch of Solid Mechanics. -- Nowadays,
OpenACC Parallelization and Optimization of NAS Parallel Benchmarks
OpenACC Parallelization and Optimization of NAS Parallel Benchmarks Presented by Rengan Xu GTC 2014, S4340 03/26/2014 Rengan Xu, Xiaonan Tian, Sunita Chandrasekaran, Yonghong Yan, Barbara Chapman HPC Tools
Bytes and BTUs: Holistic Approaches to Data Center Energy Efficiency. Steve Hammond NREL
Bytes and BTUs: Holistic Approaches to Data Center Energy Efficiency NREL 1 National Renewable Energy Laboratory Presentation Road Map A Holistic Approach to Efficiency: Power, Packaging, Cooling, Integration
GPU File System Encryption Kartik Kulkarni and Eugene Linkov
GPU File System Encryption Kartik Kulkarni and Eugene Linkov 5/10/2012 SUMMARY. We implemented a file system that encrypts and decrypts files. The implementation uses the AES algorithm computed through
Cluster Implementation and Management; Scheduling
Cluster Implementation and Management; Scheduling CPS343 Parallel and High Performance Computing Spring 2013 CPS343 (Parallel and HPC) Cluster Implementation and Management; Scheduling Spring 2013 1 /
Working with HPC and HTC Apps. Abhinav Thota Research Technologies Indiana University
Working with HPC and HTC Apps Abhinav Thota Research Technologies Indiana University Outline What are HPC apps? Working with typical HPC apps Compilers - Optimizations and libraries Installation Modules
OpenFOAM Optimization Tools
OpenFOAM Optimization Tools Henrik Rusche and Aleks Jemcov [email protected] and [email protected] Wikki, Germany and United Kingdom OpenFOAM Optimization Tools p. 1 Agenda Objective Review optimisation
