Mathematical Libraries on JUQUEEN. JSC Training Course



Similar documents
Mathematical Libraries and Application Software on JUROPA and JUQUEEN

JUROPA Linux Cluster An Overview. 19 May 2014 Ulrich Detert

Public Domain commercial vendor specific

Experiences of numerical simulations on a PC cluster Antti Vanne December 11, 2002

Mitglied der Helmholtz-Gemeinschaft JUQUEEN. Best Practices. Florian Janetzko / Wolfgang Frings. 2. Februar 2014

1 Bull, 2011 Bull Extreme Computing

Introduction to Linux and Cluster Basics for the CCR General Computing Cluster

It s Not A Disease: The Parallel Solver Packages MUMPS, PaStiX & SuperLU

AMS526: Numerical Analysis I (Numerical Linear Algebra)

Service Partition Specialized Linux nodes. Compute PE Login PE Network PE System PE I/O PE

Report on Project: Advanced System Monitoring for the Parallel Tools Platform (PTP)

A Grid-Aware Web Interface with Advanced Service Trading for Linear Algebra Calculations

PARALLEL PROGRAMMING

DARPA, NSF-NGS/ITR,ACR,CPA,

The Forthcoming Petascale Systems Era Got Tools?

Cluster performance, how to get the most out of Abel. Ole W. Saastad, Dr.Scient USIT / UAV / FI April 18 th 2013

Sourcery Overview & Virtual Machine Installation

HPC enabling of OpenFOAM R for CFD applications

Poisson Equation Solver Parallelisation for Particle-in-Cell Model

INTEL PARALLEL STUDIO XE EVALUATION GUIDE

Overview of HPC systems and software available within

AN INTRODUCTION TO NUMERICAL METHODS AND ANALYSIS

Matrix Multiplication

Cluster Computing at HRI

Parallel Programming for Multi-Core, Distributed Systems, and GPUs Exercises

The Assessment of Benchmarks Executed on Bare-Metal and Using Para-Virtualisation

Numerical Methods I Eigenvalue Problems

MPI Hands-On List of the exercises

HSL and its out-of-core solver

22S:295 Seminar in Applied Statistics High Performance Computing in Statistics

Part I Courses Syllabus

Code Generation Tools for PDEs. Matthew Knepley PETSc Developer Mathematics and Computer Science Division Argonne National Laboratory

Big Data and Big Analytics

Fast Multipole Method for particle interactions: an open source parallel library component

YALES2 porting on the Xeon- Phi Early results

How High a Degree is High Enough for High Order Finite Elements?

GPUs for Scientific Computing

Designing and Building Applications for Extreme Scale Systems CS598 William Gropp

Unleashing the Performance Potential of GPUs for Atmospheric Dynamic Solvers

RevoScaleR Speed and Scalability

BLM 413E - Parallel Programming Lecture 3

Why (and Why Not) to Use Fortran

Numerical Libraries with C or Fortran. Shaohao Chen Research Computing, IS&T, Boston University

Dynamic Load Balancing in CP2K

BIG CPU, BIG DATA. Solving the World s Toughest Computational Problems with Parallel Computing. Alan Kaminsky

APPM4720/5720: Fast algorithms for big data. Gunnar Martinsson The University of Colorado at Boulder

A Simultaneous Solution for General Linear Equations on a Ring or Hierarchical Cluster

Giac/Xcas, a swiss knife for mathematics

High Performance Matrix Inversion with Several GPUs

Operation Count; Numerical Linear Algebra

Software Development around a Millisecond

Introduction Installation Comparison. Department of Computer Science, Yazd University. SageMath. A.Rahiminasab. October9, / 17

Performance Evaluation of NAS Parallel Benchmarks on Intel Xeon Phi

Parallel Programming at the Exascale Era: A Case Study on Parallelizing Matrix Assembly For Unstructured Meshes

Free software for scientific computing

Parallel Ray Tracing using MPI: A Dynamic Load-balancing Approach

Trends in High-Performance Computing for Power Grid Applications

Linux clustering. Morris Law, IT Coordinator, Science Faculty, Hong Kong Baptist University

A Crash course to (The) Bighouse

CUDA programming on NVIDIA GPUs

Linux tools for debugging and profiling MPI codes

BookTOC.txt. 1. Functions, Graphs, and Models. Algebra Toolbox. Sets. The Real Numbers. Inequalities and Intervals on the Real Number Line

Numerical Analysis. Professor Donna Calhoun. Fall 2013 Math 465/565. Office : MG241A Office Hours : Wednesday 10:00-12:00 and 1:00-3:00

The Asynchronous Dynamic Load-Balancing Library

Algebra I Credit Recovery

Concurrent Solutions to Linear Systems using Hybrid CPU/GPU Nodes

CS (CCN 27156) CS (CCN 26880) Software Engineering for Scientific Computing. Lecture 1: Introduction

BIG CPU, BIG DATA. Solving the World s Toughest Computational Problems with Parallel Computing. Alan Kaminsky

(!' ) "' # "*# "!(!' +,

Algorithmic Research and Software Development for an Industrial Strength Sparse Matrix Library for Parallel Computers

Data Mining mit der JMSL Numerical Library for Java Applications

NUMERICAL METHODS TOPICS FOR RESEARCH PAPERS

GridSolve: : A Seamless Bridge Between the Standard Programming Interfaces and Remote Resources

Parallel Computing using MATLAB Distributed Compute Server ZORRO HPC

GPU Hardware and Programming Models. Jeremy Appleyard, September 2015

Computational Mathematics with Python

SR-IOV: Performance Benefits for Virtualized Interconnects!

Computer programming course in the Department of Physics, University of Calcutta

HPC Wales Skills Academy Course Catalogue 2015

2: Computer Performance

MATH. ALGEBRA I HONORS 9 th Grade ALGEBRA I HONORS

Data-Flow Awareness in Parallel Data Processing

Arcane/ArcGeoSim, a software framework for geosciences simulation

Systolic Computing. Fundamentals

The Top Six Advantages of CUDA-Ready Clusters. Ian Lumb Bright Evangelist

Transcription:

Mitglied der Helmholtz-Gemeinschaft Mathematical Libraries on JUQUEEN JSC Training Course May 10, 2012

Outline General Informations Sequential Libraries, planned Parallel Libraries and Application Systems: Threaded Libraries MPI parallel Libraries, planned Further Information May 10, 2012 Folie 2

General Informations JUQUEEN (I) All libraries as modules in /bgsys/local/name module avail lists names of available libraries module help name tells how to use library module load name sets environment variables for L$(*_LIB) and I$(*_INCLUDE) to include in makefile Link sequence important,.o always before the libraries, sometimes double linking necessary May 10, 2012 Folie 3

General Informations JUQUEEN (II) First all libraries will be compiled with -O3 -qstrict -g qsimd=noauto Additional version compiled without -g will be added Perhaps later on versions with simd, too See module avail for available versions Only the most recent versions will be installed May 10, 2012 Folie 4

Sequential Libraries and Packages (I) Vendor specific libraries ESSL (Engineering and Scientific Subroutine Library) version 5.1 in /opt/ibmmath/essl/5.1/lib64 Public domain Software, planned LAPACK (Linear Algebra PACKage) ARPACK (Arnoldi PACKage) GSL (Gnu Scientific Library) GMP (Gnu Multiple Precision Arithmetic Library) May 10, 2012 Folie 5

Contents of ESSL Version 5.1 BLAS level 1-3 and additional vector, matrix-vector, and matrix-matrix operations Sparse vector and matrix operations LAPACK computational routines for linear equation systems and eigensystems Banded linear system solvers Linear Least Squares Fast Fourier Transforms May 10, 2012 Folie 6

Numerical Quadrature Random Number Generation Interpolation All routines are thread-save, i.e. can be used within OpenMP threads For further information see IBM Engineering and Scientific Subroutine Library for Linux on POWER V5.1: Guide and Reference http://www.fz-juelich.de/ias/jsc/en/expertise/support/software/ SystemDependentLibraries/ESSL.html Guide and Reference May 10, 2012 Folie 7

Usage of ESSL Compilation and linking of program name.f calling ESSL routines mpixlf90_r name.f -L/opt/ibmmath/essl/5.1/lib64 lesslbg Compilation and linking of program name.c calling ESSL routines not yet tested May 10, 2012 Folie 8

Lapack (I) Public domain version 3.3 on JUQUEEN Must be used together with ESSL (or ESSLsmp) Some routines already in ESSL Attention, some calling sequences are different! May 10, 2012 Folie 9

Lapack (II) Compilation and linking of FORTRAN program name.f calling LAPACK routines JUQUEEN: module load lapack/3.3.0_g mpixlf77_r name.f -L/opt/ibmmath/essl/5.1/lib64 [-lessl[smp]bg] -L$(LAPACK_LIB) llapack lessl[smp]bg ESSL must be linked after LAPACK to resolve references May 10, 2012 Folie 10

Other sequential libraries ARPACK, ARnoldi PACKage, Version 2.1 To be installed soon GSL, GNU Scientific Library To be installed soon GMP GNU Multiple Precision Library To be installed soon May 10, 2012 Folie 11

Parallel Libraries and Systems Threaded Parallelism ESSLsmp 5.1 (JUQUEEN) Usage: mpixlf90_r name.f -L/opt/ibmmath/essl/5.1/lib64 -lesslsmpbg May 10, 2012 Folie 12

Parallel Libraries MPI Parallelism, all planned ScaLAPACK (Scalable Linear Algebra PACKage) FFTW (Fastest Fourier Transform of the West) MUMPS (Multifrontal Massively Parallel sparse direct Solver) ParMETIS (Parallel Graph Partitioning) hypre (high performance preconditioners) PARPACK (Parallel ARPACK) May 10, 2012 Folie 13

MPI Parallelism (II) Status of ScaLAPACK 2.0.1 BLACS now part of ScaLAPACK, but LAPACK and BLAS have to be linked seperately LAPACK 3.3.0 already installed, BLAS from essl, srotm and drotm are missing, will be put into liblapack.a ScaLAPACK compiled and installed, but tests give error with MPI Executables from DD1 run without error newly linked executables with all.o-files from DD1 run into error May 10, 2012 Folie 14

MPI Parallelism (III) SPRNG (Scalable Parallel Random Number Generator) sundials (Suite of Nonlinear and Differential/ALgebraic equation solvers) Parallel Systems, MPI Parallelism PETSc, toolkit for partial differential equations May 10, 2012 Folie 15

Further Information http://www.fz-juelich.de/ias/jsc/en/expertise/juqueen/ JUQUEEN_node.html http://www.fz-juelich.de/ias/jsc/en/expertise/support/ Software/Software_node.html May 10, 2012 Folie 16

JSC People I.Gutheil: Parallel basic libraries, JUQUEEN i.gutheil@fz-juelich.de Software: mathe-admin@fz-juelich.de May 10, 2012 Folie 17