Mathematical Libraries on JUQUEEN. JSC Training Course

Size: px
Start display at page:

Download "Mathematical Libraries on JUQUEEN. JSC Training Course"

Transcription

1 Mitglied der Helmholtz-Gemeinschaft Mathematical Libraries on JUQUEEN JSC Training Course May 10, 2012

2 Outline General Informations Sequential Libraries, planned Parallel Libraries and Application Systems: Threaded Libraries MPI parallel Libraries, planned Further Information May 10, 2012 Folie 2

3 General Informations JUQUEEN (I) All libraries as modules in /bgsys/local/name module avail lists names of available libraries module help name tells how to use library module load name sets environment variables for L$(*_LIB) and I$(*_INCLUDE) to include in makefile Link sequence important,.o always before the libraries, sometimes double linking necessary May 10, 2012 Folie 3

4 General Informations JUQUEEN (II) First all libraries will be compiled with -O3 -qstrict -g qsimd=noauto Additional version compiled without -g will be added Perhaps later on versions with simd, too See module avail for available versions Only the most recent versions will be installed May 10, 2012 Folie 4

5 Sequential Libraries and Packages (I) Vendor specific libraries ESSL (Engineering and Scientific Subroutine Library) version 5.1 in /opt/ibmmath/essl/5.1/lib64 Public domain Software, planned LAPACK (Linear Algebra PACKage) ARPACK (Arnoldi PACKage) GSL (Gnu Scientific Library) GMP (Gnu Multiple Precision Arithmetic Library) May 10, 2012 Folie 5

6 Contents of ESSL Version 5.1 BLAS level 1-3 and additional vector, matrix-vector, and matrix-matrix operations Sparse vector and matrix operations LAPACK computational routines for linear equation systems and eigensystems Banded linear system solvers Linear Least Squares Fast Fourier Transforms May 10, 2012 Folie 6

7 Numerical Quadrature Random Number Generation Interpolation All routines are thread-save, i.e. can be used within OpenMP threads For further information see IBM Engineering and Scientific Subroutine Library for Linux on POWER V5.1: Guide and Reference SystemDependentLibraries/ESSL.html Guide and Reference May 10, 2012 Folie 7

8 Usage of ESSL Compilation and linking of program name.f calling ESSL routines mpixlf90_r name.f -L/opt/ibmmath/essl/5.1/lib64 lesslbg Compilation and linking of program name.c calling ESSL routines not yet tested May 10, 2012 Folie 8

9 Lapack (I) Public domain version 3.3 on JUQUEEN Must be used together with ESSL (or ESSLsmp) Some routines already in ESSL Attention, some calling sequences are different! May 10, 2012 Folie 9

10 Lapack (II) Compilation and linking of FORTRAN program name.f calling LAPACK routines JUQUEEN: module load lapack/3.3.0_g mpixlf77_r name.f -L/opt/ibmmath/essl/5.1/lib64 [-lessl[smp]bg] -L$(LAPACK_LIB) llapack lessl[smp]bg ESSL must be linked after LAPACK to resolve references May 10, 2012 Folie 10

11 Other sequential libraries ARPACK, ARnoldi PACKage, Version 2.1 To be installed soon GSL, GNU Scientific Library To be installed soon GMP GNU Multiple Precision Library To be installed soon May 10, 2012 Folie 11

12 Parallel Libraries and Systems Threaded Parallelism ESSLsmp 5.1 (JUQUEEN) Usage: mpixlf90_r name.f -L/opt/ibmmath/essl/5.1/lib64 -lesslsmpbg May 10, 2012 Folie 12

13 Parallel Libraries MPI Parallelism, all planned ScaLAPACK (Scalable Linear Algebra PACKage) FFTW (Fastest Fourier Transform of the West) MUMPS (Multifrontal Massively Parallel sparse direct Solver) ParMETIS (Parallel Graph Partitioning) hypre (high performance preconditioners) PARPACK (Parallel ARPACK) May 10, 2012 Folie 13

14 MPI Parallelism (II) Status of ScaLAPACK BLACS now part of ScaLAPACK, but LAPACK and BLAS have to be linked seperately LAPACK already installed, BLAS from essl, srotm and drotm are missing, will be put into liblapack.a ScaLAPACK compiled and installed, but tests give error with MPI Executables from DD1 run without error newly linked executables with all.o-files from DD1 run into error May 10, 2012 Folie 14

15 MPI Parallelism (III) SPRNG (Scalable Parallel Random Number Generator) sundials (Suite of Nonlinear and Differential/ALgebraic equation solvers) Parallel Systems, MPI Parallelism PETSc, toolkit for partial differential equations May 10, 2012 Folie 15

16 Further Information JUQUEEN_node.html Software/Software_node.html May 10, 2012 Folie 16

17 JSC People I.Gutheil: Parallel basic libraries, JUQUEEN Software: May 10, 2012 Folie 17

Mathematical Libraries and Application Software on JUROPA and JUQUEEN

Mathematical Libraries and Application Software on JUROPA and JUQUEEN Mitglied der Helmholtz-Gemeinschaft Mathematical Libraries and Application Software on JUROPA and JUQUEEN JSC Training Course May 2014 I.Gutheil Outline General Informations Sequential Libraries Parallel

More information

Advanced Computational Software

Advanced Computational Software Advanced Computational Software Scientific Libraries: Part 2 Blue Waters Undergraduate Petascale Education Program May 29 June 10 2011 Outline Quick review Fancy Linear Algebra libraries - ScaLAPACK -PETSc

More information

Optimization on Huygens

Optimization on Huygens Optimization on Huygens Wim Rijks wimr@sara.nl Contents Introductory Remarks Support team Optimization strategy Amdahls law Compiler options An example Optimization Introductory Remarks Modern day supercomputers

More information

JUROPA Linux Cluster An Overview. 19 May 2014 Ulrich Detert

JUROPA Linux Cluster An Overview. 19 May 2014 Ulrich Detert Mitglied der Helmholtz-Gemeinschaft JUROPA Linux Cluster An Overview 19 May 2014 Ulrich Detert JuRoPA JuRoPA Jülich Research on Petaflop Architectures Bull, Sun, ParTec, Intel, Mellanox, Novell, FZJ JUROPA

More information

Public Domain commercial vendor specific

Public Domain commercial vendor specific Numerical Libraries Numerical L ibraries Public Domain commercial vendor specific 1 Public Domain Lapack-3 linear equations, eigenproblems BLAS fast linear kernels Linpack linear equations Eispack eigenproblems

More information

Experiences of numerical simulations on a PC cluster Antti Vanne December 11, 2002

Experiences of numerical simulations on a PC cluster Antti Vanne December 11, 2002 xperiences of numerical simulations on a P cluster xperiences of numerical simulations on a P cluster ecember xperiences of numerical simulations on a P cluster Introduction eowulf concept Using commodity

More information

Mitglied der Helmholtz-Gemeinschaft JUQUEEN. Best Practices. Florian Janetzko / Wolfgang Frings. 2. Februar 2014

Mitglied der Helmholtz-Gemeinschaft JUQUEEN. Best Practices. Florian Janetzko / Wolfgang Frings. 2. Februar 2014 Mitglied der Helmholtz-Gemeinschaft JUQUEEN Best Practices 2. Februar 2014 Florian Janetzko / Wolfgang Frings Outline Production Environment Module Environment Job Execution Basic Porting Compilers and

More information

1 Bull, 2011 Bull Extreme Computing

1 Bull, 2011 Bull Extreme Computing 1 Bull, 2011 Bull Extreme Computing Table of Contents HPC Overview. Cluster Overview. FLOPS. 2 Bull, 2011 Bull Extreme Computing HPC Overview Ares, Gerardo, HPC Team HPC concepts HPC: High Performance

More information

Introduction to Linux and Cluster Basics for the CCR General Computing Cluster

Introduction to Linux and Cluster Basics for the CCR General Computing Cluster Introduction to Linux and Cluster Basics for the CCR General Computing Cluster Cynthia Cornelius Center for Computational Research University at Buffalo, SUNY 701 Ellicott St Buffalo, NY 14203 Phone: 716-881-8959

More information

It s Not A Disease: The Parallel Solver Packages MUMPS, PaStiX & SuperLU

It s Not A Disease: The Parallel Solver Packages MUMPS, PaStiX & SuperLU It s Not A Disease: The Parallel Solver Packages MUMPS, PaStiX & SuperLU A. Windisch PhD Seminar: High Performance Computing II G. Haase March 29 th, 2012, Graz Outline 1 MUMPS 2 PaStiX 3 SuperLU 4 Summary

More information

AMS526: Numerical Analysis I (Numerical Linear Algebra)

AMS526: Numerical Analysis I (Numerical Linear Algebra) AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 19: SVD revisited; Software for Linear Algebra Xiangmin Jiao Stony Brook University Xiangmin Jiao Numerical Analysis I 1 / 9 Outline 1 Computing

More information

Service Partition Specialized Linux nodes. Compute PE Login PE Network PE System PE I/O PE

Service Partition Specialized Linux nodes. Compute PE Login PE Network PE System PE I/O PE 2 Service Partition Specialized Linux nodes Compute PE Login PE Network PE System PE I/O PE Microkernel on Compute PEs, full featured Linux on Service PEs. Service PEs specialize by function Software Architecture

More information

PARALLEL ALGORITHMS FOR PREDICTIVE MODELLING

PARALLEL ALGORITHMS FOR PREDICTIVE MODELLING PARALLEL ALGORITHMS FOR PREDICTIVE MODELLING MARKUS HEGLAND Abstract. Parallel computing enables the analysis of very large data sets using large collections of flexible models with many variables. The

More information

Report on Project: Advanced System Monitoring for the Parallel Tools Platform (PTP)

Report on Project: Advanced System Monitoring for the Parallel Tools Platform (PTP) Mitglied der Helmholtz-Gemeinschaft Report on Project: Advanced System Monitoring for the Parallel Tools Platform (PTP) September, 2014 Wolfgang Frings and Carsten Karbach Project progress Server caching

More information

A Grid-Aware Web Interface with Advanced Service Trading for Linear Algebra Calculations

A Grid-Aware Web Interface with Advanced Service Trading for Linear Algebra Calculations A Grid-Aware Web Interface with Advanced Service Trading for Linear Algebra Calculations Hrachya Astsatryan 1, Vladimir Sahakyan 1, Yuri Shoukouryan 1, Michel Daydé 2, Aurelie Hurault 2, Marc Pantel 2,

More information

PARALLEL PROGRAMMING

PARALLEL PROGRAMMING PARALLEL PROGRAMMING TECHNIQUES AND APPLICATIONS USING NETWORKED WORKSTATIONS AND PARALLEL COMPUTERS 2nd Edition BARRY WILKINSON University of North Carolina at Charlotte Western Carolina University MICHAEL

More information

DARPA, NSF-NGS/ITR,ACR,CPA,

DARPA, NSF-NGS/ITR,ACR,CPA, Spiral Automating Library Development Markus Püschel and the Spiral team (only part shown) With: Srinivas Chellappa Frédéric de Mesmay Franz Franchetti Daniel McFarlin Yevgen Voronenko Electrical and Computer

More information

The Forthcoming Petascale Systems Era Got Tools?

The Forthcoming Petascale Systems Era Got Tools? Era Got Tools? Tony Drummond Computational Research Division Lawrence Berkeley National Laboratory Salishan April 21, 2005 Where are the applications? Accelerator Science Astrophysics Biology Chemistry

More information

Cluster performance, how to get the most out of Abel. Ole W. Saastad, Dr.Scient USIT / UAV / FI April 18 th 2013

Cluster performance, how to get the most out of Abel. Ole W. Saastad, Dr.Scient USIT / UAV / FI April 18 th 2013 Cluster performance, how to get the most out of Abel Ole W. Saastad, Dr.Scient USIT / UAV / FI April 18 th 2013 Introduction Architecture x86-64 and NVIDIA Compilers MPI Interconnect Storage Batch queue

More information

Sourcery Overview & Virtual Machine Installation

Sourcery Overview & Virtual Machine Installation Sourcery Overview & Virtual Machine Installation Damian Rouson, Ph.D., P.E. Sourcery, Inc. www.sourceryinstitute.org Sourcery, Inc. About Us Sourcery, Inc., is a software consultancy founded by and for

More information

HPC enabling of OpenFOAM R for CFD applications

HPC enabling of OpenFOAM R for CFD applications HPC enabling of OpenFOAM R for CFD applications Towards the exascale: OpenFOAM perspective Ivan Spisso 25-27 March 2015, Casalecchio di Reno, BOLOGNA. SuperComputing Applications and Innovation Department,

More information

Poisson Equation Solver Parallelisation for Particle-in-Cell Model

Poisson Equation Solver Parallelisation for Particle-in-Cell Model WDS'14 Proceedings of Contributed Papers Physics, 233 237, 214. ISBN 978-8-7378-276-4 MATFYZPRESS Poisson Equation Solver Parallelisation for Particle-in-Cell Model A. Podolník, 1,2 M. Komm, 1 R. Dejarnac,

More information

INTEL PARALLEL STUDIO XE EVALUATION GUIDE

INTEL PARALLEL STUDIO XE EVALUATION GUIDE Introduction This guide will illustrate how you use Intel Parallel Studio XE to find the hotspots (areas that are taking a lot of time) in your application and then recompiling those parts to improve overall

More information

Overview of HPC systems and software available within

Overview of HPC systems and software available within Overview of HPC systems and software available within Overview Available HPC Systems Ba Cy-Tera Available Visualization Facilities Software Environments HPC System at Bibliotheca Alexandrina SUN cluster

More information

AN INTRODUCTION TO NUMERICAL METHODS AND ANALYSIS

AN INTRODUCTION TO NUMERICAL METHODS AND ANALYSIS AN INTRODUCTION TO NUMERICAL METHODS AND ANALYSIS Revised Edition James Epperson Mathematical Reviews BICENTENNIAL 0, 1 8 0 7 z ewiley wu 2007 r71 BICENTENNIAL WILEY-INTERSCIENCE A John Wiley & Sons, Inc.,

More information

NOTUR Technology Transfer Projects (TTP)

NOTUR Technology Transfer Projects (TTP) NOTUR Technology Transfer Projects (TTP) By Trond Kvamsdal NOTUR 10. Juni 2004, Tromsø, Norway CONTENTS The concept behind the TTPs Results obtained from the TTPs Concluding remarks Purpose Enable optimal

More information

Matrix Multiplication

Matrix Multiplication Matrix Multiplication CPS343 Parallel and High Performance Computing Spring 2016 CPS343 (Parallel and HPC) Matrix Multiplication Spring 2016 1 / 32 Outline 1 Matrix operations Importance Dense and sparse

More information

Cluster Computing at HRI

Cluster Computing at HRI Cluster Computing at HRI J.S.Bagla Harish-Chandra Research Institute, Chhatnag Road, Jhunsi, Allahabad 211019. E-mail: jasjeet@mri.ernet.in 1 Introduction and some local history High performance computing

More information

Numerical Algorithms Group. Embedded Analytics. A cure for the common code. www.nag.com. Results Matter. Trust NAG.

Numerical Algorithms Group. Embedded Analytics. A cure for the common code. www.nag.com. Results Matter. Trust NAG. Embedded Analytics A cure for the common code www.nag.com Results Matter. Trust NAG. Executive Summary How much information is there in your data? How much is hidden from you, because you don t have access

More information

Parallel Programming for Multi-Core, Distributed Systems, and GPUs Exercises

Parallel Programming for Multi-Core, Distributed Systems, and GPUs Exercises Parallel Programming for Multi-Core, Distributed Systems, and GPUs Exercises Pierre-Yves Taunay Research Computing and Cyberinfrastructure 224A Computer Building The Pennsylvania State University University

More information

The Assessment of Benchmarks Executed on Bare-Metal and Using Para-Virtualisation

The Assessment of Benchmarks Executed on Bare-Metal and Using Para-Virtualisation The Assessment of Benchmarks Executed on Bare-Metal and Using Para-Virtualisation Mark Baker, Garry Smith and Ahmad Hasaan SSE, University of Reading Paravirtualization A full assessment of paravirtualization

More information

Numerical Methods I Eigenvalue Problems

Numerical Methods I Eigenvalue Problems Numerical Methods I Eigenvalue Problems Aleksandar Donev Courant Institute, NYU 1 donev@courant.nyu.edu 1 Course G63.2010.001 / G22.2420-001, Fall 2010 September 30th, 2010 A. Donev (Courant Institute)

More information

MPI Hands-On List of the exercises

MPI Hands-On List of the exercises MPI Hands-On List of the exercises 1 MPI Hands-On Exercise 1: MPI Environment.... 2 2 MPI Hands-On Exercise 2: Ping-pong...3 3 MPI Hands-On Exercise 3: Collective communications and reductions... 5 4 MPI

More information

A Parallel Lanczos Algorithm for Eigensystem Calculation

A Parallel Lanczos Algorithm for Eigensystem Calculation A Parallel Lanczos Algorithm for Eigensystem Calculation Hans-Peter Kersken / Uwe Küster Eigenvalue problems arise in many fields of physics and engineering science for example in structural engineering

More information

Numerical Libraries and Tools for Scalable Parallel Cluster Computing

Numerical Libraries and Tools for Scalable Parallel Cluster Computing Numerical Libraries and Tools for Scalable Parallel Cluster Computing Shirley Browne, Jack Dongarra, and Anne Trefethen* Introduction University of Tennessee Oak Ridge National Laboratory * Numerical Algorithms

More information

HSL and its out-of-core solver

HSL and its out-of-core solver HSL and its out-of-core solver Jennifer A. Scott j.a.scott@rl.ac.uk Prague November 2006 p. 1/37 Sparse systems Problem: we wish to solve where A is Ax = b LARGE Informal definition: A is sparse if many

More information

22S:295 Seminar in Applied Statistics High Performance Computing in Statistics

22S:295 Seminar in Applied Statistics High Performance Computing in Statistics 22S:295 Seminar in Applied Statistics High Performance Computing in Statistics Luke Tierney Department of Statistics & Actuarial Science University of Iowa August 30, 2007 Luke Tierney (U. of Iowa) HPC

More information

AGNI: COUPLING MODEL ANALYSIS TOOLS AND HIGH-PERFORMANCE SUBSURFACE FLOW AND TRANSPORT SIMULATORS FOR RISK AND PERFORMANCE ASSESSMENTS

AGNI: COUPLING MODEL ANALYSIS TOOLS AND HIGH-PERFORMANCE SUBSURFACE FLOW AND TRANSPORT SIMULATORS FOR RISK AND PERFORMANCE ASSESSMENTS XIX International Conference on Water Resources CMWR 2012 University of Illinois at Urbana-Champaign June 17-22, 2012 AGNI: COUPLING MODEL ANALYSIS TOOLS AND HIGH-PERFORMANCE SUBSURFACE FLOW AND TRANSPORT

More information

Part I Courses Syllabus

Part I Courses Syllabus Part I Courses Syllabus This document provides detailed information about the basic courses of the MHPC first part activities. The list of courses is the following 1.1 Scientific Programming Environment

More information

Code Generation Tools for PDEs. Matthew Knepley PETSc Developer Mathematics and Computer Science Division Argonne National Laboratory

Code Generation Tools for PDEs. Matthew Knepley PETSc Developer Mathematics and Computer Science Division Argonne National Laboratory Code Generation Tools for PDEs Matthew Knepley PETSc Developer Mathematics and Computer Science Division Argonne National Laboratory Talk Objectives Introduce Code Generation Tools - Installation - Use

More information

Low Level. Software. Solution. extensions to handle. coarse grained task. compilers with. Data parallel. parallelism.

Low Level. Software. Solution. extensions to handle. coarse grained task. compilers with. Data parallel. parallelism. . 1 History 2 æ 1960s - First Organized Collections Problem Solving Environments for Parallel Scientiæc Computation Jack Dongarra Univ. of Tenn.èOak Ridge National Lab dongarra@cs.utk.edu æ 1970s - Advent

More information

Big Data and Big Analytics

Big Data and Big Analytics Big Data and Big Analytics Introducing SciDB Open source, massively parallel DBMS and analytic platform Array data model (rather than SQL, Unstructured, XML, or triple-store) Extensible micro-kernel architecture

More information

YML : un workow scientique pour le calcul haute performance

YML : un workow scientique pour le calcul haute performance YML : un workow scientique pour le calcul haute performance par Olivier Delannoy Thèse présentée à l'université de Versailles Saint-Quentin pour obtenir le titre de Docteur en informatique Commission d'examen

More information

Fast Multipole Method for particle interactions: an open source parallel library component

Fast Multipole Method for particle interactions: an open source parallel library component Fast Multipole Method for particle interactions: an open source parallel library component F. A. Cruz 1,M.G.Knepley 2,andL.A.Barba 1 1 Department of Mathematics, University of Bristol, University Walk,

More information

YALES2 porting on the Xeon- Phi Early results

YALES2 porting on the Xeon- Phi Early results YALES2 porting on the Xeon- Phi Early results Othman Bouizi Ghislain Lartigue Innovation and Pathfinding Architecture Group in Europe, Exascale Lab. Paris CRIHAN - Demi-journée calcul intensif, 16 juin

More information

How High a Degree is High Enough for High Order Finite Elements?

How High a Degree is High Enough for High Order Finite Elements? This space is reserved for the Procedia header, do not use it How High a Degree is High Enough for High Order Finite Elements? William F. National Institute of Standards and Technology, Gaithersburg, Maryland,

More information

GPUs for Scientific Computing

GPUs for Scientific Computing GPUs for Scientific Computing p. 1/16 GPUs for Scientific Computing Mike Giles mike.giles@maths.ox.ac.uk Oxford-Man Institute of Quantitative Finance Oxford University Mathematical Institute Oxford e-research

More information

Performance analysis of parallel applications on modern multithreaded processor architectures

Performance analysis of parallel applications on modern multithreaded processor architectures Available online at www.prace-ri.eu Partnership for Advanced Computing in Europe Performance analysis of parallel applications on modern multithreaded processor architectures Maciej Cytowski* a, Maciej

More information

Designing and Building Applications for Extreme Scale Systems CS598 William Gropp www.cs.illinois.edu/~wgropp

Designing and Building Applications for Extreme Scale Systems CS598 William Gropp www.cs.illinois.edu/~wgropp Designing and Building Applications for Extreme Scale Systems CS598 William Gropp www.cs.illinois.edu/~wgropp Welcome! Who am I? William (Bill) Gropp Professor of Computer Science One of the Creators of

More information

Unleashing the Performance Potential of GPUs for Atmospheric Dynamic Solvers

Unleashing the Performance Potential of GPUs for Atmospheric Dynamic Solvers Unleashing the Performance Potential of GPUs for Atmospheric Dynamic Solvers Haohuan Fu haohuan@tsinghua.edu.cn High Performance Geo-Computing (HPGC) Group Center for Earth System Science Tsinghua University

More information

RevoScaleR Speed and Scalability

RevoScaleR Speed and Scalability EXECUTIVE WHITE PAPER RevoScaleR Speed and Scalability By Lee Edlefsen Ph.D., Chief Scientist, Revolution Analytics Abstract RevoScaleR, the Big Data predictive analytics library included with Revolution

More information

BLM 413E - Parallel Programming Lecture 3

BLM 413E - Parallel Programming Lecture 3 BLM 413E - Parallel Programming Lecture 3 FSMVU Bilgisayar Mühendisliği Öğr. Gör. Musa AYDIN 14.10.2015 2015-2016 M.A. 1 Parallel Programming Models Parallel Programming Models Overview There are several

More information

Why (and Why Not) to Use Fortran

Why (and Why Not) to Use Fortran Why (and Why Not) to Use Fortran p. 1/?? Why (and Why Not) to Use Fortran Instead of C++, Matlab, Python etc. Nick Maclaren University of Cambridge Computing Service nmm1@cam.ac.uk, 01223 334761 June 2012

More information

Numerical Libraries with C or Fortran. Shaohao Chen Research Computing, IS&T, Boston University

Numerical Libraries with C or Fortran. Shaohao Chen Research Computing, IS&T, Boston University Numerical Libraries with C or Fortran Shaohao Chen Research Computing, IS&T, Boston University Outline 1. Overview: What? Why? How to? 2. Fast Fourier transform: FFTw 3. Linear algebra libraries: LAPACK/BLAS

More information

Dynamic Load Balancing in CP2K

Dynamic Load Balancing in CP2K Dynamic Load Balancing in CP2K Pradeep Shivadasan August 19, 2014 MSc in High Performance Computing The University of Edinburgh Year of Presentation: 2014 Abstract CP2K is a widely used atomistic simulation

More information

BIG CPU, BIG DATA. Solving the World s Toughest Computational Problems with Parallel Computing. Alan Kaminsky

BIG CPU, BIG DATA. Solving the World s Toughest Computational Problems with Parallel Computing. Alan Kaminsky Solving the World s Toughest Computational Problems with Parallel Computing Alan Kaminsky Solving the World s Toughest Computational Problems with Parallel Computing Alan Kaminsky Department of Computer

More information

APPM4720/5720: Fast algorithms for big data. Gunnar Martinsson The University of Colorado at Boulder

APPM4720/5720: Fast algorithms for big data. Gunnar Martinsson The University of Colorado at Boulder APPM4720/5720: Fast algorithms for big data Gunnar Martinsson The University of Colorado at Boulder Course objectives: The purpose of this course is to teach efficient algorithms for processing very large

More information

A Simultaneous Solution for General Linear Equations on a Ring or Hierarchical Cluster

A Simultaneous Solution for General Linear Equations on a Ring or Hierarchical Cluster Acta Technica Jaurinensis Vol. 3. No. 1. 010 A Simultaneous Solution for General Linear Equations on a Ring or Hierarchical Cluster G. Molnárka, N. Varjasi Széchenyi István University Győr, Hungary, H-906

More information

Giac/Xcas, a swiss knife for mathematics

Giac/Xcas, a swiss knife for mathematics Bernard Parisse Bernard Parisse University of Grenoble I Trophées du Libre 2007 Plan 1 : interface for CAS, dynamic geometry and spreadsheet, audience: scienti c students to research 2 : a C++ library,

More information

High Performance Matrix Inversion with Several GPUs

High Performance Matrix Inversion with Several GPUs High Performance Matrix Inversion on a Multi-core Platform with Several GPUs Pablo Ezzatti 1, Enrique S. Quintana-Ortí 2 and Alfredo Remón 2 1 Centro de Cálculo-Instituto de Computación, Univ. de la República

More information

Operation Count; Numerical Linear Algebra

Operation Count; Numerical Linear Algebra 10 Operation Count; Numerical Linear Algebra 10.1 Introduction Many computations are limited simply by the sheer number of required additions, multiplications, or function evaluations. If floating-point

More information

Software Development around a Millisecond

Software Development around a Millisecond Introduction Software Development around a Millisecond Geoffrey Fox In this column we consider software development methodologies with some emphasis on those relevant for large scale scientific computing.

More information

Introduction Installation Comparison. Department of Computer Science, Yazd University. SageMath. A.Rahiminasab. October9, 2015 1 / 17

Introduction Installation Comparison. Department of Computer Science, Yazd University. SageMath. A.Rahiminasab. October9, 2015 1 / 17 Department of Computer Science, Yazd University SageMath A.Rahiminasab October9, 2015 1 / 17 2 / 17 SageMath(previously Sage or SAGE) System for Algebra and Geometry Experimentation is mathematical software

More information

Performance Evaluation of NAS Parallel Benchmarks on Intel Xeon Phi

Performance Evaluation of NAS Parallel Benchmarks on Intel Xeon Phi Performance Evaluation of NAS Parallel Benchmarks on Intel Xeon Phi ICPP 6 th International Workshop on Parallel Programming Models and Systems Software for High-End Computing October 1, 2013 Lyon, France

More information

Parallel Programming at the Exascale Era: A Case Study on Parallelizing Matrix Assembly For Unstructured Meshes

Parallel Programming at the Exascale Era: A Case Study on Parallelizing Matrix Assembly For Unstructured Meshes Parallel Programming at the Exascale Era: A Case Study on Parallelizing Matrix Assembly For Unstructured Meshes Eric Petit, Loïc Thebault, Quang V. Dinh May 2014 EXA2CT Consortium 2 WPs Organization Proto-Applications

More information

Free software for scientific computing

Free software for scientific computing Free software for scientific computing F. Varas Departamento de Matemática Aplicada II Universidad de Vigo, Spain Sevilla Numérica Seville, 13-17 June 2011 Acknowledgments to the Organizing Commmittee

More information

Parallel Ray Tracing using MPI: A Dynamic Load-balancing Approach

Parallel Ray Tracing using MPI: A Dynamic Load-balancing Approach Parallel Ray Tracing using MPI: A Dynamic Load-balancing Approach S. M. Ashraful Kadir 1 and Tazrian Khan 2 1 Scientific Computing, Royal Institute of Technology (KTH), Stockholm, Sweden smakadir@csc.kth.se,

More information

Trends in High-Performance Computing for Power Grid Applications

Trends in High-Performance Computing for Power Grid Applications Trends in High-Performance Computing for Power Grid Applications Franz Franchetti ECE, Carnegie Mellon University www.spiral.net Co-Founder, SpiralGen www.spiralgen.com This talk presents my personal views

More information

Linux clustering. Morris Law, IT Coordinator, Science Faculty, Hong Kong Baptist University

Linux clustering. Morris Law, IT Coordinator, Science Faculty, Hong Kong Baptist University Linux clustering Morris Law, IT Coordinator, Science Faculty, Hong Kong Baptist University PII 4-node clusters started in 1999 PIII 16 node cluster purchased in 2001. Plan for grid For test base HKBU -

More information

A Crash course to (The) Bighouse

A Crash course to (The) Bighouse A Crash course to (The) Bighouse Brock Palen brockp@umich.edu SVTI Users meeting Sep 20th Outline 1 Resources Configuration Hardware 2 Architecture ccnuma Altix 4700 Brick 3 Software Packaged Software

More information

CUDA programming on NVIDIA GPUs

CUDA programming on NVIDIA GPUs p. 1/21 on NVIDIA GPUs Mike Giles mike.giles@maths.ox.ac.uk Oxford University Mathematical Institute Oxford-Man Institute for Quantitative Finance Oxford eresearch Centre p. 2/21 Overview hardware view

More information

Linux tools for debugging and profiling MPI codes

Linux tools for debugging and profiling MPI codes Competence in High Performance Computing Linux tools for debugging and profiling MPI codes Werner Krotz-Vogel, Pallas GmbH MRCCS September 02000 Pallas GmbH Hermülheimer Straße 10 D-50321

More information

BookTOC.txt. 1. Functions, Graphs, and Models. Algebra Toolbox. Sets. The Real Numbers. Inequalities and Intervals on the Real Number Line

BookTOC.txt. 1. Functions, Graphs, and Models. Algebra Toolbox. Sets. The Real Numbers. Inequalities and Intervals on the Real Number Line College Algebra in Context with Applications for the Managerial, Life, and Social Sciences, 3rd Edition Ronald J. Harshbarger, University of South Carolina - Beaufort Lisa S. Yocco, Georgia Southern University

More information

Numerical Analysis. Professor Donna Calhoun. Fall 2013 Math 465/565. Office : MG241A Office Hours : Wednesday 10:00-12:00 and 1:00-3:00

Numerical Analysis. Professor Donna Calhoun. Fall 2013 Math 465/565. Office : MG241A Office Hours : Wednesday 10:00-12:00 and 1:00-3:00 Numerical Analysis Professor Donna Calhoun Office : MG241A Office Hours : Wednesday 10:00-12:00 and 1:00-3:00 Fall 2013 Math 465/565 http://math.boisestate.edu/~calhoun/teaching/math565_fall2013 What is

More information

The Asynchronous Dynamic Load-Balancing Library

The Asynchronous Dynamic Load-Balancing Library The Asynchronous Dynamic Load-Balancing Library Rusty Lusk, Steve Pieper, Ralph Butler, Anthony Chan Mathematics and Computer Science Division Nuclear Physics Division Outline The Nuclear Physics problem

More information

Algebra I Credit Recovery

Algebra I Credit Recovery Algebra I Credit Recovery COURSE DESCRIPTION: The purpose of this course is to allow the student to gain mastery in working with and evaluating mathematical expressions, equations, graphs, and other topics,

More information

Concurrent Solutions to Linear Systems using Hybrid CPU/GPU Nodes

Concurrent Solutions to Linear Systems using Hybrid CPU/GPU Nodes Concurrent Solutions to Linear Systems using Hybrid CPU/GPU Nodes Oluwapelumi Adenikinju1, Julian Gilyard2, Joshua Massey1, Thomas Stitt 1 Department of Computer Science and Electrical Engineering, UMBC

More information

CS 294-73 (CCN 27156) CS 194-73 (CCN 26880) Software Engineering for Scientific Computing. Lecture 1: Introduction

CS 294-73 (CCN 27156) CS 194-73 (CCN 26880) Software Engineering for Scientific Computing. Lecture 1: Introduction CS 294-73 (CCN 27156) CS 194-73 (CCN 26880) Software Engineering for Scientific Computing http://www.eecs.berkeley.edu/~colella/cs294fall2015/ colella@eecs.berkeley.edu pcolella@lbl.gov Lecture 1: Introduction

More information

BIG CPU, BIG DATA. Solving the World s Toughest Computational Problems with Parallel Computing. Alan Kaminsky

BIG CPU, BIG DATA. Solving the World s Toughest Computational Problems with Parallel Computing. Alan Kaminsky Solving the World s Toughest Computational Problems with Parallel Computing Solving the World s Toughest Computational Problems with Parallel Computing Department of Computer Science B. Thomas Golisano

More information

(!' ) "' # "*# "!(!' +,

(!' ) ' # *# !(!' +, MATLAB is a numeric computation software for engineering and scientific calculations. The name MATLAB stands for MATRIX LABORATORY. MATLAB is primarily a tool for matrix computations. It was developed

More information

Algorithmic Research and Software Development for an Industrial Strength Sparse Matrix Library for Parallel Computers

Algorithmic Research and Software Development for an Industrial Strength Sparse Matrix Library for Parallel Computers The Boeing Company P.O.Box3707,MC7L-21 Seattle, WA 98124-2207 Final Technical Report February 1999 Document D6-82405 Copyright 1999 The Boeing Company All Rights Reserved Algorithmic Research and Software

More information

Application Performance Tools @ NERSC. David Skinner, Richard Gerber, Nick Wright, Karl Fuerlinger and 4000 others

Application Performance Tools @ NERSC. David Skinner, Richard Gerber, Nick Wright, Karl Fuerlinger and 4000 others Application Performance Tools @ NERSC David Skinner, Richard Gerber, Nick Wright, Karl Fuerlinger and 4000 others User demographics at NERSC Large scale parallelism and data needs of science teams Large

More information

Data Mining mit der JMSL Numerical Library for Java Applications

Data Mining mit der JMSL Numerical Library for Java Applications Data Mining mit der JMSL Numerical Library for Java Applications Stefan Sineux 8. Java Forum Stuttgart 07.07.2005 Agenda Visual Numerics JMSL TM Numerical Library Neuronale Netze (Hintergrund) Demos Neuronale

More information

NUMERICAL METHODS TOPICS FOR RESEARCH PAPERS

NUMERICAL METHODS TOPICS FOR RESEARCH PAPERS Faculty of Civil Engineering Belgrade Master Study COMPUTATIONAL ENGINEERING Fall semester 2004/2005 NUMERICAL METHODS TOPICS FOR RESEARCH PAPERS 1. NUMERICAL METHODS IN FINITE ELEMENT ANALYSIS - Matrices

More information

GridSolve: : A Seamless Bridge Between the Standard Programming Interfaces and Remote Resources

GridSolve: : A Seamless Bridge Between the Standard Programming Interfaces and Remote Resources GridSolve: : A Seamless Bridge Between the Standard Programming Interfaces and Remote Resources Jack Dongarra University of Tennessee and Oak Ridge National Laboratory 2/25/2006 1 Overview Grid/NetSolve

More information

Parallel Computing using MATLAB Distributed Compute Server ZORRO HPC

Parallel Computing using MATLAB Distributed Compute Server ZORRO HPC Parallel Computing using MATLAB Distributed Compute Server ZORRO HPC Goals of the session Overview of parallel MATLAB Why parallel MATLAB? Multiprocessing in MATLAB Parallel MATLAB using the Parallel Computing

More information

TITLE: The NAS Parallel Benchmarks. AUTHOR: David H Bailey 1

TITLE: The NAS Parallel Benchmarks. AUTHOR: David H Bailey 1 TITLE: The NAS Parallel Benchmarks AUTHOR: David H Bailey 1 ACRONYMS: NAS, NPB DEFINITION: The NAS Parallel Benchmarks (NPB) are a suite of parallel computer performance benchmarks. They were originally

More information

GPU Hardware and Programming Models. Jeremy Appleyard, September 2015

GPU Hardware and Programming Models. Jeremy Appleyard, September 2015 GPU Hardware and Programming Models Jeremy Appleyard, September 2015 A brief history of GPUs In this talk Hardware Overview Programming Models Ask questions at any point! 2 A Brief History of GPUs 3 Once

More information

Computational Mathematics with Python

Computational Mathematics with Python Boolean Arrays Classes Computational Mathematics with Python Basics Olivier Verdier and Claus Führer 2009-03-24 Olivier Verdier and Claus Führer Computational Mathematics with Python 2009-03-24 1 / 40

More information

SR-IOV: Performance Benefits for Virtualized Interconnects!

SR-IOV: Performance Benefits for Virtualized Interconnects! SR-IOV: Performance Benefits for Virtualized Interconnects! Glenn K. Lockwood! Mahidhar Tatineni! Rick Wagner!! July 15, XSEDE14, Atlanta! Background! High Performance Computing (HPC) reaching beyond traditional

More information

Computer programming course in the Department of Physics, University of Calcutta

Computer programming course in the Department of Physics, University of Calcutta Computer programming course in the Department of Physics, University of Calcutta Parongama Sen with inputs from Prof. S. Dasgupta and Dr. J. Saha and feedback from students Computer programming course

More information

HPC Wales Skills Academy Course Catalogue 2015

HPC Wales Skills Academy Course Catalogue 2015 HPC Wales Skills Academy Course Catalogue 2015 Overview The HPC Wales Skills Academy provides a variety of courses and workshops aimed at building skills in High Performance Computing (HPC). Our courses

More information

The NEST 4g kernel: highly scalable simulation code from laptops to supercomputers. Susanne Kunkel. 28/01/2014 Code Jam SimLab Neuroscience, JSC

The NEST 4g kernel: highly scalable simulation code from laptops to supercomputers. Susanne Kunkel. 28/01/2014 Code Jam SimLab Neuroscience, JSC The NEST 4g kernel: highly scalable simulation code from laptops to supercomputers Susanne Kunkel 28/01/2014 Code Jam SimLab Neuroscience, JSC Overview Model of the memory usage of NEST 3 rd generation

More information

2: Computer Performance

2: Computer Performance 2: Computer Performance http://people.sc.fsu.edu/ jburkardt/presentations/ fdi 2008 lecture2.pdf... John Information Technology Department Virginia Tech... FDI Summer Track V: Parallel Programming 10-12

More information

MATH. ALGEBRA I HONORS 9 th Grade 12003200 ALGEBRA I HONORS

MATH. ALGEBRA I HONORS 9 th Grade 12003200 ALGEBRA I HONORS * Students who scored a Level 3 or above on the Florida Assessment Test Math Florida Standards (FSA-MAFS) are strongly encouraged to make Advanced Placement and/or dual enrollment courses their first choices

More information

Data-Flow Awareness in Parallel Data Processing

Data-Flow Awareness in Parallel Data Processing Data-Flow Awareness in Parallel Data Processing D. Bednárek, J. Dokulil *, J. Yaghob, F. Zavoral Charles University Prague, Czech Republic * University of Vienna, Austria 6 th International Symposium on

More information

Arcane/ArcGeoSim, a software framework for geosciences simulation

Arcane/ArcGeoSim, a software framework for geosciences simulation Renewable energies Eco-friendly production Innovative transport Eco-efficient processes Sustainable resources Arcane/ArcGeoSim, a software framework for geosciences simulation Pascal Havé Outline these

More information

Systolic Computing. Fundamentals

Systolic Computing. Fundamentals Systolic Computing Fundamentals Motivations for Systolic Processing PARALLEL ALGORITHMS WHICH MODEL OF COMPUTATION IS THE BETTER TO USE? HOW MUCH TIME WE EXPECT TO SAVE USING A PARALLEL ALGORITHM? HOW

More information

The Top Six Advantages of CUDA-Ready Clusters. Ian Lumb Bright Evangelist

The Top Six Advantages of CUDA-Ready Clusters. Ian Lumb Bright Evangelist The Top Six Advantages of CUDA-Ready Clusters Ian Lumb Bright Evangelist GTC Express Webinar January 21, 2015 We scientists are time-constrained, said Dr. Yamanaka. Our priority is our research, not managing

More information