Algorithmic Research and Software Development for an Industrial Strength Sparse Matrix Library for Parallel Computers



Similar documents
Report Documentation Page

EAD Expected Annual Flood Damage Computation

Integrated Force Method Solution to Indeterminate Structural Mechanics Problems

Overview Presented by: Boyd L. Summers

Using the Advancement Degree of Difficulty (AD 2 ) as an input to Risk Management

An Application of an Iterative Approach to DoD Software Migration Planning

RT 24 - Architecture, Modeling & Simulation, and Software Design

Guide to Using DoD PKI Certificates in Outlook 2000

An Oil-Free Thrust Foil Bearing Facility Design, Calibration, and Operation

REPORT DOCUMENTATION PAGE *

Headquarters U.S. Air Force

AFRL-RX-WP-TP

DEFENSE CONTRACT AUDIT AGENCY

THE MIMOSA OPEN SOLUTION COLLABORATIVE ENGINEERING AND IT ENVIRONMENTS WORKSHOP

Addressing the Real-World Challenges in the Development of Propulsion IVHM Technology Experiment (PITEX)

Microstructural Evaluation of KM4 and SR3 Samples Subjected to Various Heat Treatments

Asset Management- Acquisitions

DCAA and the Small Business Innovative Research (SBIR) Program

A New Empirical Relationship between Thrust Coefficient and Induction Factor for the Turbulent Windmill State

Obsolescence Considerations for Materials in the Lower Sub-Tiers of the Supply Chain

Simulation of Air Flow Through a Test Chamber

Interagency National Security Knowledge and Skills in the Department of Defense

Issue Paper. Wargaming Homeland Security and Army Reserve Component Issues. By Professor Michael Pasquarett

Dr. Gary S. E. Lagerloef Earth and Space Research, 1910 Fairview Ave E

DATA ITEM DESCRIPTION

John Mathieson US Air Force (WR ALC) Systems & Software Technology Conference Salt Lake City, Utah 19 May 2011

Pima Community College Planning Grant For Autonomous Intelligent Network of Systems (AINS) Science, Mathematics & Engineering Education Center

Assessment of NASA Dual Microstructure Heat Treatment Method Utilizing Ladish SuperCooler Cooling Technology

Poisson Equation Solver Parallelisation for Particle-in-Cell Model

SOLVING LINEAR SYSTEMS

ELECTRONIC HEALTH RECORDS. Fiscal Year 2013 Expenditure Plan Lacks Key Information Needed to Inform Future Funding Decisions

Non-Autoclave (Prepreg) Manufacturing Technology

DATA ITEM DESCRIPTION

REPORT DOCUMENTATION PAGE

A Manager's Checklist for Validating Software Cost and Schedule Estimates

HSL and its out-of-core solver

Mobile Robot Knowledge Base

CAPTURE-THE-FLAG: LEARNING COMPUTER SECURITY UNDER FIRE

Optical Blade Position Tracking System Test

PROBLEM STATEMENT: Will reducing the ASD for Kadena AB F-15 C/Ds increase the CPFH for this Mission Design Series (MDS)?

Mr. Steve Mayer, PMP, P.E. McClellan Remediation Program Manger Air Force Real Property Agency. May 11, 2011

ACCELERATING COMMERCIAL LINEAR DYNAMIC AND NONLINEAR IMPLICIT FEA SOFTWARE THROUGH HIGH- PERFORMANCE COMPUTING

Ultra-low Sulfur Diesel Classification with Near-Infrared Spectroscopy and Partial Least Squares

DATA ITEM DESCRIPTION

In June 1998 the Joint Military Intelligence. Intelligence Education for Joint Warfighting A. DENIS CLIFT

Regional Field Verification Operational Results from Four Small Wind Turbines in the Pacific Northwest

It s Not A Disease: The Parallel Solver Packages MUMPS, PaStiX & SuperLU

Advanced Micro Ring Resonator Filter Technology

Military Health System Conference

AMS526: Numerical Analysis I (Numerical Linear Algebra)

Real-Time Task Scheduling for Energy-Aware Embedded Systems 1

Mathematical Libraries and Application Software on JUROPA and JUQUEEN

DATA ITEM DESCRIPTION

IISUP-. NAVAL SUPPLY SVSTE:MS COMMAND. Ready. Resourceful. Responsive!

Multiple Network Marketing coordination Model

Adapting scientific computing problems to cloud computing frameworks Ph.D. Thesis. Pelle Jakovits

DATA ITEM DESCRIPTION

DEFENSE BUSINESS PRACTICE IMPLEMENTATION BOARD

Best practices for efficient HPC performance with large models

Small PV Systems Performance Evaluation at NREL's Outdoor Test Facility Using the PVUSA Power Rating Method

ADVANCED NETWORK SECURITY PROJECT

Modification of the Minimum-Degree Algorithm by Multiple Elimination

REPORT DOCUMENTATION PAGE

I N S T I T U T E F O R D E FE N S E A N A L Y S E S NSD-5216

Software Security Engineering: A Guide for Project Managers

Online Tuning of Artificial Neural Networks for Induction Motor Control

73rd MORSS CD Cover Page UNCLASSIFIED DISCLOSURE FORM CD Presentation

Numerical Analysis. Professor Donna Calhoun. Fall 2013 Math 465/565. Office : MG241A Office Hours : Wednesday 10:00-12:00 and 1:00-3:00

How To Get A Computer Science Degree At Appalachian State

UMass at TREC 2008 Blog Distillation Task

Elastomer Compatibility Testing of Renewable Diesel Fuels

DATA ITEM DESCRIPTION

Software Vulnerabilities in Java

A Direct Numerical Method for Observability Analysis

FIRST IMPRESSION EXPERIMENT REPORT (FIER)

NAPA/MAESTRO Interface. Reducing the Level of Effort for Ship Structural Design

GAO ELECTRONIC HEALTH RECORDS. DOD and VA Should Remove Barriers and Improve Efforts to Meet Their Common System Needs

Cancellation of Nongroup Health Insurance Policies

TITLE: The Impact Of Prostate Cancer Treatment-Related Symptoms On Low-Income Latino Couples

Part I Courses Syllabus

Similar matrices and Jordan form

DATA ITEM DESCRIPTION

A GPS Digital Phased Array Antenna and Receiver

Large-Scale Reservoir Simulation and Big Data Visualization

Parallel Programming at the Exascale Era: A Case Study on Parallelizing Matrix Assembly For Unstructured Meshes

P164 Tomographic Velocity Model Building Using Iterative Eigendecomposition

THREE DIMENSIONAL REPRESENTATION OF AMINO ACID CHARAC- TERISTICS

CERT Virtual Flow Collection and Analysis

SALEM COMMUNITY COLLEGE Carneys Point, New Jersey COURSE SYLLABUS COVER SHEET. Action Taken (Please Check One) New Course Initiated

Transcription:

The Boeing Company P.O.Box3707,MC7L-21 Seattle, WA 98124-2207 Final Technical Report February 1999 Document D6-82405 Copyright 1999 The Boeing Company All Rights Reserved Algorithmic Research and Software Development for an Industrial Strength Sparse Matrix Library for Parallel Computers Prepared by Roger G. Grimes, Manager, Computational Mathematics Mathematics and Engineering Analysis Mathematics and Computing Technology The Contractor, The Boeing Company, hereby certifies that to the best of its knowledge and belief, the technical data delivered herewith under Contract No. DABT63-95-C-0122 is complete, accurate, and complies with all requirement of the contract. l AsA il Progran Manager (or Designee) Prepared for: Directorate of Contracting ATZS-DKO-I (J. Bourne) PO Box 12748 Ft Huachuca, AZ 85670-2748 Contract: DABT63-95-C-0122 CDRLA001 Date Sponsored by: Defense Advanced Research Projects Agency no (Attn: G. Koob) 3701 N. Fairfax Drive Arlington, VA 22203-1714 PR&C: HJ1500-5212-0677 AAPNo: DAR5C999 Approved for public release; distribution is unlimited. Requests for this document may be referred to Director, DARPA, Attn: TIO. Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of DARPA. ^CQUM-m EXPECTED 1

REPORT DOCUMENTATION PAGE Form Approved OMB No. 0704-0188 Public report burden for this collection of information is estimated to average 1 hour per response, including the time for reviewing instructions, searching existing data sources, gathering and maintaining the data needed, and completing and reviewing the collection of information. Send comments regarding this burden estimate or any other aspect of this collection of information, including suggestions for reducing this burden to Washington Headquarters Services, Directorate for Information Operations and Reports, 1215 Jefferson Davis Highway, Suite 1204, Arlington, VA 22202-4302, and to the Office of Management and Budget, Paperwork Reduction Project (0704-0188), Washington, DC 20503 1. AGENCY USE ONLY (Leave blank) 2. REPORT DATE 3. REPORT TYPE AND DATES COVERED 4. TITLE AND SUBTITLE 01 1999 Final Report, September 1995-January 1999 5. FUNDING NUMBERS Algorithmic Research And Software Development For An Industrial Strength Sparse Matrix Library For Parallel Computers 6. AUTHOR(S) Roger G. Grimes 7. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) The Boeing Company, P.O. Box 3707, M/C 7L-22, Seattle, WA 98124-2207 9. SPONSORING/MONITORING AGENCY NAME(S) AND ADDRESS(ES) DARPA(ITO) (G. Koob), 3701 N. Fairfax Drive, Arlington, VA 22203-1714 ATZS-DKO-I (J. Bourne), P.O. Box 12748, Ft Huachuca, AZ 85670-2748 (C) DABT63-95-C-0122 8. PERFORMING ORGANIZATION REPORT NUMBER D6-82405 10. SPONSORING/MONITORING AGENCY REPORT NUMBER DAR5 C999 CDRL A001 11. SUPPLEMENTARY NOTES 12a. DISTRIBUTION/AVAILABILITY STATEMENT Approved for public release; distribution is unlimited. Requests for this document shall be referred to ATZS-DKO-I (J. Bourne), P.O. Box 12748, Ft Huachuca, AZ 85670-2748 12b. DISTRIBUTION CODE 13. ABSTRACT (Maximum 200 words) This final report describes the status of work performed during the months of Sept 1995 through Jan 1999 on the Algorithmic Research And Software Development For An Industrial Strength Sparse Matrix Library For Parallel Computers. The objective of this effort was to: 1. Research and demonstrate new algorithms for the solution of systems of linear equations in parallel. 2. Research and demonstrate a new parallel approach for the real symmetric sparse generalized eigenvalue problem. 14. SUBJECT TERMS Sparse linear algebra, parallel computing 15. NUMBER OF PAGES 3 16. PRICE CODE 17. SECURITY CLASSIFICATION OF REPORT 18. SECURITY CLASSIFICATION OF THIS PAGE 19. SECURITY CLASSIFICATION OF ABSTRACT 20. LIMITATION OF ABSTRACT Unlimited Page li

FORWARD This Final Report, prepared by The Boeing Company Mathematics and Computing Technology organization, is provided under a Department of Defense contract for research entitled "Algorithmic Research and Software Development for an Industrial Strength Sparse Matrix Library for Parallel Computers". The reporting period is from September 1995 to December 1998. The Defense Advanced Research Projects Agency provided funding under Contract No. DABT63-95-C-0122. Drs. Robert Lucas, Federica Darema, and Gary Koob, DARPA/rrO managed the program for DARPA. Page iii

ABSTRACT The objective of the effort was to build a parallel sparse matrix library that is needed by applications migrating to parallel computers. Research on robust algorithms for industrial problems and software development for a robust implementation for industrial users are both needed for a sparse matrix library in order for parallel computers to realize their potential to out-perform previous single-threaded hardware and software platforms. An additional challenge faced during the effort was to develop software that would run efficiently on a variety of parallel computers, including both shared memory and distributed memory computers. The effort included work on the solution of sparse systems of linear equations in parallel and work on computing some eigenvalues for sparse real symmetric generalized eigenvalue problems. These technology areas are key for the migration of industrial applications such as Finite Element Analysis to parallel computers. The effort included new and innovative research on parallel methods for solution of sparse systems of linear equations. A software package, Sparse Object Oriented Linear Equation Solver (SPOOLES), containing all of the developed software for solving real and complex system using both direct and iterative methods for serial, shared memory, and distributed memory computers is now available in the public domain via http ://www.netlib.org/linal g/spooles/spooles.html. A parallel eigensolver for parallel computers was also developed and successfully demonstrated on several test problems across a range of computers (Sun, SGI, and HP). Although the amount of code that was actually parallelized is substantial, it is not sufficient to replace the serial implementation in Fortran. In addition, the parallel eigensolver developed during the effort requires that much of the data be replicated on each processor instead of being fully distributed. This leads to significant limitations on problem size capacity, which in turn limits the applicability of the software. Further code development is therefore required for a sparse real symmetric generalized parallel eigensolver that can be used by application programs such as NASTRAN and ANSYS. KEY WORDS sparse linear algebra parallel computing direct solution iterative solution eigenvalue eigensolver Page iv

TABLE OF CONTENTS PAGE FOREWORD iii ABSTRACT iv KEY WORDS iv TABLE OF CONTENTS v ACRONYMS, ABBREVIATIONS, AND SYMBOLS vi 1.0 SUMMARY 1 2.0 INTRODUCTION 1 3.0 METHODS, ASSUMPTIONS, AND PROCEDURES 2 4.0 RESULTS AND DISCUSSION 2 5.0 RECOMMENDATIONS 3 6.0 REFERENCES 3 Page v

ACRONYMS, ABBREVIATIONS AND SYMBOLS DARPA Defense Advanced Research Programs Agency SPOOLES Sparse Object Oriented Linear Equation Solver Page vi

1.0 SUMMARY The Algorithmic Research and Software Development for an Industrial Strength Sparse Matrix Library for Parallel Computers performed algorithmic research on issues associated with the parallel solution of sparse systems of linear equations. We developed a new set of sparse matrix ordering algorithms to both preserve sparsity for the numerical factorization, preserve locality for efficient local computation and to reduce communication and enhance opportunities for parallel computations. We implemented a suite of tools for the direct solution of sparse systems of linear equations supporting both real and complex matrices, both symmetric and unsymmetric systems, and providing pivoting for numerical stability. Based on the direct solution capabilities, we built a drop tolerance incomplete factorization and a suite of iterative methods to complement the direct solution capabilities. We also developed a capability for the parallel solution of large least squares problems. We also performed research work and preliminary software development for a parallel real sparse generalized eigensolver. While leveraging off our existing serial eigensolver was a viable approach, converting the serial eigensolver to the parallel computing environment was significantly more difficult than originally contemplated, and the developed software requires further research and development to improve both its reliability and parallel efficiency. 2.0 INTRODUCTION The Computational Mathematics staff of the Applied Research and Technology Division of Boeing Shared Services Group is responsible for mathematical software libraries used throughout The Boeing Company. This staff is also responsible for the commercially available software, BCSLIB-EXT, which is used extensively inside and outside of Boeing. In particular, it has become the software of choice for solving large sparse linear algebra equations arising in application areas such as finite element analysis. The Algorithmic Research and Software Development for an Industrial Strength Sparse Matrix Library for Parallel Computers contract sought to develop the basic technology for such a library for parallel computers and to make that technology available to those applications attempting to migrate to parallel computers. The lack of this enabling software technology has been a large stumbling block for the wide spread use of parallel computers for large-scale applications. The solution of sparse linear algebra problems in parallel has been an active area of research since the advent of parallel computers. However none of this research has produced a viable suite of software for the parallel solution of linear equations that meets the needs of industrial applications. This suite of software is required to extend the applicability of parallel computers to general use for application areas that have to solve sparse linear algebra problems such as finite element analysis. This contract was an attempt to meet this need. Page 1

3.0 METHODS, ASSUMPTIONS, AND PROCEDURES The research in this contract drew upon Boeing's experience supporting a commercial software library in this application area and knowledge of parallel sparse linear algebra. We identified the key research areas that required breakthroughs before a reliable industrial strength sparse matrix package could be developed. These areas were ordering methods, pivoting for numerical stability, and a robust and reliable direct factorization approach that would work well in serial, shared memory, and distributed memory parallel computing environments. We selected a development environment using the C language, with POSDC threads for the shared memory computing paradigm and MPI for the distributed memory computing paradigm. We discussed interface issues with technical representatives from computing hardware vendors and leading providers of engineering analysis application packages that were target end users of the results of the contract. We enlisted the services of Dr. Joseph W. H. Liu, York University, to assist in the first task, which was the development of a new ordering methodology for the direct solution of sparse systems of linear equations. Our starting point for the parallel eigensolver was the block shift and invert Lanczos software found in BCSLEB-EXT. 4.0 RESULTS AND DISCUSSION During the course of this contract we have developed a robust and reliable software package, SPOOLES, for the solution of sparse systems of linear equations which include the following features: Supports both Real and complex sparse matrices Solves systems that are Symmetric, unsymmetric, Hermitian, or overdetermined Provides pivoting for numerical stability Direct and iterative solution methods Available on serial, shared memory parallel, and distributed memory parallel computers SPOOLES is available in the public domain via NETLJJB, a numerical computing software repository at Oak Ridge National Laboratory. Both source code and documentation for SPOOLES is available on the World Wide Web via www.netlib.org/linal g/spooles/spooles.2.2.html. The sparse eigensolver developed during this research program requires further research and software development before it can be used on a widespread basis as a sparse real symmetric generalized parallel eigensolver. Boeing is pursuing additional funding from alternate sources to complete this work. Page 2

5.0 RECOMMENDATIONS We recommend that any application developer interested in solving sparse systems of linear equations examine the SPOOLES package for their use. 6.0 REFERENCES Ashcraft, C. C, Compressed graphs and the minimum degree algorithm, SIAM Journal of Scientific Computing, v. 16, p. 1404-1411, 1995. Ashcraft, C. C, and Grimes, R. G., SPOOLES: An object oriented sparse matrix library, Proceedings of the 1999 SIAM Conference on Parallel Processing for Scientific Computing, to appear in 1999. Ashcraft, C. C, Grimes, R. G., and Lewis, J. G, Accurate symmetric indefinite linear equation solvers, SIAM Journal of Matrix Analysis, v. 20, pp. 513-561, 1998. Ashcraft, C. C, and Liu, J. W. H., Robust ordering of sparse matrices using multisection, SIAM Journal of Matrix Analysis, v. 19, pp. 816-832, 1997. Ashcraft, C. C, and Liu, J. W. H., Applications of the Dulmage-Mendelsohn decomposition and network flow to graph bisection improvement, SIAM Journal of Matrix Analysis, v. 19, pp. 816-832, 1997. Ashcraft, C. C, and Liu, J. W. H., Using Domain Decompositions to find Graph Bisectors, BIT, v. 37, pp. 506-534, 1997. Page 3

REPORT DOCUMENTATION PAGE Form Approved OMB No. 0704-0188 Public report burden for this collection of information is estimated to average 1 hour per response, including the time for reviewing instructions, searching existing data sources, gathering and maintaining the data needed, and completing and reviewing the collection of information. Send comments regarding this burden estimate or any other aspect of this collection of information, including suggestions for reducing this burden to Washington Headquarters Services, Directorate for Information Operations and Reports, 1215 Jefferson Davis Highway, Suite 1204, Arlington, VA 22202-4302, and to the Office of Management and Budget, Paperwork Reduction Project (0704-0188), Washington, DC 20503 1. AGENCY USE ONLY (Leave blank) 2. REPORT DATE 3. REPORT TYPE AND DATES COVERED 4. TITLE AND SUBTITLE 01 1999 Final Report, September 1995-January 1999 5. FUNDING NUMBERS Algorithmic Research And Software Development For An Industrial Strength Sparse Matrix Library For Parallel Computers 6. AUTHOR(S) R. Grimes 7. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) The Boeing Company, SSG Applied Research & Technology P.O. Box 3707, M/C 7L-22, Seattle, WA 98124-2207 (C)DABT63-95-C-0122 8. PERFORMING ORGANIZATION REPORT NUMBER 9. SPONSORING/MONITORING AGENCY NAME(S) AND ADDRESS(ES) DARPA(ITO) (G. Koob), ATZS-DKO-I (J. Bourne), P.O. Box 12748, Ft Huachuca, AZ 85670-2748 10. SPONSORING/MONITORING AGENCY REPORT NUMBER 11. SUPPLEMENTARY NOTES Export Control Restrictions Apply 12a. DISTRIBUTION/AVAILABILITY STATEMENT Distribution authorized to U.S. Government Agencies and U.S. DoD contractors only; critical technologies. Other requests for this document shall be referred to ATZS-DKO-I (J. Bourne), P.O. Box 12748, Ft Huachuca, AZ 85670-2748 12b. DISTRIBUTION CODE 13. ABSTRACT (Maximum 200 words) This final report describes the status of work performed during the months of Sept 1995 through Jan 1999 on the Algorithmic Research And Software Development For An Industrial Strength Sparse Matrix Library For Parallel Com The objective of this effort was to: 1. Demonstrate the survivability of the decoupled fuels design in large scale hardware. 2. Demonstrate the manufacturing and structural capabilities of the decoupled design in a complex structure. The structural concept examined is a co-cured composite cellular wing design. Manufacturing demonstrations, hydrodynamic ram ballistic tests, and analysis were conducted. 14. SUBJECT TERMS survivable, composite structures, fighter wing, decoupling, fuel containing structure, hydrodynamic ram, cellular structure, cellular wing, tubular structure, tubular wing 15. NUMBER OF PAGES XX 16. PRICE CODE 17. SECURITY CLASSIFICATION OF REPORT 18. SECURITY CLASSIFICATION OF THIS PAGE 19. SECURITY CLASSIFICATION OF ABSTRACT 20. LIMITATION OF ABSTRACT Unlimited \\NT-BLV-04\CONTRACTS\Extemal Programs\Final ReportsVREPORT DOCUMENTATION PAGE.doc