Risk Management Tools



Similar documents
Controlling Risks Risk Assessment

How can you support RIDM/CRM/RM through the use of Risk Analysis

Continuous Risk Management at NASA

3.0 Risk Assessment and Analysis Techniques and Tools

Managing Design Changes using Safety-Guided Design for a Safety Critical Automotive System

Goddard Procedures and Guidelines

Project Zeus. Risk Management Plan

Failure Analysis Methods What, Why and How. MEEG 466 Special Topics in Design Jim Glancey Spring, 2006

Reliability Analysis A Tool Set for. Aron Brall

Prepared for NASA Office of Safety and Mission Assurance NASA Headquarters Washington, DC 20546

Criteria for Flight Project Critical Milestone Reviews

Quality Risk Management Tools Quality Risk Management Tool Selection When to Select FMEA: QRM Tool Selection Matrix

An Introduction to Fault Tree Analysis (FTA)

University of Ljubljana. Faculty of Computer and Information Science

Risk Assessment and Management. Allen L. Burgenson Manager, Regulatory Affairs Lonza Walkersville Inc.

Fault Tree Analysis (FTA) and Event Tree Analysis (ETA)

RISK ASSESMENT: FAULT TREE ANALYSIS

ASSESSMENT OF THE ISO STANDARD, ROAD VEHICLES FUNCTIONAL SAFETY

Edwin Lindsay Principal Consultant. Compliance Solutions (Life Sciences) Ltd, Tel: + 44 (0) elindsay@blueyonder.co.

University of Paderborn Software Engineering Group II-25. Dr. Holger Giese. University of Paderborn Software Engineering Group. External facilities

FMEA and FTA Analysis

Upon completion of this activity, the participant should be able to:

A Risk Management Standard

(S&MA) Paradigm. Roy W. Malone, Jr., Director Safety & Mission Assurance, MSFC. Fayssal M. Safie, Ph. D. NASA Safety Center Tech.

Space project management

pm4dev, 2007 management for development series The Project Management Processes PROJECT MANAGEMENT FOR DEVELOPMENT ORGANIZATIONS

UPTIME MAGAZINE. june/july15 JUNE/JULY uptimemagazine.com

Risk Management Primer

Guidance note. Risk Assessment. Core concepts. N GN0165 Revision 4 December 2012

With the introduction of GAMP 5, A

Media fills Periodic performance qualification (Re-Validation)

What is a definition of insurance?

FlyntGroup.com. Enterprise Risk Management and Business Impact Analysis: Understanding, Treating and Monitoring Risk

IMPLEMENTING A RELIABILITY CENTERED MAINTENANCE PROGRAM AT NASA'S KENNEDY SPACE CENTER

Mission Assurance Manager (MAM) Life Cycle Risk Management Best Practices David Pinkley Ball Aerospace MA Chief Engineer September 23, 2014

Failure Modes & Effects Analysis

3.4.4 Description of risk management plan Unofficial Translation Only the Thai version of the text is legally binding.

Risk Assessment for Medical Devices. Linda Braddon, Ph.D. Bring your medical device to market faster 1

SOFTWARE DEVELOPMENT STANDARD FOR SPACECRAFT

Risk Assessment Tools for Identifying Hazards and Evaluating Risks Associated with IVD Assays

DRAFT RESEARCH SUPPORT BUILDING AND INFRASTRUCTURE MODERNIZATION RISK MANAGEMENT PLAN. April 2009 SLAC I

AS9100 B to C Revision

Elevator Malfunction Anyone Going Down?

Summary of GAO Cost Estimate Development Best Practices and GAO Cost Estimate Audit Criteria

Overview of how to test a. Business Continuity Plan

Reliability Block Diagram RBD

Motivations. spm adolfo villafiorita - introduction to software project management

The purpose of Capacity and Availability Management (CAM) is to plan and monitor the effective provision of resources to support service requirements.

PROJECT RISK MANAGEMENT

A System-safety process for by-wire automotive systems

RISK ASSESSMENT APPLICATIONS FOR THE MARINE AND OFFSHORE OIL AND GAS INDUSTRIES

FORKLIFT SAFETY BASICS & OPERATOR S DAILY CHECKLIST

MAJOR PROJECTS CONSTRUCTION SAFETY STANDARD HS-09 Revision 0

Risk Review Process Basics

Estimating Software Reliability In the Absence of Data

Measurement Information Model

ISO Introduction

General Insurance Definitions:

CDC UNIFIED PROCESS PRACTICES GUIDE

PROJECT RISK MANAGEMENT

OPERATIONAL RISK MANAGEMENT B STUDENT HANDOUT

4 Testing General and Automated Controls

Appendix V Risk Management Plan Template

Operational Risk Management Program Version 1.0 October 2013

Risk management for external beam radiotherapy Recommendations (draft)

CRISC Glossary. Scope Note: Risk: Can also refer to the verification of the correctness of a piece of data

SCADA Business Continuity and Disaster Recovery. Presented By: William Biehl, P.E (mobile)

National Infrastructure Protection Center

How To Improve Safety In The Nhs

Failure Mode and Effect Analysis. Software Development is Different

Reliability centered maintenance: managing cost and risk

J & Z Automotive Repair Teaneck road, Teaneck, NJ 07666

An introduction to designing reliable cloud services

Root Cause Analysis Concepts and Best Practices for IT Problem Managers

U.S. Dept. of Defense Systems Engineering & Implications for SE Implementation in Other Domains

ENERGISTICS E&P BUSINESS PROCESS REFERENCE MODEL

RISK MANAGEMENT FOR INFRASTRUCTURE

Enterprise Risk Management

Quality Risk Management in Pharmaceutical Industry: A Review

Strategic Risk Management for School Board Trustees

Application of Quality Risk Management to Pharmaceutical Operations. Eldon Henson, Vice President, Quality Operations

Introductions: Dr. Stephen P. Schultz

Monitoring and Evaluation Plan Primer for DRL Grantees

Hazard Identification and Risk Assessment for the Use of Booster Fans in Underground Coal Mines

Vehicle Recovery and Towing

A System-Safety Process For By-Wire Automotive Systems

REGULATORY GUIDE OFFICE OF NUCLEAR REGULATORY RESEARCH. REGULATORY GUIDE (Draft was issued as DG-1226, dated August 2009)

Corrective and Preventive Action Background & Examples Presented by:

Placing a Value on Enterprise Risk Management ADVISORY

IRM CERTIFICATE AND DIPLOMA OUTLINE SYLLABUS

Occupational safety risk management in Australian mining

Project Management Challenges in Software Development

Risk Management. Sharif Project Management Session 10.1

Applying Risk Assessment to Your Audit Plan Break-out Session T3, Tuesday, October 26 2:00-2:50pm

Product Excellence using 6 Sigma (PEUSS)

Bicycle Riding. WHAT ARE the. One Final Note... It is against the law to ride a bicycle under the influence of alcohol and/or drugs. (21200.

Transcription:

Risk Management Tools Langley Research Center May 2, 2000 Michael A. Greenfield Deputy Associate Administrator Office of Safety and Mission Assurance 1

Recent Reviews Focusing on NASA Failures Mars Climate Orbiter Mishap Investigation (MCO) Chair: Art Stephenson/MSFC NASA FBC (Faster, Better, Cheaper) Task (FBC) Chair: Tony Spears Shuttle Independent Assessment (SIA) Chair: Dr. Henry McDonald Mars Program Independent Assessment (MPIA) Chair: Tom Young 2

Recommendations: Risk Management Deficiencies Overall there are about 175 recommendations Most addressed issues applicable throughout the Agency, at all Centers and all Projects! Findings were grouped into 4 large areas: People, Process, Process Execution, Advanced Tools and Techniques Risk and Risk Management issues represent a continuing theme Weakness in Risk Identification and Analysis Poor Risk Mitigation and Tracking Lack of strong Systems Engineering Limited application of Risk Assessment tools NASA has formed an Integrated Action Team (NIAT) to develop suitable plans to correct the deficiencies Briefing to NASA Chief Engineer on June 15. 3

Outline Continuous Risk Management Process NASA Risk Management Requirements Fault Tree Analysis (FTA) Failure Mode And Effect Analysis (FMEA) Probabilistic Risk Assessment (PRA) 4

Continuous Risk Management Process Risk management is a continuous process which: Identifies risk Analyzes risk and its impact, and prioritizes risk Develops and implements risk mitigation or acceptance Tracks risks and risk mitigation implementation plans Assures risk information is communicated to all project/program levels Risk management planning Developed during the program/project formulation phase Included in the program/project plans Executed/maintained during the implementation phase Risk management responsibility Program/project manager has the overall responsibility for the Implementation of risk management, ensuring an integrated, coherent risk management approach throughout the project 5

NASA Risk Management Requirements NPG 7120.5, NASA Program and Project Management Processes and Requirements The program or project manager shall apply risk management principles as a decision-making tool which enables programmatic and technical success Program and project decisions shall be made on the basis of an orderly risk management effort Risk management includes identification, assessment, mitigation, and disposition of risk throughout the PAPAC (Provide Aerospace Products And Capabilities) process NPG 8705.x (draft), Risk Management Procedures and Guidelines Provides additional information for applying risk management as required by NPG 7120.5 6

Risk Management Process Mission Success Starts With Safety Program / Project constraints IDENTIFY Identify risk issues and concerns Statements of risk List of risks Risk data: test data, expert opinion, hazard analysis, FMEA, FTA, PRA, lessons learned, technical analysis ANALYZE Evaluate (impact/severity, probability, time frame), classify, and prioritize risks Risk evaluation Risk classification Risk prioritization Resources PLAN Decide what, if anything, should be done about risks Risk mitigation plans Risk acceptance rationale Risk tracking requirements Program/project data (metrics information) TRACK Monitor risk metrics and verify/validate mitigation actions Risk status reports on: Risks Risk mitigation plans Note: Communication and documentation extend throughout all of the functions. CONTROL Replan mitigations, close risks, invoke contingency plans, or track risks Risk decisions 7

NASA Risk Management Requirements NPG 8715.3, NASA Safety Manual Purpose of risk assessment is to identify and evaluate risks to support decision-making regarding actions to ensure safety and mission assurance Risk assessment analyses should use the simplest methods that adequately characterize the probability and severity of undesired events Qualitative methods that characterize hazards and failure modes and effects should be used first Quantitative methods are to be used when qualitative methods do not provide an adequate understanding of failures, consequences, and events System safety analysis must include early interaction with project engineering, integration, and operations functions to ensure all hazards are identified The hazard assessment process is a principle factor in the understanding and management of technical risk As part of the responsibility for overall risk management, the program/project manager must ensure that system safety analyses, appropriate to the program/project complexity, have been conducted 8

NASA Risk Management Requirements NSTS 22206, instructions for preparation of FMEA and CIL [for Space Shuttle] System and performance requirements are defined Analysis assumptions and groundrules are specified Block diagrams (functional or reliability) are developed Analysis worksheets which include identification of every failure mode are developed (the effects documented address the worst case.) Corrective actions and design improvements are evaluated and recommended Analysis is summarized in report form SSP 30234, instructions for preparation of FMEA and CIL [for Space Station] FMEA process, requirements, rules, reporting requirements are described CIL process, requirements, rules, reporting requirements are described Ground support equipment FMEA and CIL processes, requirements, approvals, and databases are described 9

Risk Management Tools NASA has been traditionally using one Risk Assessment and Risk Management (RA/RM) tool for some time: Failure Modes and Effects Analysis (FMEA) NASA has also selectively used another important RA/RM tool: Fault Tree Analysis (FTA) Additionally, NASA has been broadening its repertoire of RA/RM tools and has begun to systematically use a more comprehensive set of tools collectively called Probabilistic Risk Assessment (PRA) PRA is a systematic, logical, and comprehensive discipline that uses tools like FMEA, FTA, Event Tree Analysis (ETA), Event Sequence Diagrams (ESD), Master Logic Diagrams (MLD), Reliability Block Diagrams (RBD), etc., to quantify risk. 10

Risk Management Tools An Introduction (to whet your appetite) 11

Design/Develop a Car for the Interstate 64 Commute Start with requirements analysis Must be safe Must provide seating for two persons to allow HOV option High price of fuel dictates need for high gas mileage High reliability to protect against being stranded And of course must be stylish Apply a disciplined risk management process to uncover risks and develop risk mitigation strategies across the product lifecycle 12

Fault Tree Analysis Background FTA is a deductive analytical technique of reliability and safety analyses and generally is used for complex dynamic systems FTA provides an objective basis for analysis and justification for changes and additions First developed by Bell Telephone in 1961 then modified by Boeing for wide uses 13

Fault Tree Analysis Concept A model that logically and graphically represents the various combinations of possible events, both faulty and normal, occurring in a system that leads to the top undesired event, e.g., electrical fire in heater box. FTA uses a tree to show the cause-and-effect relationships between a single, undesired event (failure) and the various contributing causes The tree shows the logical branches from a single failure at the top of the tree to the root cause(s) at the bottom of the tree Standard logic symbols connect the branches of the tree. For example, gates permit or inhibit the passage of fault logic up the tree through the events. Fault tree does not necessarily contain all possible failure modes of the components of the system. Fault tree contains only those failure modes whose existence contribute to the existence of the top event. 14

FTA Provides a Top-Down View to Identify Risks and Possible Mitigation Car disabled on the interstate OR Engine stops and will not re-start Insufficient power to wheels Tire out of rolling configuration Car cannot be steered Brake failure Car catches fire AND Tire lacks adequate inflation Spare tire unavailable OR OR Blowout - road damage Blowout due to wear Spare tire is flat Spare tire is missing Cannot install spare tire 15

Failure Mode and Effect Analysis FMEA is an inductive engineering technique used at the component level to define, identify, and eliminate known and/or potential failures, problems, and errors from the system, design, process, and/or service before they reach the customer. (Also see MIL-STD-1629) FMEA is an early warning or preventative technique that is methodical Systematic method of examining all ways which a failure can occur For each failure, an estimate is made of: Effect on total system Occurrence Severity Detection Bottoms-up analysis based on historical or inferential data at component level FMEA will identify corrections required to prevent failures 16

Failure Mode and Effect Analysis Example For want of a nail, the shoe was lost; For want of a shoe, the horse was lost; For want of a horse, the rider was lost; For want of a rider, the battle was lost; For want of a battle, the kingdom was lost! How would you control the loss of a nail? Is there more you can do? 17

FMECA- A Bottoms-Up View to Design, Manufacture, Operations Part Name/ Part Number Potential Failure Modes Causes (failure mechanism) Tire FMECA with Reevaluation of Risks Effects Risk Priority Rating Sev* Freq Det RPN Recommended Corrective Action Improved Rating Sev* Freq Cord Fiber 1. Weak Ply failure 4 3 8 96 Incoming inspection 4 1 8 3 separation precursor material 2. Handling Ply failure 4 3 8 96 Increase process 4 2 2 16 damage controls during mfg 3. Cumulative Ply failure 4 2 8 64 Monitor tire life 4 2 2 16 fatigue Det RPN Ply Delamin- 1. Dirt or Loss of side 7 3 8 168 Toluene wipe down 7 1 1 7 ation grease wall integrity during layup 2. Twisted Loss of side 7 2 6 84 Automatic ply 7 1 1 7 plys wall integrity alignment 3. Poor bond Loss of side 7 2 8 144 Redundant 7 1 1 7 pressure wall integrity tensioning system Carcass Disinte- 1. Poor tire Vehicle loss 9 2 9 162 Planned periodic 9 1 1 9 gration alignment maintenance 2. Tire hits Vehicle loss 9 2 9 162 Driver training 9 1 1 9 curb *Severity ratings 8 to 10 request special effort in design improvement regardless of RPN rating 18

What is PRA? Probabilistic Risk Assessment Mission Success Starts With Safety It is an analysis of the probability (or frequency) of occurrence of a consequence of interest, and the magnitude of that consequence, including assessment and display of uncertainties It is an engineering process, based on comprehensive systems analysis with analytical support, repeated periodically as the design matures and new data become available It is a means to express quantitatively our state of knowledge about the risk of failure It does not guess failure rates, or otherwise create data 19

Probabilistic Risk Assessment What is PRA used for in NASA programs? For strategic decision support; e.g., What is the probability of successfully assembling the multi-billion dollar International Space Station? For systems under development, to guide trade-offs between safety, reliability, cost, performance, and other tradable resources For mature systems, to support decision-making on risk acceptability, and (when risk is considered to be too high) on choices among options for risk reduction; e.g., Space Shuttle upgrades To track risk levels: Throughout the life cycle To measure effectiveness of risk reduction options 20

Probabilistic Risk Assessment Helps Prioritize Risk Scenarios Master Logic Diagrams To identify initiating events Event Trees or Event Sequence Diagram To construct accident scenarios from initiating events to end states Fault Trees To quantify initiating and mitigating events 21

PRA Supports Design Decisions Ordinary tire stays inflated Changing tools are OK Spare tire is OK END STATES (S=success; F=failure) Tire rolls over road hazard 0.05 0.95 0.8 0.2 0.6 0.4 S=0.05 S=0.40 F=0.30 F=0.19 S=52% F=48% Tire rolls over road hazard Run-flat Tire Stays Inflated 0.9 0.1 Changing tools are OK 0.8 Spare tire is OK 0.6 0.4 S=0.90 S=0.05 F=0.03 S=96.5% F=3.5% 0.2 F=0.02 22

Another Risk - A Dead Battery Battery Dead (1 X 10-3 ) OR Lights Left On Batt. End of Life Elec. Short (7 X 10-4 ) (2.8 X 10-4 ) (0.2 X 10-4 ) 23

PRA Can Provide Quantitative Values for Failure and Facilitate Ranking of Risk Drivers Jumper Cables Available Donor Batt. OK Batt. Terminals OK Batt. Dead (1 X 10-3 )* 0.1 0.5 0.8 Can recover (4 X 10-5 ) 0.2 Stuck (1 X 10-5 ) 0.5 Stuck (5 X 10-5 ) 0.9 Stuck (90 X 10-5 ) * Probabilities expressed in terms of a single automobile use. 24

Space Shuttle Program Development Office Block IIA Configuration - Ascent * * Based on 1998 QRAS 25

Space Shuttle Program Development Office Reliability Sensitivities Analysis - Space Shuttle Ascent Shuttle Element SSME Current Reliability ** 1.0x10-03* 1/1000 5.0x10-4 1/560 1/2000 What ifs? Element Risk is Cut in 1/2 Element Reliability is Perfect Element Shuttle Element Shuttle Shuttle 1/779 SRB 7.6x10-04 1/1312 3.8x10-4 1/525 1/2624 1/657 RSRM 1.8x10-04 1/5682 1/438 8.8x10-5 1/455 1/11,364 1/474 ET 1.4x10-04 1/7246 6.5x10-5 1/451 1/4,492 1/466 Orbiter 4.8x10-05 1/20,619 2.4x10-5 1/441 1/447 1/41,238 * Based on SSME Block IIA Configuration ** Based on 1998 QRAS 26

The SMA Community is Ready to Help Risk Management tools and techniques continues to be a major thrust within OSMA We have enhanced our ability to support Quantitative Risk Assessment A recognized expert, Dr. Michael Stamatelatos, has joined our staff to support this effort (202-358-1668) Contact Bert Garrido, LaRC SMA Director, X4-3361 or Dr. Peter Rutledge (202-358-0579) or Michael to help you in tool application 27