RISK MITIGATION THROUGH STRATEGIC MAINTENANCE AND RELIABILITY



Similar documents
Directory. Maintenance Engineering

The Asset Management Landscape

The Lowitja Institute Risk Management Plan

ASSET MANAGEMENT. solutions. smec.com

SKF Asset Management Services. Your trusted resource for life cycle support and sustainability of physical assets

HEALTH, SAFETY, ENVIRONMENT AND COMMUNITY MANAGEMENT STANDARDS. OCTOBER ISSUE No 01. Doc No: HSEC MS 001

The Asset Management Landscape

Reliability Block Diagram RBD

Independent analysis methods for Data Centers

Core Infrastructure Risk Management Plan

Overview of EAM Services. A Fully Integrated Global EAM Service Provider

Customer requirements. Asset management planning Inspection and assessment Route asset planning Annual work plans Contracting strategy

Clinical Risk Management: Agile Development Implementation Guidance

Driving Transformational Change. Innovative Solutions for Optimising Performance in Quarrying & Mining Fleets

Drawing up an Asset Management plan that delivers Operational Reliability

Reliability Modeling Software Defined

ASSET MANAGEMENT. John Woodhouse. The Woodhouse Partnership Ltd

6 Reasons Why Outsourcing Equipment Maintenance Is Your Best Hedge In A Down Economy

RISK MANAGEMENT FOR INFRASTRUCTURE

Risk Management Policy and Framework

A Risk Management Standard

TEC Capital Asset Management Standard January 2011

Risk Management. Policy

The purpose of Capacity and Availability Management (CAM) is to plan and monitor the effective provision of resources to support service requirements.

Over 70,000 RELIABILITY CENTERED MAINTENANCE (RCM2 TM ) TRAINING COURSES

The 39 Subjects and 6 Subject Groups

Information Needed for Effective Asset Management

Master Data Management for Life Sciences Manufacturers

identify hazards, analyze or evaluate the risk associated with that hazard, and determine appropriate ways to eliminate or control the hazard.

IMPLEMENTING A RELIABILITY CENTERED MAINTENANCE PROGRAM AT NASA'S KENNEDY SPACE CENTER

Risk Management Procedure

Title: OHS Risk Management Procedure

Five steps to a successful material handling CMMS implementation

RISK MANAGEMENT POLICY

WATE R O&M S E RV I CE S An asset management perspective

Guidelines for the Application of Asset Management in Railway Infrastructure Organisations

Page 1 of 24. To present the Asset Management Policy 2014 for Council adoption.

Physical Asset Management

Training Pack. Kaizen PDCA ( Plan / Do / Check / Act )

REDUCING THE COST OF PREVENTIVE MAINTENANCE

ESKISP Manage security testing

Asset Management Plan Overview

An Introduction to Continuous Controls Monitoring

Audit & Inspection Management. Enterprise Cloud Audit & Inspection Management Solution

Predictive Asset Maintenance

Consultancy Services Proposal

Efficient Operations for Maintenance

Master Data Management For Life Sciences Manufacturers

2016 Asset Management & Maintenance Priorities Survey

Asset Management Strategy

Title: Rio Tinto management system

Risk Management Policy

GFMAM Competency Specification for an ISO Asset Management System Auditor/Assessor First Edition, Version 2

Mining & Construction Transforming your fleet performance

Rail Asset Management. Rail

Master Data Management for Life Science Manufacturers

Analyzing Risks in Healthcare. February 12, 2014

Quality Manual ISO 9001:2015 Quality Management System

Capacity Plan. Template. Version X.x October 11, 2012

An Introduction to Risk Management. For Event Holders in Western Australia. May 2014

Module 1 Study Guide

13 ENVIRONMENTAL AND SOCIAL MANAGEMENT SYSTEM

Enterprise Asset. Public Sector

IT Professional Standards. Information Security Discipline. Sub-discipline 605 Information Security Testing and Information Assurance Methodologies

Incident Reporting & Management

Optimising a LNG supply chain and its assets through performance prediction studies

Maintenance Strategy 2015 Owner: Kevin Bullimore Head of Infrastructure Next review 2020

RCM Success Starts with CMMS

Public Training Programs. Master classes in Maintenance Practical Courses in Maintenance Management

Stakeholder management and. communication PROJECT ADVISORY. Leadership Series 3

ASSET MANAGEMENT STRATEGY

Changing data needs from a life cycle perspective in the context of ISO 55000

A Perspective on Draft Embedded Digital RIS. Ron Jarrett Tennessee Valley Authority Embedded Digital Devices Workshop October 9, 2014 NRC

Physical Asset Management: What is it all about and why?

Risk Management: Coordinated activities to direct and control an organisation with regard to risk.

Compliance Management Framework. Managing Compliance at the University

SOUTHERN RURAL WATER POLICY RISK MANAGEMENT POLICY

ITIL Introducing service operation

2012 Americas School of Mines

of The New England Water Works Association

Quality Risk Management Understanding and Control the Risk in Pharmaceutical Manufacturing Industry

Risk & Hazard Management

Increase Equipment Uptime Through Robust Enterprise Asset Management. For the Upstream, Midstream & Downstream Sectors of the Oil & Gas Industry

Hazard Identification, Risk Assessment and Management Procedure. Documentation Control

ROAD NETWORK ASSET MANAGEMENT AS A BUSINESS PROCESS. Paper prepared for the REAAA Conference in Japan, 4 9 September 2000

ASSET MANAGEMENT STRATEGY

NORWEGIAN PROJECT MANAGEMENT PRINCIPLES APPLIED IN THE JURONG ROCK CAVERN PROJECT

COST EFFECTIVE APPROACHES FOR PRESCRIBED DAM ASSET MANAGEMENT

ENERGY EFFICIENCY IN BUSINESS PLANNING ENERGY EFFICIENCY FACT SHEET

Health and Safety Management Standards

Transcription:

RISK MITIGATION THROUGH STRATEGIC MAINTENANCE AND RELIABILITY Manufacturing plants today contain numerous equipment items that require some form of maintenance to help deliver company output and profit. This paper presents a practical case study detailing various methods that can be applied to achieve a structured approach to maintenance and reliability.

PAGE 2 INTRODUCTION Manufacturing plants today contain numerous equipment items that require some form of maintenance to help deliver company output and profit. This paper presents a practical case study from the initial Reliability Centered Maintenance concept using a computerised simulation approach, followed by the detailed Root Cause Analysis on some high consequence failure modes. The company had recognised through their internal data management processes that a particular critical equipment item was causing unexpected outages resulting in a loss of business profit. It was therefore decided by the business to embark on a continuous improvement path that would enable them to fully understand the equipment and how to prevent the unexpected losses from reoccurring. The approach taken for the critical equipment item was to initially perform a Reliability Centered Maintenance (RCM) study, in which all the likely and the dominant failure modes are identified. A Failure Mode Effect and Criticality Analysis (FMECA) model was then populated with the consequences of failure using the corporate risk matrix and financial cost of failure. The outputs of the study now gave the company the ability to optimise the choice of maintenance tasks from both a cost and safety perspective. Once the optimum maintenance plan was developed, lifetime simulations using a Monte Carlo simulator were performed to provide a forward maintenance budget prediction and also to assess the effectiveness of the plan in reducing the impact of any critical failures. Where maintenance was not able to reduce the failure effects to acceptable levels, they were subjected to detailed root cause analysis in order for effective solutions to be identified. In conclusion, this paper illustrates how a manufacturing plant developed a fully documented maintenance plan with associated work instructions for implementation through the site maintenance management system. The RCM study addressed a critical item of equipment within the manufacturing facility to develop a plan to ensure safe, reliable and economical use of the plant item throughout its life. 1.1 RELIABILITY CENTERED MAINTENANCE Reliability Centered Maintenance (RCM) is a systematic approach that focuses on preserving function rather than preserving the asset. It addresses only failures that matter using a logical process for making maintenance decisions. During this process, the equipment is broken down to the lowest maintainable item. The failure causes for each maintainable item are then identified and analysed. If reliable failure/ maintenance history is available, then this can be used to establish the failure rate or expected life of each failure cause. However, more often than not this information is not readily available or its accuracy is not very reliable. In such cases this information can be estimated based on technical, operational and maintenance experience on similar machines, as well as general engineering knowledge. The maintenance strategy for each failure cause identified must also be derived. These strategies may be predictive maintenance, preventive maintenance or a run-to-fail strategy. As part of the RCM process, the failure effect or failure consequence is assigned to each failure cause. It is important to identify the failure causes that have a potential impact on the availability of the equipment so that the equipment can be correctly modeled when the data is transferred into a Reliability Block Diagram.

PAGE 3 1.2 RELIABILITY BLOCK DIAGRAM Reliability Block Diagrams (RBD s) are a tool used to carry out a system availability analysis and produce performance predictions. They are made up of series and parallel relationships that represent equipment redundancy levels. The RBD can predict downtime, number of interruptions and the Mean Time Between Failures (MTBF). The information collated during the RCM process can be utilised to create the Reliability Block Diagram (RBD) model. 1.3 ROOT CAUSE ANALYSIS Root Cause Analysis is a problem solving methodology that is aimed at identifying the root causes of problems or events. If the root causes are identified and effective solutions implemented then it would be fair to say that we should not see the problem reoccur again in the future. 2. CASE STUDY ZINC STRIPPING MACHINE The equipment identified for further improvement by the business was a zinc stripping machine. The stripping machine is not something available off the shelf ; it was designed specifically for the business and is unique in its operation. The stripping machine main function is to strip two 20 kilogram sheets of zinc every 4 seconds with a daily target of 450 tonnes a day. 2.1 THE ANALYSIS The RCM study began by collating as much information as was available, including equipment lists and current maintenance plans. Work order history, and performance data were analysed using Weibull Analysis to determine what failure patterns were occurring across the stripping machine. This information was then used to develop a Desktop Reliability Centered Maintenance (RCM) model using the simulation software package Availability Workbench (AWB) 1. The asset hierarchy was built using AWB based on the current structure within the business SAP Computerised Maintenance Management System (CMMS). This alignment with SAP was required for future use of the AWB SAP portal and the uploading and downloading of maintenance plans. With the asset hierarchy developed the next stage of the process was to define equipment functions, functional failures and failure modes. This was predominately completed using available information and equipment experts located within the business. The first model developed was the current maintenance strategy which used the plans that were being currently performed on the stripping machine. Once the current maintenance plans were implemented the next step was to assign the consequences of failure using the corporate risk matrix. The severity levels within the table were calibrated to the companies risk matrix. See Table 1. With the current practice AWB model complete the business was now in a position to simulate using the Monte Carlo Simulator built within AWB to determine the impact of the maintenance tasks on the lifecycle costs and risk levels. The simulation results provided rapid analysis, and allowed for further optimisation of maintenance plans to take place. Where maintenance was not effective in either preventing or predicting a failure mode then Root Cause Analysis was performed to determine what effective solutions could be implemented to prevent reoccurrence. (Ref 1 - Availability Workbench (AWB) is a Trade Mark owned by Isograph Limited UK)

Table 1 Consequence Table PAGE 4 ID Description Cost Per Hour Cost Per Occurrence Safety Severity Operational Severity S&H1 Report Only No treatment 0 0 24 0 S&H2 First Aid Treatment 0 0 730 0 S&H3 Medical treatment or lost time injury 0 0 8760 0 S&H4 Extensive or Permanent Part Disability 0 0 87600 0 S&H5 Fatality (s) or permanent serious disability 0 0 876000 0 ENV1 No impact or measurable impairment 0 2500 24 0 ENV2 Minor effects on biological or physical environment 0 25000 730 0 ENV3 Measurable impairment on biological or physical environment 0 250000 8760 0 ENV4 Serious environmental effects with impairment of eco system 0 2500000 87600 0 ENV5 Very serious environmental effects 0 5000000 876000 0 LEG1 Unlikely to result in adverse regulatory response 0 0 0 24 LEG2 Minor non compliances of breaches or regulations 0 0 0 730 LEG3 Serious breach of regulation 0 0 0 8760 LEG4 Major breach of regulation 0 0 0 87600 LEG5 May be considered wilful or negligent. 0 0 0 876000 QS1 Can easily be absorbed through normal activity 0 0 0 24 QS2 Consequences can be absorbed. Minor delivery delays 0 0 0 730 QS3 QS4 QS5 Significant event which can be managed under special circumstances Major event with prioritised and focused management, Customers lost Extreme event with failure to lead to failure of business. Key customers lost 0 0 0 8760 0 0 0 87600 0 0 0 876000 STK1 Little or no stakeholder interest 0 0 0 24 STK2 Minor adverse local media attention 0 0 0 730 STK3 STK4 STK5 Attention from local media or heightened concern from local community Significant adverse national media/public attention. Share price may be affected Serious public outcry. International coverage. Share price seriously affected 0 0 0 8760 0 0 0 87600 0 0 0 876000 DT1 Downtime Cost Smelter 19000 0 0 0 2.2 THE RESULTS With the AWB desktop model now complete, we needed to perform a reality check and challenge the model results. This is carried out by first examining the Cost Criticality Pareto chart. The Cost Criticality Pareto ranks the failure modes based on total cost which includes any business costs associated with failure, labour cost, spares cost and equipment cost. See Figure 1. Figure 1 Cost Criticality Pareto Chart Failure Mode Prill Build Up Bench Chain Worn Busbars Dirty Prox Misaligned Broken Switches Brushes Worn Hangers Damaged Air Gap Damaged Flag damage Light Curtain Trip $0 2m 4m 6m 8m 10m 12m Total Cost Contribution

PAGE 5 Using the Cost Criticality Pareto Chart, the high cost failure modes were examined in detail by carrying out a validation of the failure mode information. Once validation was complete, the Cost Criticality Pareto identified areas for improvement. The type of recommendations made for improvement included root cause analysis (RCA), equipment redesign and maintenance strategy optimisation. The second method used to challenge the model results was to examine the historical maintenance costs and compare to the budget prediction using AWB. The goal of this was not primarily to ensure the budget prediction matched the historical spend but rather to clearly understand the reasons for any discrepancies in the results. In this case study, the predicted results were within 4% of actual maintenance spending. See Figure 2. Figure 2 Actual Cost v Predicted Cost If any failure modes exceeded this target then a safety risk exposure existed. With the stripping machine, four failure modes exceeded the target and showed safety risk exposure. These failure modes were associated to emergency stop switches and the testing regime for those switches. The comparison between current maintenance and optimised maintenance can be seen with a reduction in safety risk to below the target of 1 following maintenance optimisation. See Figure 3 and Figure 4. Figure 3 Safety Criticality Current Practice Failure Mode Edge Growth Not Removed Emergency Stop Panel Failure Emergency Stop Load Failure Cost $700k $600k $500k $400k $300k Emergency Stop Unload Failure Emergency Stop Switch Failure 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 Safety Criticality - Target = 1 $200k $100k 0 SAP Actual Year 1 SAP Actual Year 2 RCMCost Current RCMCost Optimised Spares $361,520 $404,287 $354,028 $353,121 Resources $214,507 $236,551 $241,344 $238,344 Now failure modes had been addressed from a cost perspective, the next step was to determine if the safety risk associated to the stripping machine was less than the acceptable business exposure threshold. This threshold was calibrated with the business risk matrix to align with a target of 1. Figure 4 Safety Criticality Optimised Failure Mode Edge Growth Not Removed Emergency Stop Panel Failure Emergency Stop Load Failure Emergency Stop Unload Failure Emergency Stop Switch Failure 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 Safety Citicality - Target = 1

PAGE 6 Figure 5 Average Annual Cost Strategy Comparisons - Stripping Machine Cost $10m $8m $6m $4m $2m 0 Run to Fail Current Optimised Opt Redesign Effects $22,056,284 $3,289,972 $2,334,326 $759,258 Resources $45,107 $241,344 $238,661 $238,344 Spares $320,654 $354,028 $353,121 $353,121 Figure 6 Average Annual Budget Predictions - Stripping Machine $700k $600k $500k $400k $300k Spares Labour With the model details validated, it was then important to recognise the comparison between the alternative strategies on offer to the business. In this case, run to failure, current practice, optimised and optimised with redesign strategies were compared. See Figure 5. Once optimisation was finalised a maintenance budget prediction was developed to determine spares and resource requirements. Using AWB, the budget prediction could be interrogated at many levels including system, subsystem, individual asset and individual failure mode level. See Figure 6. We were now in a position to determine what impact the optimised maintenance strategy was having on the stripping machine overall availability. This would indicate single points of failure and bottlenecks within the machine and identify further opportunity for improvement. See Figure 7 $200k $100k 0 1 2 3 4 5 6 7 8 9 10 Figure 7 Predicted Availability Performances - Stripping Machine Availabilty % 99.0 97.0 95.0 93.0 91.0 89.0 Predicted Availability: 91.6% Total Downtime: 8.4% Outage due to Planned Maintenance: 4.8% Outage due to Unplanned Maintenance: 3.6% 87.0 85.0 1 2 3 4 5 6 7 8 9 10 Year

PAGE 7 2.3 THE IMPROVEMENT WORKSHOPS With a new maintenance strategy developed for the stripping machine, it was important to recognise that two failure modes that were occurring on the machine could not be prevented or predicted with any form of regular maintenance. See Figure 1 Cost Criticality Pareto. It was therefore decided to conduct a detailed RCA (Root Cause Analysis) workshop on these two failures. Table 2 Failure Modes for RCA EQUIPMENT Monorail DESCRIPTION Light curtains trip COST PER YEAR FREQUENCY $1,107,070 Once/shift for 5 min Side Shifter Flag damage $493,765 Once/week for 30 min The approach was taken to create a common reality and build a cause and effect chart that would address the conditions and the actions surrounding both problems using the Apollo Root Cause Analysis Method 2. The goal of the Apollo Root Cause Analysis Method is to identify not only the cause and effects but identify the effective solutions that will prevent reoccurrence of the problem. The methodology is based on the four elements of cause and effect principle: Cause and effect are the same thing Causes and effects are part of an infinite continuum Every effect has at least two causes in the form of actions and conditions An effect exists only if its causes exist at the same point in time and space. The light curtain trip identified 63 cause and effects and 12 solutions. The flag damage identified 22 cause and effects and 3 solutions. With potential solutions identified a cost benefit analysis was carried out to ensure payback on the cost of solutions. Table 3 Solution Cost Benefit PROBLEM Light Curtain Trip COST/YEAR 2.4 FMECA SUMMARY The Stripping machine FMECA analysed a total of 1798 failure modes (see Table 4). The new optimised maintenance plan produced 51 Maintenance Schedules with a total of 1466 maintenance tasks. The maintenance strategy was categorised into 10% Preventive Work and 90% Inspection Work. Table 4 FMECA Summary OPTIMIZED FMECA SUMMARY Action 2.5 CONTINUOUS IMPROVEMENT COST OF SOLUTION Number of Tasks Use As Is 929 Deleted 0 Extend Interval 54 New Tasks 471 Reduce Interval 0 Run to Failure 344 PAYBACK DAYS $1,107,070 $34,800 11.47 Flag Damage $493,765 $2,880 2.17 With the new maintenance strategies uploaded into the CMMS via the AWB SAP portal, it was recognised that a periodic review would be performed to validate the strategy and update according to new information and data becoming available. This will form part of the business continuous improvement process. (Ref 1 - Availability Workbench (AWB) is a Trade Mark owned by Isograph Limited UK)

3. CONCLUSION PAGE 8 The paper has presented a Reliability Centered Maintenance approach to evaluating current maintenance regimes. Once complete, the models can be used to predict budgets, maintenance labour and resource requirements and spares usage. The approach also incorporated Root Cause Analysis on high cost failure modes that could not be prevented through the maintenance regime. The paper shows through the breadth of results available, the benefit in applying an RCM and RCA approach to plant strategy optimisation. The outcomes can help ensure that the throughput goals are achieved at least cost and minimum risk exposure to the business. About ARMS Reliability Since 1995, ARMS Reliability has been at the forefront of proactive asset management strategies for a range of blue chip companies throughout the world. These companies have entrusted us with delivering business goals through effective asset management and improvements in operational productivity. ARMS Reliability is a service, software, and training organisation providing a one stop shop for Reliability Engineering, RAMS, and Maintenance Optimisation for both new and existing projects. If you would like to discuss your situation and what are the essential elements to achieving best practice Asset Management, then give ARMS Reliability a call or Apollo Root Cause Analysis ARMS Reliability are global partners and providers of the Apollo Root Cause Analysis methodology. Please visit our website apollorootcause.com to Learn more or to view our Enquire now worldwide training schedule Share this! info@armsreliability.com NORTH / CENTRAL / SOUTH AMERICA P: +1 512 795 5291 AUSTRALIA / ASIA / NEW ZEALAND P: +61 3 5255 5357 EUROPE P: +44 122 445 9362 SOUTH AFRICA P: +27 10 500 8232 North America Latin America Europe Asia Africa Australia