Predictive Maintenance in a Mission Critical Environment



Similar documents
WHAT IS INFRARED (IR) THERMOGRAPHY

Predictive Maintenance

Online Hydro Machinery Monitoring Protection, Prediction and Performance Monitoring Solutions

Statement Of Work. Data Center Thermography Assessment. Service. Professional Services. Table of Contents. 1.0 Executive Summary

Equipment Performance Monitoring

Use your troubleshooting superpowers for good Application Note

IMPLEMENTING A RELIABILITY CENTERED MAINTENANCE PROGRAM AT NASA'S KENNEDY SPACE CENTER

Asset management systems increase reliability and efficiency

TURBINE FAN COMPRESSOR GEARBOX BOILER GENERATOR PUMP MOTOR

Emergency Power System Services Industrial UPS, Batteries, Chargers, Inverters and Static Switches

ARC VIEW. Emerson Asset Optimization Business Breaks New Ground. Summary. A Business of Vital Strategic Importance.

uptimeplus.co.uk predictive maintenance strategy galileo Supported by

Enterprise Remote Monitoring

A Practical Guide to Maximizing Machine Uptime

in-service inspections

Enhance Power Equipment Reliability with Predictive Maintenance Technologies

Interactive Computer Based Courses

PdM Overview. Predictive Maintenance Services

13 common causes of motor failure

SWITCHING STRATEGIES: MOVING TO CONDITION- BASED MAINTENANCE. By David Stevens, Technical Manager & Trainer, AVT Reliability

Asset Management Acceptance Testing Asset Control Parts and Equipment Sales

Optimization Services

Pump Vibration Analysis

Intelligent Vibration Monitoring

BLUESTAR INFRARED. Description: Thermal Properties Comparison. Project Location: Client: 759 South Federal Hwy 225 Bayshore Drive

Liebert GXT MT+ CX UPS, 1000VA VA Intelligent, Reliable UPS Protection. AC Power for Business-Critical Continuity

employed to ensure the continuing reliability of critical systems.

Operations & Maintenance 201 Using online monitoring and predictive diagnostics to reduce maintenance costs

Ultrasound Condition Monitoring

Appendix B Audit Forms for PdM Program Assessment

High Availability White Paper

Proficy Monitoring & Analysis. Software to harness the industrial internet

Response, Reliability and Results. for Managed Property

How To Run A Data Center Efficiently

Reducing operations & maintenance costs with PlantWeb digital plant architecture

Maximizing return on plant assets

WATCHMAN Reliability Services Enhance OEM Service Center Revenue

Intel RAID Controllers

Thermal Storage System Provides Emergency Data Center Cooling

Statement Of Work Professional Services

Five Nines and Infrared (IR) Testing at Data Centers

Appendix 4. Service Level Agreement (SLA) Data Center Services

Increase System Efficiency with Condition Monitoring. Embedded Control and Monitoring Summit National Instruments

Equipment Breakdown. The extended property coverage you need for the equipment you rely on.

CSI Oil Analysis Options

Oil and Coolant Circulating Heating System. Model - OCSM

, User s Manual

Preventative Maintenance & Statutory Testing

AMS Suite: Machinery Health Manager v5.4

INSPECTION AND TESTING OF EMERGENCY GENERATORS

DEXTER. Ship Health Monitoring Software. Efficiency & Reliability. Smart Solutions for

Considerations for a Highly Available Intelligent Rack Power Distribution Unit. A White Paper on Availability

Modern Power Distribution Systems

Online Condition Monitoring Reaps Benefits for Borealis Porvoo

Preventive and Predictive Maintenance

On-line PD Monitoring Makes Good Business Sense

SINGLE PHASE MOTORS. INSTALLATION AND MAINTENANCE MANUAL March 21, 2006

I-care headquarters. Rue R. Descartes 7000 Mons - Belgium T. +32 (0) BE@icareweb.com

COMMON EQUIPMENT FAILURES AND CAUSES. How breakdowns can impact the bottom line. A handy reference guide from Hartford Steam Boiler.

Best Practices, Benefits and Economics of Load Testing IEEE PES Fort Worth, TX Chapter July 16, 2013

Response, Reliability and Results in the delivery of leak detection services

Introduction. Background

Industry Solution. Predictive Asset Analytics at Power Utilities

Tune Up Your Condition Monitoring Program by Answering the Five W s

Failure to comply with the following cautions and warnings could cause equipment damage and personal injury.

DC MOTOR ANALYSIS & TROUBLESHOOTING

Program: Management Information Systems. David Pfafman 01/11/2006

Failure to comply with the following cautions and warnings could cause equipment damage and personal injury.

Federal Wage System Job Grading Standards for Air Conditioning Equipment Operating, Table of Contents

Information Needed for Effective Asset Management

MCM; An Inexpensive, Simple to Use Model Based Condition Monitoring Technology

GLYCOL CHILLER SYSTEM TROUBLESHOOTING 101

Proactive Asset Management with IIoT and Analytics

DATA CENTER ASSESSMENT SERVICES

Condition Monitoring Solutions

(3) Explosion proof. Designed to prevent ignition of any explosive gases or dust and dirt which may surround the motor.

Enhance Production in 6 Steps Using Preventive Maintenance

Perhaps the advent of Big Data possibly accounts for the reluctance of some individuals to believe that

WHITE PAPER. Automated IT Asset Management Maximize Organizational Value Using Numara Track-It! p: f:

Features. Emerson Solutions for Abnormal Situations

ProSmart. Predictive Condition Monitoring

ABC PLAZA. For. ABC PROPERTY MANAGEMENT (Client)

Maximize Your Investment! Machinery Health Training and Certification

Plan Number A Cross Beam Dr. Charlotte, NC Simplified Service Plan. Air Conditioning.

White Paper. Business Service Management Solution

Uptime for Wind Turbines. Dedicated Remote Condition Monitoring System for the Wind Power Industry

Acuren s Vision: To be a Trusted Partner by Providing Programs that clearly reduce our client s cost to operate.

Essential NCPI Management Requirements for Next Generation Data Centers

Failure to comply with the following cautions and warnings could cause equipment damage and personal injury.

Back-UPS Pro 1300/1500 Installation and Operation

Coal Handling Plant Alignment Standards For Shaft To Shaft Alignment. By Makarand Joshi M.Tech

can be defined as the ability

CDS TROUBLESHOOTING SECTION I. VACUUM Weak vacuum at wand. Gauge reads normal (10hg to 14hg)

Mobile field balancing reduces vibrations in energy and power plants. Published in VGB PowerTech 07/2012

Applications for Infrared Thermography at Computer Centers

Machinery condition monitoring software

Master Protection Extended Powertrain Warranty Program. Master Maintenance And Full Maintenance Programs

How to Analyze and Communicate Preventive Maintenance Address Now or Pay Later

Pump Maintenance - Repair

Oracle Maps Cloud Service Enterprise Hosting and Delivery Policies Effective Date: October 1, 2015 Version 1.0

Transcription:

March 2009 Predictive Maintenance in a Mission Critical Environment This paper discusses the need to proactively protect mechanical and electrical support systems in data processing centers and data storage facilities against unexpected failure that could cause an interruption in essential operations. By implementing predictive maintenance programs based on the use of advanced information gathering technologies, these facilities can maintain critical equipment at high performance levels and prevent failures that could cause unplanned downtime with catastrophic results. The contents of this publication are presented for informational purposes only, and while effort has been made to ensure their accuracy, they are not to be construed as warranties or guarantees, express or implied, regarding the products or services described herein or their use or applicability. All sales are governed by our terms and conditions, which are available on request. We reserve the right to modify or improve the designs or specifications of our products at any time without notice. AMS and Machinery Health are marks of Emerson Process Management. All other marks are the property of their respective owners.

March 2009 Page 1 Protecting mechanical and electrical support systems against unexpected failure Preventive maintenance is a misnomer! Periodic lubrication of mechanical equipment and other checks may not actually prevent a breakdown, because serious internal faults and electrical overloads can go unnoticed for a long time. A more effective maintenance program is necessary for support equipment that is critical to the uninterrupted operation of a data processing center or data storage facility. Motors, pumps, HVAC units, chillers, electrical control cabinets, and power distribution panels seldom fail without providing some kind of warning! More often than not, the signs of impending failure, such as elevated vibration levels or high internal temperatures, go unnoticed during routine tasks. Yet, the results can be catastrophic. Information about the health of rotating machinery and electrical equipment is often concealed in physical signals, which can be recognized using well proven techniques for vibration monitoring and analysis as well as infrared thermography. If properly interpreted, these hidden signs can pinpoint the nature, location, and even the severity of developing problems, so steps can be taken in time to avoid losses of data and money. While every data center backs up current information and maintains redundant systems to prevent the loss of irreplaceable data, the possibility of a faulty piece of support equipment shutting down your round-the-clock operation may never have been considered. Yet, the timely identification of a potentially bad bearing in a motor, pump, fan, or other mechanical equipment could avoid a catastrophic failure and save your operation thousands of dollars. Predictive Maintenance Sometimes called reliability centered maintenance, predictive maintenance goes considerably beyond the generally accepted rules for preventive maintenance, performing service only when necessary as dictated by the condition of the equipment. If a failure is imminent, that unit may need to be

March 2009 Page 2 repaired or replaced immediately. If performance is not significantly degraded, it may be possible to delay repairs until a backup can be arranged to avoid lost time. Machinery health management carries this approach a step further by prioritizing each machine according to its significance to the operation. Then, greater care is given to the most important machines to be sure they keep running. For example, vibration monitoring is much more frequent, and in some cases continuous online monitoring may be installed, on essential equipment that must continue to operate reliably. On support equipment deemed to be less important, collection of vibration data can be done less often. Information generated by the monitors is collected and used by analysts and maintenance managers to predict when a problem is likely to occur. With that insight, maintenance can be scheduled when needed before the problem becomes severe enough to adversely affect the performance of the asset. Predictive maintenance has been proved in the process industries as a great way to improve maintenance effectiveness while reducing costs for repairs and unexpected downtime. Reactive and preventive strategies are generally reserved for equipment that s not process-critical and will cause little or no collateral damage if allowed to run to failure. As shown in Figure 1, implementing a predictive maintenance program based on monitoring the health of machines similar to those found in data centers and storage facilities dramatically reduced the risk of downtime by making timely repairs. In this graph, Priority 1 calls for immediate repair action, Priority 2 requires repair at the earliest opportunity, and Priority 3 allows maintenance to be scheduled when convenient. Reactive and preventive strategies (Priorities 4 and 5) are generally reserved for equipment that s not process-critical and will cause little or no collateral damage if allowed to run to failure. In all cases, identifying potential problems early resulted in better maintenance at a lower overall cost.

March 2009 Page 3 Figure 1 Measuring Problems Found with Vibration Analysis The Reliability Challenge In order to handle the core business and operational data of an organization in the information technology age, data centers must reliably process the transactions and store important electronic information that allow a modern business and government to operate efficiently. Critical mechanical and electrical support assets in server rooms, data centers, and their UPS systems include: HVAC units for temperature and humidity control Chillers for cooling or dehumidification Cooling Towers to reject heat from chillers or HVAC units Pumps that support the cooling equipment Engine-Generator sets as source of backup power Electrical control cabinets, motor control centers and switch gear Uninterruptible power supplies (UPS), automatic transfer switches Power distribution panels

March 2009 Page 4 Preventing Equipment Failures Vibration Analysis is a non-intrusive technology by which users attach a spectrum analyzer to machinery and record waveform signatures. Analysis of this data makes it possible to diagnose: Mechanical wear in bearings, belts, couplings, gears, and support structures Imbalances and misalignments Other defects such as lubrication failure, bent shafts, broken motor rotor bars, resonances, and many more. Emerson Process Management was contracted by a major financial institution to perform vibration analysis on critical rotating machinery at their data center and trading floor. Maintenance technicians in the facility were trained to collect vibration data using a portable CSI 2130 Machinery Health Analyzer to record waveform measurements along a predetermined route of selected machinery. The data is dumped periodically via a secured Internet connection or e-mailed to an Emerson machinery health analyst who performs vibration analysis using the AMS Suite: Machinery Health Manager software. A monthly machinery condition report to the facility manager includes recommendations for repairs. The trend analysis chart (Figure 2) illustrates periodic measurements recorded on an HVAC unit at this facility over a five-month period. In March 2008, the initial data collection and analysis showed that the motor had a severe bearing defect, which was reported to the facility manager as critical, along with a recommendation to replace the motor at the earliest opportunity.

March 2009 Page 5 Figure 2 Vibration Analysis Reveals Failing Bearing That unit continued to deteriorate until the repair actually took place during a regularly scheduled outage in June 2008. A catastrophic failure might well have occurred if the warning signs had not been recognized in time. As a result, unexpected downtime in this mission critical operation was averted. Infrared Thermography is an effective means of identifying potential problems before an incident or failure occurs. Applying this technology in a data center environment can be extremely beneficial when trying to maintain system availability 24 hours a day, 7 days a week. This non-intrusive technology can identify abnormal thermal rises in electrical, mechanical, or structural components. A successful infrared thermography program will identify: Loose and or poor connections Unbalanced electrical loads Defective components Irregular surface temperatures The benefit of performing infrared thermography is its ability to identify problems before they impact the facility. A tripped circuit breaker may protect equipment, but the subsequent downtime could have disastrous consequences within a data center.

March 2009 Page 6 A Meta Group study identified the hourly impact on different organizations to system outages. (Figure 3) It s clear that any downtime in the data center of a retail brokerage comes with a very expensive price tag. With this amount of money at stake, infrared thermography should be a key component in the maintenance routine of any data center, especially those serving retail brokerage firms. Figure 3 Financial Impact of Data Center Downtime Type of Business Average Hourly Impact Retail brokerage $6,450,000 Credit card sales authorization $2,600,000 Home shopping channel $113,750 Catalog sales centers $90,000 Airline reservations centers $89,500 Cellular service activation $41,000 Package shipping services $28,250 Online network connect fees $25,250 ATM service fees $14,500 Source Meta Practice, 25 February 2004 Emerson Network Power was contracted by a major financial institution to perform an infrared thermographic survey on all electrical connections within their facility. The survey included electrical and mechanical equipment from the building s utility entrance through the branch circuit breakers within the data center and trading floor. Qualified maintenance technicians performed the survey using an infrared camera in accordance with the procedure set forth in the Standard for Maintenance Testing Specifications for Electrical Power Distribution Equipment and Systems by the International Electrical Testing Association. Scanning the electrical distribution panels that feed the IT equipment on the data center floor revealed a problem (Figure 4) where the camera indicated a temperature 140.2 F. Furthermore, current readings of the individual conductors revealed that Phase A of the circuit was carrying 120Amps, or 96 percent of the rated line capacity of 125Amps. The National Electrical Code 102-22(c) states a continuous load shall not exceed 80 percent of the branch circuit rating.

March 2009 Page 7 Figure 4 Infrared Camera Readings on a Data Center Floor The Emerson technician recommended the loads in the panel be distributed immediately to other circuits. If the load should exceed 125Amps, the circuit breaker will likely overheat and trip open, causing the panel fed from this device to lose power, as well as all the IT equipment fed from that panel. Depending on circuits fed from that panel, part or all of the data center s electronic equipment could be shut down due to that overheating situation. That disaster was averted by the timely infrared scan. Conclusion With no room for downtime, data centers need a strategy to ensure that supporting assets are reliable. Deteriorating machinery may cause severe problems, which can be avoided by utilizing advanced predictive technologies in a mission critical environment. A best-practices facility uses predictive maintenance for essential equipment where condition-monitoring is practical, limiting reactive and preventive strategies to equipment that s not mission-critical and will cause little or no collateral damage if allowed to run to failure. By applying infrared thermography and vibration monitoring, you can observe the health of critical support systems in order to detect impending failures and minimize the risk of a catastrophe.

March 2009 Page 8 About the Authors Frank Vitucci is a District Manager in New England & New York Metro regions for Emerson Process Management which helps process industries to better manage plants through automation systems, intelligent control systems and software, measurement devices, consulting and industry expertise. He has a B.S. from the University of Connecticut, Level 1 Certification in Infrared Thermography and Vibration Analysis and 20 years experience working with diverse industries to help implement Plant Asset Management Programs, Reliability Based Maintenance and Machinery Health Management Technologies. Brian Olsen is a Business Development Manager for Emerson Network Power, Liebert Service Division. He has a B.S. from the United States Merchant Marine Academy and a M.S. in Management from Stevens Institute of Technology. Brian has 8 years of experience in maintenance of critical data center equipment. Brandon Schaner is a Senior Machinery Health Analyst at Emerson Process Management. He has over 13 years experience in heavy and commercial industrial applications providing predictive & preventive maintenance implementation service, along with PdM mentoring services. In addition, he also provides troubleshooting, corrective and re-engineering expertise for problematic in-place machinery. Brandon is certified ISO Category II in Vibration Analysis and ASNT Level I in Thermography, Ultrasound and Visual Inspection. AMS Suite: Machinery Health Manager powers PlantWeb with predictive and proactive maintenance through condition monitoring of mechanical equipment to improve availability and performance. Emerson Process Management Asset Optimization Division 835 Innovation Drive Knoxville, TN 37932 T (865) 675-2400 F (865) 218-1401