A machine that is well aligned, well-balanced, and well lubricated is not only more efficient – it’s simply safer for the people who have to work around it. That’s the case we need to make because too many operations continue to treat safety compliance and equipment maintenance as two separate workstreams, managed by two separate teams.
Lockout/Tagout is non-negotiable, but it’s a starting point
Lockout/Tagout (LOTO) should be considered the minimum standard, not the maximum standard. Any maintenance or servicing activity for heavy industrial equipment must begin with total isolation of all energy sources; electrical, hydraulic, and pneumatic. No exceptions based on perceived simplicity.
Where LOTO can fall down is in the detail. A master LOTO document will not suffice. The unique stored energy sources, fluid line pressure zones, and capacitor discharge characteristics present in each machine category must be enumerated in a detailed document. Technicians must then confirm isolation using appropriate test equipment, not merely trust in the lockout.
This matters because a significant number of industrial workplace fatalities are estimated to occur while the equipment is shut down for maintenance, not during regular use. The ‘machine is safe’ assumption is consistently the cause of injuries.
Predictive maintenance outperforms reactive firefighting
Operating machinery continuously until it breaks down is an unnecessary and costly practice. Predictive maintenance (PdM) aims to eliminate this by assessing the equipment’s condition in real time and performing maintenance only when needed. Vibration analysis and oil analysis are the two key PdM tools used for rotating equipment. Vibration monitoring can detect wear and mechanical imbalance in the bearings and shafts of rotating machinery. Oil analysis can detect the existence of metal particles, which are produced by components as they wear. Metal particles in the oil are a sign of internal wear.
Rotating and fluid systems need specialist-level attention
Shaft misalignment and seal degradation account for more fluid leaks, pressure loss, and unplanned downtime in heavy pumps than nearly any other issue in process environments. This is all but impossible to manage with general maintenance practices, since the tolerances around this work and the potential consequences of failing to meet them are just too tight. Flooding or out-of-compliance emissions are unacceptable in most situations, but those are the stakes when you’re replacing a pump seal or realigning a shaft.
Those risks are compounded with high-pressure process pumps, where only the most precise calibration will do – this sort of work is not generally possible to execute at an OEM level with a standard in-house maintenance team. Dynamic balancing, one aspect of the work required to fully repair a damaged or worn pump, is a great example of this. The tolerances here are in the single mil, impossible without the right equipment and extensive training on measurements. When pump repair and maintenance get to this level, it’s worth engaging certified specialists. Operations that rely on westpower industrial pump repair services, for example, have pre-approved access to these sorts of technologies, as well as to pressure testing and component verification to the OEM’s design specs. This is work that safeguards the equipment as well as the employees who run it.
Train operators to read machines, not just monitors
Digital control panels provide operators with information. They do not provide operators with knowledge. Heavy industrial equipment operator training should involve training on how to “use” the operator’s natural sensing capacities to see, hear and feel warning signs.
Let’s take cavitation as an example. It is the bane of fluid-handling systems the world over. Cavitation occurs when low pressures in a pump cause the fluid to release vapor bubbles that collapse against any nearby solid surface with enough force to erode metal over time. The warning acoustic signature, a sound like rapping gravel, a coffee percolator or a jackhammer, often precedes measurable higher pressure drop by hours, days or sometimes weeks.
An operator who knows what to listen for can alert maintenance before the deteriorating component causes a broader failure.
Root cause analysis closes the loop
Whenever a piece of equipment malfunctions unexpectedly and has to be shut down, this occurrence should serve as a formal trigger for a Root Cause Analysis (RCA). This should not be a discussion only in the maintenance bay. RCA should be a documented, structured approach where you work backward from the failure until you find out what systemic condition allowed the failure to happen in the first place.
This is not just a reliability issue, it’s one of the most fundamental safety practices there is.
When you fail to perform RCA after a machine fails and it then fails in the same way again, you have put multiple people at serious risk. Each time a machine fails in the same way you have essentially been lucky. It’s like playing a terrible form of Russian Roulette.
For RCA to result in meaningful prevention, there are a few key ideas that must be understood:
- Doing RCA so you know who to blame for a misfortune isn’t how it works. RCA is misapplied 99% of the time if people are the outcome. It was a system that allowed events to unfold as they did. Fix the system so a similar event can’t be repeated – no blame required.
2. RCA is rock simple in theory: ask ‘why’ at least 5 times (usually it’s more) until you reveal the first event that set off the line of occurrences that ended in the machine failing. Fix that issue, check your work, and if you are lucky, you have found the only issue required to put safeguards in place in order that you can continue.
3. The limiting factor in performing RCA is almost never access to the expertise that needs to apply the 5 why questioning. It’s an unflinching decision not to get used to having that machine hit the shutdown critical event again.