Every control room that has run leak-detection AI knows the failure pattern. The system fires often enough on nothing that operators start discounting it, and the moment alarms become noise, the tool has failed even if it's technically accurate. The opposite failure is worse: a missed real leak is a safety and environmental incident. Living between those two is the actual job, and most teams discover it's an operating problem, not a model problem.

Two failure modes, one eroded trust

False positives and false negatives feel like opposites, but they damage the same thing: the control room's trust in the system. Too many false alarms and operators tune it out, which quietly converts into missed real events because nobody's listening. Too many misses and they stop relying on it deliberately. Either way you end up with an expensive system nobody trusts, which is worse than no system, because someone signed off on it.

Why this isn't a model-accuracy problem

Teams instinctively respond by chasing a more accurate model. Accuracy helps, but it doesn't resolve the tension, because tuning toward fewer false positives pushes toward more misses and vice versa. The threshold is a judgment about which error is more tolerable in your operation, and that judgment lives with people, not the model. Treating it as purely technical is how teams keep retuning forever and never build trust.

The three things to watch once it's live

Operating leak detection well comes down to a short list:

The false-positive rate, tracked over time. Not a one-time accuracy number. A trend the control room can see, so alarm fatigue is caught before it sets in.
Drift. Conditions and flow patterns change, and a model calibrated last year slowly stops matching this year. A scheduled check and a retrain trigger keep it current.
A named owner of the response. Someone whose job is to decide what happens when the system fires, and who is accountable when it misses. Not a committee. One person with the authority to act and to pause.

Building the response so the room trusts it

The system that earns trust is the one where every alarm has a clear, human-owned response path, and where operators understand enough about what the model is doing to calibrate their own confidence. The AI surfaces the signal; a qualified person decides; the decision is logged. That loop is what keeps both trust and accountability intact, and it's what a regulator will eventually want to see.

Leak detection doesn't fail because the model wasn't good enough. It fails because nobody designed how the control room would live with it. Design that, and the system stops crying wolf and starts earning the response a real alarm deserves.

Related field notes

March 20, 2026

Take the next step.

If this is the kind of work you want Claude doing inside your own operation, Cortland scopes engagements in three tiers: Walk (strategy), Run (build), Sprint (ongoing). Start wherever the risk fits.

Scope an engagement More from the blog

When Your Leak-Detection AI Cries Wolf

Two failure modes, one eroded trust

Why this isn't a model-accuracy problem

The three things to watch once it's live

Building the response so the room trusts it

AI Literacy for the Control Room and Integrity Desk

Turning SCADA and Historian Data Into Decisions

The Highest-Uptime AI Use Case in Midstream

Take the next step.