Alert thresholds

The line between noise and signal is not obvious

Every monitoring system generates data. The previous lesson established that monitoring creates accountability -- that the act of tracking makes you more likely to maintain what you track. But accountability without discrimination produces a new problem: you're paying attention to everything, which means you're paying attention to nothing in particular.

Your morning planning agent missed one meeting out of twenty this week. Is that a problem? Your weekly review habit took ninety minutes instead of the usual sixty. Should you investigate? Your reading agent -- the routine that selects and processes articles relevant to your work -- surfaced three irrelevant pieces out of ten. Does that warrant intervention?

Without a threshold, every deviation looks like it might be a problem. And "might be a problem" is the most exhausting state a monitoring system can produce. It keeps you in a perpetual state of semi-vigilance -- never alarmed enough to act, never confident enough to ignore. The solution is a line drawn in advance: above this value, the agent is operating normally and you leave it alone. Below this value, you stop and investigate.

That line is your alert threshold. Getting it right is harder than it sounds, and the consequences of getting it wrong are well-documented across engineering, medicine, and psychology.

Signal detection theory: the science of drawing lines

In 1954, a group of researchers at the University of Michigan -- Wilson Tanner and John Swets among them -- formalized a problem that radar operators had been struggling with since World War II: how do you decide whether a blip on the screen is an enemy aircraft or just atmospheric noise? The resulting framework, signal detection theory (SDT), has since been applied to medical diagnosis, quality control, cybersecurity, and any domain where you must decide if an observation is signal or noise.

SDT separates the threshold problem into two independent components. The first is sensitivity -- how different the signal actually is from the noise. If your agent's failure mode produces obviously bad outputs, sensitivity is high and detection is easy. If failure looks almost identical to normal operation, sensitivity is low and you're working with overlapping distributions. Sensitivity is measured as d-prime (d'), the standardized distance between the noise distribution and the signal-plus-noise distribution. You cannot change d-prime by adjusting your threshold -- it's a property of the system itself.

The second component is your criterion -- the point on the evidence axis where you draw the line between "normal" and "investigate." This is what you control. Move the criterion to the left (lower threshold) and you catch more real problems, but you also generate more false alarms. Move it to the right (higher threshold) and you eliminate false alarms, but you miss real problems. There is no setting that maximizes both.

This tradeoff is visualized through the receiver operating characteristic (ROC) curve, which plots the hit rate (correctly detected problems) against the false alarm rate as you sweep the criterion across all possible values. Every point on the curve represents a different threshold setting. A system with high d-prime produces a curve that bows dramatically toward the upper-left corner -- strong separation between signal and noise. A system with low d-prime produces a curve that hugs the diagonal, which means no threshold setting will perform well.

The practical implication for your agents: before agonizing over where to set the threshold, first ask whether the metric you're monitoring actually separates good performance from bad. If your morning planning agent's success metric is "did it produce a plan?" then the answer is always yes -- even terrible plans are still plans. That metric has near-zero d-prime. Switch to "how many of the top-three priorities appeared in the plan?" and you've increased the distance between the noise and signal distributions. Better metrics make threshold-setting easier.

The four outcomes: hits, misses, false alarms, and correct rejections

Every time your threshold fires -- or doesn't -- one of four things happens. Understanding all four is essential because most people only think about two of them.

A hit is when your threshold fires and there's a real problem. Your agent's accuracy dropped to 60% when your threshold was set at 70%, you investigated, and you found a genuine issue. This is the outcome that makes thresholds feel worthwhile.

A miss (false negative) is when there's a real problem but your threshold didn't fire. The agent has been subtly degrading for weeks, but not enough to cross your line. Earlier lessons in this phase -- L-0548 on false positive rate and L-0549 on false negative rate -- addressed these concepts as properties of the agents themselves. Here the concern is your monitoring system's false negative rate. Misses are invisible by definition, which makes them the most dangerous outcome.

A false alarm (false positive) is when your threshold fires but there's no real problem. Normal variance briefly dipped below your line. You investigate, find nothing wrong, and lose thirty minutes. One false alarm is a minor cost. Repeated false alarms produce something far worse.

A correct rejection is when the threshold doesn't fire and everything is actually fine. This is the silent majority of monitoring cycles -- the outcome you want most often and think about least.

The ratio between these four outcomes is entirely determined by where you set your threshold. There is no threshold that maximizes hits and correct rejections while simultaneously minimizing misses and false alarms. Every threshold is a bet about which errors cost you more.

Alert fatigue: what happens when you set thresholds too tight

In healthcare, clinical monitoring systems generate alarms when patient vital signs cross predefined thresholds. The Joint Commission estimates that 80 to 99 percent of these alarms do not require clinical intervention. Research across hospital ICUs shows that 72 to 99 percent of all clinical alarms are false. The result is predictable: an estimated 90 percent of medication alerts are overridden by physicians, and more than half of those overrides occur because clinicians have learned that most alerts are noise.

The consequences are not abstract. The FDA cataloged 566 deaths from ignored alarms between 2005 and 2008. A Boston Globe investigation identified more than 200 additional deaths over five years from failure to respond to physiologic monitoring alarms. The mechanism is straightforward: when most alerts are false, responders stop treating alerts as meaningful. They don't consciously decide to ignore them -- they develop a conditioned response to dismiss them, and that conditioned response doesn't distinguish between the ninety-nine false alarms and the one real one.

The same pattern appears in software engineering. Google's Site Reliability Engineering practice defines an explicit standard: every paging alert must be immediately actionable, meaning there must be a specific action a human can take that the system cannot take itself. Google warns that a low signal-to-noise ratio in alerting raises the risk for on-call engineers to develop alert fatigue -- the same phenomenon that kills patients in hospitals.

In cybersecurity, up to 30 percent of security alerts go completely uninvestigated. More than half of respondents in industry surveys report that critical alerts are missed weekly.

The lesson for your personal agents is identical: if your threshold fires too often and most firings lead nowhere, you will stop responding. Not through a deliberate decision, but through the same habituation mechanism that causes hospital nurses to ignore cardiac monitors. The threshold that catches everything catches nothing, because it trains you to stop looking.

The Shewhart principle: let the data draw the line

Walter Shewhart, working at Bell Telephone Laboratories in the 1920s, faced the threshold problem in industrial manufacturing. How do you know when a production process has shifted from normal variation to an actual problem? His solution -- the control chart -- remains the foundation of statistical process control a century later.

Shewhart's insight was that you should not set thresholds based on what you want the process to do. You should set them based on what the process actually does when it's running normally. Measure the process repeatedly, compute the mean and standard deviation, and place your control limits at three standard deviations from the mean. Any observation within those limits is attributed to common-cause variation -- the inherent randomness of a stable process. Any observation outside those limits is attributed to special-cause variation -- something has actually changed.

Why three sigma? Not because it's mathematically optimal in some absolute sense, but because decades of empirical application showed that three-sigma limits minimize the total cost of two competing errors: reacting to noise (false alarms) and ignoring real shifts (misses). At three sigma, only 0.27 percent of observations from a stable process will fall outside the limits by chance -- roughly one false alarm per 370 observations. Tighter limits catch smaller shifts but generate more false alarms. Looser limits reduce false alarms but let real shifts persist longer.

The Western Electric Company extended Shewhart's system with additional rules for detecting subtler shifts: two out of three consecutive points beyond two sigma on the same side, or eight consecutive points on the same side of the center line. Each rule trades off detection speed against false alarm rate, but all share Shewhart's fundamental approach -- the data defines normal, and the threshold is placed relative to the data.

Applied to your agents: don't set a threshold based on what you think your morning planning agent should achieve. Run it for two to four weeks, measure the metric daily, and let the actual distribution of outcomes tell you where "normal" ends and "investigate" begins. An agent that hits 85 percent accuracy with a standard deviation of 5 points has a very different threshold than one that hits 85 percent with a standard deviation of 15 points -- even though the average performance is identical.

Choosing where to set the line: costs, not comfort

Signal detection theory teaches that the optimal threshold depends on four factors: the base rate of actual problems (how often things really go wrong), the cost of a miss (what happens if you don't catch a real problem), the cost of a false alarm (what you lose investigating a non-problem), and the benefit of a hit (what you gain by catching a real problem early).

When the cost of a miss is catastrophic and the cost of a false alarm is minor, set the threshold aggressively low. A financial fraud detection system can tolerate investigating ten false alarms if it means catching the one real fraud that would cost millions. When the cost of a false alarm is high -- when every investigation pulls you out of deep work for an hour -- and the cost of a miss is recoverable, set the threshold higher. A reading agent that surfaces one irrelevant article per week doesn't need the same threshold sensitivity as a medication dosing agent.

The Yerkes-Dodson law from psychology suggests a parallel principle for cognitive thresholds. Yerkes and Dodson demonstrated in 1908 that performance follows an inverted-U relationship with arousal. Too little arousal (threshold too loose, no alerts ever fire) produces disengagement -- you stop caring about your agents. Too much arousal (threshold too tight, alerts firing constantly) produces frazzle -- you're overwhelmed and start ignoring everything. Optimal performance lives in the middle, where alerts fire rarely enough to be taken seriously and frequently enough to maintain accountability.

This maps directly to your monitoring practice. A well-calibrated threshold produces what Google SRE calls a "high signal-to-noise ratio" -- when the alert fires, it almost always means something real, so you almost always respond. That reliability of response is the entire point.

Tiered thresholds: not every deviation deserves the same response

One threshold is better than none. Multiple thresholds are better than one. Google's SRE practice distinguishes between paging alerts (wake someone up at 3 AM) and ticket alerts (create a work item for business hours). The triggering conditions differ: their recommended starting points are 2 percent error budget consumption in one hour for a page, versus 10 percent budget consumption in three days for a ticket.

The same principle applies to personal agents. Define at least two levels:

Warning threshold: the metric has entered an unusual range. You note it, increase the frequency of observation, but don't intervene yet. This corresponds to Shewhart's two-sigma warning limits. A single observation in this zone might be noise. Two or three in a row likely are not.

Action threshold: the metric has crossed into territory that demands investigation and likely intervention. This corresponds to the three-sigma action limits. When this fires, you stop what you're doing and examine the agent.

The warning threshold gives you lead time. Most real problems don't appear as sudden catastrophic failures -- they appear as gradual degradation that passes through the warning zone before reaching the action zone. If you only have an action threshold, you catch problems later than you need to. If you only have a warning threshold, you never know when to actually act.

The threshold is a commitment to your future self

Here is the connection to the previous lesson. L-0554 established that monitoring creates accountability -- that tracking makes you more likely to maintain your agents. The threshold extends that accountability from observation to action. Without a threshold, monitoring produces data. With a threshold, monitoring produces decisions.

A threshold set in advance is a commitment made by your calm, analytical self to your future reactive self. When your agent's accuracy drops to 68 percent, your in-the-moment self will rationalize: "It's probably fine. I'm busy. I'll check later." The threshold -- set during a calm moment, grounded in data, written down -- overrides that rationalization. It converts a subjective judgment ("should I worry about this?") into a binary check ("did it cross the line?"). And binary checks are much harder to rationalize away than continuous assessments.

This is why the threshold must be defined before you need it. A threshold set after you notice a problem is not a threshold -- it's a retroactive justification for whatever you were going to do anyway.

From threshold to trajectory

A well-set threshold answers one question: is this observation normal or abnormal? That's valuable, but it's incomplete. A single point below your threshold tells you something went wrong today. It doesn't tell you whether things are getting worse.

The next lesson introduces trend analysis -- the practice of looking at how metrics move over time rather than evaluating each observation in isolation. A metric sitting at 88 percent with a threshold of 80 percent looks safe. But if that metric was 95 percent a month ago, 92 percent two weeks ago, and 88 percent today, the trajectory tells a different story. Trend analysis converts your threshold from a static tripwire into a dynamic early warning system -- one that can detect problems before they cross the line, rather than after.

Your threshold defines the boundary. Trend analysis tells you how fast you're approaching it.