Question 1

How is the score calculated?

Accepted Answer

The composite score weights three factors: noise rate (50%) — the percentage of alerts that did not require action; flap rate (30%) — the percentage that auto-resolved without intervention; and after-hours skew (20%) — the percentage fired outside business hours. Higher number = more noise. Below 25 is healthy; 50-65 is significant noise; above 80 means your team is in alert fatigue territory.

Question 2

What counts as an "actionable" alert?

Accepted Answer

An alert is actionable if a human took at least one remediation step in response: ran a script, restarted a service, escalated to another team, opened a ticket, made a code change, or even confirmed it was a known false-positive. An alert is NOT actionable if the responder looked at it, decided nothing needed to be done, and went back to sleep — that alert should not have fired.

Question 3

How do I find these numbers?

Accepted Answer

In PagerDuty, look at "Notifications by status" reports — alerts that resolved without acknowledgment are usually flap. In Datadog, the alert review dashboard shows fire-resolved durations. In Prometheus alertmanager, the alert-fatigue exporter gives most of these. Or get them retroactively by exporting the last 30-90 days of alerts and labeling them yourself — even rough estimates produce a useful score.

Question 4

My noise rate is high. What do I fix first?

Accepted Answer

Sort alerts by name and look at the top 5 most-fired. In most teams, 5 alert rules produce 50%+ of the volume. For each: (1) is there a "for:" clause to filter transient blips? (2) is the threshold actually meaningful, or just an arbitrary number copied from a tutorial? (3) does anyone respond when this fires? Often the right answer is delete or convert to a daily digest instead of a page.

Question 5

What is the connection to Uptimes.ai?

Accepted Answer

Alert noise is the problem Uptimes.ai exists to solve at scale. Our platform automatically correlates and deduplicates alerts (typical reduction of 90-94%), and our AI SRE agent runs the first 30 minutes of investigation before paging a human. Customers commonly cut both alert volume and after-hours pages by ~50% within the first month. This tool helps you measure where you are starting from.

Alert Noise Score

Recommendations

Automate your incident response

Why alert noise is the metric that matters

The 80/20 rule of alert hygiene

From measurement to automation

Frequently Asked Questions

Related Tools

On-Call Fairness Analyzer

Burn Rate Calculator

MTTR Calculator