Alert Rules

The Alert Config page lets you create, manage, and monitor alert rules that evaluate conditions on your metrics, logs, and traces data.

Alert Rules

Alert Rules Table

All configured alerts are listed with their current status:

Column	Description
Enable/Disable	Toggle to activate or deactivate a rule without deleting it
Name	Alert rule name
Description	What the alert monitors
Current State	`normal` (green) — condition not met, or `firing` (red) — condition met and notifications active
Evaluation Interval	How often the rule is checked (e.g., 1m, 2m, 5m)
Last Evaluation Time	When the rule was last evaluated
Created By	User who created the rule

Field	Description
Data Source	Choose Logs, Metrics, or Traces
Filters	Filter data using common filters or advanced filter expressions (e.g., `type:ERROR`, `level IN (ERROR, WARN)`)
Aggregation	Aggregation function to apply — `count(*)`, `avg()`, `sum()`, `max()`, `min()`, `p50()`, `p90()`, `p95()`, `p99()`
Group By	Group results by a column (e.g., `workload`, `namespace`, `level`). Produces multi-series alerts that evaluate and fire independently per group.
Label Type	Controls how series labels are generated
Step	Bucket duration for time-series aggregation (Auto, 30s, 1m, etc.)

Field	Description
Run alert every	Evaluation interval (e.g., every 2 Minutes)
Fires, when metric	Which query or formula to evaluate (A, B, C, etc.)
Evaluated over the last	Time window for the data query lookback
Operator	`equal to`, `above`, `above or equal to`, `below`, `below or equal to`
Alert Threshold	The threshold value to compare against

Frequency	Behavior
At least once	Fire immediately when the condition is met
More than once	Fire only after the condition is breached more than N times within a specified window
Always	Fire only when the condition is met on every evaluation within the window

When using more than once, you specify both the number of required breaches and the time window over which to count them. For example, "for 2 times within 5 Minutes" means the condition must be breached more than 2 times in the last 5-minute window before the alert fires.

When using always, the alert fires only if every single evaluation within the breach counting window resulted in a breach — useful for avoiding false positives from transient spikes.

Change Alert

Fires when the query value changes by a specified amount relative to a previous time period.

Change Alert

Field	Description
Change type	`change in value` (absolute difference) or `change in %` (percentage difference)
Operator	`equal to`, `above`, `below`, etc.
Threshold compared to	The historical period to compare against (e.g., 5 Minutes ago)

Use change alerts to detect sudden spikes or drops relative to recent behavior. For example, "fire when error count changes by more than 50% compared to 5 minutes ago."

New Value Alert

Fires when a value appears that was not seen in the comparison window.

New Value Alert

Field	Description
New value compared to	The historical window to check against (e.g., 5 Minutes)

Use new value alerts to detect previously unseen patterns — new error types, new endpoints, new log levels, or any other unexpected values appearing in your data.

Step 3 — Alert Configuration

Field	Description
Severity	`Critical`, `Warning`, or `Info`
Alert Name	A descriptive name for the alert
Alert Description	Optional text explaining what the alert monitors and why
Labels	Key-value pairs attached to the alert. Labels appear in notifications and can be used for routing (e.g., `team:backend`, `env:production`).

Step 4 — Alert Routing

Choose how notifications are delivered when the alert fires.

Notification Channel — Select a specific notification channel (Slack, Teams, Email, etc.) to receive this alert
Notification Channel Policy — Route alerts dynamically based on label matching. Alerts are sent to channels whose matching labels correspond to the alert's labels.

Alert Lifecycle

States

State	Description
Normal	The alert condition is not met. The system continues evaluating at each interval.
Firing	The alert condition is met and the required firing frequency is satisfied. Notifications are sent.

State Transitions

Normal → Firing — The condition is met and the firing frequency requirement is satisfied. A notification is sent and an alert event is recorded.
Firing → Normal — The condition is no longer met (for at_least_once: no breaches in the breach counting window; for more_than_once: breach count drops below the threshold; for always: at least one evaluation did not breach). A resolved notification is sent.

Resolved Notifications

When an alert transitions from firing back to normal, a resolved notification is automatically sent to all configured channels, so your team knows the issue has been addressed.

Multi-Series Alerts

When a query uses Group By, the alert evaluates each group independently. For example, if you group by workload and three workloads breach the threshold, you receive three separate alert notifications — each with the specific workload name and value in the labels.

This is useful for scenarios like:

Error count per workload exceeding a threshold
Latency per namespace crossing an SLO boundary
Request rate per service dropping below expected levels

Example Alert Rules

Name	Type	What it monitors
Error log alert	Threshold	Error log count exceeds a limit
499 Error Spike - traces	Change	Sudden increase in 499 status codes from trace data
Low Login Activity	Threshold	Login activity drops below expected levels
Failed Login Attempts >= 1	Threshold	Any failed login attempt detected
Node max capacity	Threshold	Node CPU or memory approaching capacity
Detect Log level change	New Value	Unexpected log levels appearing
HTTP 500 Error Count	Threshold	HTTP 500 errors exceed a threshold
Application Error Alert	Threshold	Application error logs detected

Alert Rules

Alert Rules Table

Creating an Alert Rule

Step 1 — Define Alert Query

Multiple Queries and Formulas

Step 2 — Alert Conditions

Threshold Alert

Change Alert

New Value Alert

Step 3 — Alert Configuration

Step 4 — Alert Routing

Alert Lifecycle

States

State Transitions

Resolved Notifications

Multi-Series Alerts

Example Alert Rules

ON THIS PAGE