AI Error Analytics

AI Error Analytics automatically groups, classifies, and tracks errors across your services, providing a centralized view of all recorded incidents.

AI Error Analytics

Overview

Instead of manually sifting through logs and traces to find errors, AI Error Analytics automatically detects error patterns and groups them into distinct issues. Each issue shows the error type, affected services, and trend over time.

Incident Timeline

The top of the page shows a time-series chart of Recorded Incidents with color-coded categories:

Container crash — Pod/container crash events
gRPC — gRPC protocol errors
Unreachable deployment — Services that cannot be reached
HTTP — HTTP protocol errors
Kafka — Kafka message processing errors

Issues Table

Each issue row displays:

Column	Description
Issue	Error description (e.g., "404 - Not Found /api/v2/traces")
Service	The affected service/workload
Time	When the error was last seen (e.g., "35s ago")
Reason	Root cause classification (e.g., "Not Found")
Details	Protocol and HTTP method (e.g., "POST HTTP")

Filters

The left panel provides filtering by:

Type

HTTP — HTTP protocol errors
Kafka — Kafka processing errors
Container crash — Container/pod crash events
Unreachable deployment — Network reachability issues

Source

OTel, eBPF, agent, k8s_watcher, logs

Status Code

Filter by specific HTTP status codes (e.g., 400, 404, 405, 408, 500)

Error Details

Click any issue to see:

Full error message and stack trace
Affected services and endpoints
Error frequency and trend
Correlated traces and logs
Crash reason for container crashes (e.g., "OOMKilled - ExitCode: ...")

AI Error Analytics

ON THIS PAGE