Kubesense

AgentSRE

AgentSRE is KubeSense's AI-powered SRE companion that helps you investigate issues, perform root cause analysis, and answer questions about your infrastructure and applications.

AgentSRE

Overview

AgentSRE provides a conversational interface where you can ask questions about your infrastructure and get AI-powered answers backed by your actual observability data. It acts as an intelligent assistant for DevOps and SRE teams.

How It Works

  1. Type your question in the "Investigate anything" input box
  2. AgentSRE queries your metrics, traces, logs, and infrastructure data
  3. It returns an analysis with relevant data, charts, and actionable recommendations

AgentSRE provides suggested questions to get started:

General Health & Performance

  • "What are the top errors impacting my services right now?"
  • "Which services are experiencing latency spikes?"
  • "How is my infrastructure performing over the last 24 hours?"
  • "Are there any anomalies detected in my system today?"
  • "Which APIs or endpoints have the slowest response times?"

Investigation

  • "Why is service X returning 500 errors?"
  • "What changed in the last hour that might have caused this outage?"
  • "Show me the traces with the highest latency in the last 15 minutes"
  • "Which pods are consuming the most memory?"

Features

  • Root Cause Analysis (RCA) — Automatically correlates signals across metrics, logs, and traces to identify the root cause of incidents
  • Previous Investigations — Access past investigation results for reference
  • Data-backed answers — All responses are grounded in your actual observability data
  • Natural language queries — Ask questions in plain English without needing to write complex queries