
Observe Inc. is rolling out two new AI agents, AI SRE and o11y.ai, to automate software reliability tasks and cut down on the manual "dirty work" for developers, according to a company announcement on Tuesday.
The reliability bottleneck: The move targets key industry pain points as systems sprawl and generate mountains of data. Observe cites a 2025 Gartner survey where nearly three-quarters of infrastructure and operations heads named cost optimization a top goal, while more than half of CIOs focused on improving operational resilience. "As AI code generation accelerates software delivery, the bottleneck has shifted to running and maintaining systems reliably at scale," said Jeremy Burton, CEO of Observe Inc.
An automated partner: The first offering, AI SRE, acts as an automated partner for site reliability engineers. It autonomously digs through logs, metrics, and traces to pinpoint root causes and suggest fixes, with the company claiming it can slash incident resolution from hours to minutes.
A developer's shortcut: The second agent, o11y.ai, gives developers a shortcut for understanding their own applications. It automatically adds OpenTelemetry instrumentation and lets them ask plain-English questions about performance and errors, right from their editor.
Early customers are signaling the potential impact. "Observe's AI SRE and MCP Server have the potential to transform how we investigate incidents," said Adam Skobodzinski, a Software Engineer at Foursquare. Similarly, Topgolf's Senior Director of Engineering, Brian Schneider, called the technology a "major unlock for our SRE practice."
The goal isn't to replace engineers, but to reduce the daily toil they face from AI-generated code that is often verbose and a headache to debug. As one industry observer noted, a human engineer will remain central to making sense of it all. For a look under the hood, Observe has published technical documentation detailing how its MCP server shares context with third-party AI models.