Inside Snowflake’s Latest AI Push: New Platform Capabilities and a $200M OpenAI Deal

At its BUILD London event yesterday, Snowflake rolled out a new set of AI capabilities aimed at simplifying the development of agents and other advanced applications. The new capabilities include native integration of Snowflake Postgres, Semantic View Autopilot, and Cortex Code, among other tools.  These new capabilities come only a couple of days after Snowflake…

Read More

Lightrun unveils AI SRE to find and fix software production errors

Lightrun has announced Lightrun AI SRE, an AI-powered site reliability engineering (SRE) assistant designed to detect software production errors and performance degradations. Introduced February 25, the Lightrun AI SRE correlates the service-level issues it finds with proven root causes to propose solutions. Drawing on on live, in-line runtime context, the AI SRE allows AI agents…

Read More

The reliability cost of default timeouts

In user-facing distributed systems, latency is often a stronger signal of failure than errors. When responses exceed user expectations, the distinction between “slow” and “down” becomes largely irrelevant, even if every service is technically healthy. I’ve seen this pattern across multiple systems. One incident, in particular, forced me to confront how much production behavior is…

Read More