Deep dives into observability, infrastructure, and engineering culture.
A deep dive into the query optimizer rewrite that transformed our tail latencies from 180ms to under 50ms across all production clusters.
Our new ML-powered anomaly detection engine learns your system's baseline behavior and surfaces deviations before they become incidents.
A practical, step-by-step guide to instrumenting your services with OpenTelemetry and shipping traces, metrics, and logs to Kalleo.
The architecture decisions, failure modes, and operational practices that took us from three-nines to four-nines of availability.
Custom dashboards, AI-powered root cause analysis, expanded integrations, and a completely redesigned alerting experience.
Learn how to create tailored observability dashboards with drag-and-drop widgets, custom queries, and real-time data streaming.
The strategic thinking behind releasing our observability agent to the community and how it strengthens the entire ecosystem.
Lessons from instrumenting 500+ microservices: propagation patterns, sampling strategies, and avoiding the most common tracing pitfalls.
From ex-Googlers to indie hackers — get to know the engineers, designers, and operators building the future of infrastructure observability.