Operations | Monitoring | ITSM | DevOps | Cloud

How one partnership powers search for over 2 million WP Engine users

How do you make search faster, smarter, and more scalable? During our recent webinar, I sat down with Luke Patterson, senior product manager at WP Engine, and Delphin Barankanira, independent software vendor partner engineering lead and data & AI specialist at Google Cloud, to answer that question. We dug into the mechanics behind WP Engine’s ability to deliver near-instant updates to over 2 million users.

Eliminate noisy log lines with Adaptive Logs drop rules

Most platform and observability teams have logs they know are noise. These could be throwaway health check logs, forgotten DEBUG logs, or verbose INFO logs from little used services that only serve to inflate your bill. Regardless of what they contain and why they're there in the first place, the hard part is getting rid of them. Centralized teams want to easily and quickly prevent these logs from being ingested, without having to work with toilsome infrastructure change management to do so.

Why I Give My Engineers $5,000 Per Month Of Claude Code Tokens

A few weeks ago, a group of engineering leaders I trade notes with got into it over a question none... A few weeks ago, a group of engineering leaders I trade notes with got into it over a question none of us has a clean answer to: How much should you let an engineer spend on AI? One SVP at a company of similar size and stage is in calibration mode and capping engineers at $200 per month. Hit the cap, you can self-bump by $100. Hit that, you need your manager. I told the thread our number. $5,000.

Span or Attribute in OpenTelemetry Custom Instrumentation

TL;DR: Attribute. More information on one event gives us more correlation power. It’s also cheaper. When you want to add some information to your tracing telemetry, you could emit a log, create a span, or add a piece of data to your current span. Adding a piece of data to your current span is the best! Usually.

Infrastructure for AI Agents: what platform teams need to build now

If an AI agent in your development workflow needed to spin up a test environment tonight, how many manual steps would stand between the request and the environment being ready? By early 2026, AI agents have transitioned from simple code assistants to first-class platform citizens. They are running test suites, analyzing performance, and triggering deployments.

What is sovereign AI, and why does it matter for your business?

With AI reshaping every corner of the modern business, the highest-value workloads are often locked behind complex regulatory frameworks. Yet many organizations are still running them on infrastructure they don't fully control, trusting external platforms to decide where their data lives, where workloads run, and how their AI operates. Civo was built to change that.

Why Blast Radius Analysis Does Not End When Alerts Fire

Modern distributed systems fail in ways that can bypass even well-designed isolation patterns. When a failure is actively propagating across services at four in the morning, the question shifts from “how do we limit the blast radius” to “how do we confirm what it actually is.” Monitoring shows which services are in the impact zone, but it cannot show what code path caused the failure to spread, or whether it has stopped.

Turn Alerts into Action: Why Modern Operations Need More Than Monitoring

Modern ops stacks are very good at detecting problems. From IT infrastructure and cloud platforms to industrial systems, cybersecurity tools, and IoT environments, monitoring technologies generate alerts the moment something goes wrong. But there is a critical problem modern operations teams still struggle with: Detection does not ensure response. And that gap is becoming one of the biggest operational risks organizations face today.

Why database ownership is so fragmented in 2026 - and what you can do about it

In today’s cloud-driven, multi-platform environments, answering the simple question - who owns that database? - is no longer straightforward. As teams adopt open-source tools and spin up cloud services on demand, ownership is becoming fragmented across development, operations, and data teams. This shift is accelerating innovation but also creating new challenges in visibility, control, and accountability, as Grant Fritchey explains.