Operations | Monitoring | ITSM | DevOps | Cloud

Apple Developer outage on March 10th

On March 10, 2026, developers around the world began experiencing issues with Apple Developer services that prevented apps from being verified or launched on physical devices. For many teams building and testing iPhone apps, the outage disrupted development workflows and blocked deployment to test devices. The issue appeared to involve Apple’s developer certificate verification systems.

Improved Azure status integration

Monitoring Azure health across large environments should not require complicated setup. Until recently, connecting Azure to StatusGator required configuring access at the subscription level, which could become difficult for organizations managing dozens or even hundreds of subscriptions. We redesigned the Azure integration to make it simpler, more scalable, and easier to manage.

Let's Encrypt 45-Day Certificate Expiration: Monitoring & More

TLS certificate lifetimes are shrinking fast — and that changes how every organization handles renewals, validation, and outage prevention. Let’s Encrypt has confirmed it will move from 90-day certificates to 45-day certificates (with staged rollouts) and dramatically shorten authorization reuse windows. At the same time, the CA/Browser Forum’s Ballot SC-081v3 has adopted a broader industry schedule that ultimately caps public TLS certificates at 47 days by March 15, 2029.

Monitoring Your Node.js App Health on Fly.io

The Node.js service has just been containerized and deployed with a single fly deploy command across continents. Everything seems to be alright, but then a week later, a user messages you saying the app is slow. You run the fly logs command and scroll through some logs, and find nothing out of the ordinary. The Fly.io dashboard says the app is running and healthy, but something behind the scenes is slowing down the app, and you have no idea what. You don’t even know where to start.

Seven early warning signs you're heading toward a governance crisis

Governance failures rarely start with a major outage or a failed audit. They start with small, localized signals that teams treat as isolated annoyances. By the time a crisis becomes visible, the structural breakdown is already expensive to fix. If you are in IT leadership or platform engineering, you have likely seen these signs. The risk is ignoring them until they consolidate into a systemic failure.

Redgate Test Data Manager Updates - March 2026

This is a guest post from James Hemson. Redgate Test Data Manager's latest release adds Entra ID authentication, multi-target anonymization, and direct treatment code editing, with workflow improvements to make pipeline management faster and more flexible. Entra ID Authentication You can now connect to SQL Server using token-based authentication via Azure Entra ID, for both anonymization and subsetting.

Native OpenTelemetry inside Alloy: Now you can get the best of both worlds

We're big proponents of OpenTelemetery, which has quickly become a new unified standard for delivering metrics, logs, traces, and even profiles. It's an essential component of Alloy, our popular telemetry agent, but we're also aware that some users would prefer to have a more "vanilla" OpenTelemetry experience.