Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Why Gaining Control of Your #telemetry Data Is a Game Changer

Disconnected pipelines. Unknown data sources. Costs that do not add up. Many teams struggle to answer a simple question. What data do we have and where is it going? In this clip, a Cribl customer explains how bringing all telemetry data together changed everything. With Cribl, their team can finally see what they collect, where it flows, and what it costs. That clarity unlocked smarter reduction, better routing decisions, and major optimization across security and observability workflows.

OnlineOrNot's lessons from Cloudflare's outage on 2025-11-18

On 2025-11-18 at 11:48 UTC, Cloudflare declared an incident affecting the global network (that also affected OnlineOrNot). OnlineOrNot monitors websites, APIs, web apps, and cron jobs, while providing status pages as well. While we partially mitigated the issue by enabling a fallback to AWS-based monitoring, between 13:00 UTC and 14:33 UTC failing checks went unreported, heartbeat checks over-reported, and status pages were unavailable.

AI-Suggested Alert Thresholds for Mobile Telemetry

Life is pretty good. I’ve shipped a mobile app and I’m (happily) drowning in telemetry. Battery impact, time in foreground/background per screen, crash rates, slow frames, network retries – the works. The data is brilliant; the challenge is turning signals into reliable alerts that catch real issues which are relevant to my app’s functions. So… what should I actually listen for, and where should I set the thresholds?

Outage map now available in your StatusGator board

We’re excited to introduce a helpful new update to your StatusGator experience – the service outage map is now built directly into your StatusGator account. StatusGator has displayed outage heatmaps on our public website’s service landing pages. These maps helped users understand where issues were being reported across the globe. Now, we’ve taken that same valuable visibility and placed it inside your board.

StatusGator earns SOC 2 Type 2 certification

We are absolutely thrilled to share some momentous news: StatusGator has officially achieved SOC 2 Type 2 certification! This isn’t just another checkbox on a compliance list – it’s a powerful validation of our dedication to safeguarding your data and delivering the reliable service you depend on.

Stay audit-ready with real-time file change alerts in Site24x7 server monitoring

Maintaining the integrity of server files and directories is essential for security, operational resilience, and compliance. Whether it’s business-critical application configurations, sensitive data files, or audit logs, any unauthorized, unexpected, or accidental modification can jeopardize service continuity and expose an organization to regulatory risks. Manual file monitoring is impractical at scale.

How OpManager powered IT reliability for DWHIN

In healthcare, every moment counts—and for Detroit Wayne Integrated Health Network (DWIHN), every heartbeat depends on a network that doesnt skip one. Serving over 75,000 patients across Detroit and Wayne County, DWIHN’s IT network powers essential behavioral health services, from autism care to crisis intervention. When its systems started showing signs of strain, DWIHN turned to ManageEngine OpManager to bring reliability, clarity, and calm back to its IT operations.

Introducing Kentik AI Advisor

Introducing Kentik AI Advisor. AI with a comprehensive understanding of your network that thinks critically and advises how to design, operate, and protect infrastructure at scale. With the rise of hybrid cloud networks and the growing demands of AI infrastructure, network teams are under pressure to balance cost, performance, and security, often with limited resources that delay critical strategic initiatives.

Better together: Cribl and Microsoft Fabric just got radically simpler

In September, I wrote about how Cribl and Microsoft Fabric Real-Time Intelligence provide a powerful combination, unlocking new analytics capabilities for security and IT teams. I also said there was more to come… Today, Cribl is thrilled to announce a new Cribl Destination for Microsoft Fabric Real-Time Intelligence, marking another big step forward in our collaboration with Microsoft to make it much easier for Cribl customers to use Fabric.