Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Linux Security Logs: Complete Guide for DevOps and SysAdmins

Security logs are the quiet sentinels of your Linux systems, recording critical information that can mean the difference between detecting an intrusion and discovering a breach months too late. For most DevOps professionals and system administrators, these logs contain valuable insights that often go untapped. While they're essential for compliance, their real value lies in providing visibility into your system's security posture and operational health.

You, Me, and BugSplat's MCP

Let's face it - from an experienced developer's perspective, most software trends are, put lightly, incredibly annoying. The last thing a grizzled, old, technical wizard wants to hear is some half-brained junior developer telling them to switch their SQL server to MongoDB, replace the PHP EC2 with serverless Python, or rewrite their entire front-end with HTMX. The hype-train is so intense that even watching TV feels risky, as you might see something as absurd as an ad for AI toothpaste.

Prometheus vs Zabbix: A Hands-On Technical Comparison and a Modern Alternative

When choosing a monitoring tool, two popular names often come up, Prometheus and Zabbix. Both are powerful and widely adopted but come with different approaches and learning curves. Prometheus is favored in cloud-native environments for its time-series data model and flexibility, while Zabbix has long served traditional IT infrastructures with its rich agent-based monitoring. But what if you are looking for a simpler, more unified solution?

Tracealyzer Was Just the Beginning

If you’ve been building embedded systems for a while, chances are you know Percepio for Tracealyzer. And we’re proud of that. For over a decade, Tracealyzer has been helping engineers visualize and solve complex RTOS issues faster, with over 30 ways to slice and understand system behavior. But in 2025, embedded systems demand more. They’re always on. Always connected. And increasingly, always business-critical.

Getting started with ServiceNow dashboards

ServiceNow is a cloud-based platform that streamlines IT service management, operations, and various business workflows across organizations. Dashboards in ServiceNow can play a valuable role by offering a clear view of key metrics, trends, and performance indicators. While there are dashboards locally in ServiceNow portal, they often fail to provide a fuller picture of the impact of the incidents in context with other key metrics from external tools.

vmalert - Maximize Your Monitoring - Tech Talk #5

This time, we're diving into a critical component for operational excellence: vmalert. Effective alerting is the backbone of proactive monitoring, enabling teams to detect and respond to issues swiftly before they impact users. But setting up truly effective alerting – alerts that are reliable, actionable, and low-noise – requires understanding the tools and best practices.

Baseline configuration management: Why it's critical for network stability

Imagine this: You've onboarded 30 new switches, 15 firewalls, and 20 routers into your network. You assume they all follow company policy. But months later, half of them are misconfigured, a few are running vulnerable firmware, and one rogue device is exposing ports it shouldn't. That’s not poor luck—that’s poor baseline configuration.

8 Network Statistics IT Pros Should Know to Understand and Optimize Network Performance

Slow Zoom calls, dropped VPN connections, and lagging applications sound familiar? These common network frustrations often stem from underlying performance issues that could be diagnosed and resolved with the right data. For IT professionals, raw network metrics alone aren’t enough. To truly optimize performance, you need network statistics: aggregated, analyzed, and interpreted insights that turn numbers into actionable decisions.

Third party API Monitoring powered by OpenTelemetry semantics

In today’s cloud-native world, third-party APIs are everywhere. Payments, notifications, search, AI, analytics as modern applications are built on a web of external services. But what happens when one of those APIs slows down, starts throwing errors, or gets rate-limited? Suddenly, your users are facing issues, and you’re stuck asking.