Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

The hidden data challenges CIOs face on their quest to accelerate business outcomes

Navigating the complex terrain of IT systems, operational issues, and security breaches is no easy job, even for the seasoned CIO. And when tasked with the lofty goals of improving operational resilience, mitigating security risk, and enhancing customer experiences, dealing with the day-to-day operations is all the more challenging. Achieving these goals can often feel overwhelming, with no end to the journey in sight.

Celebrating Grafana 10: Top 10 Grafana features you need to know about

Since Grafana started 10 years ago, there have been more than 43,000 commits to the open source project. Grafana founder Torkel Ödegaard has made more than 7,600 of those commits, and he recently reflected on some personal favorites he’s worked on, ranging from early query builders to the latest navigation updates. Torkel isn’t the only one who has strong feelings.

Service Level Objectives: A Complete Overview for Beginners

DevOps engineers are under intense pressure to provide reliable, high-quality services to teams and stakeholders. In large part, this is because end users today demand seamless access to software and a great user experience – a trend that will only increase as digital transformation accelerates and we move further into the future. DevOps professionals rely on various metrics to meet performance and reliability goals, one of the most important being service level objectives (SLOs).

10 Key Application Performance Metrics & How to Measure Them

If you are trying to figure out how to measure the performance of your application, you are in the correct place. We spend a lot of time at Stackify thinking about application performance, especially about how to monitor and improve it. In this article, we cover some of our most important application performance metrics you should be tracking.

Unearthing Gold: Deriving Metrics from Logs with Mezmo Telemetry Pipeline

Logs are like gold ore. They have valuable nuggets of information, but those nuggets often come in a matrix of less helpful material. Extracting the gold from the ore is crucial because it is vital to unlocking insights and optimizing your system(s). Raw logs can be overwhelming, containing informational messages, debug statements, errors, etc. However, buried within this sea of data lies the key metrics you can use to understand your applications' performance, availability, and health.

Intel Leverages Telegraf to Deliver Platform Visibility

Since 2020, the Intel team has been contributing to Telegraf, including both telemetry from Intel-specific platform features (such as Intel® Resource Director Technology, Intel® Dynamic Load Balancer, or power statistics from Intel-based platforms) and telemetry gathered from generic tools and frameworks; for example, Data Plane Development Kit (DPDK), Libvirt, P4 Runtime, or Reliability Availability Serviceability (RAS).