Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Microsoft Outage MO842351: Understanding Impact & Scope Saves You From Raising Unnecessary Alarm Bells

Just ten days after the last major Microsoft 365 outage, Microsoft reported another incident at 8:48 am on July 30, 2024. The message on X was vague, offering limited details about the scope and impact of the problem. This left many IT teams preparing for what they anticipated would be another rocky day.

Understand your Kubernetes cost drivers and the best ways to rein in spending

In the previous blog post in this two-part series, we discussed the critical signals you need to monitor in your Kubernetes environment to ensure optimal resource provisioning. These signals include high CPU and memory utilization, frequent pod evictions, slow application performance, and other indicators that your resources are over- or under-provisioned. Monitoring these signals is essential for maintaining an efficient, cost-effective, and environmentally sustainable Kubernetes environment.

Achieving Autonomic IT: Your Journey to Highly Efficient Operations and Elevated Business Performance

In today’s fast-paced digital business landscape, IT service management teams face immense pressure to swiftly adapt to new technologies and meet stringent SLAs. To ensure optimal customer experiences and drive business growth, organizations need an approach that goes beyond current AIOps and semi-autonomous market offerings – they need Autonomic IT. Imagine a self-managing IT environment that monitors and optimizes technology investments as it runs.

Staying on Top: Nexthink's Continuous Pursuit of Excellence

"It's tough to get out of bed to do roadwork at 5 am when you've been sleeping in silk pajamas." This quote from boxing champion Marvin Hagler, I feel, perfectly encapsulates the relentless drive needed to sustain excellence in any endeavor. It speaks to Hagler’s vigilance against complacency, an ethos that resonates deeply with us at Nexthink, especially as we celebrate our 20th anniversary and our ongoing status as a Leader in the Forrester Wave.

The MING Stack: What It Is and How It Works

The Internet of Things (IoT) is rapidly reshaping the world. From smart devices in our homes to connected sensors in industrial settings, the amount of data generated is rapidly increasing. But what use is this data if we can’t collect and analyze it in real-time to gain key insights? This is where the MING stack (which includes Mosquitto/MQTT, InfluxDB, Node-RED, and Grafana) comes in. This powerful combination of open-source tools is intended to simplify IoT data management.

Top Nagios Alternatives for Advanced Network Monitoring

Monitoring the health and performance of IT infrastructure is crucial for practically all organizations to ensure the reliability, availability, and efficiency of an organization's technology environment. By continuously tracking servers, network devices, applications, and services, organizations can promptly detect and address issues before they escalate into significant problems and impact customers.