Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Instrumenting Lumigo for Python using OpenTelemetry

Standardized frameworks play a fundamental role in leveling the playing field and setting the standard within the tech industry, ensuring that everyone has access to the same tools and practices. These frameworks promote best practices and foster innovation and collaboration across different sectors. One example of such a framework is OpenTelemetry, a project that has rapidly gained traction and continued to flourish as an open-source initiative under the Cloud Native Computing Foundation (CNCF).

OpenTelemetry and Elastic: Working together to establish continuous profiling for the community

Profiling is emerging as a core pillar of observability, aptly dubbed the fourth pillar, with the OpenTelemetry (OTel) project leading this essential development. This blog post dives into the recent advancements in profiling within OTel and how Elastic® is actively contributing toward it. At Elastic, we’re big believers in and contributors to the OpenTelemetry project.

Schedule Cron Jobs in Node.js with Node-Cron

Cron jobs are tasks set to run by themselves at certain times or intervals. They help with doing repetitive tasks automatically, like backing up data, sending emails, and updating systems. In Node.js, cron jobs can make tasks in applications run by themselves, making things more efficient and reliable. Node.js gives a good way to set these tasks through different libraries and tools.

Evolving Corporate Sustainability Solutions: An Interview with Sebastien Duprez & Nina Zellweger

As the world continues to feel the pressure of climate change, more and more actors in the private sector are implementing solutions to reduce their carbon emissions and slow down global warming. For many organizations, technology is a major focus in their carbon reduction strategy. And most emissions are linked to digital workplace equipment. In fact, the workplace represents 70% of overall IT-related emissions.

Improving your on-call schedule with runbooks

Incidents are a stressful time for your team: your service isn't working the way you expect and your customers/stakeholders want to know what's going on. The last thing you want to do is let your team improvise everything when it comes to responding to incidents. Google's own SRE book has great overall tips for incident management, part of which involves "develop(ing) and document(ing) your incident management procedures in advance", which this article dives into.

How to Monitor ClickHouse With Telegraf and MetricFire

Monitoring your ClickHouse database is a proactive measure that helps maintain its health and ensure that it continues to meet the needs of your applications and users efficiently. It allows you to address issues before they become critical, ensuring that your database environment is secure, reliable, and performing optimally. In this article, we'll detail how to use the Telegraf agent to collect performance metrics from your ClickHouse clusters, and forward them to a datasource.

The Value Hosted Graphite brings to the Heroku Marketplace

Hosted Graphite is a time-series metrics monitoring tool used for application, systems, infrastructure and network monitoring. HostedGraphite is a Hosted Graphite service that offers the full capabilities and benefits of Graphite, without any of the hassle of trying to set up your own open-source Graphite installation.

SolarWinds Observability helps you troubleshoot faster with New Log Patterns feature

SolarWinds® Observability now brings more intelligence to issue identification to help you troubleshoot smarter and faster. When an entity alert is triggered, Log Patterns automates an AIOps / ML-based analysis of events surrounding the triggering event. Using Log Patterns, you can skip the hours spent manually scrolling through event messages looking for unusual or significant patterns.

The cost of inaction: A CIO's primer on why investing in Internet Performance Monitoring can't wait

When John Wanamaker famously declared, “When a customer enters my store, forget me. He is king,” he unknowingly coined a mantra that remains as relevant today as it was in the 1900s. This philosophy, rooted in the customer service ideologies of his time, holds true not just for brick-and-mortar stores but also for eCommerce.

Signs You Are Suffering From Alert Fatigue

In an IT environment with multiple alerting channels and notifications, it is easy to become overwhelmed and desensitized to alerts. This tendency to avoid or respond negatively to incoming alerts is alert fatigue. Alert fatigue is a crucial issue in IT teams, with the sheer volume of alerts generated by modern IT systems. You might prioritize the first five alerts you receive in a workday. Maybe even up to the tenth alert. But is the twentieth alert as important?