Operations | Monitoring | ITSM | DevOps | Cloud

Monitoring

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Top 15 Infrastructure Monitoring Tools

Infrastructure monitoring tools ensure systems’ optimal performance and availability, enabling the identification and resolution of potential issues before they become complex. This article delves into the different infrastructure monitoring tools available and their impact on business continuity and operational efficiency.

Stile Education's Best-of-Breed Observability Strategy

"One of the best things we’ve gotten out of ChaosSearch is the ability to keep all of our data in S3. It’s cheap and easy to keep all of our data available and indexed. We can search through it at any time to dig deeper into problems that crop up." Learn more about how the Stile's team can now retain log data indefinitely, versus saving only a week or two of data in Elasticsearch. That change has increased the team’s capacity to use log data to solve business problems, and unlocked new opportunities to discover deeper product insights.

Our lessons from the latest AWS us-east-1 outage

In case you missed it, AWS experienced an outage or "elevated error rates" on their AWS Lambda APIs in the us-east-1 region between 18:52 UTC and 20:15 UTC on June 13, 2023. If this sounds familiar, it's because it's almost a replay of what happened on December 7, 2021, although that outage was significantly more severe and took longer to restore.

What Is A Time-Series Metric?

Today, businesses and organizations rely heavily on metrics and analytics to make informed decisions. Metrics are important whether you’re a developer, a marketer, or the head of a company. One type of metric that is widely used is a time-series metric. Time-series metrics provide insights into how data changes over time. With time-series data, businesses can track trends, detect anomalies, and make predictions.

Celebrating Grafana 10: Torkel's top 10 moments from a decade of dashboarding

Grafana creator Torkel Ödegaard will never forget the very first GrafanaCON in 2015, when he shared some big news with the audience gathered in New York City. “I’ll always remember standing on stage and announcing that we just reached 12,000 instances and being super proud because it was just a couple of months after we started tracking these numbers,” says Torkel, who also launched Grafana Labs with co-founders Raj Dutt and Anthony Woods in 2014.

FWaaS (Firewall as a Service): How to Monitor Your Traffic Through Cloud

Cybersecurity remains a key concern for any organization. The cost of cybercrime is expected to rise to $8 trillion in 2023 and reach $10.5 trillion by 2025. Various cybersecurity solutions are available, with Firewall as a Service (FWaaS) emerging as one of the most valuable assets when it comes to protecting your interests. We will investigate FWaaS solutions, how they work, how they're different from traditional firewalls, and what benefits they can provide for a range of organizations.

What are Connectors in OpenTelemetry?

The OpenTelemetry Collector plays many different roles in the observability ecosystem. One of its most important roles is that of a telemetry processor. Recent upgrades to the Collector have enhanced its ability to condense, derive, replicate, and reason about telemetry streams. This is achieved with a new class of pipeline components called Connectors.

Introducing Goliath Technologies ChromeOS Device Monitoring and Troubleshooting Solution

Goliath Technologies recently introduced their ChromeOS Device Monitoring and Troubleshooting Solution. They have partnered with Google to be able to provide rich data about the performance and health of ChromeOS and ChromeOS Flex devices. Goliath Technologies is the only monitoring and troubleshooting platform that has access to the Google APIs to get this ChromeOS data. The Goliath Technologies solution tackles issues using a user experience monitoring model.