Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

15 years of unwavering customer trust: the Site24x7 story

Drawing synergy from ManageEngine, Zoho Corporation's business IT division, Site24x7 grew steadily to cover all geographies and sectors. We extended observability to cover the entire gamut of the rapidly changing IT infrastructure landscape. Today, Site24x7 is an AI-powered, comprehensive IT monitoring solution with a keen eye on privacy and security.

Introduction to Collecting Traces with OpenTelemetry

OpenTelemetry (also abbreviated as OTEL) is an increasingly popular open-source observability platform under the Cloud Native Computing Foundation (CNCF), which is currently the most active project in the CNCF after Kubernetes. It was created to establish a unified and vendor-agnostic way for instrumenting, collecting, and exporting telemetry data for your system and application across traces, logs, and metrics.

How many metrics? A guide to estimating the size of your system in Grafana Cloud

Grafana Cloud, our composable observability platform, is billed based on usage. A common question we get is: “How much will it cost to monitor N servers?” Well, the recently expanded Grafana Cloud Free tier includes up to 10,000 active series. To help you understand what that translates to in terms of time series requirements, here’s a rough guide to estimating what you’ll need.

How to observe your TensorFlow Serving instances with Grafana Cloud

The world of AI and machine learning has evolved at an accelerated pace these past few years, and the advent of ChatGPT, DALL-E, and Stable Diffusion has brought a lot of additional attention to the topic. Being aware of this, Grafana Labs prepared an integration for monitoring one of the most used machine learning model servers available: TensorFlow Serving. TensorFlow Serving is an open source, flexible serving system built to support the use of machine learning models at scale.

The Evils of Data Debt

In this livestream, Jackie McGuire and I discuss the harmful effects of data debt on observability and security teams. Data debt is a pervasive problem that increases costs and produces poor results across observability and security. Simply put — garbage in equals garbage out. We delve into what data debt is and some long term solutions. You can also subscribe to Cribl’s podcast to listen on the go!

Understanding Network Traffic Monitoring

Network traffic monitoring has become critical in today's digital age, where businesses rely on various applications and services to operate. As the amount of data transmitted over networks continues to grow exponentially, network administrators must keep a close eye on the traffic to ensure optimal network performance and security. Network administrators must have a deep understanding of packet flows, collection methods, and analytics to ensure that their networks are secure and performing optimally.

Azure Integration with Graphite and Grafana

In this article, we will see how we can integrate an Azure data source with Graphite and Grafana. This will allow us to monitor metrics from the applications hosted in the Azure cloud on a Grafana dashboard. We will also see how to integrate Azure Active Directory with MetricFire’s Hosted Graphite and Grafana. You don’t need fully functional cloud services running with Azure to understand this article, but it assumes that you have basic familiarity with Azure Cloud.

Sponsored Post

Revolutionize Your Enterprise Operations with CloudFabrix Observability Data Modernization

If you research modern observability solutions to manage multi-cloud and hybrid IT environments, you will inevitably learn about OpenTelemetry (OTEL or OTel). The technology has become so rampant that dev or ops professionals still unaware of it are afraid to ask what it actually means. Fret not, as we’ll describe it for you here.

Impact of AI on IT Operations

The rise of Artificial Intelligence in every domain is very apparent, and as a result, the impact of AI on IT operations needs to be comprehended by one and all. AI, or artificial intelligence, is a field of computer science that focuses on developing intelligent machines that can perform tasks that typically require human intelligence and decision-making. But what exactly are IT operations?