Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

The ins, outs, and benefits of using Grafana Loki as a backend logging solution

As organizations have moved from monolithic to microservice-based architectures, there has been an explosion in the volume of logs generated. Most logging solutions create a full index of the logs and use SSD drives, which results in costly compute and storage resources for logs that are mostly write once, read never. We created Grafana Loki to solve these problems. Loki only indexes the metadata of the log lines, relies on inexpensive object storage, and is architected for scalability. In addition, Loki takes advantage of parallelism and sharding that results in fast query performance. In this session, we will discuss the benefits of using Loki as a backend logging solution.

Video: Get started with Grafana Mimir in minutes

Since we launched Grafana Mimir — the most scalable, most performant open source time series database in the world — we have answered many of your questions about our latest open source project, including how to pronounce it. (All together now: /mɪ’mir/.) We have walked through how we scaled Grafana Mimir to 1 billion active series. And we will be hosting webinars to showcase cutting-edge features like query sharding and the two-stage compactor.

Ask Miss O11y: Logs vs. Traces

Ah, good question! TL;DR: Trace instead of log. Traces show connection, performance, concurrency, and causality. Logs are the original observability, right? Back in the day, I did all my debugging with `printf.` Sometimes I still write `console.log(“JESS WAS HERE”)` to see that my code ran. That’s instrumentation, technically. What if I emitted a “JESS WAS HERE” span instead? What’s so great about a span in a trace? Yeah, and so do logs in any decent framework.

Businesses Must Know About the Best Practices in Asset Monitoring

Lots of organizations do not pay attention to their assets and pieces of equipment, who is using them and where are they located. These are particularly important questions but usually, they are ignored as a result assets are lost and nowhere to be found when they are required. Lots of employees waste their time finding the required assets and pieces of equipment. It also leads to delayed production work. Overall, the top line and bottom line suffer.

Network Alerts-Monitoring and Notifications

When it comes to IT, you can’t do anything with an asset you can’t see. When it comes to your networking, monitoring offers the eyeballs to know what is going on. But IT and network pros don’t spend all day staring at a dashboard waiting for something to happen. Like your local police department, they rely on notifications of trouble. Instead of 911 calls, IT depends on network alerts.

Kubernetes: Tips, Tricks, Pitfalls, and More

If you’re involved in IT, you’ve likely come across the word “Kubernetes.” It’s a Greek word that means “boat.” It’s one of the most exciting developments in cloud-native hosting in years. Kubernetes has unlocked a new universe of reliability, scalability, and observability, changing how organizations behave and redefining what’s possible. But what exactly is it?

Monitoring Hyper-V and ESXi-what should you do?

Over the years, I found that building out monitoring scripts and using them properly has proven to be a challenge. When I look back at my internal IT days using platforms like Whatsupgold, PRTG, or N-central, the question always remained the same: how can I monitor efficiently and get alerts that matter? In this blog post, I thought I’d tackle something that is a challenge for a lot of people: monitoring Hypervisors.

Monitoring critical systems at Roblox with Grafana and Grafana Agent

It’s like an obby unto itself: With roughly 100 million global active users, how does an observability team monitor operations and troubleshoot problems that pop up across more than 10,000 servers? In this session, you’ll get an inside look at how the Roblox team evolved their observability platform to combine a multitude of data sources, from low-level hardware health to high-level performance metrics. Grafana Agent plays a key role by replacing many special-purpose pipelines with a single, easy-to-manage tool. Roblox’s observability team has met growing demand to provide actionable insights to hundreds of engineers, covering dozens of data sources and thousands of Grafana dashboards, which all help keep its infrastructure running and ready for play.

Understanding data analysis and online activity with David Belson

Network AF host and Kentik CEO Avi Freedman discusses data analysis and trends in understanding online activity with David Belson. David is Cloudflare's Head of Data Insight, where he helps the organization communicate information about the internet such as outages and changes in protocol adoption.