Operations | Monitoring | ITSM | DevOps | Cloud

April 2023

Feature Spotlight: Dynamic Kubernetes Observability Dashboards

If you're a software engineer working with Kubernetes, you know how vital it is to have accurate, real-time information about your applications and resources. With StackState's dynamic Kubernetes observability dashboards, you can now access all the essential data you need for troubleshooting on a single screen. In this blog post, we'll discuss the key features of these dashboards, why they're valuable and how to get started with them.

Plexporters, Energize: How we monitor Plex with Grafana

As a Grafanista, you tend to find things to visualize — databases, microservices, classic video games, etc. It’s part of our “big tent” philosophy. So when our December hackathon rolled around, some of us in our internal homelab Slack channel decided to take a look at how we could get metrics out of our Plex Media Servers.

Grafana k6 v0.44.0 release: web crypto API, Web Vitals metrics, and more!

Grafana k6 v0.44.0 has been released, featuring new experimental modules, an upgraded browser module, and tons of improvements. Get Grafana k6 0.44.0 Here’s a quick overview of the latest k6 news from the team and the community.

Grafana 9.5 release: Grafana Alerting updates, stronger security with service accounts, upgraded dashboards, and more

Grafana 9.5 has arrived! 🔥🎉 Get Grafana 9.5 The latest Grafana release introduces new features and improvements, such as major Grafana Alerting improvements, dashboard and visualization enhancements, a redesigned navigation experience, support bundles for faster issue resolution, and much more to provide you with better insights into your data.

Multiple players, one stack: Inside Roblox's centralized observability stack

When you sign into the Roblox platform, you get 30 million immersive experiences, ranging from concerts to fashion shows to, of course, video games. But when the observability team at Roblox logs on, they’re not playing around. The Roblox observability engineers are responsible for keeping more than 214 million monthly users happy and engaged by making the wildly popular gaming platform highly available around the world.

Supercharging Grafana with the Power of Telemetry Pipelines

Grafana is a popular open-source tool for visualizing and analyzing data from various sources. It provides a platform for creating interactive, customizable dashboards that display real-time data in various formats, including graphs, tables, and alerts. When powered by Mezmo's Telemetry Pipeline, Grafana can access a wide range of data sources and provide a unified view of the performance and behavior of complex systems.

Find connections and expand your data visualization with new dashboards

One of my favorite movies of all time is WarGames, which depicted a teenage hacker accidentally breaking into NORAD and starting a nuclear war simulation that almost turned into a real catastrophe. The movie featured state-of-the-art dashboards (at least for 1983) showing simulated missile launches by different countries. Now you can create Sumo Logic dashboards like the ones shown in this movie using our new Connection Map panel.

Learn how to monitor IoT devices with Grafana

IoT devices open the door to all sorts of computing potential, but they can also produce a flood of telemetry data that users need to properly collect and monitor to ensure those devices are working properly. It’s no wonder so many individuals and businesses use Grafana for IoT use cases, whether they’re starting an aquaponic farm in South Africa, managing an industrial-scale electroplating factory in Ohio, or simply keeping tabs on Pretzel the python at its home in the UK.

Cloud Monitoring Console's Health Dashboard: Maximize Your Monitoring Efficiency

Are you a Splunk Cloud admin tired of sifting through various tools and dashboards to monitor the health of your Splunk Cloud deployment? Do you often find yourself wondering what actions you can take to keep your Splunk Cloud deployment running smoothly? Are you looking for ways to be alerted before something impacts your deployment performance? Look no further than the Cloud Monitoring Console's Health Dashboard!

Best Grafana dashboard for IoT Device monitored via MQTT metrics and Graphite

Internet-of-Things (IoT) devices and technologies have penetrated into our life. Nowadays, we can easily find IoT devices around us. Home appliance manufacturers produce IoT products for smart homes. Car makers invest in smart vehicles that can communicate with other smart cars and IoT objects nearby. Governments allocate huge resources to build smart cities with IoT technologies to improve the quality of life for people and promote economic growth.

Grafana vs Graphite: A Comparison for Data Visualization and Analysis

Data generation today is at an unprecedented level, and we are generating an inexhaustible amount of data. As a matter of fact, more data has been created over the past two years than ever before in the history of mankind. This throws a big complexity in front of us. How do we even manage such a huge amount of data? Where do we store them? Can they be segregated to fit into our needs, who would do that for us, and so on! The questions are endless, and so is the rate of generation of new data.

Observability overload: Insights into the rise of tools, data sources, and environments in use today

With countless observability tools, data sources, and environments to juggle, the organizations that deploy and manage today’s distributed applications often face an uphill battle to gain visibility into their application performance. That was a key takeaway from the Grafana Labs Observability Survey 2023, which incorporated input from more than 250 industry practitioners who are all too familiar with these complexities.

How to troubleshoot memory leaks in Go with Grafana Pyroscope

Memory leaks can be a significant issue in any programming language, and Go is no exception. Despite being a garbage-collected language, Go is still susceptible to memory leaks, which can lead to performance degradation and cause your operating system to run out of memory. To defend itself, the Linux operating system implements an Out-of-Memory (OOM) killer that identifies and terminates processes that consume too much memory and cause the system to become unresponsive.

Grafana Cloud is now available in AWS Marketplace

Grafana Labs is excited to announce that Grafana Cloud is now available in AWS Marketplace. With this new offering, existing AWS customers can procure, deploy, and scale the fully managed Grafana LGTM observability stack (Loki for logs, Grafana for visualization, Tempo for traces, Mimir for Prometheus metrics) with just a few clicks.

Tutorial: Querying InfluxDB with SQL and Grafana

Thank to a new and improved core database engine, InfluxDB now supports native SQL queries. Leveraging Flight SQL, InfluxData Developer Advocate, Jay Clifford, walks through how to set up querying InfluxDB using Grafana. He shows you how to configure Flight SQL and how to build a dashboard in Grafana for visualizing industrial IoT data.

Preview: Cloud 66 New Dashboard to Streamline Application Deployment and Management

If you're a developer looking for an easier way to manage your Rails, JAMstack, or containerized applications, you'll be pleased to know that our team has worked hard to launch a new dashboard that simplifies the process. The new dashboard will be released to general availability next week. Here is a quick preview, let us know your thoughts.

Observability for business decision making: bridging the gap between IT and the business

Hear from Corneile Britz, Observability and DevOps Specialist and Co-Founder of Boxfish how Observability can provide real-time data to support business decision-making and improve customer experience, enabling collaboration and trust between IT and business stakeholders.

Grafana Alerting: Searching for Grafana alerts just got faster, easier, and more accurate

Grafana Alerting enables users to create and customize alert rules as separate entities and link them to Grafana panels. It also supports various data sources with built-in alerting engines, such as Prometheus, Grafana Mimir, and Grafana Loki, allowing users to manage their alert rules directly from Grafana’s UI.

Dashboard Fridays: Sample VMware dashboard

These three VMware dashboards built in SquaredUp provide a full overview of the data in a VMware deployment so users can spot performance issues and fix them fast. Having all the VM performance and health metrics in one place – for your VMs, VM hosts, and guest VMs – allows engineers to pre-empt issues and fix them before they become problems for end users.

How to collect and query Kubernetes logs with Grafana Loki, Grafana, and Grafana Agent

Logging in Kubernetes can help you track the health of your cluster and its applications. Logs can be used to identify and debug any issues that occur. Logging can also be used to gain insights into application and system performance. Moreover, collecting and analyzing application and cluster logs can help identify bottlenecks and optimize your deployment for better performance.

How to monitor Microsoft SQL Server performance with Grafana Cloud

A database is one of the most critical components for almost every application. Making sure it is running with the expected read and write latencies is paramount. This can be the difference between a smooth, pleasing user experience and a slow, error-filled one that makes your customers turn their back on a product — and never come back.

How to get started with monitoring Apache Cassandra with Grafana Cloud

Apache Cassandra is a highly scalable, open source NoSQL database system designed to handle large amounts of data across multiple commodity servers with no single point of failure. Apache Cassandra can be run as a single node but starts making sense when its run in a cluster setup. The system is optimized for high write throughput and is known for its ability to handle big data workloads with ease at super-low latencies.

Scrape Azure metrics and monitor AKS using Grafana Agent

As more organizations adopt cloud-based services like Microsoft Azure Kubernetes Service (AKS), it becomes increasingly important to monitor and manage the performance and reliability of these services. If you’re using AKS today, then Grafana Cloud provides the flexibility, performance, and visualizations you need to monitor your distributed applications.

Grafana Loki 2.8 release: TSDB GA, LogQL enhancements, and a third target for scalable mode

Grafana Loki 2.8 is here — and it’s at least 0.1 better than Loki 2.7! Jokes aside, this release includes a number of improvements users will appreciate. In addition to graduating our TSDB index from Experimental to General Availability, we’ve added a number of nifty LogQL features, and we’ve made the Loki deployment and management experience much easier. This also marks the release of Grafana Enterprise Logs (GEL) 1.7.

Grafana Alerting: A beginner's guide to templating alert notifications

We often see questions about how to template alerts. In Grafana, you can template information about your alerts with custom labels and annotations, and you can also template how notifications look and what information they contain with notification templates. Many users confuse the two, despite being separate features with different use cases.

MetricFire - Dashboards Tutorial

In this video, we explore MetricFire’s powerful dashboarding platform. Discover how to customize your dashboard to view the metrics that matter most to your business. With a variety of visualization options and an intuitive interface, creating and editing charts, tables, and graphs has never been easier. Share your dashboard with your team and keep everyone on the same page. Join MetricFire and start tracking your business metrics today.

Frontend vs. backend: How to plan your performance testing strategy

There are many aspects of application performance, but they broadly fall into two categories: frontend performance and backend performance. As a tester, it’s important to know the differences between the two and how that impacts the way you approach your tests. In this blog, I’ll provide a high-level overview of frontend performance testing and backend performance testing, including pros and cons of each one.