Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

How to Scale and Standardize Observability Practices: Hear from Canva and Atlassian | Grafana

This panel discussion, featuring Jenna, Director of Engineering, Reliability Platforms at Canva and Andrew, Head of Engineering at Atlassian, explored the challenges and strategies of implementing standardization in large tech companies. Atlassian, known for its software development and collaboration tools, initially faced resistance to standardization but shifted as inefficiencies and compliance issues emerged. Canva, a graphic design platform, highlighted the balance between flexibility and standardization, using observability tools for accountability.

From "rebooting" to reliable and secure applications: Optimizing the customer experience

Not so long ago in my career, I remember when it was relatively acceptable for infrastructure or development teams to solve a problem by rebooting a server or just “turning things off and on again.” It didn’t matter what caused the problem or how long the reboot would fix things, provided they were fixed for now. Security teams were always held to a different standard.

Mobile app observability with OpenTelemetry, Embrace, and Grafana Cloud

We are excited to announce an expansion of our partnership with Embrace to bring mobile observability to our users using open standards like OpenTelemetry. We first worked with Embrace last year when they created a plugin for Grafana that gives mobile teams an easy way to visualize and analyze real-time mobile metrics directly in a Grafana dashboard.

Kentik Close-Up 02. Support

Welcome to the second episode of Kentik Close-Up, where we explore the latest Kentik features, products, and capabilities. In this episode, Leon Adato is joined by Chris O'Brien, Product Manager, and Steve Meuse, Solutions Architect, to discuss the challenges and improvements in providing support for network monitoring systems like Kentik NMS. Learn about the innovative approaches Kentik has taken to enhance support experiences, including proactive monitoring, automation, and real-time data visibility.

Your Guide to Observability Engineering in 2024

It may sound complicated and daunting, but so much of observability is about discovering the unknown unknowns in your critical systems. The capabilities of observability engineering can help you make those discoveries. Most organizations have some form of monitoring, alerting and troubleshooting, which can be adequate to a point but fall short when trying to determine the root cause of unexpected outages.

The Importance of Observability for Healthcare Providers

The systems and data that healthcare providers utilize and process are fundamental to its successful operation. Therefore these organizations must invest in appropriate and powerful observability solutions that enable them to effectively monitor their systems and valuable data. These tools and solutions allow healthcare providers to securely manage, deliver, and ensure uptime for their entire IT infrastructure.

Mastering Centralized Logging with OpenSearch

For effective centralized logging, OpenSearch is a perfect solution as OpenSearch offers powerful querying and analysis capabilities, and it’s highly scalable and flexible. In this article, we will outline why you should use OpenSearch for centralized logging, before outlining how to easily configure centralized logging in OpenSearch.

Reducing MTTR and the Hidden Costs of Downtime Through AI & Automation

Of all the KPIs that gauge the health and operational fitness of an enterprise, Mean Time to Repair (MTTR) from an outage or downtime is one of the most crucial. Yet while MTTR is a universally recognized metric, many organizations still fail to consider the total cost of MTTR when deciding where and how to invest in their IT environments.