Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Key learnings from the State of Cloud Costs study

We recently released our initial State of Cloud Costs report, which identified factors shaping the costs of hundreds of organizations that use Datadog Cloud Cost Management to monitor their AWS spend. The report reveals several widely applicable themes, including the ways in which resource utilization, adoption of emerging technologies, and participation in commitment-based discount programs all shape cloud environments and costs.

Attach Screenshots to Your Playwright Test Reports

Today I want to show you how you can attach your screenshots directly to Playwright's test reports. Imagine you have a simple Playwright test that navigates to Checkly. You take a screenshot and store it in screenshots/home.png. Then, you click a link in the main navigation, expect a specific heading to be visible, and take another screenshot. When you run this test using npx playwright test, the test passes, and you find the screenshots in the /screenshots directory.

Deliver Peak Microsoft Teams Performance at Scale

Scale is a perennial challenge for most IT teams. While organizations expect the same performance and experience whether 500 users are accessing essential applications or 50,000, IT headcount rarely increases in proportion with organizational growth. This often leaves IT departments overtaxed and pressed to triage the most urgent concerns. But even that requires good data to inform decisions — which can be in short supply.

Better root cause analysis: Mastering alert insights with the new central history timeline

A year ago we rebuilt our alert rule state history, using Grafana Loki for storage and updating the UI to display a timeline of all state changes of an alert rule. As a result, users can now conduct better root cause analysis by going down to the level of an alert rule and seeing when certain alert instances started or stopped firing. But we aren’t stopping there. To ensure system stability and avert outages, you also need one place to see the state history for all the alerts in your system.

New Relic vs Grafana - 2024 Comparison

New Relic and Grafana are leading tools in monitoring and observability, each with distinct use cases. New Relic excels in Application Performance Monitoring (APM), providing detailed insights for application performance. In contrast, Grafana is designed for data visualization and monitoring, allowing users to create customizable dashboards for metrics and logs. This article provides a clear comparison of their features, including application performance monitoring, log management, and dashboards.

Nexthink the Clear #1 Vendor in DEX. Whichever Way You Look at It.

This headline is maybe a little surprising, even for Nexthinkers. We’re known for being conservative and letting our work do the talking. Nexthink first created the DEX category, then rolled out the world’s most capable and holistic DEX platform, and then delivered year on year as the most successful DEX vendor in the market. Something to do with our Swiss heritage perhaps – an emphasis on diligence and execution, rather than on singing our own praises.

Turbo360 FinOps and Cost Management Is Now Available in the Microsoft Azure Marketplace

Microsoft Azure customers worldwide now gain access to Turbo360 to take advantage of the scalability, reliability, and agility of Azure to drive application development and shape business strategies. Turbo360, an advanced cloud Management platform, today announced the availability of its flagship module, FinOps and Cost Management for Azure, in the Microsoft Azure Marketplace.

Essential Linux Logs To Monitor for System Health

Linux is an open-source operating system kernel originally created in 1991. It has a reputation for being versatile, stable, and secure, hence its wide use on computing devices, beginning from servers and mainframes down to desktop computers, smartphones, and embedded devices. The broad uses for Linux and its popularity have led to the demand for effective monitoring.

Is it Time to Version Observability? Signs Point to Yes

In 2016, we at Honeycomb first borrowed the term “observability” from the wikipedia entry for control systems observability, where it is a measure of your ability to understand internal system states just by observing its outputs. We then spent a couple of years trying to work out how that definition might apply to software systems. Many twitter threads, podcasts, blog posts, and lengthy laundry lists of technical criteria emerged from that work, including a whole ass book.