Operations | Monitoring | ITSM | DevOps | Cloud

Centrally govern and remotely manage Datadog Agents at scale with Fleet Automation

As customers scale to thousands of hosts and deploy increasingly complex applications, it can be difficult to ensure that every host is configured to give you the visibility you need to monitor your infrastructure and applications. To ensure visibility across a growing number of hosts, you need to know that your observability strategy is implemented uniformly across your entire fleet of Datadog Agents installed on these hosts.

NiCE VMware Management Pack 5.7

In the dynamic realm of virtualization management, the latest release of the NiCE VMware Management Pack is bringing new, much-desired monitoring features to the SCOM admin table. Packed with a slew of powerful features and enhancements, this release promises to enhance the monitoring capabilities for Microsoft System Center Operations Manager (SCOM) and Azure Monitor SCOM Managed Instance.

Announcing the Splunk Add-on for OpenTelemetry Collector

The Splunk Add-on for OpenTelemetry Collector is a variation of the Splunk Distribution of the OpenTelemetry Collector that simplifies metrics and traces data collection, configuration and management. Since it is an add-on, users can deploy it alongside Universal Forwarders using tools like Deployment Server to start collecting high-fidelity metrics and traces from 1000s of their hosts easily. We’re happy to announce that the Add-On is now generally available in Splunkbase.

Beta Announcement: Bring Your Own Kubernetes with Qovery

Today marks a significant milestone for Qovery and a highly anticipated evolution of our product, especially among Platform Engineers and DevOps professionals. We're thrilled to announce the "Bring Your Own Kubernetes" (BYOK) offer in beta access, a transformative step in our journey towards more flexible and adaptive infrastructure management. Please keep reading to understand why we extend our offer.

Introducing Responsive Pipelines from Mezmo

The ability to swiftly resolve incidents is central to SREs responsible for a service's reliability and its users' satisfaction. Mezmo has recognized this need and, at Kubecon, unveiled an innovative solution: Mezmo Responsive Pipelines. Responsive Pipelines enable users to pre-configure a Pipeline to respond automatically in the case of an incident.

Set and scale service level objectives in Grafana Cloud: Introducing Grafana SLO

When we began offering Grafana Cloud Metrics, we set a service level agreement (SLA) for 99.5% of requests to be completed within a few seconds. So we built an alert that would go off if more than 0.5% of requests were slower than a couple of seconds within a five-minute moving window. Sounds reasonable, right?

The Future of Operations: AI-powered Internet Performance Monitoring

At Catchpoint, our philosophy is that AI should not be adopted simply for the sake of AI itself. Instead, it should be embraced when it proves to be the most effective solution for addressing a particular business challenge. While the world is currently in the fervor of the oncoming AI revolution, our industry-leading IPM platform has quietly harnessed the potential of Artificial Intelligence for years.

Not Every Problem is an Error: Introducing Rage and Dead Clicks + New User Feedback Reports

I know, we’re Sentry the error and performance monitoring platform and we catch production issues. But as you (hopefully) saw during our Launch Week announcement, some broken experiences simply won’t throw an exception. So we built a way to detect when your users are slamming their keys on the keyboard in frustration, and to even let them contact you directly when that doesn’t go their way.

Manage log volumes, metrics cardinality, monthly bills: Explore Grafana Cloud cost management tools

As more organizations adopt observability at massive scale, they have also been grappling with rising costs. Over the past 12 months, we have been working on different solutions to help our users better understand and manage their observability stack, not to mention the bills that come with scaling it.

OnPage Releases Healthcare-Focused Slack Integration

In the healthcare realm, the need for communication platforms that meet HIPAA standards is undeniable. Enter Slack, a popular collaboration platform armed with robust security features. However, the real game-changer emerges through the integration with OnPage. This isn’t just an upgrade in collaboration; it’s a transformative shift in critical communication within healthcare—a field where every moment counts.