Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Cloud monitoring, security and related technologies.

How Datadog Feature Flags is resilient to cloud provider failures

As major incidents like AWS’s October 2025 outage illustrate, modern systems are immensely interconnected. A failure in one can lead to a cascade of downstream problems. In this case, issues with DNS resolution for DynamoDB led to widespread disruptions with other AWS services and, subsequently, thousands of applications and services that rely on that infrastructure.

Introducing Honeycomb Private Cloud

More and more enterprises are shifting toward private cloud and hybrid deployments for control, data residency, and security. At the same time, observability is no longer a “nice to have” tool. It's mission-critical for teams driving rapid change across cloud-native, multi-service architectures. Leaders are realizing they need deep visibility and rapid debugging everywhere their systems run.

How to Onboard AWS & Azure Hosts in SolarWinds Observability

Connecting your cloud infrastructure has never been easier. In this quick walkthrough, you’ll see how SolarWinds Observability natively integrates with AWS and Azure to onboard virtual machines and supported managed services—fast. Select your hyperscaler Click “Add Data” → Choose “Hosts” Follow simple steps to connect your cloud environment via API Whether you're running AWS EC2, Azure VMs, or other managed services, SolarWinds helps you get visibility in minutes.

Modern Service Architecture for High-Velocity Operations

Modern service architecture supports organizations that target sustained velocity, predictable delivery cycles, and scalable global operations. Cloud-native platforms, microservices patterns, and distributed execution models now anchor these environments. Modern service architecture emphasizes modularity and flexibility, which contrasts with traditional monolithic approaches. The 2025 Gartner Magic Quadrant for Cloud-Native Application Platforms identifies AWS, Red Hat OpenShift, and Heroku as leaders because they strengthen developer experience, platform engineering, and security.

Cloud Security Best Practices Every Company Should Follow

As more businesses move their data, applications, and daily operations to the cloud, securing that environment has become a top priority. Cloud platforms offer flexibility, scalability, and cost savings, but they also introduce shared responsibility-meaning both the provider and the business must play a role in keeping systems safe. Understanding essential cloud security best practices helps organizations reduce risk, protect sensitive information, and maintain compliance in an increasingly digital world.

AWS And Azure Outages Will Recur - Here's How You Ensure Resilience

The cloud has long promised limitless scalability and near-perfect uptime. But if you tried to access your Microsoft 365 dashboard or recline your smart bed last week, and got nothing but a spinning icon, you weren’t alone. In the span of 10 days, both Amazon Web Services (AWS) and Microsoft’s Azure Cloud suffered widespread outages that rippled across industries.

KubeCon Atlanta Signals Key Shift: From Cloud Cost To Value Engineering

After three days of demos, sessions, and hallway conversations at KubeCon Atlanta, one thing became clear to CloudZero CTO Erik Peterson: the cloud-native world is shifting from cost control to value engineering. Teams aren’t just fighting bills anymore. They’re fighting complexity, GPU scarcity, Kubernetes sprawl, and pressure from the business to justify every dollar of technical investment. And this year’s KubeCon attendees? They were ready for those conversations.

Elasticsearch: The context engine for grounding and orchestration in Microsoft Azure AI Foundry Agent Service

The rise of large language models (LLMs) and agentic applications promises to transform enterprise workflows. Yet, the core challenge remains: How do we ensure these powerful agents generate accurate, relevant, and trustworthy responses based on proprietary enterprise data rather than relying solely on their generic training knowledge? The answer lies in grounding — connecting the LLM to verified, trusted, and up-to-date information.

Azure Monitor offers Grafana dashboards natively for immediate, real-time operational monitoring

Editor’s note: This blog originally published in May 2025 when Azure Monitor dashboards with Grafana became available in public preview. It was updated in November 2025 to reflect general availability. The Grafanaverse just got a little bit bigger.