Operations | Monitoring | ITSM | DevOps | Cloud

Platform engineering unplugged: What nobody tells you about platform engineering at scale

Most platform engineering stories are told in hindsight, with the rough edges smoothed out. On June 17th, we are doing it differently. Join us for Platform Engineering Unplugged, a frank conversation with a practitioner who has navigated the real challenges of building and scaling platform engineering. What worked, what didn't, and what they would do differently. If you lead engineering teams and are thinking seriously about platform engineering, this is the session for you.

Datadog Data Observability: Be the first to know when data fails

Bad data doesn't announce itself. Datadog Data Observability gives you unified visibility across your entire data stack—from source systems and pipelines to dashboards and AI applications—so you catch silent failures before they cascade. Detect data quality and pipeline issues before stakeholders do, pinpoint root causes with end-to-end lineage, and reduce pipeline costs with job, cluster, and query recommendations.

Reduce Alert Fatigue with Composite Alerting in Hosted Graphite | Tutorial

Tired of noisy alerts waking you up for issues that are not actually impacting your services? In this tutorial, we walk through MetricFire's Composite Alerting capabilities and show how to combine multiple metric conditions into a single high-confidence alert using AND / OR logic. Learn how to: Reduce alert fatigue and false positives Create service level alerts in Graphite Combine CPU, latency, and database metrics into meaningful alerts Use conditional logic to improve signal quality Build smarter observability workflows with Hosted Graphite.

The sovereignty debate explained with Nine23

Who really owns your data? Data sovereignty has become one of the defining issues shaping digital infrastructure, cloud strategy and AI adoption. But what does it actually mean, and why has it become a board-level discussion for so many organisations? In Episode 4 of Perspectives from the Edge, Pulsant's Wendy Shearer is joined by Steve Jewell, CEO of Nine23, to explore data sovereignty and its relationship to security, resilience and digital transformation.

How one PM scaled customer discovery with AI

Customer interviews are one of the most powerful ways to build better products — but they’re also time-consuming. In this video, Avinoam “Avi” Zelenko, Principal Product Manager at Atlassian, shares how he transformed the way he runs customer interviews using AI automation and Rovo agents. What used to take hours of coordination, note-taking, and manual summaries now happens automatically. By stitching together the Teamwork Collection and Slack, Avi built a workflow that captures conversations, summarizes insights, and shares them across teams in real time.

How to Fix Azure Integration Errors in Minutes Instead of Days

Azure integration errors can be difficult to diagnose when messages flow across multiple services such as Logic Apps, Service Bus, Azure Functions, APIs, and external systems. Support teams often spend hours searching through logs and correlating events across services just to identify where a transaction failed.

AI Agents Are the New Employees: The Identity & Security Crisis Enterprise IT Must Solve

As AI agents become more autonomous, enterprises face a new challenge: How do you secure a workforce that isn't human? In this episode of Agents of IT, Fran Fernandez, Zach Austin, and Ian Coppock explore the growing identity and security challenges surrounding Agentic AI. From permissions and governance to digital identities and access controls, the team breaks down what enterprise leaders need to know before deploying AI agents at scale.

How Property Managers Can Respond Faster to Critical Issues | OnPage

When managing properties and facilities remotely, every minute matters. Whether it's an HVAC failure, maintenance request, or after-hours emergency, critical issues need immediate attention. Traditional communication methods like phone calls, emails, and text messages can easily be missed, delaying response times and impacting tenant satisfaction. In this video, discover how OnPage helps property managers and facilities teams receive critical alerts in real time, coordinate responses faster, and maintain visibility throughout the incident lifecycle.

Scaling Android development with Anbox Cloud

Discover how Anbox Cloud helps engineering teams scale Android development by moving Android workloads from physical hardware into the cloud. In this video, we showcase how developers can run, test, validate, and share Android environments on demand using containerized and virtualized Android instances. We explore how both approaches work, key differences, and use cases.