At Intercom, we focus on customer experience above all—our service’s availability and performance is our top priority. That requires a strong culture of observability across our teams and systems. As a result, we invest a lot in the reliability of our application. But unpredictable failures are inevitable, and when they happen it’s humans that fix them. We operate a socio-technical system, and its ability to recover when faced with adversity is called resilience.
Dish Network is on a mission to connect people and things by changing the way the world communicates. With products ranging from Dish and Sling TV to retail wireless services and 5G networks, monitoring their satellite communications equipment is mission critical to maintaining extreme uptime for Dish’s 20 million customers across the United States.
Integration and middleware staffing issues within IT Operations and Service Delivery teams are common, even in the largest enterprises. Here is the story of one large enterprise.
Tracking Service Level Objectives (SLOs) helps developers build more reliable software. At least, that’s the hope of Nobl9. The company’s self-titled SLO platform provides real-time data to software developers, DevOps practitioners, and reliability engineers so that they have the information they need to build reliable features quickly.
Clever.fm is developing a platform to connect podcast listeners with community and content to make podcasting financially sustainable for creators. Our application provides features such as capturing moments, sharing snippets, and having discussions on podcast episodes.