Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Service Reliability Engineering and related technologies.

What Is Site Reliability Engineering? Understanding the complexities of this crucial function

Site reliability engineers manage a lot, and often in incredibly high-stakes environments. Remember that scene from "The Matrix" where Neo dodges bullets in slow motion? Of course you do. As an SRE, it can feel like you're the person getting hit by those bullets, frantically trying to investigate performance issues, automate away toil, and support the engineers around you, all before the next wave of attacks.

Improve Visibility and Capture More Data with Triage Incidents

As new incidents emerge, there are often many unknowns about the size, severity, and cause of the problem. Sometimes it’s not clear if the problem is an incident at all. That’s where introducing a triage stage to your incident management process can help. In this post, we’ll look at the benefits of adding a triage layer to your incident management, and how Rootly’s Triage feature allows you to seamlessly transition from triage to real incident (or false alarm).

Docker Compose Logs: Guide & Best Practices

Docker Compose is a tool for defining and running multi-container Docker applications. It allows developers to streamline the process of configuring, building, and running multiple containers as a single unit with a docker-compose.yml. This configuration file specifies the services, networks, and volumes required for an application, and their relationships and dependencies. The docker-compose logs command displays the logs of all services defined in the docker-compose.yml file.
Sponsored Post

Squadcast's Improved Mobile App for Better Incident Response

The 2020 pandemic has definitely changed the way teams operate across the globe. Many of you may have already experienced moving from 100% office work to 100% remote work, and now that it has been almost three years since the pandemic started many of us have resorted to hybrid models. We at Squadcast value the importance of efficient communication, reaching the right people during a crisis, and the freedom to resolve critical incidents from anywhere, anytime. Keeping that in mind, we have made major improvements to our mobile app to help you effectively partake in Incident Response activities anytime from across the globe.

Bitrix24 + Squadcast Integration: Simplifying Alert Routing

Bitrix24 is a cloud-based business management and collaboration platform that provides a suite of tools for managing various business processes. If you use Bitrix24 for your collaboration and CRM requirements, you can integrate it with Squadcast, an end-to-end Incident Response tool, to route alerts such as creating a lead on Bitrix24 CRM or creating a task in Bitrix24 to Squadcast. ‍

Endtest + Squadcast Integration: Alert Routing Made Easy

Endtest is a low code test automation platform enabling organizations to efficiently build automated end-to-end tests for web and mobile applications. If you use Endtest for your test automation requirements, you can integrate it with Squadcast, an end-to-end Incident Response tool, to route detailed alerts from Endtest to the right users in Squadcast.