Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Terraform vs Pulumi

Terraform and Pulumi are both Infrastructure as Code (IaC) tools. They allow you to manage, provision, and configure your infrastructure using code, which makes it easy to automate your infrastructure deployments and manage them in a version control system. Terraform is an open source tool developed by Hashicorp. It’s popular among developers because it’s easy to use and has a wide range of community-developed plugins and integrations.

Site Reliability Engineering, Site Reliability Engineers and SRE Practices: State of Adoption

Site reliability engineering (SRE) is what you get when you treat operations as if it’s a software problem. The mission of an SRE practice is to protect, provide for and progress the software and systems offered and managed by an organization with an ever-watchful eye on their availability, latency, performance and capacity.1.

How to Measure Data Center Sustainability

Sustainability is one of the biggest, if not the biggest, topic in the data center industry today. A sustainable data center is a facility that can maintain operations at a high level of efficiency over time. It is important for data centers to be as sustainable as possible because they use a lot of resources which makes reducing their environmental impact and carbon footprint top priorities. It is also important because these facilities need to comply with corporate sustainability initiatives.

Native vs cross-platform mobile app development

In just a decade, smartphones have become ubiquitous. They facilitate communication via texting and calling, provide entertainment, enable administration, and offer utilities for their users in the form of applications. Users access these mobile applications through their app store, whether it is Apple’s App Store or the Google Play Store. Developers construct them with the smartphone’s operating system in mind. The two mainstream operating systems that are targeted are Android and iOS.

Feeling zen, finding DORA, and the policy police

We’ve had a bumper month here at incident.io HQ. We’ve welcomed 3 new joiners, celebrated two 1 year incident.io anniversaries (congrats Lisa and Lawrence!), released a whole load of exciting new features and (for those of you wondering what’s been causing the recent heatwave) we’ve redesigned our website and it is on fire 🔥 😎 Here’s a round-up of some of this month's highlights…

Updating our data stack

It’s been over 6 months since Lawrence’s excellent blog post on our data stack here at incident.io, and we thought it was about time for an update. This post runs through the tweaks we’ve made to our setup over the past 2 months and challenges we’ve found as we’ve scaled from a company of 10 people to 30, now with a 2 person data team (soon to be 3 - we’re hiring)!

Find the root cause faster with Datadog and Zebrium

When troubleshooting an incident, DevOps teams often get bogged down searching for errors and unexpected events in an ever-increasing volume of logs. The painstaking nature of this work can result in teams struggling to resolve issues before new incidents appear, potentially leading to an incident backlog, longer MTTR, and a degraded end-user experience.