Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Starting the XLA Journey: A Next-level Perspective for Enhanced Experiences

There’s a profound shift happening today that is taking businesses in a fresh, new direction. Outcomes are at the forefront of IT leaders’ minds, and they’re rightfully becoming a core business accelerator. It’s clear that employee and customer experiences are critical for growing businesses. The trend stems from a shift in priorities.

The importance of SDT and how to successfully schedule planned downtime

Scheduled downtime (SDT), also known as planned downtime, lets you perform maintenance, testing, or repairs on your systems, servers, software, data centers, and other infrastructure. While no business likes being offline, this preventative work is essential for ensuring your assets function correctly. Unlike unplanned outages, you can limit downtime so it minimizes impact on your company and customers.

How we've made Status Pages better over the last three months

A few months ago we announced Status Pages – the most delightful way to keep customers up-to-date about ongoing incidents. We built them because we realized that there was a disconnect between what customers needed to know about incidents, and how easily accessible this information was. For example: As we built them, we focused on designing a solution that powered crystal-clear communication, without the overhead — all beautifully integrated into incident.io.

Using Imposter Syndrome to your Advantage - Civo Navigate NA 2023

​@KunalKushwaha addresses the topic of Imposter Syndrome and its implications within the tech community. From identifying its presence among seasoned professionals to beginners, Kunal provides insights on how to turn this seemingly negative feeling into a powerful tool for growth and learning. Join the conversation and learn strategies to harness your potential.

Bill Kennedy: The mistake boot, building ACs, Black boxes & AI in software - The Reliability Podcast

The Reliability podcast aims to speak with engineers who have worked on large, complex systems and glean through their learnings. What best practices should one imbibe? What are non-negotiable learnings to become better at a craft? What’s ‘engineering’ going to be like with the advent of AI? We answer these and more tracing personal journeys of engineers who have built stellar careers around decoding the innumerable intricacies of software engineering.

What is CMDB?

A Configuration Management Database (CMDB) like ServiceNow CMDB serves as a centralized repository for comprehensive information about the various components of an information system. These components, known as Configuration Items (CIs), encompass hardware (such as servers and switches), software applications, network paths, and even individuals or documentation.

Terraform is No Longer Open Source. Is OpenTofu (ex OpenTF) the Successor?

Terraform, a powerful Infrastructure as Code (IAC) tool, has long been the backbone of choice for DevOps professionals and developers seeking to manage their cloud infrastructure efficiently. However, recent shifts in its licensing have sent ripples of concern throughout the tech community. HashiCorp, the company behind Terraform, made a pivotal decision last month to move away from its longstanding open-source licensing, opting instead for the Business Source License (BSL) 1.1.

How a simple metric drives reliability culture at Slack

How do you track reliability in an organization with hundreds of engineers, dozens of daily production changes, and over 32 million monthly users? Even more, how do you do this in a way that's simple, presentable to executives, and doesn't dump a ton of extra work on to engineers' plates? Slack recently wrote about how they created the Service Delivery Index for Reliability (SDI-R), a simple yet comprehensive metric that became the basis for many of their reliability and performance indicators.

Automate Agent installation with the Datadog Ansible collection

Ansible is a configuration management tool that helps you automatically deploy, manage, and configure software on your hosts. By turning manual workflows into automated processes, you can quicken your deployment lifecycle and ensure that all hosts are equipped with the proper configurations and tools. The Datadog collection is now available in both Ansible Galaxy and Ansible Automation Hub.