Operations | Monitoring | ITSM | DevOps | Cloud

DevOps

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

How to make your services resilient to slow dependencies

When discussing reliability, we tend to focus on the things that we have control over: applications, virtual machine instances, deployment patterns, etc. But this ignores a significant and ever-growing part of nearly all modern software: dependencies. Dependencies are services that provide extra functionality for other services and applications. For instance, many websites depend on databases, caches, payment processors, and similar services in order to function.

Broadband Architecture for the Future

The pace of technological innovation, user demand, and the sophistication of new applications are all rapidly increasing. Will your network be able to keep up? If you build today, will the network support what your customers will require in 5 years? What about 10 years? Join us to discuss the latest developments in utilities-based middle-mile architecture and how you can confidently meet tomorrow's requirements. Learn how to create a customized and secure OT network while delivering high-revenue services to premium broadband customers.

HAProxy Fusion: New External Load Balancing & Multi-Cluster Routing Features

Recently, we added powerful new K8s features to HAProxy Fusion Control Plane—enabling service discovery in any Kubernetes or Consul environment without complex, technical workarounds. We've covered the headlining features in our HAProxy Fusion Control Plane 1.2 LTS release blog. But while service discovery, external load balancing, and multi-cluster routing are undeniably beneficial, context helps us understand their impact.

Bridging the IT-business comms gap comes down to this one word: Ask

A highlight of the SRE Report is the insightful analysis based on the organizational ranks of respondents. The 2023 installment exposed significant misalignment between practitioners and management in several key areas, including the benefits of AIOps, the challenge of tool sprawl, and attitudes towards blamelessness. While the 2024 SRE Report showed a rare consensus on the importance of monitoring external endpoints, it uncovered yet more ongoing differences. Let’s dive in.

Navigating Automation: Uniting Resolve Systems' Framework with TM Forum's Model for Operational Excellence

With the possibilities for increased productivity, reduced costs, and improved customer experiences, organizations are embracing automation across multiple areas of their operational activities. However, navigating the complexities of automation requires a structured approach. This is where frameworks such as Resolve Systems’ Automation Capability Framework and the TM Forum Automation Maturity Model come into play.

Streamlining Incident Management with Squadcast's Workflows

Watch this Webinar to understand how automating with Squadcast's 'Workflows' can save your team over 1000+ productive hours. Learn about the power of automation in the Incident lifecycle and see a live demo on setting up and tailoring Workflows to boost efficiency. 🛠️

A Guide to Choosing The Best WordPress Hosting Provider

WordPress is essentially a self-hosted content management system, which means you'll need to find your own hosting provider for your WordPress website. There are different web hosting types and options you can consider. Those include entry-level shared hosting, premium WordPress hosting, managed cloud hosting, and others. Beginners may feel lost among all the endless choices and reviews that can keep you going around in circles.

SRE and the Enterprise: Building a Culture of Reliability at Scale

As the digital landscape evolves at breakneck speed, enterprises face an increasingly complex challenge: how to ensure their systems remain reliable and available amidst the chaos of modern technology. In this journey, Site Reliability Engineering (SRE) emerges as a beacon of hope, offering a pragmatic approach to building a culture of reliability at scale.

Bridging the Skills Gap in Data Centers with DCIM Software

The Uptime Institute’s 2022 Global Data Center Survey highlights a growing challenge for operators: attracting and retaining qualified staff. With 53% struggling to find skilled employees and 42% losing staff to competitors—a sharp rise from 17% in 2018—there’s a clear need for solutions. DCIM software emerges as a key response, offering a holistic view of data center operations. This includes monitoring power usage, cooling systems, server space, and network operations.