Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Get your team to adopt DORA metrics when you're already on board

Need help getting your teams to adopt DORA metrics? Join this live stream on Thursday, February 23 at 9 a.m. PT to learn how. Sleuth TV Live is back for season 2! It's a common reality: You're already on board with DORA metrics and ready to run with them, but now you need to get your teams to adopt DORA metrics. How? We'll share 5 ways we use DORA metrics at Sleuth to shed light on the possibilities for your own team.

Autonomously optimize AWS Lambda deployments with Sedai and Datadog

In dynamic production environments, unpredictable traffic loads and frequent code changes can make it difficult for organizations to consistently optimize their cloud infrastructure, resulting in application performance issues, latency, and wasted cloud spend. Teams that manage large-scale cloud infrastructure deployments are often forced to tune their workloads’ configurations using a complicated mesh of script jobs—or worse, manual remediation by on-call engineers prompted by alerts.

Site Reliability Engineer: Responsibilities, Roles and Salaries

DevOps gained popularity in order to combat siloed workflows, decreased collaboration and a lack of visibility across the software development lifecycle. While establishing a culture of DevOps has helped teams collaborate better and deliver reliable software faster, DevOps teams don’t necessarily have someone specifically dedicated to developing systems that increase site reliability and performance. That’s where a site reliability engineer (SRE) comes into the picture.

How to track the failures in microservice applications?

Microservices architecture (often shortened to microservices) is an architectural style for developing applications. Microservices allow a large application to be separated into smaller independent parts, each having its own realm of responsibility. To serve a single user request, a microservices-based application can call on many internal microservices to compose its response. It is critical to track failures in microservice to take corrective actions and keep the business process ongoing.

Migrating On-Premises File Shares to Azure File Storage

As businesses grow, so does their need for scalable and secure data storage. For many companies, on-premises file shares have long been the preferred method of storing and sharing data. However, with the advent of cloud computing, more and more businesses are turning to cloud-based storage solutions like Azure File Storage. In this article, we will explore the challenges and solutions involved in migrating on-premises file shares to Azure File Storage.

Ensuring High Availability with Azure Datacentre Redundancy

In today’s fast-paced business environment, downtime can have severe consequences for any organization. It can lead to lost revenue, decreased productivity, and loss of customer trust. Therefore, it is essential to ensure that your business-critical applications are always available, no matter what. This is where Azure Datacentre Redundancy comes into play.

From Alert Madness to Incident Response Nirvana: An AIOps and ITPA Strategy

Complex environments are notorious for generating a high volume of alerts. For IT teams, this deluge presents a critical, time-consuming challenge. Managing alerts and incident response keeps these busy professionals under constant pressure and risks alert fatigue. Nonstop “noise” can desensitize people and actually lead to missed or ignored alerts—risking delayed responses and downtime. These high stakes make handling alerts a key security and productivity issue.