Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Flight to Success: Birdie's DevOps Evolution Fueled by Observability Insights

Birdie wanted to uplevel observability to a platform that would provide meaningful insights for application performance and debugging. Ensuring customers can provide seamless and timely care to in-home patients stands as a top priority for Birdie, and the development team takes pride in building and maintaining a high-quality platform distinguished by its reliability and responsiveness.

Mattermost wins 2024 DEVIES Award for Best Innovation in ITOps

We’re thrilled to announce that Mattermost has earned the 12th annual DEVIES Award for Best Innovation in ITOps! The award — given to a platform “responsible for acquiring, designing, deploying, configuring, and maintaining the physical and virtual components that comprise IT infrastructure” — was presented on Feb. 21 during DeveloperWeek 2024 to our very own Director of Product Marketing Amanda Cheong and Developer Advocate Andrew Zigler at the Oakland Marriott City Center.

How to find and test critical dependencies with Gremlin

Part of the Gremlin Office Hours series: A monthly deep dive with Gremlin experts. Pop quiz—what are all of the dependencies your services rely on? If you’re like most engineers, you probably struggled to come up with the answer. Modern applications are complex and rely on dozens (if not hundreds) of dependencies. Many teams rely on spreadsheets, but manual processes like these break down over time. What if you had a tool that found and tracked dependencies for you?

How to use host redundancy to improve service reliability and availability

Cloud computing has made provisioning new servers easy, fast, and relatively cheap. Almost anyone can log into a cloud console, spin up a new server, and deploy an application. And if they need greater uptime, major cloud providers include all kinds of settings, services, and configurations to add fault tolerance and failover. So why is it that many services fail when a single server instance fails?

ArgoCD vs FluxCD vs Jenkins X - Battle of Declarative GitOps Tools

The need for automation is becoming more important day by day. The process of integrating written code with already working code and publishing new code to live environments is a very error-prone process. Performing static analysis, running tests, packaging, and versioning are tasks that require a lot of manual effort. It’s also a complex task to solve the problem of deploying the projects we develop to more than one environment, on more than one machine, without automation.

Migrate from SolarWinds to Tidal's Modern IPAM Solution

Here at Tidal, we have helped many organizations make the switch from SolarWinds to LightMesh. We developed our LightMesh IPAM (IP Address Management) solution to provide advanced automation and ease-of-use for network engineers. If you’ve considered migrating away from SolarWinds IP Address Manager, this post will walk through how to make the switch.

The Transformative Benefits Of AWS Well-Architected Reviews For Organizations In 2024

IDC predicted that over half of Asia Pacific's digital-first businesses plan to pump up their tech spend by 20% in the next year. They're betting on cutting-edge tech like AI and cloud platforms to stay ahead, innovate, and maintain their financial viability. As AWS is the leading cloud provider across the globe, most businesses rely on it for highly reliable quality cloud services.

AI-powered diagnostics for incident response: New Sift features in Grafana IRM

Sift is a machine-learning-powered diagnostic feature in Grafana Cloud that SREs and DevOps teams can use to automate routine parts of incident investigation, such as searching for new errors in logs, surfacing recent deployments, or identifying overloaded Kubernetes nodes. We want Sift to springboard you into an investigation, so useful context is already there by the time you see an alert or declare an incident.

Preview Confidential AI with Ubuntu Confidential VMs and Nvidia H100 GPUs on Microsoft Azure

With Ubuntu confidential AI on Azure, businesses can undertake various tasks including ML training, inference, confidential multi-party data analytics, and federated learning with confidence. The effectiveness of AI models depends heavily on having access to large amounts of good quality data. While using publicly available datasets has its place, for tasks like medical diagnosis or financial risk assessment, we need access to private data during both training and inference.