Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

OpenTelemetry Metrics in Quarkus Explained

When you run services on Quarkus, you need a steady stream of signals to understand how the application behaves—CPU trends, request timings, memory patterns, and how each endpoint responds under load. Metrics give you that visibility. They help answer questions like: OpenTelemetry fits well here because it gives Quarkus a common way to generate and export metrics without locking you into a specific monitoring tool.

Why 2025 & Beyond is The Builders Era

The tech world loves buzzwords. We’ve lived through the Cloud Era, the Mobile Era, the AI Era (we’re still in that one, apparently). But 2025 marks something different. Something developers have been craving for years but couldn’t quite name. Welcome to The Builders Era. Not because of some shiny new framework or yet another platform promising to 10x your productivity. The Builders Era is happening because developers are done being spectators in their own craft.

Jira Service Management (JSM) Review for On-Call Management (2025)

OpsGenie is shutting down. And Atlassian recommends migrating to Jira Service Management (JSM). But if you’re not sure JSM is the right fit for your team’s on-call management needs, this review will help you decide. I signed up for JSM and put it through real-world testing. I created on-call schedules, rotations, and overrides. Then, I reviewed JSM’s on-call management across 4 key criteria. For each criterion, I shared what I liked and what I didn’t.

How Rootly works with Slack | An end-to-end demo.

Rootly is the AI-native on-call and incident management platform that helps you resolve incidents faster, improve system resilience, and streamline on-call operations. It’s your always-on SRE copilot that automates root cause analysis and identifies patterns that drive continuous improvement—trusted by thousands of companies like LinkedIn, NVIDIA, Replit, Elastic, Canva, Clay, Tripadvisor, and Grammarly.

Latency, Loneliness, and Laundry: A Practical Field Guide to Remote Ops That Actually Feels Good

Remote ops is weird. You're juggling alerts, releases, tickets-and five meters away there's a pile of laundry silently negotiating your willpower. You want focus without turning into a hermit. You want flexibility without drifting into 11 p.m. "just one more thing" spirals. And you want your team to feel like a team, not just avatars in a status channel. This guide blends human factors with ops pragmatism. Short, testable ideas. Minimal ceremony. A little empathy for the person behind the keyboard.
Sponsored Post

Preparing for cloud failures: Monitoring strategies for distributed hybrid infrastructure

When AWS experienced its recent outage, the ripple effect was immediate. Critical workloads slowed, dashboards went blank, and many teams realized multi-cloud isn't automatically resilient. Cloud-level failures are inevitable due to the interdependent components and complex IT architecture. The recent AWS disruption reminded many teams that the cloud isn't a magic uptime guarantee. Even the most mature providers can-and do-experience large-scale service interruptions.

Devart ODBC Drivers vs Free ODBC and JDBC: Key Comparison

Most teams never question the JDBC or ODBC drivers they use. If it connects, it’s “good enough.” That assumption can cost more than $14,000 per minute during an outage, according to EMA’s 2024 IT downtime benchmark. Drivers are more than connectors. They dictate how efficiently data moves between databases, applications, and analytics tools. When overlooked, the entire stack slows down. Breakdowns at this level lead to failed reports, missed deadlines, and avoidable downtime.

What Is BigQuery? A Guide To How It Works And Costs

Data has exploded — and so have the challenges that come with it. Every click, transaction, and sensor ping generates mountains of data that traditional databases can’t handle. That’s why more than 94% of organizations now rely on cloud platforms, according to CloudZero’s 2025 cloud report. The goal isn’t just to store data, but rather, to make sense of it fast. And this is exactly where tools such as Google BigQuery step in.

Streamline Incident Management with the New Netdata-ServiceNow Integration

When a critical alert fires at 2 AM, the last thing your on-call engineer should be doing is manual administrative work. Yet, for many teams, that’s exactly what happens. You see the alert in your monitoring tool, then you have to switch contexts, open a new browser tab, log into your ITSM platform, and manually create an incident—all while your systems are failing.