Operations | Monitoring | ITSM | DevOps | Cloud

env zero Unveils Enhanced Cloud Governance Platform to Future-Proof Infrastructure Delivery in the AI Era

env zero today announced the next evolution of its Cloud Governance Platform. Designed for the speed, scale, and complexity of the AI era, and marked by a fresh new look and feel, the enhanced env zero platform empowers enterprises to deliver infrastructure 10x faster without losing control of infrastructure governance, security, compliance and cost oversight.
Sponsored Post

Accelerating Software Development: Modern SDLC Practices with AI and Automation

Modern software teams - especially in fast-paced SaaS startups - face constant pressure to deliver features quickly without compromising quality. The Software Development Life Cycle (SDLC) has evolved significantly in recent years, and embracing new AI-powered tools and automated workflows can dramatically increase a team's velocity. In this whitepaper, we'll explore how a small team of developers can work smarter and faster by integrating AI assistants, AI pair programming, modern Git workflows, and automated testing into their SDLC.

The Developer's Guide to Debugging AI-Generated Code

AI coding tools like ChatGPT, GitHub Copilot, and Claude have completely changed how we write software. From humble beginnings where non-AI-enabled code assistants made intelligent code suggestions, like Intellisense, the latest agentic tools can generate entire functions, suggest optimal algorithms, and even scaffold complete applications in minutes. However, as any developer who’s worked with AI-generated code knows, the output isn’t always perfect.

We've raised $13M Series A to make DevOps so simple, it feels unfair

I'm excited to announce our $13M Series A, led by IRIS and Crane Venture Partners with support from Datadog founders and Speedinvest. This investment will fuel our mission to make DevOps simple and scalable, expand in the US and Europe, and accelerate product innovation.

A Multidisciplinary Guide To Cloud Cost Intelligence

Cloud cost intelligence has moved beyond simple cost-cutting. Now, it’s about creating value. Cloud bills continue to rise, and workloads are becoming increasingly complex. Teams also need to understand what they’re spending, why, and how that spend ties to business results. FinOps has become the framework for bringing finance and engineering together. It’s helping teams manage costs, improve margins, and plan with confidence. But challenges remain.

The AI Velocity Paradox

AI-powered coding alone isn’t enough. True software delivery velocity requires end-to-end automation and intelligent governance across the entire lifecycle. Harness enables organizations to escape the AI Velocity Paradox by unifying speed, safety, and resilience, turning rapid development into a sustainable competitive advantage. The widespread adoption of AI coding assistants is transforming software engineering.

This Developer Built an App That Doesn't Spy on You

Rizel Scarlet (Staff Developer Advocate at Block) tackled a problem every developer should care about: apps that spy on users before they even know they need the service. Her solution? Build a privacy-first pregnancy app using AI agents and decentralized web nodes. This isn't just about pregnancy apps - it's about the future of user data ownership.

Docker Daemon Logs: How to Find, Read, and Use Them

Sometimes Docker behaves in ways that catch you off guard—containers don’t start as expected, images pause during pull, or networking takes longer than usual to respond. In those moments, the Docker daemon logs are your best reference point. These logs capture exactly what the Docker engine is doing at any given time. They give you a running account of system state, performance signals, and events that help you understand what’s happening beneath the surface.

Recapping SEV0 San Francisco 2025

Earlier this week, we gathered in San Francisco for our second SEV0—almost a year after our very first event. SEV0 has always been about shining a light on the biggest challenges (and opportunities) in incident response. Last year, we were still talking about the fundamentals: blameless culture, strong processes, and lessons from the best in reliability. This year felt different. AI has moved from background noise to front and center in every conversation, every team, everywhere.

Automation Observability: See It, Fix It, Skip the Firefighting

IT leaders know the drill. An alert storm rolls in and the tickets pile up. Your team scrambles to piece together root causes before service degradation kicks in. But the firefighting rages on, even when you have enough dashboards, monitoring, and alerts to light up a Christmas tree. Enterprise leaders need to quit burning budget on shiny dashboards that look good in the boardroom but do nothing to stop outages in the real world.

The Unit Economics Of Watering My Lawn: A Lesson On Runaway AI Costs

My wife and I spent hours this summer at home digging in the dirt. We planted new shrubs and perennials and created a small vegetable garden. We spread many square yards of fresh topsoil and grass seed over areas of lawn that needed rejuvenation. It turns out, I should have done all that landscaping with a FinOps leader’s mindset — before my water bill tripled when I wasn’t looking.

Ubuntu Pro Containers on AWS: Secure Your Workloads

‎‎ Subscribe and join the Ubuntu community. ‎ ‎‎ Learn how to build and run Ubuntu Pro containers on AWS with enhanced security coverage. In this step-by-step walkthrough, we demonstrate how Ubuntu Pro enhances protection for third-party open-source libraries, enabling you to patch CVEs more quickly and maintain secure workloads. You’ll see how to: · Launch an Ubuntu Pro server on AWS· Create and attach Pro containers using tokens· Build a simple PHP container with Pro security updates enabled· Verify security patches with ESM repositories· Run and test your container on AWS.

Ship features faster and safer with Datadog Feature Flags

Releasing new features is one of the highest-stakes moments in the software delivery life cycle. Even with CI/CD pipelines in place, plenty of things can still go wrong when a feature goes live for actual users. Most feature flagging tools operate in isolation from important observability tooling, forcing engineers to monitor changes across multiple disconnected systems to fully understand their impact. This slows down development and increases the chance of missing critical issues.

JFrog and ServiceNow: Accelerate Trusted Software Application Development

Today’s software organizations can’t make tradeoffs between speed and trust – you need both to succeed. But juggling them is tough. Moving too fast can lead to security vulnerabilities and compliance issues, while moving too slow means your competitors beat you to market. This tension creates friction that slows down every release, a problem that is rooted in your software pipeline.

Introducing Runner Replicas: Scalable, Reliable Automation for Modern Ops

When you’re responsible for the reliability of complex systems, the execution layer of your automation is not something you want to think about—it should just work. Whether you’re deploying code, patching servers, or responding to an incident at 3 a.m., your automation engine should be as resilient and scalable as the infrastructure it’s operating on.

AI-powered email automation with CI/CD pipelines

Email automation allows you to send emails automatically based on certain triggers or schedules, so you don’t have to click the Send button every time. This includes things like welcome messages, drip campaigns, and regular newsletters. In this tutorial, you will create a simple system that automatically welcomes new subscribers and sends them updates about technology, all with the help of AI.

AI is writing your code. Who's watching your standards?

As a platform integrator, we get a unique look at the tools our customers adopt every day. Of all the shifts I’ve seen, none has been as rapid as the adoption of coding assistants. The conversation has quickly gone from ‘is this tool really going to drive value?’ to ‘how quickly can we roll this out?’ No one can doubt the immense value these tools provide in shipping code faster.

A Path to Autonomous Networks

We’re getting ready to take the stage at Network X in Paris, where Ribbon has been nominated for Most Innovative Optical Transport Use Case & Most Innovative IP Transport Solution. But before that, we’re hosting a special 30-minute preview to show you what’s coming. Join Jonathan Homa and David Stokes from our Solutions Marketing team as they walk through how Ribbon is advancing toward autonomous networks with.

Resolve + Espressive: Effortless, Ticketless Agentic Automation at Scale

If you work in IT or in employee support, you know how much of a painful burden tickets are. They’ve been around forever and they’ve become the default unit of work for how organizations support employees. But let’s be honest: nobody loves tickets. Employees hate filing them. Agents hate working them. Leaders hate the ballooning costs and wasted cycles that tickets represent.

Harness Acquires Qwiet AI to Power Its Application Security for the AI Era

Harness acquires Qwiet AI to power application security in the AI era, embedding reachability analysis to cut noise and prioritize real risks. By Sanjay Nagaraj, SVP Global Engineering, Harness; Co-founder and CTO, Traceable by Harness Today, I am excited to share that Harness has acquired Qwiet AI (formerly ShiftLeft), a leader in agentic AI-powered vulnerability detection and reachability analysis.

Top 11 Java APM Tools: A Comprehensive Comparison

Are your Java applications running at their optimal performance, or is there room for improvement to make them faster and more efficient? With so many services depending on Java, keeping applications responsive and reliable is a core part of modern software engineering. This blog walks you through the leading Java Application Performance Monitoring (APM) tools, with a clear comparison to help you choose the right option for your needs.

The evolution of Integration technology through AI

Join us in this exciting podcast episode where integration pioneer Tom shares his 25+ year journey in tech, from message-oriented middleware in 1998 to leading AI projects at Microsoft. Tom dives into how AI is revolutionizing integration as the "backbone" of modern systems - think generative AI agents automating home damage inspections in minutes, reducing manufacturing downtime, and transforming financial trading.

Fortifying security for Ubuntu on Azure with Metadata Security Protocol (MSP)

We’re pleased to share a security enhancement for Ubuntu workloads on Microsoft Azure. In collaboration with Microsoft, Ubuntu now supports Azure’s Metadata Security Protocol (MSP)—a feature that hardens access to the Instance Metadata Service (IMDS) and WireServer. On Ubuntu, MSP is enabled by the azure-proxy-agent package, Canonical’s integration of Microsoft’s Guest Proxy Agent (GPA).

Cortex Overview

Cortex is the leading internal developer portal that helps engineering teams ship reliable, secure, and efficient software, faster. In this video, you’ll learn how Cortex empowers organizations to adopt best practices in DevOps and platform engineering while giving developers the tools they need to succeed. What you’ll learn in this video: With Cortex, engineering leaders get visibility into service health, teams embrace best practices with ease, and developers gain a frictionless path to building high-quality software.

Workflows | Cortex

Cortex Workflows enable teams to streamline engineering processes, enforce best practices, and improve developer productivity with automation. In this video, we’ll show you how Cortex empowers engineering and platform teams to design, manage, and scale workflows that keep teams aligned and services reliable. What you’ll learn in this video: With Cortex, teams can standardize processes across services, integrate automation into daily workflows, and gain visibility into engineering performance—all within a unified platform.

Catalogs & Entities | Cortex

Cortex makes it easy for engineering teams to organize, discover, and manage their software ecosystem with catalogs and entities. In this video, you’ll learn how Cortex helps teams build a unified developer portal and service catalog that improves visibility, adoption, and productivity. What you’ll learn in this video: With Cortex, organizations reduce silos, standardize operations, and give developers the tools they need to move faster—while maintaining reliability and compliance.

Scorecards, Initiatives and Reporting | Cortex

Discover how Cortex empowers engineering teams with scorecards, initiatives, and reporting to accelerate delivery, improve service health, and align engineering with business goals. In this video, you’ll see how Cortex helps teams adopt best practices, track progress, and measure impact at scale. What you’ll learn in this video: With Cortex, teams gain clarity, leaders get actionable insights, and organizations achieve alignment across engineering initiatives.

Need to do Integration Testing without a real Postgres SQL Database? #speedscale #postgres #sql

Struggling with integration testing because you need a real Postgres SQL database running? This video walks you through how to use Speedscale's proxymock to easily record and mock a live Postgres connection. You'll see how to: By the end, you'll be able to create realistic database mocks for your testing and development, saving you time and hassle.

Resolve + Espressive: Accelerating Zero Ticket IT with Agentic AI and Automation

Traditional IT operations are struggling under the weight of manual processes, growing complexity, and escalating expectations for employee experience. Without agentic AI driven by an intelligent orchestration engine, organizations risk falling behind. That’s why Resolve and Espressive have joined forces. Together, we’ll deliver the industry’s most advanced AI agents for enterprises, combining conversational intelligence with powerful automation to eliminate tickets end-to-end.

How to Overcome NaaS Integration Challenges

As network teams constantly search for new ways to simplify and optimize their networks, it helps to hear from experts who have spent years at the center of these transformations. One of these voices is Fabio D’Avino, a specialist in Network as a Service (NaaS) with more than seven years of experience researching, designing, and building global network services across industries.

Deploying a multimodal RAG application with Gemma 3 and CircleCI on GKE

Retrieval-Augmented Generation (RAG) has transformed how applications interact with Large Language Models (LLMs). RAGs ground LLM responses in external knowledge, improves accuracy, and reduces hallucinations. But traditional RAG systems have a significant limitation: they only process text. Multimodal RAG addresses this limitation by processing and understanding multiple data types (text, images, and potentially audio).

Streamline Software Delivery Right From Your IDE with Amazon Kiro and Harness

The integration of Amazon Kiro and Harness’s MCP server enables developers to manage, troubleshoot, and optimize CI/CD pipelines directly from their IDE using natural language, dramatically reducing manual effort and accelerating software delivery from code generation to production.

{Unscripted} Autonomous Code Maintenance

Nothing drains developer productivity like codebase maintenance. The endless cycle of dependency upgrades, bug fixes, refactoring, and paying down technical debt is tedious, error-prone work that pulls engineers away from building new features. Harness Autonomous Code Maintenance (ACM) turns these manual chores into automated, intent-driven workflows. Developers can now state their intent in plain English, with prompts like, "Upgrade the front end from React 15.6 to 16.4". From there, the Harness AI agent drives the workflow.

{unscripted} AI for DevOps and DBDevOps

Many software engineers are experts in application code but not in the nuances of creating a production-ready delivery pipeline. Architect Mode acts as a seasoned DevOps expert, engaging the user in a conversation to design a pipeline that incorporates organizational best practices for security, quality, and compliance from the very beginning. It’s like having a personal DevOps architect as a partner.

{unscripted} IDP Knowledge Agent

We're making Internal Developer Platforms (IDPs) more accessible with a natural language assistant. Developers can ask questions like, "What are the failing checks for my service's scorecard?" or "Who is the owner of a service?" to find metadata instantly. The agent also bridges the gap to action by suggesting and executing self-service workflows, like creating a new repo or onboarding a new engineer. It can even assist in generating new workflows, turning complex processes into simple conversational tasks.

{unscripted} AI Verification and Rollback

Our first AI/ML capability, Continuous Verification, made Harness the first Continuous Delivery tool to understand observability telemetry and trigger rollbacks when deployments caused trouble. We knew we could do more to eliminate the friction involved in its setup. Deploying with confidence shouldn't require a coordination meeting between DevOps, SREs, and developers just to configure the right health checks. That’s why we’re introducing the next generation: AI Verification and Rollback.

{unscripted} AI in Chaos Engineering

Harness AI enhances your chaos engineering capabilities by leveraging artificial intelligence to automate and optimize reliability testing and analysis. One of the challenges of scaling up the Chaos Engineering practice within the organization is skilling up the users to create or run chaos experiments and to come up with solutions to mitigate the risks that are identified during the chaos experiment execution. The Chaos Engineering module comes with an AI Agent called "AI Reliability Agent" that helps in these aspects.

API World 2025: Growth, Memories, and Next Steps

A couple of weeks ago, our team returned from API World. We’ve officially had a few weeks to decompress and get back into the swing of things after an incredible time at API World 2025. Looking back, the experience was even more rewarding than I had imagined in my Pre-API World blog. This year was especially memorable for me, as I had the opportunity to attend my first tech conference and travel across the country for work. I’m still buzzing from everything I learned and the people I met.

FinOps Training At Scale Webinar: Key Takeaways On Proven Strategies From Fred FinOps

Cloud costs are no longer just an engineering problem or a finance problem. It’s now an everyone problem. That was the central message from the FinOps Training At Scale webinar that took place on Sept. 25, 2025, where CloudZero’s Larry Advey (a.k.a. “Fred FinOps”) and Director of Tech Enablement Umesh Rao walked through the realities of building FinOps practices that work in the real world.

How To Tag AI Cloud Spend: A Practical Framework For FinOps Teams

The world of cloud costs is always evolving, and AI spend is quickly becoming one of the most unpredictable and confusing cost drivers. As more organizations integrate generative AI into their products, FinOps teams are struggling to account for — and control — these new, often mind-boggling cost streams. In fact, 44% of engineering professionals say improving AI explainability is a top priority in AI budgeting, according to CloudZero’s State Of AI Costs In 2025 report.

Deployment of AWS Step Functions with Lambda and CircleCI

In this guide, you will build and deploy a serverless data processing workflow using AWS Step Functions and AWS Lambda. This approach enables you to orchestrate discrete processing tasks in a scalable and cost-efficient way, leveraging the event-driven architecture that AWS offers. You will begin by creating individual Lambda functions that handle specific tasks in your data pipeline.

New in Flyway: Pre and post deployment scripts for state-based deployments via callbacks

Flyway provides a lot of flexibility for releasing database changes in a safe and repeatable manner. Earlier this year, we added the ability to automate state-based deployments. This means the structure of the database is defined in version control and Flyway handles updating a target database to match it. The Flyway comparison engine does all the hard work of identifying what’s different and creating a script that will run on the target database to alter it so it matches the latest state.

Building community-first brands ft. Edreece Arghandiwal, CMO & Co-Founder of Oakland Roots SC

In this special episode of The Confident Commit, Rob Zuber sits down with Edreece Arghandiwal, CMO and Co-Founder of Oakland Roots Soccer Club, for an inspiring conversation about building purpose-driven organizations from the ground up. Edreece shares how the Roots challenged traditional sports business models by putting community first, turning a simple question—"Why isn't there professional soccer in Oakland?"—into a movement that raised $3.5 million from 6,000+ community investors with zero paid advertising.

AI-Powered Chaos Engineering with Harness MCP Server and Cursor

The Harness MCP Server integration with Cursor transforms chaos engineering from a complex, specialized discipline into an accessible, conversational workflow that any developer can leverage directly within their AI-powered IDE. By combining natural language prompts with comprehensive resilience testing tools, teams can discover, execute, and analyze chaos experiments without vendor-specific expertise, democratizing system reliability across DevOps, QA, and SRE functions.

Harness GitOps: Scaling Argo CD with Enterprise-Grade Control

Harness GitOps extends Argo CD by preserving its reconciliation loop while adding governance, audit, and RBAC through the GitOps Agent’s secure connection to Harness SaaS. Teams can choose Harness-managed or bring-your-own Argo CD and scale to multi-cluster fleets with unified dashboards, promotion pipelines, and true rollback, while Git stays the single source of truth.

My Criteria for Automated Incident Response Tools

Managing incidents manually isn’t realistic when their number keeps growing. That’s where automated incident response tools come in. They handle routine tasks so you can focus on actual problem-solving. In this blog, I’ve put together a list of the 9 best automated incident response tools for you. I looked at each one based on four key areas of the incident response process. This will help you see how they handle everything from start to finish.

Streamline Software Delivery Right From Your IDE with Amazon Kiro and Harness

The integration of Amazon Kiro and Harness’s MCP server enables developers to manage, troubleshoot, and optimize CI/CD pipelines directly from their IDE using natural language, dramatically reducing manual effort and accelerating software delivery from code generation to production.

Global Online Meetup: K3k

Even though multi-tenancy isn't a new concept, when it comes to Kubernetes, implementing the concept can come with its own set of challenges - noisy neighbours, operational complexities, and, of course, security considerations. Sounds like a lot? Well, that's why it's essential to strike a balance between flexibility and optimising resource utilisation. Join Divya Mohan at 2 PM UTC on 25th September as she hosts Rossella Sblendido and Jean-Phillipe Gouin to explore how the K3k project from SUSE helps us achieve all this and more in this edition of the Global Online Meetup.

Why Security Must Include Cost Accountability In The Cloud

A SaaS team once spotted their first breach not in a SIEM dashboard, but in their AWS bill. Their compute costs spiked by 400% overnight. Turns out, an attacker had spun up dozens of high-powered instances for crypto mining. Logs eventually confirmed the intrusion, but the cost anomaly was the first signal that something was wrong. This incident isn’t unusual. Cloud costs often reflect consumption, but they can also reflect compromise.

Monitor Kubernetes Hosts with OpenTelemetry

It’s 3 AM. API latency just spiked from 200ms to 2s. Alerts are firing, and users are frustrated. You SSH into the first server: top, free -h, iostat — nothing unusual. On to the next host. And the next. That’s how most of us learned to debug. The tools worked, and we got good at using them. But as infrastructure became distributed and dynamic, this approach started to break down. Modern monitoring needs more than SSH and top. It needs unified telemetry.

Densify Talks, CNCF, OpenAPI, and Kubernetes with Dan Ciruli from Nutanix

<span data-mce-type="bookmark" style="display: inline-block; width: 0px; overflow: hidden; line-height: 0;" class="mce_SELRES_start"></span> Andrew Hillier sits down with Dan Ciruli, who leads the cloud native product management team at Nutanix.

14: CNCF, OpenAPI, and Kubernetes with Dan Ciruli from Nutanix

Andrew Hillier sits down with Dan Ciruli, who leads the cloud native product management team at Nutanix. Dan’s got some great stories from his days at Google—back when cloud native and Kubernetes were just getting started, in addition to the knowledge and wisdom he picked up along the way.

Build an automated ETL pipeline for cryptocurrency data with CircleCI

To stay ahead in the crypto world, you need latest information about cryptocurrencies. With so many coins out there and prices changing all the time, knowing which ones are doing the best gives you a quick snapshot of what’s hot right now. Whether you’re investing, just curious, or trying to understand the market better, this information makes it easier to spot trends and make smarter decisions.

Cisco ASA and IOS Vulnerabilities Expose Critical Systems, Making Edge Automation Essential for Rapid Remediation

The launch of Puppet Edge this week could not have been more timely. Within a day of its general availability, Cisco disclosed a vulnerability in its IOS and IOS XE software, followed almost immediately by an Event Response detailing two additional critical-severity CVEs affecting its firewalls.

SQL NTILE Function Explained with Practical Examples

NTILE in SQL transforms raw lists into structured distributions before they reach a dashboard. By pushing distribution logic upstream, it divides ordered rows at the query level, so the insights in Power BI, Tableau, or Excel are accurate, not just polished visuals. This guide explores its syntax, practical examples, and comparisons with other ranking functions. It also highlights how database IDEs bring NTILE insights directly into analysis and reporting workflows. Table of contents.

Learn to mock your MySQL database and get realistic test data without the hassle of a live server!

Proxymock allows you to record real interactions between your application and a MySQL database. Use proxymock to simulate your database during local development and testing. Get real data without running a live MySQL server. Modify mock responses to fit your testing needs. Simplify your testing workflow and replicate production data easily.

Security vs. ops: the two sides of reliability

Security and ops work together to keep your systems reliable, but why do we treat them so differently? Reliability results start when you proactively take charge of your infrastructure and application risks. Transcript: When we talk about reliability in the software space and the digital operations space, you really end up falling into these two different mindsets.

Why Organizations Choose Cycle for AI

The race on AI is heating up; the next generation of vibe coders and prompt engineers are entering the job market as we speak, and AI is the hottest line item on most IT budgets this year. Building the next great home automation software or adaptive learning platform with cutting-edge machine learning is great and all, but like all great software, it needs to start with the plumbing.

Harness Named a Leader in the 2025 Gartner Magic Quadrant for DevOps Platforms For the Second Consecutive Year

Harness has been recognized as a Leader in the 2025 Gartner Magic Quadrant for DevOps Platforms for the second consecutive year. To us, this recognition acknowledges our solutions and capabilities that we offer. . Harness continues to support the full software delivery lifecycle and remains focused on expanding capabilities and improving adoption across industries.

Now Available: Ready-to-Use Policies - Guardrails You Can Activate Instantly

Policies are essential for maintaining secure, cost-effective, and compliant infrastructure. But writing them from scratch takes time, time most teams would rather spend on building and shipping. The intent is there. Platform teams want to enforce standards. But figuring out what to enforce, writing the logic, validating parameters, and wiring it all into workflows adds friction. Even experienced teams find themselves blocked not by policy frameworks, but by the overhead of getting started.

Audit log streaming for real-time security visibility in your CI/CD pipeline

Security and compliance teams face a critical challenge: by the time they discover suspicious activity in their development pipeline, it’s often too late to prevent damage. Manual audit log requests create bottlenecks that delay incident response, and gaps in visibility leave organizations vulnerable to insider threats and compliance violations. If your team struggles with any of these issues, you need a systematic approach to real-time audit monitoring.

Complete Guide to HAProxy Visibility Using Promtail and Loki

HAProxy is the workhorse in front of countless APIs and apps because it’s fast, lean, and flexible. Because it sits on the traffic hot path, it’s also your earliest warning system when something slows down or breaks entirely. This means that monitoring it isn’t optional. You need to see connection queues and retries, per-stage timings, health-check failures, and spikes in error statuses to catch incidents before users do.

Streamline your code reviews with automated acceptance criteria checks | Rovo Dev | Atlassian

In this video, we explore how the Code Reviewer feature can enhance your pull request workflow by automatically analyzing your code against acceptance criteria from linked work items. Watch as we demonstrate how Code Reviewer scans your pull request and the associated ticket—checking the summary, description, and custom fields to ensure all requirements are addressed.

Visualize Jenkins CI/CD Pipelines: Introducing the New Jenkins Data Source Plugin in Grafana 12.2

Grafana 12.2 introduces the new Jenkins data source plugin, giving you real-time insights into your Jenkins CI/CD pipelines. With easy setup, you can connect your Jenkins instance and explore two built-in dashboards: See how Jenkins data becomes instantly actionable inside Grafana.

Build a versatile query agent with RAG, LlamaIndex, and Google Gemini

As a developer, you often face the challenge of retrieving information from multiple sources with different structures. What if you could create a single interface that automatically routes queries to the right data source? Imagine your application needing to answer both “What’s the population of California?” and “What are popular attractions in Hawaii?”.

Workstations at Scale: Challenges, Trade-offs, and Emerging Solutions

Scaling workstation deployments is never straightforward. IT teams need to balance performance, cost, manageability, and security, but the trade-offs between approaches can be significant. In this blog, we’ll walk through the three most common deployment models, highlight their limitations, and introduce how newer solutions like Computle illustrate a different path.

Anbox Cloud 1.27.0: what's new?

In this video, the Anbox team covers new features and changes in their latest 1.27.0 release: What is Anbox Cloud? Anbox Cloud lets you run virtualized Android environments securely, at any scale, to any device letting you focus on your use case. Run Android in system containers, not emulators, on AWS, OCI, Azure, GCP or your private cloud with ultra low streaming latency.

Understanding Variable Costs In The Cloud

Cloud costs don’t wait for your finance team to catch up. They spike on product launches, dip when usage slows, and sometimes blow past forecasts overnight. Every container spun up, every gigabyte stored, and every terabyte transferred adds to the tab. The main culprit is often variable costs. In this guide, we’ll break down how variable costs affect budgeting and the strategies you can use to turn cost variability into a competitive advantage.

GitKraken MCP: Give Copilot & Cursor the Git Context They're Missing

AI assistants like Cursor and GitHub Copilot are fun to play with. They autocomplete code, refactor functions, and occasionally argue with you about whether you really needed that semicolon. But the moment you ask them to do something grounded in your repo, say, “start work on JIRA-123”… you hit a wall. They don’t know your branching conventions. They don’t know how your team links issues.

Kubernetes v1.34: What You Need to Know

Kubernetes v1.34, codenamed “Of Wind & Will (O’ WaW)”, brings a wide range of enhancements aimed at making clusters more efficient, secure, and easier to manage. This release delivers 58 enhancements with 23 graduating to Stable, 22 entering Beta, and 13 in Alpha, reflecting the platform’s continued maturation as enterprises scale their container orchestration needs.

Introducing Puppet Edge: The Future of Network and Edge Device Management

Discover how Puppet Edge is revolutionizing infrastructure management by combining declarative and imperative approaches into one powerful solution. In this video, Margaret Lee, a product leader at Puppet, introduces Puppet Edge and explains how it simplifies the management of your entire estate, from network to edge devices.

#049 - The AI Translator: Using LLMs & MCP for K8s Operations & Self-Healing Infra with Alexei Le...

In this episode, Itiel Shwartz kicks off a series on MLOps, LLM, and GenAI in Kubernetes. Starting with Alexei Ledenev, who has over two decades in software development and deep experience in cloud architecture and distributed systems. He shares his journey from CoreOS Fleet to his current role on the Platform Team at Doit.

dbForge Hits a New Milestone With 2025.2!

A new milestone is achieved with the release of dbForge 2025.2! This time, it brings you enhancements in dbForge AI Assistant, long-awaited user interface improvements, and further significant advancements across dbForge tools for SQL Server, MySQL/MariaDB, Oracle, PostgreSQL. Interested in learning more? Let’s get started!

SecureBridge 11.0, EntityDAC 3.5, and dbExpress Drivers: Latest Versions Released

We are excited to announce a new release of SecureBridge, EntityDAC, and the dbExpress product line. This update introduces support for the newest RAD Studio version, enhances compatibility with popular databases, and adds new components and options to simplify development and strengthen security.

Authentication Methods and Features in Microsoft Entra ID

What is Microsoft Entra ID, and why is this important in the database ecosystem? If you have ever used the Microsoft Authenticator app, you’ve already seen how the MS Entra ID works, but only a fraction of it. Microsoft Entra Identity (formerly Azure Active Directory) is a modern identity platform that unifies authentication, strengthens access controls, and integrates with the applications and databases businesses rely on most.

The growing DDoS threat UK businesses can't ignore

In today’s small and medium-sized UK businesses, most of the cybersecurity budget goes into protecting data and strengthening authentication. Those are important measures – the cost of a data breach is still enough to close your firm, after all. But they aren’t enough. Because in 2025, thanks to increased capabilities of DDoS attacks, you don’t have to lose your data to lose your business– you just have to lose access to it.

Introducing SCION with Anapaya and Megaport

Discover how SCION is transforming internet routing, and how Anapaya and Megaport Virtual Edge are simplifying its adoption. Enterprise internet routing is anything but simple. With distributed endpoints, data security, and service reliability to consider, it doesn’t take long for network teams to feel overwhelmed with a complicated mess of workarounds and add-ons that become increasingly difficult to oversee and manage.

Introducing Chunk: The agent that validates code at AI speed

The software development landscape has fundamentally shifted. Teams are shipping more value faster than ever, leveraging AI to generate code at unprecedented speed. But with this velocity comes equally dramatic complexity — small teams now face challenges that once belonged only to large organizations, while large organizations grapple with codebases that strain human comprehension. This complexity isn’t a bug — it’s a productivity opportunity to embrace.

Meet Rovo Chat in Bitbucket: your AI-powered teammate

We’re excited to announce that we are starting a phased rollout of Rovo Chat in Bitbucket Cloud today! It will be available to 100% of users by September 30th, 2025. To check if it’s available on your account, look for the Rovo Chat button in your top navigation. Rovo is Atlassian’s AI-powered teammate that helps you find what matters, learn faster, and take action right in your workflow.

Cortex is now available in the Devin Marketplace, keeping your AI within the guardrails of your org wide best practices

We are thrilled to announce that the Cortex Model Context Protocol (MCP) is now available in the Devin marketplace. This integration connects the world’s first AI software engineer with the real-time context of your entire engineering ecosystem, as managed and measured by Cortex. The rise of AI software engineers like Devin fundamentally changes how organizations tackle their biggest technical challenges.

AI Hype vs. IT Reality: What to Expect from an IT Automation Platform That Actually Delivers

Artificial intelligence is everywhere: in keynotes, press releases, product announcements, and quarterly roadmaps. From help desk chatbots to predictive analytics, AI is being positioned as the silver bullet for almost every IT challenge. But under the surface, most IT leaders are asking a much more practical question: What’s actually real?

Agents of IT Podcast Episode 02 Sean and Ari's Hot Takes Agentic AI: Hype, Hope, or Hell No?

Welcome to the very first episode of Sean and Ari’s Hot Takes! AI agents are everywhere, but are they really ready for enterprise IT? In this episode, Resolve CEO Sean Heuer and COO Ari Stowe sit down with in-house guest Ian, flipping the mic for a hot-take-heavy discussion on the evolution of AI agents, what “production-ready” really means, and why context and guardrails matter more than hype. From real-world automation wins to the misconceptions about AI “black boxes,” we break down.

Key APM Metrics You Must Track

Application Performance Monitoring (APM) helps you understand how your software runs in production. When you track the right metrics, you see how requests move through your system, where slowdowns happen, and how resources are being used. With this knowledge, you can spot issues early and keep your applications reliable for your users. In this blog, we discuss the key APM metrics to monitor, grouped into categories, and why each one matters for performance and user experience.

Smarter AI Cost Optimization With Guardrails That Scale

AI adoption is reshaping how organizations innovate. It’s also driving cloud costs higher. CloudZero’s State Of AI Costs In 2025 report finds that for mature FinOps and engineering leaders, visibility into AI costs is a critical first step, but it’s not enough. To enable fast, responsible AI and machine learning innovation at scale, teams need pragmatic, flexible guardrails. They don’t need rigid budgets or knee-jerk shutdowns that slow progress or push teams into shadow ML.

{Unscripted} AI Verification and Rollback

Our first AI/ML capability, Continuous Verification, made Harness the first Continuous Delivery tool to understand observability telemetry and trigger rollbacks when deployments caused trouble. We knew we could do more to eliminate the friction involved in its setup. Deploying with confidence shouldn't require a coordination meeting between DevOps, SREs, and developers just to configure the right health checks. That’s why we’re introducing the next generation: AI Verification and Rollback. We’ve moved beyond just AI-powered analysis to AI-powered setup.

Automate Your Infrastructure Analysis with Scheduled AI Reports

The least exciting part of an operations or SRE role is often the manual, repetitive task of generating reports. It’s the Monday morning scramble to summarize weekly infrastructure health for the team, or the end-of-quarter push to build a capacity planning document. This is boilerplate work that pulls you away from critical engineering tasks. We believe that if a process is repeatable, it should be automated. That’s why we’re introducing Scheduled AI Investigations and Insights.

ECS Vs. EKS Vs. Fargate: AWS Container Services Compared

Amazon Web Services (AWS) provides more than 200 services. Among those, Amazon Elastic Compute Service (ECS), Elastic Kubernetes Service (EKS), and AWS Fargate help deploy and manage containers. Choosing between these services can be challenging. They seem similar on the surface (and are all popular). But each offers unique benefits and limitations. In this guide, we compare the three services, discussing the best use cases for each, and helping you choose the best fit for your business.

Acumen - AIOps Automation Platform

Ribbon Acumen is an AIOps & Automation platform for voice and data networks. It's comprised of a series of ready-made applications and a Builder capability that enables organizations to combine those applications with AI to create custom workflows. Those workflows can analyze and react to data from devices and applications, enabling Acumen to automate network deployment and operations, as well as provide tools to rapidly resolve issues.

Lightning-Fast Kubernetes Management with Rancher's Vai Project

If you manage Kubernetes at scale with Rancher, you know that UI performance is not just a “nice-to-have”—it’s crucial for productivity. The Rancher team is on a continuous journey to enhance our platform’s ability to handle increasingly complex environments. In this post take a deep dive into an exciting, evolving improvement we’ve been developing: a project codenamed “Vai” (also called UI Server-Side Pagination or SQLite-backed caching).

Introducing Cortex MCP in Devin: AI Engineering Guided by Your Best Practices

Cortex is now in the Devin Marketplace keeping your AI within the guardrails of your org wide best practices With this integration, Devin, the world’s first AI software engineer, can use Cortex data & best practices, like Scorecards, to understand your engineering standards and automatically fix issues at scale. Here’s how it works: Watch the demo to see how Eyal, one of the engineers at Cortex, used Cortex & Devin to turn best practices into action.

How to Connect Jaeger with Your APM

Microservices make it tough to understand how applications behave end-to-end. Most teams already rely on an Application Performance Monitoring (APM) tool to track system health. But as requests move across many services, you also need distributed tracing. Jaeger gives you that visibility. The real value comes from connecting the two. Instead of running APM and Jaeger in silos, you can combine their strengths, metrics from your APM, and traces from Jaeger, to get a clearer view of performance.

Feature Spotlight: PostGres Mocking #speedscale #postgres #postgresql

Struggling with integration testing because of a Postgres database dependency? Testing can feel impossible when you need to replicate realistic data in Postgres. That’s where Speedscale’s Postgres Mocking Tool comes in. Speedscale drops a recorder into your live system, observes all Postgres traffic, and shows you the actual sequence of statements and responses—making it easier than ever to test reliably.

Major DAC Update: Expanded Database Support, Enhanced Security, and AI-Powered Features

We are thrilled to announce a significant update across our DAC (Data Access Components) product line. This release focuses on expanding compatibility with the latest development environments, extending support for modern databases, strengthening security, and adding powerful new functionality for AI-driven applications.

Building a .NET PostgreSQL MCP Server for Claude Desktop

Querying PostgreSQL in plain English is no longer a futuristic idea; it’s something you can build today. With a Model Context Protocol (MCP) server, Claude Desktop connects securely to external systems, including databases, through a structured interface. Here, MCP acts as the bridge between Claude and PostgreSQL, allowing it to run queries, explore schemas, and return results instantly in Markdown. This guide walks you through building that bridge in C#.
Sponsored Post

Accelerating Cloudnative Development & DevOps

Cloud-native development, and the resultant rise of DevOps, has transformed how software is built, deployed, and maintained. By embracing containerization, microservices, and continuous delivery, organizations have been able to deliver features faster, scale with demand, and recover from failures more gracefully than ever before. Many organizations are adopting these practices to keep up with industry demands and improve efficiency and security. But this speed and flexibility come with a significant cost - complexity.

The real reason your AI initiatives are failing

AI has made it faster and easier to change a codebase than ever before. But in a system as complex and interdependent as modern software delivery, writing code has never been the biggest challenge. For most teams, the real constraint is getting that code safely into production. So while AI assistants and autonomous coding agents have dramatically accelerated the pace of change, for many organizations those changes are piling up against bottlenecks that were already slowing them down.

Reliability means smooth on-call and a strong team

True reliability is when your engineers have confidence in their systems and their teams. Full transcript: Reliability to me means my on-call shift is gonna be smooth because everybody is making the attempts to be smart about the type of code that we're writing. And we're regularly testing to make sure that our system has redundancy and can withstand latency spikes, it can withstand resource spikes.

Introducing PostgreSQL Static Data in Flyway

One kind of data in most relational databases is what we call static data. This is also referred to as lookup data, code data, domain data or even list data. Whatever you like to call it, it’s usually smaller data sets consisting of data that never changes, or changes very slowly. One example might be Canadian postal codes. Another example, and one I’m going to use, is the amateur radio band definitions within a given country.

Azure Integration Services and AI

Join Mick and Sebastian as they dive deep into the world of enterprise integration, exploring the evolution from BizTalk Server to Azure Integration Services and the growing impact of AI on integration projects. Discover how integration is crucial for breaking down data silos to power AI models, the importance of data privacy and compliance especially in the EU and the challenges developers face in keeping up with rapid technological change.

Modern Integration Stratergies: Moving from BizTalk to the Azure cloud

In this episode, Martin Abbott sits down with Ahmed Bayoumy, a Microsoft MVP and integration expert, to discuss his unique journey into the world of integration - from a background in construction project management to becoming a passionate integration professional. They explore the evolution of integration technologies, the impact of AI on integration, and the importance of adapting to new tools and trends.

The Best Cloud Cost Allocation Methods, Explained

All the major cloud providers enable users to attach business context to their infrastructure in some way. This process — known as cloud cost allocation — is how companies map spend to the teams, products, or features driving it. Done well, cost allocation fuels smarter business decisions. It connects cloud bills to business value, helping teams not just control spend but also understand unit economics and margins.

Go beyond the dashboard: Operationalize DORA with our new Scorecard and Academy course

If you've adopted DORA metrics as your standard for measuring DevOps performance, stop us if this hypothetical scenario doesn't sound familiar. You check your DORA dashboard during a lunch break, full of optimism that you’ll get a clear picture of your team’s performance. Instead, you leave with nothing but a sandwich in your stomach and the nagging feeling that you’re focusing too much on the results of the game instead of the people that are playing it.

How to build an awesome cloud gaming platform with Anbox Cloud

Cloud gaming is changing the way we play. Instead of buying expensive hardware, players stream games from the cloud, like Netflix for games. This is no longer a futuristic idea, it’s here. Services like NVIDIA GeForce Now, Sony PS Plus, and Xbox Cloud Gaming have shown what’s possible: playing high-end games on low-end devices by streaming all of your favorite games – from indie to AAA – from powerful cloud servers.

Rightsizing Cloud Infrastructure: Stop Leaving Money On The Table

In FinOps, rightsizing means adjusting cloud resources (instance types, number of CPUs, amount of memory, storage, databases, containers, and many other configuration parameters) to match actual workload requirements. It’s one of the most powerful levers in the FinOps toolbox, and for good reason. Consider: Average CPU utilization across Kubernetes clusters sits at just 10%, according to Cast AI’s 2025 Kubernetes Cost Benchmark Report.

AWS Prometheus: Production Patterns That Help You Scale

You've got Prometheus running in one cluster — maybe a dev environment, a single EKS cluster, or a proof-of-concept setup. The configuration is straightforward: node_exporter on a few EC2 instances, some service discovery for pods, and a single Prometheus server scraping everything. Storage is local, retention is 15 days, and you can keep all the default recording rules without worrying about costs.

From Feathers to Fiber: The Fast Lane of Battlefield Intel

Carrier pigeons once flapped through war zones with vital messages. Today? Sensor fusion delivers battlefield intel faster than thought. The fight isn’t just about who has the data, it’s about who moves on it first. Speed wins wars now! Read how it’s changing the game, how deployable command and control (C2) nodes can help, and why network slicing is a key ingredient in building a resilient, high-speed transport network: Carrier Pigeons to Sensor Fusion: Speed Matters in Information.

The Blind Spots That Haunt Legal IT

In a recent survey, Udacity’s team explored the evolving landscape of AI adoption by asking 2000 professionals (including those in the legal sector) if they used AI. Unsurprisingly, over 90% of respondents said they did. More concerning, 72% of managers reported personally paying out of pocket for AI tools to use at work, introducing uncontrolled risk into corporate environments.

8 IT Issues You Can Fix with Pulseway Mobile App

If you have worked in IT for more than five minutes, you know Murphy’s Law: anything that can go wrong will go wrong sooner or later (sometimes at the worst possible time). Systems do not wait for you to finish your coffee, much less for you to get back to your desk. So, yes, we know your pain. That is why more IT pros are leaning on mobile tools–like Pulseway RMM Mobile App–to stay ahead of problems instead of scrambling back to the office or dragging around a laptop everywhere.

Zero Ticket Video Series: Accelerated Troubleshooting & Diagnostics

In this real-world demo, watch how to use Resolve’s RITA Agent to accelerate troubleshooting, reduce resolution time, and capture knowledge without writing a single script. RITA’s Technician Assist works behind the scenes to: With RITA as your AI-powered co-pilot, IT teams can shift from manual triage to intelligent resolution, capturing fixes in the moment and transforming them into future automations.

Stop Fighting Kubernetes to Go Multi Region

Every engineering leader eventually asks the same question: What happens if my cloud region goes down? This isn't unheard of, or even rare, and the stakes are obvious. A single-region deployment might work fine on day one, but it leaves you exposed: one outage, one fiber cut, or one bad update from your provider, and your application is offline. In some cases, your entire business could be at risk. That's why recently, multi-region architecture has become the gold standard.

Database Monitoring Challenges Every DevOps Engineer Should Know

Databases form the critical foundation of modern applications, and maintaining their performance and reliability is essential for operational efficiency and user satisfaction. Effective database monitoring however presents numerous challenges. Modern systems produce extensive metrics, operate across diverse environments, and must scale in line with growing workloads, all while ensuring compliance and security.

From Productivity to Performance: SQL Prompt's Next Chapter with AI

For nearly 20 years, SQL Prompt has been a trusted companion for database professionals, helping them write cleaner code, reduce errors, and save time. From its earliest days, the focus has been on productivity while removing the repetitive, mundane tasks that slow people down. It’s been about building the reliability and performance that developers and DBAs depend on every day and that story doesn’t stop here.

Cloud vs Colocation: Choosing the right solution for your business

When you’re planning your IT strategy, deciding between cloud computing and colocation services isn’t always simple. Each option comes with its strengths and potential pitfalls. And with tech at the heart of most modern operations, knowing where to house your data and infrastructure is a big decision, and one that could shape your business's future. So, which one is best for your business: cloud computing vs colocation?

Simple Talk Podcast - Coffee Chat with Tom Hodgson

Steve Jones is at Redgate’s Cambridge, UK headquarters once again, this time joined by Tom Hodgson, Innovation Lead on Redgate’s Foundry team. Tom gives an overview of The Foundry, which is Redgate’s innovation hub, offering some insight on what they do and what they’re working on (spoiler: it involves AI)! Also, hear Tom’s thoughts on AI & LLMs in general – including why sometimes just the basic version of a model is enough.

From insight to impact: Key takeaways from our DORA webinar with Nathen Harvey

For most engineering leaders, getting a DORA dashboard up and running feels like a huge win. You can finally track performance, compare it to industry benchmarks, and report on your progress. But then a nagging question settles in: how do you actually make the numbers go up? That frustration points to a common gap between the dashboard and the daily engineering practices that drive those outcomes.

Canonical achieves IEC 62443-4-1 compliance in Industrial Automation and Control Systems

Canonical is proud to announce it has achieved compliance with IEC 62443-4-1 for cybersecurity in Industrial Automation and Control Systems (IACS). Building on Canonical’s existing ISO/SAE 21434 certification, this milestone expands Ubuntu’s leadership in securing critical infrastructure at the intersection of IT and operational technology (OT) environments.

Making AI Costs Make Sense: A FinOps Guide To Tagging And Tracking AI Spend

AI is reshaping the cost landscape. As a positive person, I’m going to call this change exciting! FinOps teams are integrating AI into cloud platforms and incurring the spend that comes with it. As a FinOps strategist who has helped several companies optimize cloud spend across industries, it became evident that clarity around AI spend unlocks swift, smart decisions. That’s AI … optimized.

Wipe It Clean, Sell It Smart: The Right Way to Retire Your Servers

Selling servers is about more than moving hardware, it's about controlling risk. Inside every decommissioned unit sits sensitive data most businesses forget to address. Deleting files or formatting disks doesn't erase information. That oversight can cost millions, damage trust, or lead to serious legal consequences. Before a sale, every server must go through proper data sanitization. This process isn't just technical, it's strategic. It protects your clients, your team, and your brand. This guide walks you through the process, from data erasure methods to resale pricing. The goal.

Why AIX Automation Starts with Better Monitoring: How Galileo Powers Smarter Action

If your automation can’t trust the data it’s acting on, it’s not automation. It’s a guess. That’s why AIX automation monitoring is the foundation for success. Many teams encounter this gap when trying to automate AIX operations. Red Hat Ansible Automation Platform (AAP) and Event-Driven Ansible (EDA) can absolutely streamline routine tasks, like expanding filesystems or tuning adapters. But every playbook still depends on one thing: accurate, real-time monitoring.

Troubleshooting Microservices with AI

Ever found yourself saying, "But it works on my machine!" when a bug pops up in a microservices environment? It's a common and frustrating problem. Unlike a monolithic application, microservices are a collection of independently deployed services that communicate with each other. This complexity makes it difficult to reproduce real-world issues on your local machine, as you may not have all the necessary services and dependencies running. But what if you could take a snapshot of a running application's behavior and bring it home for debugging?

Breaking Barriers: Insights From an Internet Pioneer

What does it take to shape the backbone of the internet? For Raphael Maunier, it started with three failed bachelor’s attempts and a leap into configuring Cisco routers with zero formal training. In this episode of Uplink, Raphael Maunier, Co-Founder and COO of OpsMill, joins host Michael Reid to share the remarkable story of how chance, risk-taking, and vision led him to co-found multiple internet infrastructure companies—France IX, Acorus Networks, and more—ultimately acquired by global tech players.

5 Key Azure FinOps Principles for Ultimate Cost Control.

Seeking ways to slash Azure costs? This video breaks down the challenges and reveals how FinOps can be the strategic answer to optimizing your Azure resources that can help you maximize Azure cost savings. Furthermore, learn how the Azure Cost Management Tool from Turbo360 (formerly Serverless360) perfectly aligns with the five essential FinOps principles, enabling you to boost your Azure savings by up to 30%.

What is Asynchronous Job Monitoring?

Modern applications don’t process everything inside the request/response path. To keep APIs responsive, time-consuming work like image resizing, payment processing, or data syncs is moved into background queues. Workers then pick up these asynchronous jobs and run them outside the main thread. Asynchronous job monitoring is the practice of tracking these background tasks: Without this visibility, background workers become a blind spot.

AI's Impact on Software Devs Productivity & Downsides

AI is boosting flow, job satisfaction, and productivity for devs. But here’s the twist: it’s not actually giving us more time to do valuable work. In his GitKon talk, Nathen Harvey digs into the real impacts of AI adoption in technical teams. Yes, AI reduces toil and improves satisfaction. But it also shifts how we spend our time and sometimes that means less “valuable work” gets done. So the question becomes: how can we use AI not just to work faster, but to work better?

How Data Centre Energy Consumption Impacts Business Efficiency and ESG Goals

Data centre energy use has become a critical factor in business infrastructure strategy. Once a background cost, it now plays a direct role in decisions about operational resilience, sustainability reporting, and future capacity planning. The scale of consumption is hard to ignore. Even smaller facilities can draw between 1 and 5 MW of continuous power, enough to supply thousands of homes. Larger hyperscale environments consume significantly more, 20 MW to over 100 MW of power.

What Does a Carrier Neutral Data Centre Really Mean for Your Business?

The demands placed on digital infrastructure have changed. Businesses are adding locations, connecting to cloud platforms, or responding to changing compliance requirements. Rigid network contracts and fixed provider models no longer make operational sense. Carrier neutral data centres offer a different approach. By enabling provider choice, flexible routing, and integration on your terms, they give infrastructure teams more control, and more room to move.

2025 Data Centre Security Threats: What Business Leaders Need to Know

Security expectations around data centres have changes, and it’s no longer enough to secure the perimeter or roll out a firewall policy. Today’s infrastructure relies on interconnected systems, from APIs and cloud endpoints to HVAC controls and building access tools, all of which can be targeted, misconfigured, or exploited.

The AI Cost 'Black Box' - And How CloudZero Provides Clarity Into Spend

AI adoption continues to explode, and so do their costs. By mid-2025, enterprise LLM spend had already hit $8.4 billion, more than double the year before. And in a major shift, Anthropic recently overtook OpenAI as the enterprise leader. Their Claude models are now core tools for companies adding generative AI technology into their products and workflows. CloudZero recently announced we are the first cloud cost platform to integrate with Anthropic.

How You Can Use Network as a Service (NaaS) to Future-Proof Your Network

Support global growth, AI integration, and complex use cases with a scalable, programmable connectivity layer. In a recent blog, we explored exactly what Network as a Service (NaaS) is and how it has redefined connectivity for enterprises. But in this blog, we take the next step of exploring how adopting NaaS future-proofs your network.

Best Practices for SQL Formatting: Write Clear and Consistent Code

Inconsistent SQL formatting is a silent productivity killer. The database will execute it, but for developers, poorly structured queries lead to slower reviews, harder debugging, and errors that slip through unnoticed. Over time, this lack of consistency compounds into costly technical debt. This guide shows how to format SQL code so it remains clear, consistent, and easy to maintain.

Mastering the User Off-Boarding Process

When someone leaves your organisation — whether they resign, retire, or are let go — it’s easy to think the hard work is over. But the moment an employee’s last day arrives, a new risk window opens. If their access isn’t revoked properly or their data isn’t captured, organisations face security breaches, data loss, compliance issues, and rising costs. This is why a well-designed user off-boarding process is just as important as onboarding.

AI's False Efficiency Curve: How To Save And Protect Your Margins

The popular narrative around AI economics is changing. At one time, Moore’s Law conditioned us to expect that smarter, faster computing would steadily get cheaper. When it comes to AI, that expectation holds true at the unit level. Per-token costs are indeed declining. But the number of tokens consumed per task is growing exponentially, making total costs spike. The tension here is important: on paper, inference is getting cheaper.

The 5 Generations of Programming

Programming just evolved from syntax-focused to intent-driven development. GitHub Cloud Solutions Engineer Ambily Kavumkal Kamalasanan breaks down how we've moved through 5 generations of programming languages, from binary code to natural language processing + why GitHub Copilot is changing how developers actually work. Perfect for developers tired of syntax battles and ready to embrace AI-assisted workflows that actually save time.

Build or Buy AI? Why Homegrown Service Desk Tools Fail (and How Leading Vendors Get It Right)

Service desk AI has become one of the hottest topics in IT. Everyone wants to slash ticket volumes, resolve incidents faster, and give employees the kind of instant, self-service support they expect in the modern workplace. The appeal here is obvious: fewer tickets, happier users, and IT teams finally freed from the grind of repetitive, reactive firefighting.

Using DCIM to Consolidate Multiple Tools for a Single Source of Truth

Modern data centers depend on multiple teams, each using their own systems—CMDBs, ticketing platforms, cloud and virtualization tools, network and server management software, observability stacks, collaboration apps, and countless spreadsheets. Each tool provides important insights, but together they create a complex and sprawling technology landscape.

Logstash Alternative: Why Security Teams Are Choosing Modern Data Pipelines

Logstash has been a workhorse in data processing pipelines for years, but it was not designed with today’s security operations in mind. Security teams now deal with massive telemetry volumes, rising SIEM costs, and diverse log formats that require constant normalization. In this environment, Logstash shows its age: manual configuration, outdated parsing, and scalability bottlenecks introduce fragility instead of efficiency.

Kubernetes Service Discovery Explained with Practical Examples

In Kubernetes, applications are constantly changing — new pods start, old ones shut down, workloads shift across nodes. The challenge is making sure that different parts of your system, and even external clients, can still find each other when the actual locations keep moving. That’s what service discovery handles. It provides a stable way for applications to connect and communicate, no matter where they’re running or how often the underlying infrastructure changes.

Looking Back, Looking Ahead: Thoughts on My First Year at Speedscale

When I started at Speedscale, I looked like this: And after one year of learning, growing, and keeping pace with innovation well, let’s just say the journey has left its mark: Of course, I’m joking (sort of). The truth is, this past year has been intense, energizing, and filled with new challenges. If anything, it’s made me feel younger in spirit, even if the mirror might disagree some mornings.

You Built Your Own Certificate Management System - It's Already Broken

You were tired of renewing all those certificates, and Certbot looked so easy. Now you have scripts thousands of lines long filled with command line incantations you have to Google every time you open it. The script is running on all the critical servers. And some of the printers. If someone looks at it the wrong way, a certificate expires.

The Ultimate Guide To Container Orchestration Tools

Managing containerized applications or microservices can be difficult. It is even more demanding and prone to error if you do it manually. So, what’s the alternative? Container orchestration. Container orchestration is an automation technology that enables engineers to coordinate when containers start and stop, schedule and execute tasks, manage failovers, and perform recovery processes. The technology helps automate these tasks throughout a container’s lifecycle.

What is Database Monitoring? A Guide for Developers, DevOps, and SREs

Databases handle critical operations for applications, from online banking to e-commerce and streaming services. Any slowdown or failure can directly affect application performance and user experience. Database monitoring tracks performance, detects issues, and helps prevent downtime. It also ensures efficient use of resources, maintains security, and supports compliance requirements.

Background Job Observability Beyond the Queue

Background jobs handle the critical work that happens outside the request path: processing payments, sending emails, generating reports, syncing data. They keep applications running smoothly, but the signals they produce look different from API endpoints. Most teams start with queue metrics—how many jobs are waiting and how quickly they complete. These metrics provide the foundation, but job health extends beyond throughput.

Simulating Multi-Agent Workflows to Find Hidden API Vulnerabilities

API gateways are often viewed as the centralized entry point for client HTTP requests in a distributed system. They act as intermediaries between clients and backend services, managing API request routing, load balancing, rate limiting, access control, and traffic shaping across multiple backend services. This API management is vital for many services and products, but many organizations can put too much stock in it.

Snowflake Pricing In 2025: Your Usage And Cost Guide

Snowflake’s scalable architecture, minimal latency, advanced analytics, simplified data handling, flexible pay-as-you-go model, and always-on security make the data cloud a top choice for many businesses. You can also purchase Snowflake resources on demand or upfront. But if you struggle to control your Snowflake costs, you’re not alone. With the help of this guide, you’ll know how to manage your Snowflake costs better.

What is Service Catalog Observability and How Does It Work?

A service catalog gives teams a shared view of their systems—what services exist, who owns them, how dependencies are structured, and the SLAs that guide expectations. It’s an important part of development infrastructure because it helps everyone speak the same language about services. Service catalog observability builds on that foundation.

Configuring Data Loss Prevention

Redacting PII (DLP): Speedscale can be configured to redact personally identifiable (PII) or other sensitive information (PII) from traffic via it's data loss prevention (DLP) features. This redaction happens before data leaves your network, preventing the Speedscale service from seeing the data at all. However, the overall shape or structure of the data is retained in order to facilitate useful testing against systems.

Strategic career decisions ft. Cate Huston, Engineering Director at DuckDuckGo

In this episode of The Confident Commit, Rob Zuber sits down with Cate Huston, Engineering Director at DuckDuckGo and author of "The Engineering Leader," for a deep dive into career ownership and sustainable engineering leadership. Cate challenges the common misconception that career growth equals promotion, introducing the concept of being the "directly responsible individual" for your own career and the crucial difference between "buying" versus "renting" your skills in the marketplace.

AI Reliability Insights: How to Build a Gremlin MCP Server

Gremlin’s Reliability Intelligence helps teams uncover the cause behind failure modes so they can move faster and improve reliability without sacrificing velocity. The new Gremlin MCP Server, part of Reliability Intelligence, gives you new ways to explore your data, giving you access to insights and recommendations to improve reliability and better run your systems using Gremlin. In this webinar, Gremlin CTO Sam Rossoff shows you how to integrate your favorite LLM and use plain language to query data, uncover insights, create dynamic dashboards, and more.

Guardrails and Gains: How Flyway Brings Stability to Cloud Migrations

Avoid the pitfalls of cloud database migration with Redgate Flyway. Learn how automation, schema discipline, rollback strategies, and traceability reduce risk and enable fast, compliant cloud deployments, with these insights from John Q. Martin, Technology Partner Manager at Redgate Software. Cloud migrations promise faster releases and more flexible scaling, but a poorly executed database migration will stop you from exploiting them.

How to migrate data from Snowflake to SQL Server using SSIS

Learn how to connect SSIS to Snowflake, access your cloud data warehouse, and load data into SQL Server using Devart SSIS Data Flow Components. This clear, step-by-step guide covers setting up the connection, selecting and previewing Snowflake data, and running smooth, efficient data transfers—all without coding. Ideal for data professionals looking to simplify Snowflake integrations with SSIS.

How to make Netflix reliable: Address low-hanging fruit

Reliability doesn’t have to be fancy and dramatic. Kolton and his team dramatically improved Netflix reliability by focusing on low-hanging fruit. FULL TRANSCRIPT: My first holiday peak at Netflix, where my VP of engineering came to me and he said, "Kolton, what do you think the chance we make it through the holiday peak without an outage is?"  I thought about it for a minute and I said, "50/50.".

FireHydrant 4-Minute Demo

Get a quick walkthrough of the FireHydrant platform. FireHydrant is the all-in-one incident management platform that helps teams resolve incidents up to 90% faster — and prevent them from happening again. From flexible alerting and powerful automation to retros and AI insights, it brings clarity and control to every step of your response.

DevOps Guide to Monitoring in Serverless Applications

Serverless computing helps teams move faster by removing the need to manage servers. Code runs only when needed, scaling up or down automatically. For DevOps engineers, this means quicker deployments and less infrastructure work. But serverless also brings new challenges. Functions run for short periods, making it hard to track errors, performance, and costs.

Behind Megaport's Network Automation Platform

We’ve teamed up with the Heavy Networking podcast to take you under the hood of Megaport’s resilient, software-driven network. Luke Gollan, Network Automation Engineer at Megaport, joins Heavy Networking hosts Ethan Banks and Drew Conry-Murray to unpack what happens when you click “provision” in the Megaport portal.

Puppet Control Repository: Your Source of Truth for Infrastructure Management

Learn the fundamentals of Puppet's Control Repository with Margaret and Tony in this comprehensive walkthrough. See how Control Repos serve as your single source of truth for managing configuration across your entire infrastructure, driving collaboration and standardization while simplifying code deployments.

ClickUp Integration Guide

ClickUp integration has become a key driver of operational efficiency for over 3 million teams worldwide. By connecting the ClickUp platform with business applications, organizations simplify operations, eliminate manual handoffs, and give leaders the visibility needed to act with confidence. But achieving these outcomes requires a stable integration approach because not all methods offer the same flexibility or access.

How to Connect to MariaDB Using Devart ODBC Driver

MariaDB has become a preferred backend for many enterprises, thanks to features like Galera clustering, JSON functions, and ColumnStore for analytics. But to extract value from MariaDB, organizations need reliable integration with BI tools, ETL pipelines, and custom apps, and this is where MariaDB ODBC drivers come in. These drivers bridge the gap between your database and external systems. But not all of them are production-ready.

Best Applications and Tools for Connecting to Snowflake

Snowflake is the backbone of modern enterprise data architecture, powering over 10,600 organizations—including 800+ Fortune Global 2000 leaders. But thriving in this ecosystem takes more than just the platform. It requires a data architecture that can handle real-world complexity—and that’s where Snowflake connectors come in.

Finding the Ghost in the Machine

The industry is rapidly moving towards deeper AI integration than ever before. What was once simply focused on chatbots or recommendation engines has pivoted significantly to AI systems communicating with other AI systems. These AI tools are leveraging multi-agent workflows to accomplish complex tasks that traditional systems have struggled with. Innovation without validation is a liability. Any developer worth their salt will know that these systems require ample testability and validation.

How to Export Data from Zendesk: Best Methods Explained

Zendesk is a goldmine of customer insights, but extracting that value from its usage is not simple. Teams trying to export data from Zendesk often run into paywall restrictions, API rate limits, and third-party tools that promise simplicity but falter at scale. For organizations integrating support data into BI platforms, migrating systems, or automating reporting pipelines, these challenges stall analytics and strategy.

The End of "Good Code"? AI, Throughput, and Reliability with CircleCI CTO Rob Zuber

Is “good code” still the right measure of engineering success in an AI-driven world? In this episode of *Humans of Reliability*, Rob Zuber, CircleCI CTO, joins Sylvain to explore how coding assistants are reshaping developer workflows and changing what teams value. Rob shares what he’s seeing across CircleCI’s customer base: a clear boost in throughput, new bottlenecks shifting from code creation to code review, and the rise of “vibe coding,” where engineers trust AI-generated code they may not fully understand.

Expanding in Europe: Balancing Security, Regulation, and Innovation

What does data sovereignty mean in practice for European cloud adoption? In this episode of Uplink, Nicolas Duffour, Strategic Development Director at Cloud Temple, joins host Michael Reid to unpack how enterprises and public agencies balance compliance, security, and innovation. From leading IT for French government ministries to helping Cloud Temple secure the prestigious SecNumCloud qualification, Nicolas brings first-hand expertise in navigating Europe’s regulatory landscape.

K3s Vs. K8s: Which Kubernetes Is Right For You?

Kubernetes, also known as K8s, is an open-source, portable, and scalable container orchestration platform. With K8s, you can reliably manage distributed systems for your applications, enabling declarative configuration and automatic deployment. Yet, K8s can be resource-intensive and costly, with a rather steep learning curve. But in 2019, a lighter, faster, and potentially more cost-effective alternative appeared: K3s. Still, K3s is not a magic wand that works for all Kubernetes deployments.

What are dependencies, and how do you secure them?

Open source software is everywhere. Research shows that around 97% of codebases contain open source software, and it’s clear to see why. It’s always magical to realize that there are thousands of free-to-use, ready-built programs and code repositories that solve problems you’d otherwise need to spend weeks building the solutions for from scratch. However, like with all software, you still need to ensure that your software supply chain is secure and safe to consume.

What's the state of open source adoption in Europe?

The Linux Foundation’s latest report, Open source as Europe’s strategic advantage: trends, barriers, and priorities for the European open source community amid regulatory and geopolitical shifts, provides key insights into how European enterprises are using open source software (OSS), as well as the barriers towards further development of open source in the continent.

Granular Allocation, Accurate Unit Costs: The New Standard For FinOps In The Outcome Era

If you’re struggling to contain cloud costs in this suddenly volatile AI-fixated environment, it might be time to consider FinOps as an exercise in granular allocation and unit economics, with a focus on outcome.

APM for Kubernetes: Monitor Distributed Applications at Scale

When a payment service runs across 12 pods — each serving different customer segments — and an authentication layer spans three namespaces, performance issues can originate in both the application code and the orchestration layer. The challenge is linking request-level performance data with what’s happening inside the cluster: container CPU limits, pod scheduling decisions, and node-level events.

Unlocking insights: Introducing Step Metrics for Bitbucket Pipelines

We’re excited to announce step metrics – a new capability coming to Bitbucket Pipelines to help you better manage and optimise your CI/CD workflows. Ever wondered what’s happening under the hood during your pipeline runs? Step metrics provide a window into the resource usage of your build and service containers. More specifically, step metrics let you monitor CPU and memory usage for each build and service container in your pipeline steps.

Visualize Logs Alongside Metrics: Complete Observability Elasticsearch Performance

Elasticsearch is a distributed search and analytics engine that powers everything from log management platforms to e-commerce search bars. It excels at indexing and retrieving large volumes of data quickly, but like any complex system it can slow down under heavy load or inefficient queries.

Resolve's Agents of IT podcast - Ep. 1 - Pat Calhoun at Espressive - AI in IT: A Combined Operation

Welcome to the very first episode of Agents of IT, the podcast for IT leaders accelerating the shift to Zero Ticket IT! For our inaugural episode, Sean Heuer (CEO) and Ari Stowe (COO) sit down with Espressive founder Pat Calhoun to discuss why agentic AI and intelligent orchestration are the superhero duo that teams need to meaningfully transform IT. This is a conversation that tech leaders looking to shake things up won't want to miss.

How to migrate data from Salesforce to SQL Server using SSIS

Learn how to easily connect SSIS to Salesforce, access your CRM data, and load it into SQL Server using Devart SSIS Data Flow Components. This step-by-step guide shows you how to set up the connection, select and preview Salesforce data, and run efficient data transfers—all without coding. Perfect for data professionals looking to simplify Salesforce integrations with SSIS.

How to Automate Away Access Request Tickets for Good | The Zero Ticket IT Video Series with Resolve

Access requests: they’re one of the most common (and frustrating) tickets clogging up IT service desks. In this episode of the Zero Ticket IT Video Series, we show you how Resolve + RITA handle access requests end-to-end without opening a single ticket. Request access via Teams, Slack, or your portal RITA checks internal policies Grants pre-approved access instantly Notifies the user No manual routing. No delays. No ticket.

AWS HIPAA Compliance: A Comprehensive Guide & Checklist

Learn how to achieve and maintain HIPAA compliance on AWS with this comprehensive guide. Understand the shared responsibility model, essential architectural principles, and a practical checklist to protect PHI and avoid costly compliance violations. Discover how automation can reduce human error and streamline your security posture.

13 Cloud Cost Management Strategies (And How CloudZero Can Help)

Cloud cost management is a big deal right now. For instance, 58% of organizations say their cloud costs are too high, according to our State of Cloud Cost report. Over the last five years, several other studies have shown that controlling cloud spend is a top cloud computing challenge. There’s more. As AI adoption accelerates, a new challenge has emerged: managing the rapidly growing costs of AI in a scalable and intelligent way.

Can Your Team Survive the AI Revolution?

AI is transforming how development teams work, but many organizations struggle to adapt effectively. Snowflake Director of Product Jeff Hollan shares four essential principles for building adaptive development teams that thrive in an AI-driven world. Drawing from his experience at Microsoft Azure and Snowflake, Jeff provides practical strategies for embracing change, fostering innovation, and preparing your team for the future of software development.

The Enterprise Automation Platform Driving the Zero-Ticket Future

The surge of interest in artificial intelligence has opened exciting new doors, but many CIOs are finding themselves in the same bind: lots of promising pilots, but very few at-scale results. Intelligent agents can interpret requests, classify tickets, and even recommend fixes, but unless they are connected into broader workflows, these efforts remain isolated experiments.

Capacity Planning Still a Major Issue for Data Center Managers

Uptime Institute’s 2025 Global Data Center Survey shows that capacity planning remains a top challenge for operators. Nearly one-third of vendors identify forecasting future capacity requirements as their customers’ single biggest issue, more than any other concern. Modern data centers face new complexities as digital services expand and hybrid IT architectures shift workloads across on-premises, colocation, and cloud environments.

How To Communicate Cloud Economics To Executives Effectively

We’ve seen the same story play out time and time again in numerous SaaS companies: The problem often begins when engineers — with a technical understanding of cloud costs and a deep understanding of how to build robust products — struggle to communicate the actual business impact of their efforts to company leaders.

Mastering Kubernetes Testing with Traffic Replay

Kubernetes has become the backbone of many modern application deployment pipelines, and for good reason as a container orchestration platform, Kubernetes automates the scaling, deployment, and management of workloads, allowing developers to make their applications easier to manage and deploy at scale without worrying about their service’s dependencies, their user’s operating system, or the intricacies of their data center or infrastructure provider.

5 Ways To Align Engineering And Finance On Cloud Spend

Finance and engineering both thrive on efficiency. So when companies realize they’re wasting cloud spend, but aren’t sure where or how, both teams become frustrated. It’s remarkable how common this scenario has become, particularly over the past five years. Before the pandemic, Gartner reported that companies wasted over $14 billion on cloud services. During the cloud adoption surge in 2020, IDG found that 70% of US businesses overspent by as much as 62%.

Software-Defined Healthcare: Modernizing Through DevOps, Observability & AIOps

Healthcare delivery is undergoing a transformation unlike any other. Digital systems now shape how physicians deliver care, how practices are managed, and how patients experience the health system. From cloud-native platforms to intelligent automation, the shift toward software-defined healthcare is revolutionizing clinical operations. At the heart of this change are three critical enablers: DevOps, Observability, and AIOps. Together, they form the backbone of a modern healthcare IT environment, driving resilience, agility, and patient-centered outcomes.

Kubernetes Monitoring Metrics That Improve Cluster Reliability

A Kubernetes cluster can generate more than 1,400 metrics out of the box. That’s a lot of numbers to sift through, especially when you’re troubleshooting a production slowdown in the middle of the night. The key is knowing which metrics tell you the most, with the least noise. These are the signals worth paying attention to when you need answers fast.

Proactive testing means less stress and better results

Proactive reliability not only prevents costly outages, it also means your engineers are less stressed so they do their best work. Full transcript: It's not only helping when outages occur, but it's also helping reduce outages. It's this whole culture of blamelessness, right? And oftentimes, when you're in an environment where people are pointing fingers and saying, "Whose fault was it? And why is this thing broken?" and all these other things that are stressing you out.

How to Improve MariaDB Performance: Track Slow Queries with Logs and Metrics

Database latency rarely starts in your app layer because it’s almost always a query doing more work than it should. Metrics tell you when that happens, but slow-query logging tells you which statement did it and how. That’s gold for tracking down missing indexes, inefficient filters, or accidental full scans. Pair the logging with a some lightweight counter metrics, and you get both an early warning and a clear path to a fix.

Bringing Canonical Kubernetes to Sylva: a new chapter for European telco clouds

The telecommunications industry is undergoing its most significant transformation in decades. The move from vertically integrated, proprietary systems to disaggregated, cloud-native infrastructure has unlocked enormous potential for agility and innovation. Yet, for many operators, the challenge has been how to realize that potential while meeting the stringent performance, security, and interoperability requirements that telecom networks demand.

Re-Solution Data Ltd Among Top 5 Cisco Managed Service Providers in the UK

In today's ever-evolving digital landscape, Cisco Managed Services have become a must-have for businesses aiming to maintain secure, efficient, and scalable IT networks. These services allow organizations to tap into Cisco's state-of-the-art technologies without the hassle of managing complex infrastructures on their own. Among the top providers in the UK, Re-Solution Data Ltd has made a name for itself as a trusted Cisco partner, earning recognition as the Top Cisco Networking Provider in UK.

Considerations for Testing gRPC Streams

If you’ve spent any time building cloud-native systems, you’ve probably tripped over the tricky beast that is gRPC streaming. It’s powerful, flexible, and feels like magic when it works. But the minute you need to test it? Suddenly, you’re in “hold my coffee, I need a week” territory. One of the most common places we see gRPC streams in the wild is when clients connect to asynchronous message buses like Google Pub/Sub.

The Next Evolution of AI: Forget Smarter Models - It's All About the Data

It’s been a noisy summer in the AI world. Headlines have been filled with doom and gloom: For example, OpenAI’s ChatGPT-5 landing with a thud, and an MIT report claiming 95% of AI pilots are failing. For the sceptics, this is “proof” that AI is just hype. I don’t buy it. The MIT study looked at just 50 projects, a sample size so small you’d fail a basic stats exam for using it. And as someone who uses AI every single day, I can tell you the benefits are real.

10 Mistakes I've Made When Migrating Databases to the Cloud

John Q Martin, Technology & Alliances Partner Manager at Redgate, covers 10 mistakes he's made when migrating databases to the cloud - and the strategies he now implements to avoid them happening again. But first, John explains exactly why databases are the hardest part of cloud migration.

Ubuntu Lead Dev Answers Your Comments About 'sudo-rs'

Ubuntu will be the first major Linux distribution to adopt sudo-rs, a Rust-based reimplementation of sudo. That announcement sparked plenty of discussion across our social channels. In this episode, Jon, VP Engineering and Ubuntu Lead Developer, answers your comments and questions from our socials. He shares the thinking behind the transition, why it matters for security and sustainability, and what it means for everyday Ubuntu users. Along the way, he adds context and reflects on how these changes shape our community.

Speed vs Security? In DevSecOps, You Can Have Both

Speed vs security has long been treated as an impossible choice: move fast and risk instability, or stay safe and fall behind. For DevOps, DevSecOps, and Governance, Risk, and Compliance (GRC) leaders, that tension often plays out between the demand to ship updates quickly and the need to maintain airtight security and compliance.

Reliability results require visibility & accountability

Reliability doesn’t just happen if you build a good tool. It takes visibility and accountability to get results. FULL TRANSCRIPT:  One of the things I've observed over the last 10 years in the software engineering culture is this idea of kind of Field of Dreams DevOps. If you build it, they will come. And there's a lot of this in the developer tool space in particular. "Hey, if we just build a great tool and we make it easy to use, engineers will use it because they want to do the right thing.".

Implement an enterprise-ready data lakehouse architecture with Spark and Kyuubi

Here at Canonical we are excited to announce that we have shipped the first release of our solution for enterprise-ready data lakehouses, built on the combination of Apache Spark and Apache Kyuubi. Using our Charmed Apache Kyuubi in integration with Spark, you can deliver a robust, production-level, and open source data lakehouse. Our Apache Kyuubi charm integrates tightly as part of the Charmed Apache Spark bundle, providing a single and simpler-to-use SQL interface to big data analytics enthusiasts.

The Hidden Costs of VMware

VMware. A name synonymous with virtualization, a powerhouse in the infrastructure space and seemingly the enterprise standard for those managing VMs, to put it simply. But if you haven't noticed yet, behind that familiar name is a new age of license complexity, unexpected renewals, and sharp price hikes that have left many organizations scrambling. Since Broadcom acquired VMware, these challenges have only intensified, with customers blindsided by costs increasing several hundred percent overnight.

Simple Talks Podcast | S3, Episode 2 - Coffee chat with Jeff Foster

Steve Jones sits down for a chat with Jeff Foster, Redgate’s Director of Technology & Innovation, at the company’s Cambridge, UK headquarters. The main subject covered is hugely topical – AI – with Jeff explaining how he feels about AI today and how it might impact us, and our jobs, in the future. You’ll also learn a bit more about Jeff’s background and Redgate’s ways of working.

How do you choose the right Web Hosting Solution for Your Business in India?

In today's digital age, having a solid online presence is critical for company success. The often-overlooked choice of a web hosting provider is essential to this quest. The significance of this choice cannot be emphasized for enterprise decision-makers and business owners preparing to enter the online sphere. This thorough guide is designed to assist you in navigating the complicated web hosting market and ensuring that your selected solution matches smoothly with your company goals.

Why We Built CertKit

SSL Certificates have always been a pain in the butt. From the magical OpenSSL incantations to generate a CSR to the various formats that each webserver requires. Remembering what hardware needs which certificates. Managing scheduled renewals and runbooks for which file goes where. Screw anything up and your site is “Not Secure”. And now Apple wants us to do it every 47 days. Remember when we had HTTP-only websites? Or when certificates lasted three years? Then one?

CFO Reveals: 30% of Your Servers Are Wasting Money RIGHT NOW

In this exclusive interview with Ticker's business news show, Joyda Bianco, CFO of Hyperview, exposes a shocking industry secret: nearly one-third of servers in enterprise data centers are obsolete, unused, or "comatose"—consuming electricity and occupying valuable rack space while contributing absolutely nothing to business operations. That's potentially millions in wasted operational expenses and thousands of tons of unnecessary carbon emissions. But as Bianco explains, this crisis represents your single biggest quick-win for cost reduction and sustainability improvement.

54% of European enterprises want long term open source support: how Ubuntu Pro + Support delivers

Europe’s open source ecosystem is at a turning point. The Linux Foundation’s Open Source as Europe’s Strategic Advantage: Trends, Barriers, and Priorities for the European Open Source Community amid Regulatory and Geopolitical Shifts report shows organizations across the continent are broadly adopting open source software (OSS). But adoption alone doesn’t guarantee resilience, innovation, or security.

Using ISO/SAE 21434 to stay ahead of the Cyber Resilience Act

If you work in automotive, you’ve probably already heard of the CRA – the EU’s Cyber Resilience Act. It’s one of the most ambitious pieces of cybersecurity regulation in years. And while it wasn’t written specifically for cars, it’s going to impact a huge part of how software gets built, updated, and maintained across the automotive stack. So here’s the question: how do you prepare for something like the CRA?

OpenAI Pricing: The Models, Features, And Costs To Know

If your SaaS organization is experimenting with OpenAI, your cloud bill just got a new line item. And unless you know exactly what drives it, that line item can go from manageable to margin-killer, fast. It’s also worth clarifying that OpenAI is not the same as ChatGPT. ChatGPT is the familiar end-user app with a flat monthly subscription. OpenAI, meanwhile, is the platform behind it — a mix of models, features, and usage-based pricing that shifts depending on what you build.

CloudZero Is The First Cloud Cost Platform To Integrate With Anthropic

The most challenging question in AI today isn’t how to build with it. It’s whether you can prove it’s worth what you’re spending on it. Every week, I hear the same thing from engineering and finance leaders: “We know the AI bills are big.

What is APM Tracing?

APM tracing records the complete execution path of a request as it travels through your system, including database queries, external API calls, cache lookups, message queue events, and inter-service requests. Each step is captured with precise start and end timestamps, duration, and context such as service name, operation name, and relevant attributes. This lets you pinpoint where latency or errors originate without piecing together metrics and logs manually.

Building a DORA metrics Scorecard

There are a lot of ways to gauge the performance of your DevOps teams and the health of your software, but DORA metrics have emerged as the industry standard. If you aren’t familiar with DORA metrics, take a few minutes to read this comprehensive guide to understanding DORA metrics. DORA metrics were designed to offer a high-level, long-term view of how your teams are performing.

Self-Healing Networks and Sovereign AI: The Future of Global Connectivity

Bridging East and West in today’s digital landscape takes more than connectivity – it requires navigating regulations, geopolitics, and the rise of AI. In this episode of Uplink, Elena Chernykh, Head of Europe Enterprise Sales at CITIC Telecom CPC, joins Michael Reid to discuss how enterprises can thrive in a world where data sovereignty and AI demands are reshaping infrastructure.

Visualize Logs Alongside Metrics: Complete Observability for Slow MongoDB Operations

MongoDB’s strength of flexible schema and fast iteration can also hide costly queries until they surface as user-facing latency, replica lag, or spiky CPU. A handful of slow operations can impact the cache, starve other workloads, and cascade into timeouts across services. Monitoring slow queries gives you an early warning system for index gaps and query-plan regressions introduced by code deploys, schema changes, or shifting data shapes.

10 Ways to Optimize Data Center Operations

Running a data center efficiently is no small feat. From managing energy costs to preventing downtime, there's a lot that can go wrong—and a lot that can be optimized. Discover 10 actionable ways to enhance your data center operations, with practical tips on how Hyperview DCIM software can help you achieve these improvements more easily and effectively.

New in Redgate Monitor: Oracle Data Guard support

Redgate Monitor now supports Oracle Data Guard environments, giving DBAs instant visibility into replication health, lag and role transitions, so Standbys stay in sync and are ready to protect availability and data when needed. DBAs running Oracle Data Guard know that keeping replicas healthy requires constant vigilance. It often involves querying dynamic performance views, such as V$DATAGUARD_STATS, or running Data Guard Broker commands to check lag and role status.

Build and deploy a Pinecone question answering RAG application

Vector databases allow you to store, manage, and efficiently query high-dimensional vector data, which are numerical representations of data like text, images, or audio. Pinecone is a fully managed vector database optimized for fast, scalable similarity search—to power a Retrieval-Augmented Generation (RAG) system. This allows you to enhance language model responses by grounding them in relevant context retrieved from your own documents.

#048 - Shaping the Future of Software Development with Idan Gazit (GitHub Next)

Meet Idan Gazit from GitHub Next, a team responsible for projects like GitHub Copilot. Gazit, despite jokingly claiming to be "the least knowledgeable about Kubernetes," shares his diverse career journey, spanning from early web development with Perl and Django to his time at Heroku and eventually GitHub. He discusses his team's role in prototyping future software development solutions, emphasizing the importance of identifying and nurturing risky, impactful ideas for developers, even if it means "killing projects" that don't gain traction.

Why Cost Optimization Should Be More Like Pulling Levers, Not Using Scissors

The cloud, as we know it today, was created as recently as 2006. For most of its lifespan since then, companies have been throwing money at cloud services with abandon. The competitive edge gained by having the newest, best, and most powerful tools at their disposal made it worthwhile for companies to spend ever-increasing amounts without too much worry.

What is Automated Incident Response

While writing our 2024 recap, we found that teams handled over 2.2 million new incidents. Critical incidents alone tripled, increasing from 3,000 in 2023 to 9,200 in 2024. Dealing with such a large volume of incidents is not an easy task. And dealing with them manually is definitely not easy. Your valuable time goes into routine tasks like creating tickets, setting up war rooms, and notifying stakeholders. These keep you from fixing the actual problem.

AI's Impact on Developer Experience: GitLens Creator Eric Amodio on the Future of Coding

AI is reshaping how developers work, from enhanced autocomplete to agentic workflows. GitKraken CTO and GitLens creator Eric Amodio breaks down the current state of AI in development, potential risks of over-reliance, and where the industry is heading. Learn about the evolution from simple code completion to sophisticated agents, the challenges facing junior vs senior developers, and practical advice for leveraging AI tools effectively.

What is Single Pane of Glass Monitoring and How Can Enterprises Leverage It for Enhanced Visibility?

Large enterprises today grapple with increasingly complex IT environments - spanning multiple cloud services, hybrid infrastructures and countless applications. Exacerbated by technology silos, the sheer volumes of data generated in such environments can quickly overwhelm IT teams, impairing their ability to identify and respond to customer impacting issues before outages strike.

Digital Infrastructure Expertise: The Secret Sauce for Scaling AI

The past few years have seen the incredible rise of cloud-native AI start-ups, many of them born during the pandemic. These companies emerged agile, experimental, and ready to scale. But as their ambitions grow and their AI models become more complex, they face a critical crossroads: how to manage infrastructure sustainably while continuing to innovate at speed. In the early days, public cloud services were the obvious choice.

Every AI Agent Needs a Sidekick: An AI Orchestration Platform

Agentic AI has sparked a ton of excitement in IT. These intelligent agents can analyze signals, interpret requests, and recommend actions with surprising accuracy. But left on their own, they struggle to translate those insights into reliable execution. The end result is a fragmented picture of great thinking... but limited doing. This is why orchestration matters.

Netdata AI Troubleshooting is Now Generally Available with On-Demand Credits

Since launching our AI investigations and insights in a research preview, one thing has become clear: automated root cause analysis delivers a significant return on investment. Teams have confirmed that instant insights don’t just save a few minutes; they fundamentally shorten incident response cycles, free up valuable engineering hours, and reduce the business impact of downtime.

Console Connect becomes first to achieve Mplify's new LSO API certification

Console Connect has become the first platform to be certified under Mplify’s (formerly MEF) expanded Lifecycle Service Orchestration (LSO) API Certification Program. The certification recognises Console Connect’s achievement in validating four of Mplify’s key LSO Business APIs – Address Validation, Quote, Product Order, and Product Inventory – ensuring they meet strict standards for conformance and interoperability.

A Single Hub for Telemetry: OpenTelemetry Gateway

The OpenTelemetry Gateway (OTel Gateway) is a centralized service that collects, processes, and routes telemetry data—metrics, traces, and logs—across your infrastructure. In a typical setup, each service pushes telemetry directly to an observability backend. While this approach works well for small environments, it becomes increasingly difficult to manage as systems grow.

Gemini AI Pricing: What You'll Really Pay In 2025

If your team is experimenting with Google’s latest Gemini models, you’ve probably noticed the pricing can get murky. And like any cloud service, usage-based billing means those features can quickly rack up your SaaS costs. Free tiers fade fast. Token usage gets fuzzy. And before long, your GenAI bill feels more like a guessing game than a budget line item.

The Essential Guide to Azure Infrastructure, Monitoring, and Management Tools

Master Azure infrastructure management with this comprehensive guide. Learn the four critical pillars—governance, cost control, security, and operations—and discover the essential native and third-party tools needed to scale your cloud strategy effectively.

Measuring the Impact of AI Development Tools

You're adopting AI code generation tools to enhance your engineering team's output, but how do you quantify the real return on investment? Without precise measurement, you're navigating in the dark, unable to identify true productivity gains or pinpoint areas for optimization. Justifying these critical AI investments becomes difficult.