Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

How sum_over_time Works in Prometheus

The sum_over_time() function in Prometheus gives you a way to aggregate counter resets, gauge fluctuations, and histogram samples across specific time windows. Instead of seeing point-in-time values, you get the cumulative total of all data points within your chosen range—useful for calculating totals from rate data, tracking accumulated errors, or understanding resource consumption patterns over custom intervals.

What to expect in a Gremlin workshop

Gremlin workshops give your team hands-on training with Gremlin so they can get real results and dramatically improve your reliability. Full transcript:  The goal of our workshops is really to accelerate you and the team in your reliability journey. Whether you're starting out for the first time, or you're a more advanced user, this workshop is really designed for you to take you to the next level.

From Dial-Up to Colo: The Impact of AI on Data Center Design

In this episode of Uplink, we’re joined by Jay Smith, VP of Data Center Operations and Engineering at Evocative. With nearly 30 years in the industry, Jay unpacks how data centers are adapting to support AI’s massive power and cooling demands. This episode covers: Why colo is thriving in the AI era Liquid cooling and rear-door heat exchangers Powering 275kW racks and beyond How AI inference is shifting compute to the edge Career opportunities in infrastructure without a degree.

What is Java Performance Monitoring? [A Guide to DevOps Engineers]

You rolled out a Java application that worked fine in development. Fast, clean, no errors. However, once it went into production, things began to change. Suddenly, the app feels slow. CPU usage climbs without warning. Some users start getting timeouts. You check the dashboards, but nothing jumps out. You look through the logs, but it's mostly noise. And then the questions start coming in - "Is the JVM the problem?" If you've been in that situation, you're not alone.

Build EF Core Models Visually with Entity Developer - No More Manual Mapping!

Want to simplify your EF Core development? Discover how Entity Developer — a powerful visual ORM tool from Devart — helps.NET developers design, generate, and maintain EF Core models faster and with fewer errors. In this video, you’ll learn how to: Entity Developer works as both a standalone app and a Visual Studio plugin, giving you flexibility across any development environment. With support for EF Core, NHibernate, and LinqConnect, it's your all-in-one solution for ORM design in.NET.

How To Sell Cloud Cost Optimization To Your CFO

You know you’re bleeding money in the cloud. Maybe not everywhere, but enough to feel it. Your engineers know it too. You’ve got idle resources humming away, AI workloads scaling like wildfire, and nobody can quite explain why last month’s bill jumped by 17%. So, you bring up the idea of investing in a cloud cost optimization product. Cue the skeptical glance from your CFO.

Tutorial: Visualize Your Puppet Data in Grafana with the Observability Data Connector

When you manage complex IT infrastructure, it becomes critical to use tooling to understand what’s happening across all of your systems in terms of performance, reliability, and compliance. Monitoring key indicators manually is simply no longer possible at that scale. Puppet has long been known as a solution for managing large environments and collecting a vast amount of data about your infrastructure, but accessing and visualizing that data in a meaningful way can be a challenge.

Understanding GPUs for AI success: Insights from our panel discussion

This blog is based on the webinar, “Panel Discussion: Understanding the importance of GPUs for AI success”, you can watch the full recording by clicking here! Last week, we hosted a panel discussion surrounding the importance of GPUs for AI success that featured Kunal Kushwaha (Field CTO), Ben Norris (AI Engineer), and Kendall Miller (Strategic Business Development).

Is your cloud data truly sovereign? The CLOUD Act & FISA 702 reality check

As UK public sector bodies, financial institutions, and enterprises accelerate cloud adoption, a pivotal question emerges: Who truly controls your data, and under which laws? With data breaches and regulatory scrutiny intensifying, storing data and workloads in a host country alone doesn't guarantee sovereignty. U.S.

Early preview: Auto-translation in Mattermost channels

In this demo, we’re showcasing an early prototype of channel-based auto-translation in Mattermost — designed to break down language barriers in global operations. Watch how users can seamlessly read messages in their preferred language across any channel, enabling inclusive, multilingual collaboration in real time. Note: This is an early prototype demo. The capability has not yet been released in Mattermost but is on our roadmap for the near future (release date TBD).