Operations | Monitoring | ITSM | DevOps | Cloud

What is the SPACE developer productivity framework?

The SPACE framework is an acronym for an approach to measuring, understanding, and optimizing engineering productivity. Outlined by researchers from GitHub, Microsoft, and the University of Victoria, it encourages leaders to look at productivity holistically, placing leading metrics in context with each other and linking them to output, often team goals, rather than individual effort. The SPACE framework breaks productivity into five metrics.

10 FinOps Diagrams To Help You Better Understand The Value Of FinOps

By now FinOps has proven its value — so much so that the web is full of helpful in-depth guides, broad overviews, bite-sized articles, and resource lists on the subject. But if you're preparing to pitch a FinOps program at your company, you need to convey only the most important points in a way that's both impactful and succinct. That's when it's time to call in the visuals.

Scaling Monitoring Administration with Experience-Driven NetOps: AppNeta and DX NetOps

Today, pretty much every critical business service, every critical employee job function, every critical customer transaction, and so much more are all reliant upon network connectivity. It falls to network operations (NetOps) teams to ensure network connections continue to support these demands. Over time, the scale and the complexity of the networks the organization relies upon have continued to grow, making the job of NetOps teams increasingly challenging.

Managing Users and User Groups: A Guide to OKTA and Cloudsmith Integration

Explore Cloudsmith’s powerful OKTA integration for user and user group management. Dive into the benefits, security considerations, and best practices to optimize user access, streamline workflows, and bolster security in your software operations. User management is the backbone of secure and efficient software operations. As businesses grow and evolve, the tools they use must keep pace. Enter OKTA and Cloudsmith.

What is Website Maintenance: Your Ultimate Guide to Keeping Your Site Functional

Website maintenance is not that different from keeping up with the maintenance of real brick-and-mortar stores. Would you shop at a dirty store, filled with broken furniture, and selling outdated products? We didn’t think so. Website maintenance plays the same role: it makes the business inviting, makes you look professional, and engages customers.

Observability and the DORA metrics

The Accelerate State of Devops Report highlights four key metrics (known as the DORA metrics, for DevOps Research & Assessment) that distinguish high-performing software organizations: deployment frequency, lead time for changes, time-to-restore, and change fail rate. Observability can kickstart a virtuous cycle that improves all the DORA metrics.

Less is more: How Grafana Mimir queries run faster and more cost efficiently with fewer indexes

Over the past six months, we have been working on optimizing query performance in Grafana Mimir, the open source TSDB for long-term metrics storage. First, we tackled most of the out-of-memory errors in the Mimir store-gateway component by streaming results, as we discussed in a previous blog post. We also wrote about how we eliminated mmap from the store-gateway and as a result, health check timeouts largely disappeared.

A Closer Look at AlertBot's Alert Group Feature

If we start by sharing that AlertBot’s alert group feature lets you, well, alert certain groups, then you might wonder what earth-shattering revelations we have in store — such as water is wet, fire is hot, and the pain of Game of Throne’s final season will never, ever go away (seriously, whatever happened to Gendry?!). Yes, you’re right: the alert group feature IS about alerting groups of people about a site failure — but as George R.R.

The Power of Visibility: Energy Company Uncovers the True Root Cause At Last

If you work in end user computing, you’re no stranger to the irritation of mystery issues. Tickets come in weekly but no matter how many teams you talk to, or fixes you try to implement, the issues never go away. You search and search for the root cause - but can’t find it. Frustrated, you assume it’s something outside of your control. Maybe the issues is caused by home Wi-Fi or end user error. That must be it – right?