Operations | Monitoring | ITSM | DevOps | Cloud

Icinga Module for AWS

Some say you have to move all of your server infrastructure into the cloud. Others counter that you should keep your data safe and secure in your own datacenter. And then there are many people in between who use cloud services as an addition to their self-hosted servers. In fact, there’s no right or wrong, because as always in IT: It depends. We at Icinga always try to find a way to make everyone happy with their monitoring – be it in the cloud or on premise.

Unplanned Work, Part 2: The Impact on the Enterprise

Today, technology problems can alter the trajectory of a business. Minutes of downtime or latency (slow is the new down) cost organizations dearly in lost revenue and can jeopardize customer relationships. However, there’s an even more important consequence of technology problems than top-line risk: reduced innovation as teams are forced into reactive fire drills that take time away from product development.

Large Diamond Mining Organization Adopts OnPage

Diamond mining is recognized as a dangerous occupation, causing serious accidents for mineworkers across the globe. Often times, these incidents turn out to be fatal because the victim didn’t receive immediate care from first responders. However, significant strides are being made to minimize the impact of these accidents by large, international organizations.

12 Reasons to Opt for Serverless Computing for Your Mid-sized Venture

Serverless has become something of a buzzword lately - but it’s much more than a trend. Its benefits have gained the attention of professionals and amateurs in the tech industry, mainly due to how much cloud vendors have raved about the design. For mid-sized ventures, the idea of serverless is exceptionally appealing, for those looking for budget-friendly, but useful pieces of technology. It’s genuinely breaking the ground and is sure to make even more moves in the tech industry.

What Enterprise IT Teams Can Learn from Google Cloud's June Outage: A Guide

The following first appeared in Cloud Tech News. In early June 2019, Google Cloud suffered a cascading set of faults that rendered multiple service regions unavailable for a number of hours. This by itself isn’t totally unprecedented; what made it significant was the way it propagated through the very software that was designed to contain it. Moreover, engineers’ initial attempts to correct the issue were thwarted by the failure of that same software architecture.

What Is Network Agility and Why Does It Matter?

In a 2019 Top Trends Transforming Network Operations survey, 34% of networking pros identified improving network agility as their top business goal for the year. The stat isn’t surprising. “Network agility” is considered the future of networking, but the term itself has become a bit of a buzzword. Everyone’s talking about it, but no one can agree on its definition. So what does network agility actually mean? We surfed the web and analyzed what (almost) everyone has to say.

Challenges of Monitoring and Troubleshooting in Kubernetes Environments

Kubernetes is great but complex! Whether to enable hybrid and multi-cloud, promote deeper specialization among development teams, enhance reliability, or simply stay ahead of the curve, organizations are reaping the varied benefits of this technology investment— but it comes at a cost. With each optimization, there are tradeoffs. With each layer of abstraction comes less visibility, resulting in more complexity when something goes wrong.

How to Prepare Your Staff for Hybrid Cloud

By Des Nnochiri Budgetary constraints are often a key factor in determining how an organization sets up its IT infrastructure. The hybrid cloud typically leads to cost savings of between 5% and 30% for enterprises that make the transition. Besides the monetary aspects, performance benefits and easier administration also inspire many organizations to consider moving to a hybrid cloud.

From Homegrown to Hosted: How The Trade Desk Migrated to a Modern Monitoring System with Grafana Cloud

When Patrick O’Brien interviewed to become a Site Reliability Engineer at The Trade Desk™, it was clear that taking the company’s monitoring system to the next level was the priority. “A chunk of my interview was about The Trade Desk’s previous monitoring system and how to scale it,” says O’Brien, who joined The Trade Desk more than two years ago. “I had a good feeling that would be an early task.”