Operations | Monitoring | ITSM | DevOps | Cloud

Why Andy Warhol would like - and dislike - AI

In a series of blog posts about AI, I’ve been looking at how intelligent ChatGPT is, how good ChatGPT and Bing are when you employ them as a technology writer, and how the engineering team at Redgate is using GitHub Copilot to aid with writing code. Now it’s time to take a look at image creation tools, and where better to start than Andy Warhol? I like Andy Warhol.

Unraveling AWS Lambda: Exploring Scalability and Applicability

In our previous blog, we shared our firsthand experience of implementing a tracing collector API using serverless components. Drawing parallels with Amazon Prime Video’s architectural redesign, we discussed the challenges we encountered, such as cold-start delays and increased costs, which prompted us to transition to a non-serverless architecture for more efficient solutions.

IT Event Correlation: Software, Techniques and Benefits

IT event correlation is the process of analyzing IT infrastructure events and identifying relationships between them to detect problems and uncover their root cause. Using an event correlation tool can help organizations monitor their systems and applications more effectively while improving their uptime and performance.

When Third-Party Plugins Go Wild

Every single day RapidSpike detects thousands of problems with website third-party plugins that are causing revenue and customer experience issues, and 90% of them are not just affecting our users; they are affecting every user of that third party. The difference is with RapidSpike, we tell them about it. In 2018, a major e-commerce website experienced a significant performance failure due to a third-party plugin.

Introducing the new Lumigo Live Tail

As developers, we understand the immense value of having real-time access to live traces. It significantly enhances our ability to identify, debug, and troubleshoot potential issues within applications, streamlining the development and deployment process. Today, we are excited to introduce the new and improved Live Tail feature at Lumigo, which enhances your observability experience to a whole other level.

CloudZero Launches Advanced Analytics For Deeper Visibility And Savings Insights

We built CloudZero for a simple reason: Bring business fundamentals to cloud-driven organizations without stifling innovation. It sounds simple, but for years, the intrinsic complexities of the cloud and extrinsic pressures to grab SaaS market share made it near-impossible for businesses to achieve this. Until CloudZero.

The Road Ahead: 4 Ways AIOps Will Build More Resilient IT Operations

This article is the final installment in a 4-part series on leveraging artificial intelligence and machine learning (ML) for IT operations (AIOps) to provide a more efficient, reliable, agile, cost-effective, and optimized IT infrastructure. Just as our roads and highways evolve overtime to meet the demands of the travelers who use them, AIOps will continue to transform how organizations build, use, and manage their infrastructures.

Benefits and challenges of containerization for IT operations

Your IT teams are critical to improving the efficiency of your operations and ensuring long-term business scalability. But as your organization grows and demands become more complex, the challenges of managing IT operations can become difficult, especially when managing multiple applications across various server environments. Containerization has become a popular solution for some of these challenges.

How a connected ecosystem can drive growth and lower costs

It’s an exciting time to be a service provider. The shift in enterprise digital buying preferences to everything as a service has opened vast opportunities to strengthen customer relationships and create new revenue streams in such things as robotics, autonomous fleets, analytics, drone-based inspection, and remote telemetry. A connected ecosystem can help you seize these opportunities, accelerate innovation, and grow revenue while reducing the cost to serve.

Exploring distributed vs centralized incident command models

Recently in our Better Incidents Slack channel, there’s been some chatter around how people structure dedicated incident commanders at their company: distributed or centralized. The way I see it, there are two types of commanders: the temporary, distributed role — a hat that an on-call engineer or an engineering manager puts on during an incident. Then there’s the centralized, full-time role, where someone is the designated incident commander (or one of a few) for all incidents.