Operations | Monitoring | ITSM | DevOps | Cloud

Blog

What Slack Downtime Costs, and What We Can Do About It

This morning, though, all of our backlogs were a little harder to sift through thanks to a Slack outage in Europe and the US. To calm down, some of us might have turned to our Google Home or Chromecast to unwind while the outage hours piled up, only to find those were down too! What a morning!Now that Slack is running again, let’s take a moment to reflect on what the outage means and what we can learn from it.

Metrics At Scale: How to Scale and Manage Millions of Metrics (Part 2)

With businesses collecting millions of metrics, let’s look at how they can efficiently scale and deal with these amounts. As covered in the previous article (A Spike in Sales Is Not Always Good News), analyzing millions of metrics for changes may result in alert storms, notifying users about EVERY change, not just the most significant ones. To bring order to this situation, Anodot groups correlated anomalies together, in a unified alert.

6 Things You Need in an IT Incident Management Platform

Your incident management process is greatly impacted by the tools you have available. And technology is key when it comes to gaining visibility and obtaining contextual data. You need tools to send alerts when incidents arise, as well as track activity for compliance reporting purposes. Whether you’re in healthcare, information technology or work at a small MSP – you need a robust incident management platform that gives you results and helps mitigate MTTR.

Don't Worry About Your (Con)figure, Have The PI !

Congratulations! You have Foglight installed and it is collecting meaningful performance data. There’s no doubt it is providing a ‘smorgasbord’ of actionable information about your mission-critical environment. Help yourself to delicious servings of baseline data, proactive alerts, and custom dashboards and reports. Now make room for dessert! Foglight is probably best known for its generous helping of PI. PI is an abbreviation for Performance Investigator.

Monitor Azure Kubernetes Service with Datadog

Microsoft recently released Azure Kubernetes Service (AKS), a managed service that helps you deploy, run, and scale Kubernetes clusters. We’re pleased to share that Datadog’s integrations with Kubernetes and Azure Monitor can give you comprehensive visibility into your AKS infrastructure with no additional configuration.

DevOps Redemption: Don't Let Outdated Data Analytics Tools Slow You Down

You know what’s not fun for DevOps engineers? Manually investigating and troubleshooting issues within their applications. It’s also no longer feasible in today’s highly complex and fast moving IT landscape. Gone are the days of using legacy on-premises tools for modern applications and infrastructures because they simply aren’t compatible.

How Bitbucket Data Center's largest customers scale with Git

Supporting a growing software team is a daunting challenge, and Git is often at the heart of that task. Ensuring developers can effectively collaborate requires user provisioning, tool permissions, and enough horsepower to support all of the load. If you support a distributed team, the factors become more complex. How do you ensure developers have a consistent experience across geographies and help productivity flourish?