Operations | Monitoring | ITSM | DevOps | Cloud

Introducing OpsRamp #OpsQ

With OpsRamp OpsQ, IT operations teams can analyze IT event streams in real-time, extract meaningful insights from events for continuous improvement, drive faster deployments and better collaboration, and reduce downtime with proactive detection. In this webinar you’ll get an overview of OpsQ, including demonstrations of features to drive greater efficiency within modern IT operational environments.

Have we discovered the secret sauce for successful offsites?

Offsite meetings can be great for getting things done. Being out of the office can clear the cobwebs, break down barriers, and lead to real breakthroughs. At BigPanda, the marketing team has started experimenting with how we run offsites, with the aim of trying to find a “secret sauce” that leads to success – maximizing both team building and task execution that we tackle in our offsites.

Unsung IT Ops and DevOps heroes are finally getting their due!

IT Ops and DevOps teams in every organization are capable of focusing on revenue-generating initiatives and projects. Unfortunately they’re held back by constant fire-fighting…which means they are reduced to supporting just the current state and existing/legacy applications and services.

Alert fatigue, part 4: alert consolidation

So far, we’ve covered alert reduction with Sensu filters and token substitution; automating triage; and remediation with check hooks and handlers (links above). In this post, I’ll cover alert consolidation via round robin subscriptions and JIT/proxy clients; aggregates; and check dependencies. These are all designed to help you cut through the “white noise” and focus on what’s important (especially in the middle of a major incident).

Monitoring that Monitors the Monitors of the Monitors

One way to break the cycle of alert fatigue is by improving the quality of the signals you monitor. That can mean greater resolution at which monitoring data is ingested and processed, smarter statistical methods for aggregating and correlating data across multiple services, or routing alerts through an escalation and incident management system.

This IS NOT Fine: Putting Out (Code) Fires

So the dumpster is on fire. Again. The site’s down. Your boss’s face is an ever-deepening purple. And you begin debating whether you should join the #incident channel or call an ambulance to deal with his impending stroke. Firefighters have clear procedures and a strong hierarchy. The first truck at a scene immediately begins assessing the situation.

Reducing Noise with Event Intelligence

Learn how Event Intelligence, the next-gen approach to Event Management and AIOps, helps teams to cut through the noise and operate at scale. This introductory session will walk through key best practices and requirements such as reducing noise via adaptive machine learning, accelerating triage via integrating machine data with human response, and much more.

Introducing Jira Ops: Respond Faster with Atlassian + PagerDuty

Atlassian’s mission is to unleash the potential of every team. Atlassian’s newest product, Jira Ops, is built on top of Jira with a direct connection to PagerDuty to ensure teams can be successful and respond quickly when things break. This session will cover how PagerDuty and JiraOps work together to help teams respond to incidents, quickly and in real-time.