Operations | Monitoring | ITSM | DevOps | Cloud

2019 Hurricane Season: Solidify a Business Continuity Plan With a Mass Notification Solution

Summer is typically synonymous with beach days, outdoor barbecues and fulfilling weekend getaways. Unfortunately, the summer months aren’t only about enjoyable moments and exciting vacations. It’s also tropical storm season, with higher risks of destruction, community displacement and business operation disruption. With this potential for human and business peril, it’s important for organizations to implement a business continuity plan, equipped with a robust communication strategy.

DevOps Adoption: Leading the Transformation

Digital transformation has been the catalyst for many organizations in their move to DevOps adoption, and many are finding early success in small, siloed DevOps implementations. How can those successes be realized as organizations scale their DevOps company-wide? This webinar will offer some tips to help organizations implement and manage DevOps on a larger scale, including some warning signs pointing to DevOps in crisis.

How to Monitor Website Changes

Before, to know detect a website changed its content or not, you had to manually visit the website and check by yourself. This is a thing from the past! Tools such as Hyperping can send instant alerts in case of changes. One way is making sure your API or marketing site returns the expected content. Some servers return an expected status code (200, OK) but can often return the wrong content in the response body, whether it is HTML content or JSON.

Bringing AIOps to Hybrid Cloud Monitoring and Management

Artificial intelligence for IT Operations is purpose-built to ingest large sources of data from infrastructure and point tools, and produce actionable insights on root-cause analysis and incident remediation. How do you bring these innovations to an enterprise ecosystem that’s also in the middle of cloud migration and overall digital transformation?

Best Practices for Managing Multiple On-Call Teams

Alerting has come a long way from the days of paging an on-call administrator in the middle of the night, to multiple on-call teams that run and manage incident response around the clock. This is because as organizations grow and scale, responding to incidents also gets more complex and you often need more than one team to get involved to successfully resolve an incident.

Mark Henderson from Stack Overflow shares his experience on being an SRE

Mark Henderson has been a Site Reliability Engineer at Stack Overflow since 2015. Before this he worked as the sole systems administrator at a small software company in Sydney, Australia. These days, he lives in South Australia and works from home with his wife and two children.

Serverless Event-Driven Workflows with PagerDuty and Amazon EventBridge

This week’s AWS Summit in New York was an exciting one for both AWS and PagerDuty. The AWS team rolled out Amazon EventBridge, a set of APIs for AWS CloudWatch Events that makes it easy for AWS SaaS partners to inject events for their customers to process in AWS. PagerDuty is excited to continue and deepen our long partnership with AWS by supporting EventBridge as a launch partner.

No CMDB? No problem. Not for BigPanda.

I hear it all the time when talking to future BigPanda customers; “I’m not sure BigPanda can really help me correlate all these alerts together because our CMDB is very immature.” Or sometimes, they don’t even have a CMDB, and incorrectly assume this disqualifies them from meaningful noise reduction and alert correlation. I’m happy to tell you the same thing I tell the folks who are looking at BigPanda for the first time. “No CMDB? No problem!”.

AI/ML - Are We Using It in the Right Context?

There used to be a distinct, technical separation between terms such as AI and machine learning (ML) – but only while these technologies remained largely theoretical. As soon as they became practical in the real world, and then commodifiable into products, the marketers stepped in. Widespread overuse of the terms AI/ML in marketing have managed to thoroughly confuse the meanings of these words.

Assessing the Per-Minute Cost of an Outage for YOUR Company

Software vendors and analysts love to rattle off scary numbers about how many thousands of dollars per minute or hour an infrastructure outage will cost the typical company. Those numbers can be scary indeed; for example, Gartner quotes $5,400 per minute as the cost borne by a medium to large-sized retailer. Your company, however, is most likely not identical to the “typical” company on which the numbers are based.