%term

Chaos Engineering: The Path to Reliability - Kolton Andrus

Oct 15, 2020 By Gremlin In Gremlin

We’re all here for the same purpose: to ensure the systems we build operate reliably. This is a difficult task, one that must balance people, process and technology during difficult conditions. We operate with incomplete information, assessing risks and dealing with emerging issues. We’ve found Chaos Engineering to be a valuable tool in addressing these concerns. Learn from real world examples what works, what doesn’t, and what the future holds.

View Video

Gremlin

Read more about Chaos Engineering: The Path to Reliability - Kolton Andrus

Identifying Hidden Dependencies - Liz Fong Jones

Oct 15, 2020 By Gremlin In Gremlin

You don't need to write automation or deploy on Kubernetes to gain benefits from resilience engineering! Learn how Honeycomb improved the reliability of our Zookeeper, Kafka, and stateful storage systems through terminating nodes on purpose. We'll discuss the initial manual experiments we ran, the bugs in our automatic replacement tools we uncovered, and what steps we needed to progress towards continuously running the experiments. Today, no node at Honeycomb lives longer than 12 months, and we automatically recycle nodes every week.

View Video

Gremlin

Read more about Identifying Hidden Dependencies - Liz Fong Jones

Enhancing Observability in DevSecOps

Oct 15, 2020 By Sumo Logic In Sumo Logic

Digital transformation often accelerates innovation at the expense of creating an intelligence gap with massive amounts of unanalyzed data. This is where Continuous Intelligence comes into play. Join Sumo Logic’s Systems Engineer, Suresh Govindachetty, as he demonstrates how Continuous Intelligence helps find and solve information gaps, and how a single platform approach allows organisations to combine devs, operations, and security in ways that ease the burden for all teams across the organisation.

View Video

Sumo Logic

Read more about Enhancing Observability in DevSecOps

Lessons from Incident Management and Postmortems at Atlassian - Jim Severino

Oct 15, 2020 By Gremlin In Gremlin

How do you run incidents and postmortems at a company with thousands of engineers spread across the globe? Jim Severino shares what worked (and didn't worked) for Atlassian.

View Video

Gremlin

Read more about Lessons from Incident Management and Postmortems at Atlassian - Jim Severino

Looking back on Chaos Conf 2020

Oct 15, 2020 By Andre Newman In Gremlin

It’s already been a week since we closed our third annual Chaos Conf! While we were forced to take the conference online, this meant that more of you could join us. Over 3,500 people signed up to help make this the world’s largest Chaos Engineering conference. That’s 5x more than 2019, and nearly 10x more than 2018! This is a testament to the growth of Chaos Engineering as a practice across many different industries and around the world.

Read Post

Gremlin

Read more about Looking back on Chaos Conf 2020

Monitoring in Pandora FMS with server plugins

Oct 15, 2020 By Pandora FMS In Pandora FMS

Do you know how to add a server plug-in? Find out in this video and take advantage of your monitoring.

View Video

Pandora FMS

Monitoring

Read more about Monitoring in Pandora FMS with server plugins

The Jfrog Platform - End-to-end Devops Solution

Oct 15, 2020 By JFrog In JFrog

Manage your DevOps pipeline from a single pane of glass. The JFrog Platform provides a universal end-to-end solution that integrates with your ecosystem to orchestrate and optimize all key processes in your CI / CD pipeline. Deliver fearless updates from code to the edge in self-managed, on-prem, hybrid, and multi-cloud environments.

View Video

JFrog

CI CD
DevOps

Read more about The Jfrog Platform - End-to-end Devops Solution

Incident Ready: How to Chaos Engineer Your Incident Response Process | FireHydrant

Oct 15, 2020 By FireHydrant In FireHydrant

We’re pretty sure using a real incident to test a new response process is not the best idea. So, how do you test your process ahead of time? In this video, FireHydrant CEO, Robert Ross, will share how FireHydrant customers leverage best practices to break, mitigate, resolve, and fireproof incident processes. We’ll show you how to use chaos engineering philosophies to stress test 3 critical parts of a great process.

View Video

FireHydrant

Read more about Incident Ready: How to Chaos Engineer Your Incident Response Process | FireHydrant

Lead Times and Psychological Safety within the Five Ideals - Gene Kim

Oct 15, 2020 By Gremlin In Gremlin

The biggest challenges engineering organizations face are not technical. They’re fundamental problems with how we think and go about doing work, and the environments that we work in. In this talk, Gene Kim will share the Five Ideals and how they relate to Chaos Engineering. He’ll also show how the Five Ideals help build stronger, better performing, and ultimately more reliable companies.

View Video

Gremlin

Read more about Lead Times and Psychological Safety within the Five Ideals - Gene Kim

Everything You Need to Know About DNS Monitoring

Oct 15, 2020 By Olga Burnaeva In VirtualMetric

In order to communicate, web pages, devices and applications need a common naming system which allows them to identify each other and send information. This is particularly important when the communication takes place over the Internet because of the large number of services and websites that need to be identified. This is why the Domain Name System (DNS) is so important for businesses. It matches website pages and devices to an IP address that can be traced by other devices.

Read Post

VirtualMetric

Read more about Everything You Need to Know About DNS Monitoring

Operations | Monitoring | ITSM | DevOps | Cloud

Chaos Engineering: The Path to Reliability - Kolton Andrus

Identifying Hidden Dependencies - Liz Fong Jones

Enhancing Observability in DevSecOps

Lessons from Incident Management and Postmortems at Atlassian - Jim Severino

Looking back on Chaos Conf 2020

Monitoring in Pandora FMS with server plugins

The Jfrog Platform - End-to-end Devops Solution

Incident Ready: How to Chaos Engineer Your Incident Response Process | FireHydrant

Lead Times and Psychological Safety within the Five Ideals - Gene Kim

Everything You Need to Know About DNS Monitoring

Monthly Archive

Follow Us