Operations | Monitoring | ITSM | DevOps | Cloud

Maintaining Operational Sanity Across 100+ AWS Accounts | Eric Mann / Ryan Tomac (Vacasa)

At Vacasa, AWS accounts represent the unit of isolation for distinct applications & services in our software ecosystem, providing security benefits and operational autonomy for our teams as we scale. Managing accounts at this scale requires strong DevOps practices to maintain security, operational sanity, and uniform observability across the system. In this talk, we’ll cover the benefits of such an approach, the practices that make it possible, and the important role Datadog plays.

Democratizing Delivery: Seamless Observability for Optimal Application Performance |Ekim Maurer(NS1)

When application delivery performance issues happen, observability is critical to diagnosing the problem at hand. The adage “it’s always DNS” means that observability must extend to the foundational layers of the application delivery and access networking stacks. Yet granting administrative access to core network services like DNS and DHCP may run contrary to an organization’s least-privileged access policies. In this session, attendees will learn how global internet companies and enterprises use NS1 and Datadog to provide democratized DNS observability and reach optimal application performance.

Observability for Service Organizations | Bart Scheltinga (RawWorks)

Observability is trending. Organizations that rely on cloud infrastructure and cloud applications prioritize observability initiatives to get control over their business’s applications. At the same time, we see the “gap” between the on-premises infrastructure and “non-cloud” infrastructure is becoming bigger. Examples are End User Computing (EUC) and Global networks (SD-WAN).

Metrics for Apache Kafka with Datadog and Aiven | Ryan Martin (Aiven)

Using managed services is all very well, but how do you get the data you need from the different services into Datadog so you can see it all in one place? This session will walk through the configuration for bringing your Aiven-managed Apache Kafka service metrics into your Datadog explorer. You’ll see how to filter the metrics to focus on specific topics or consumer groups, and how to use the Aiven client to create a repeatable, scriptable setup. This session is recommended for anyone living in the as-a-Service world who cares about data and is interested in using metrics to optimize their Kafka clusters.

Monitoring Open Source Success in Arduino | Silvano Cerza (Arduino)

Arduino is an open-source hardware and software company, project, and user community that designs and manufactures single-board microcontrollers and microcontroller kits for building digital devices. In the course of developing software downloaded and used by millions around the world, we have found it vitally important to be aware of the quality and performance of our software.

Adaptive Alerts: Easy, actionable alerts for noisy systems

Adaptive Alerts feature provides reliable, informative, and actionable notifications about unexpected issues in monitored applications and services. Rollbar is the leading continuous code improvement platform that proactively discovers, predicts, and remediates errors with real-time AI-assisted workflows. With Rollbar, developers continually improve their code and constantly innovate rather than spending time monitoring, investigating, and debugging.

Rollbar Pro Tips: Launch Darkly Feature Flag

Enabling the Launch Darkly integration allows engineers to automate Feature Flag toggles based on errors captured in Rollbar. This means that if you ship a feature to users, only 1 user will see an error before Rollbar automatically toggles the feature flag for all subsequent users. Rollbar is the leading continuous code improvement platform that proactively discovers, predicts, and remediates errors with real-time AI-assisted workflows. With Rollbar, developers continually improve their code and constantly innovate rather than spending time monitoring, investigating, and debugging.