Operations | Monitoring | ITSM | DevOps | Cloud

Dash 2021: Guide to Datadog's newest announcements

Today at Dash 2021, we announced new products and features that give your team even greater visibility into the health and performance of your code, databases, CI/CD pipelines, and more. Now, you can monitor network devices, get visibility into your services' golden signal metrics without touching a single line of code, and integrate third-party tools into our platform with Datadog Apps. We expanded RUM to include iOS error tracking, Session Replay, and Watchdog Insights.

Announcing the Gremlin Chaos Engineering Professional Certificate Program

There’s a reason why thousands of Engineers, Testers, and other Reliability specialists signed up for Gremlin’s first Gremlin Certified Chaos Engineering Practitioner (GCCEP) certificate program: Chaos Engineering is in high demand, and the market is looking for professionals who know how to wield it well.

Adaptive Alerts: Easy, actionable alerts for noisy systems

Adaptive Alerts feature provides reliable, informative, and actionable notifications about unexpected issues in monitored applications and services. Rollbar is the leading continuous code improvement platform that proactively discovers, predicts, and remediates errors with real-time AI-assisted workflows. With Rollbar, developers continually improve their code and constantly innovate rather than spending time monitoring, investigating, and debugging.

Rollbar Pro Tips: Launch Darkly Feature Flag

Enabling the Launch Darkly integration allows engineers to automate Feature Flag toggles based on errors captured in Rollbar. This means that if you ship a feature to users, only 1 user will see an error before Rollbar automatically toggles the feature flag for all subsequent users. Rollbar is the leading continuous code improvement platform that proactively discovers, predicts, and remediates errors with real-time AI-assisted workflows. With Rollbar, developers continually improve their code and constantly innovate rather than spending time monitoring, investigating, and debugging.

The Nightmare Before Business: Stay Safe with Uptime.com Status Pages

We’re nearing Halloween and mischief night has stolen tricks from the holiday season. With online sales alone expected to creep up toward $3 billion before the next crescent moon, we’re offering you a solution to keep the angry mobs with pitchforks at bay by giving them a crystal ball into your real-time incident response with Uptime.com Status Pages.

VMworld 2021: "We're Proud To Announce..."

I've never seen so much news during VMworld! It began to seem comical that every speaker at the opening "General Session" and subsequent keynotes used the line "We are proud to announce." By the way, one of the most excellent General Sessions I've ever seen in terms of tempo, delivery, and rhetoric! From October 15, you will be able to find all content on-demand here.

Zero to hero: Enterprise multi-cloud application management from Day 0 to Day 2, on any substrate

Kubernetes and Operators have stormed the IT industry, but the enterprise landscape is still a mix of bare metal, virtual machines and Kubernetes, on-prem and on public clouds, all running different workloads. How can we manage the lifecycle of all those applications across all these environments, whithout having to learn new tools and frameworks as the industry is already moving to serverless? In this fast-flowing demo we will show you how the Charmed Operator Framework brings a unified approach for application management across clouds and infrastructure generations.

How We Use Sloth to do SLO Monitoring and Alerting with Prometheus

One of the most challenging tasks for Site Reliability Engineers is to align the reliability of the systems with the business goals. There is a constant battle between delivering more features—which increases the product’s value—and keeping the system reliable and maintainable. A significant ally to achieve both objectives is the Service Level Objective Framework.

Differences between Site Reliability Engineer Vs. Software Engineer Vs. Cloud Engineer Vs. DevOps Engineer

The evolution of Software Engineering over the last decade has lead to the emergence of numerous job roles. So how different is a Software Engineer, DevOps Engineer, Site Reliability Engineer and a Cloud Engineer from each other? In this blog, we drill down and compare the differences between these roles and their functions.