Operations | Monitoring | ITSM | DevOps | Cloud

October 2023

2023 State of DevOps Report Takeaways

Don: The debate is over - how should you structure your software teams? That question is now answered in this year's State of DevOps report 2023. Other questions answered include: How does AI affect my company and team performance? How can we quantify the impact of culture on performance burnout? What even is culture in the first place? All these things are included in the State of DevOps report 2023. We have a very special guest, Eric Maxwell from the DORA group, to offer his takes on the report.

Webinar: Streamlining Incident Management With Automation and Contextual Awareness

In the modern context of distributed teams & complex digital infrastructure, major incidents having a negative impact spanning multiple teams and services can cause a barrage of alerts. While a meticulously designed incident response strategy can aid in restoring order, it's essential to underscore the significance of providing responders with effective tools that offer contextual understanding and facilitate the identification of actionable alerts.

A Brief History of BGP Incidents

Kentik internet analysis expert, Doug Madory, discusses the most notable and significant BGP incidents in the history of the internet, from traffic-disrupting leaks to recent crypto-stealing hijacks. Stretching back to the AS7007 leak of 1997, this webinar uses a historical perspective to explore the questions: what progress has been made and what is the path to finally securing BGP?

Fighting DDoS at the Source

For decades, the scourge of distributed denial of service (DDoS) attacks has plagued the internet. Join Doug Madory, Director of Internet Analysis at Kentik, and Aaron Weintraub, Principal Engineer at Cogent Communications, as they explain how organizations can identify customer networks sending the spoofed traffic that leads to DDoS attacks.

What Should Your System Outage Notifications Say?

System outages: they are an inevitable problem that every single IT team will encounter at some point. Whether they come about due to technical issues, act-of-god natural disasters, or simply random human error, system outages happen to the best of us. Though the cause of system outages is not always in your control, you can control your team’s processes for response and resolution.

Set up Microsoft Teams alerts when a website changes

Website monitoring has grown in importance over the past decade for individuals and businesses all around the globe – and for different purposes. It became even more important in 2020 during the COVID-19 pandemic. As travel, events, and offices around the globe shut down rapidly, people relied on different tools and features to be kept up to speed regarding the ongoing situation.

Continuous profiling: The key to more efficient and cost-effective applications

Recently, Elastic Universal ProfilingTM became generally available. It is the part of our Observability solution that allows users to do whole system, continuous profiling in production environments. If you're not familiar with continuous profiling, you are probably wondering what Universal Profiling is and why you should care. That's what we will address in this post.

Use Datadog Dynamic Instrumentation to add application logs without redeploying

Modern distributed applications are composed of potentially hundreds of disparate services, all containing code from different internal development teams as well as from third-party libraries and frameworks with limited external visibility. Instrumenting your code is essential for ensuring the operational excellence of all these different services. However, keeping your instrumentation up to date can be challenging when new issues arise outside the scope of your existing logs.