Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Mattermost Playbooks How-to: Release Management

Releasing software to users has become a sophisticated and intricate process that requires high levels of consistency and coordination. A release has to be built, brought together, documented, tested and deployed, which requires coordination of at least four separate teams and a generous handful of pipelines and other tools. Without a well-documented process things can get messy very quickly, causing stress for everyone involved.

Mattermost Playbooks How-to: Incident Resolution

Whether you’re part of a team managing SaaS products or a high-security digital workspace, sometimes Things Go Wrong and must be addressed with extreme care, professionalism, and predictability. For outages, data breaches, vulnerabilities and more, you and your team are juggling a variety of tools, processes, and rigid incident management systems. When the on-call pager goes off at 3 am almost no one has the ability to remember every step needed to kick off all the response workflows.

Site Reliability Engineering (SRE) Survey Now Open for 2022 - Calling All Reliability Practitioners and Leaders

In its fifth year, Catchpoint sponsors The SRE Survey, in partnership with Blameless, to uncover new trends and challenges for teams focused on advancing the reliability of digital products.

Annual Study: Hybrid IT Acceleration Has Increased Network Complexity and Lowered Tech Pros' Network Management Confidence

SolarWinds IT Trends Report 2022-Getting IT Right: Managing Hybrid IT Complexity examines the current state and areas of opportunity for technology professionals managing increased complexity as hybrid IT accelerates. The continued shift to hybrid IT drives increased levels of IT management complexity, but tech pros feel a lack of confidence in how to best manage it. Nearly half (44%) of tech pros said their organization manages hybrid IT complexity through training staff and adopting IT monitoring/management tools (37%)

Monitor and diagnose network performance issues with SNMP Traps

Monitoring your on-premise or hybrid infrastructure means keeping track of potentially thousands of devices, any one of which could be a point of failure. Additionally, silos between application and network teams can create visibility gaps that complicate troubleshooting. For network engineers investigating bottlenecks, being able to view real-time infrastructure health and performance data alongside application metrics is essential for ensuring their organizations meet key SLOs.

Podcast: Break Things on Purpose | Developer Advocacy and Innersource with Aaron Clark

In this episode, Jason chats with Aaron Clark, Director of Developer Advocacy at the Royal Bank of Canada. Aaron shares what it was like starting out as a developer at RBC and working in early cloud development, and then transitioning to his role as a developer advocate. Jason and Aaron talk about the value applying open source principles within organizations, or “innersource.” Their time ends with a discussion on continuing education and how to keep learning.

GitKraken Client v8.6 - Faster Git LFS and beyond!

We know that everyone’s code story may be a little different, but speedier repos are something everyone can get behind. No matter where your developer adventures take you, it is important to keep all your code, configuration, and media assets together, and never leave a file behind. That is why we have been working on a lot of performance improvements for Git LFS users and have added Bitbucket Workspace support for Bitbucket Server users!

Puppet and Government: Maintaining compliance in complex hybrid cloud environments

This blog is the third in a four-part series about how Puppet can help government agencies meet compliance and security requirements. Read the second post here. Government agency IT departments know that migrating applications to the cloud can improve efficiency, increase visibility, and reduce costs. They also recognize the value in keeping some operation resources on-premises.

Netdata Agent release v1.35

The latest Netdata Agent release v1.35 introduces massive improvements for the machine learning-powered Anomaly Advisor, Metric Correlations, Kubernetes monitoring, and much more. Anomaly Advisor & on-device Machine Learning This release features a launch of the flagship machine learning (ML) assisted troubleshooting Anomaly Advisor. Unsupervised ML models are trained for every metric, at the edge, on your devices, enabling real-time anomaly detection across all your systems and applications.