Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

The Role of an Operations Manager in Enhancing IT Department Performance

You might not always be able to tell, but operations managers are like the quiet heroes working in the background to ensure that your IT department does not just function - but flourishes. They skillfully manage resources, perfect processes and confirm technology serves its real purpose: to make tasks simpler and results superior.

How To Find All Files Containing Specific String In Linux?

The grep command in Linux searches and matches text within files. It finds files containing a specific text string. The grep command syntax is: search_pattern is the text string you want to search for, and file_or_directory is the file or directory you want to search in. Some grep command examples are.

Improving our broken link tests

One of the most unique features Oh Dear offers is the broken links and mixed content crawler. We will crawl your site for all links, reporting any broken pages to your defined notification channels. Recently, we encountered degraded performance with our crawler service due to a breaking change in an underlying library called Guzzle. This caused HTTP 505 responses on the first page of each site, blocking further crawling and resulting in false positive reports.

6 Tips to Integrate Container Orchestration and APM Tools

Application performance monitoring (APM) setup and strategies vary based on the application’s infrastructure design. Containers managed by orchestration tools like Docker Swarm or Kubernetes are dynamic and ephemeral, significantly affecting monitoring strategies. Container development speeds up an organization’s ability to build, deploy and scale new features.

Scaling Monitoring & Observability for a Software Platform with Grafana Cloud | Builder.ai | Grafana

In this talk, Utsav and James from Builder.ai discuss their journey in scaling their composable software platform. Builder.ai empowers users, from entrepreneurs to enterprises, to build and innovate without dealing with technical complexities. The focus of the talk is on their Developer Service platform and the integration of Grafana Cloud for monitoring and observability.

How to Scale Observability with Grafana, Tempo, Loki, and Prometheus | Dojo | Grafana

In this talk, Roberto, a staff engineer at Dojo, outlines the company's journey toward achieving advanced observability, which has been crucial for their reliability efforts over the past three years. Dojo, a payments provider in the UK, has focused on evolving their observability practices, initially starting with basic monitoring and progressing towards comprehensive observability, encompassing metrics, traces, and logs.

Grafana Provisioned Alerting for Effective Observability

Implementing a consistent and reliable alerting system across a sprawling organization is a significant challenge for just about any engineering team. For example, diverse infrastructures across different teams and numerous team-specific customizations may not translate well when investigating specific incidents. Inconsistent alerting practices can eventually lead to fatigue, leading to triggering of alerts that may not be relevant or actionable.