Operations | Monitoring | ITSM | DevOps | Cloud

8 Simple Ways to Enhance Your Data Center Busway Deployment

Modern data centers and labs are growing in density and complexity as IT devices require more and more power to racks. As such, many data center and lab managers deploy overhead busways for their reduced installation costs and better flexibility, scalability, reliability, and aesthetics. However, without a software solution to monitor and manage your busways, you likely aren’t getting the full value out of them.

What causes repeated account lockouts and how to resolve them

A stringent account lockout policy is vital to derail password guessing and brute-force attacks but it also runs the risk of locking out legitimate users costing businesses valuable time, money, and effort. With password reset requests accounting for almost 30% of the total IT help desk tickets, resolving frequent account lockouts becomes an indispensable part of a sysadmin’s job.

Build a modern data compliance strategy with Datadog's Sensitive Data Scanner

Within distributed applications, data moves across many loosely connected endpoints, microservices, and teams, making it difficult to know when services are storing—or inadvertently leaking—sensitive data. This is especially true for governance, risk management, and compliance (GRC) or other security teams working for enterprises in highly regulated industries, such as healthcare, banking, insurance, and financial services.

Ecommerce monitoring strategy for Black Friday and Cyber Monday 2021

It’s nearly here. The annual mad rush at the wee hours of the morning. The stampede into retail stores to claim really deep discounts on the latest toys, electronics, and gadgets makes headline news every year. It begins the day after Thanksgiving and is usually two of the biggest shopping days of the year. Yes, we’re talking about Black Friday and Cyber Monday.

What can you learn from IoT with i2M - Part 3

In the last 2 installments (Part 1 & Part 2), we discussed the basics of IoT and an example of how the components can be connected and used to provide basic automation and alerting. These seemingly simple steps can build up to provide very advanced controls of all aspects of the physical world. The challenge can become managing situations that were not expected.

Podcast: Break Things on Purpose | Gustavo Franco, Senior Engineering Manager at VMWare

In this episode Jason is joined by Gustavo Franco, Senior Engineering Manager at VMWare, to chat about chaos in the Gustavo’s early days. Gustavo reflects on Googles early disaster recovery practices, to the contemporary SRE movement.

Top 5 Challenges of Monitoring Complex IT Infrastructures

When monitoring a large IT infrastructure, there are multiple aspects you need to keep under control. Doing things manually and relying on people to ensure the infrastructure reliability can be a wrong decision and mislead you when resolving issues or troubleshooting problems. All these complexities faced while managing a large ecosystem can seem hard to overcome, but in reality, they can be handled.

NGINX Monitoring: Best Tools and Key Metrics You Should Know About

NGINX is a popular web server featuring a wide range of capabilities, including reverse proxy, mail proxy, HTTP cache, and load balancing. It offers TLS offloading and a health check of the backends and supports gRPC, WebSocket, and HTTP/2. In short, NGINX is a one-stop solution for most of your web server needs. When using NGINX, monitoring its metrics is crucial for tackling issues.

How they SRE: Insights from the Cloudflare SRE team

Cloudflare is a global cloud services provider that is based all over the globe, from San Francisco, US to London, England to Sydney, Australia. Their mission, as stated front and center on their homepage, is to help build a better Internet. While that may read like hyperbole, their numbers are impressive - Cloudflare has over 126,000 paying customers and 95% of Internet Users in the developed world are within 50ms of their network.