Operations | Monitoring | ITSM | DevOps | Cloud

Latest posts

Elastic: Operationalizing machine learning for SIEM

Unsupervised machine learning (ML) is a core capability for most security operations teams looking to implement an advanced threat detection or insider threat program. However, the deployment of ML can present adoption challenges for security teams. Unless they have in-house data scientists to develop and tune threat models and skilled threat hunters to investigate alerts and manually follow up on interpreting anomalous behaviors, teams may find themselves struggling to gain useful insights and operational value out of ML tools.

Elastic: App Search web crawler Fundamentals: Ingestion, tuning & user interface

Join Elastic experts and learn how to deliver great search experiences based on crawled content to your website and applications. During this workshop, you'll learn about Elastic App Search web crawler - a maintenance-free crawler that keeps content current with intuitive customization features and robust analytics that provide full control over search relevance. See how quickly you can get set up based on these three main concepts: ingestion, tuning, and user interface.

Grafana: Observability with Loki 2.0

In just over two years, the Loki open source project for logging aggregation has seen a great uptick in adoption by users benefiting from its small index, ease of use, and cost effectiveness. Enterprises like Grofers and Paytm Insider are now using Loki in both Grafana Labs' hosted offering and on premise. And with the Loki 2.0 release last October, we've made big gains for improving performance through parallelization and query optimization.

What is Chaos Engineering and How to Implement It

Chaos Engineering is one of the hottest new approaches in DevOps. Netflix first pioneered it back in 2008, and since then it’s been adopted by thousands of companies, from the biggest names in tech to small software companies. In our age of highly distributed cloud-based systems, Chaos Engineering promotes resilient system architectures by applying scientific principles. In this article, I’ll explain exactly what Chaos Engineering is and how you can make it work for your team.

Best Practices for Writing Secure Java Code

Every Java developer should follow coding standards and best practices to develop secure Java code. It is critical your code is not vulnerable to exploits or malicious attacks. In recent times, even big organizations like eBay, the CIA, and the IRS have fallen victim to vulnerabilities in their applications that have been discovered and exploited by attackers. The following guidelines provide a solid foundation for writing secure Java code and applications.

10 Ways to Implement Effective IoT Log Management

The Internet of Things (IoT) has quickly become a huge part of how people live, communicate and do business. All kinds of everyday things make up this network – fridges, kettles, light switches – you name it. If it’s connected to WiFi, it’s part of the Internet of Things. IoT raises significant challenges that could stand in your way of fully realizing its potential benefits.

Snooze notifications until the next workday

When a site is down, Oh Dear sends a notification every hour. Since last year, our notifications can be snoozed for a fixed amount of time (5 minutes, 1 hour, 4 hours, one day). In the evenings and weekends, you might not want to receive repeated notifications. That's why we've added a nice human touch: all notifications can now be snoozed until the start of the next workday. You can choose this new options in the snooze settings of a check.

What are MTTR, MTBF, MTTF, and MTTA? A guide to Incident Management metrics

In the present fast-moving digital world, it has become critical for businesses to measure and track their service delivery performance especially the incident management metrics that monitor the uptime of systems, downtime due to outages, and how fast and efficiently issues are resolved because even a slight glitch in the system can cause disruption in the business processes costing millions of dollars.

Getting started with Kubernetes audit logs and Falco

As Kubernetes adoption continues to grow, Kubernetes audit logs are a critical information source to incorporate in your Kubernetes security strategy. It allows security and DevOps teams to have full visibility into all events happening inside the cluster. The Kubernetes audit logging feature was introduced in Kubernetes 1.11.