Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Elevating Engineering Excellence: The Imperative of Site Reliability for Every Engineer

In the ever-evolving landscape of technology, engineers are the architects of the digital world. Their expertise shapes the platforms, applications, and services that define our daily interactions with technology. Yet, in the pursuit of innovation and functionality, there's one crucial aspect that often takes a backseat—site reliability. Site reliability engineering (SRE) has emerged as a critical discipline in the realm of software development and operations.

Best practices for monitoring managed ML platforms

Machine learning (ML) platforms such as Amazon Sagemaker, Azure Machine Learning, and Google Vertex AI are fully managed services that enable data scientists and engineers to easily build, train, and deploy ML models. Common use cases for ML platforms include natural language processing (NLP) models for text analysis and chatbots, personalized recommendation systems for e-commerce web applications and streaming services, and predictive business analytics.

Densify Talks, Avoiding Sticker Shock with Gokul Naidu of SAP's SuccessFactors

On this episode of Densify Talks, we welcome Gokul Naidu, Senior Manager, Cloud Operations for SAP’s SuccessFactors Product Suite. Andrew and Gokul discuss an array of topics in the episode. The general theme of the discussion focuses on cost management and the importance of being prepared, aware, and executing the right planning in order to avoid sticker shock.

Universal Monitoring Agent: A Powerful, Flexible and Innovative Approach to Monitor Modern Apps

With the advent of microservices and cloud native, organizations are shifting how they approach software development and deployment to become more agile and respond quickly to continually evolving business needs. These changes result in fundamental transformation for IT.

Deploying AI Apps with GPUs on AWS EKS and Karpenter

As AI and machine learning workloads continue to grow in complexity and size, the need for efficient and scalable infrastructure becomes more important than ever. In this tutorial, I will show you how to deploy AI applications on AWS Elastic Kubernetes Service (EKS) with Karpenter from scratch, leveraging GPU resources for high-performance computing.

Introducing AI by Design: Principles for Responsible AI

Generative artificial intelligence (AI) represents a new frontier for transformative productivity. With over 300,000 customers worldwide harnessing our data-driven solutions, SolarWinds is well-positioned to leverage AI to enrich the lives of the IT professionals we serve. But for this exciting technology to yield real value over time, it’s crucial to build it sustainably.

From the Edge to the Cloud - How HPE and OpsRamp Can Help Power & Manage Your Hybrid IT Estate

You may have heard the phrase “from the edge to the cloud” but what does it really mean, how can your organization take advantage of it, and how can HPE and OpsRamp, a Hewlett Packard Enterprise company, help? Edge to cloud refers to the fact that enterprise data is no longer confined to the traditional data center. It is being generated and processed at the edge in ever-increasing amounts, then stored in the cloud, and used by an increasingly distributed global workforce.