Operations | Monitoring | ITSM | DevOps | Cloud

February 2022

Top 5 Interview Questions to Ask DevOps Candidates in 2022

DevOps plays a critical role in today’s business landscape, enabling organizations to automate and innovate swiftly at a time when digital transformation projects put a premium on those capabilities. The benefits of DevOps, though, can only be relied on when related security risk mitigation is considered and embedded into DevOps processes.

Rakuten Symphony agrees to acquire leading US-based cloud technology company Robin.io to deliver highly integrated telco-cloud for mobile

Focused on reliable high performance, cost efficiency, multi-domain automation. Delivering on today's mobile operator needs in the simplest way and preparing for the next generation of 5G and Enterprise cloud-native deployments & operations. Partha Seetala to lead Rakuten Symphony's Unified Cloud business unit.

How certificates work in Puppet

This video gives you a basic introduction to certificates and explains how they’re used to secure Puppet communications. Certificates help to provide secure connections between different parts of your infrastructure as those parts communicate with each other. When you run the agent for the first time, it submits a CSR (Certificate Signing Request) to the primary server. Then the CSR is reviewed by the Puppet administrator and either accepted or denied.

Monitoring system performance metrics with Graphite

In this article, we will explain what system performance metrics are and why you need to monitor them. Then we will look at Graphite and Grafana monitoring systems, which make it easy to collect, save and visualize metrics. Finally, we will consider why you should choose MetricFire to monitor your system’s metrics. If you would like to learn more about the benefits of MetricFire, book a demo with our experts or sign up for a free trial today.

Automate Deployments to Amazon EKS with Skaffold and GitHub Actions

Creating a DevOps workflow to optimize application deployments to your Kubernetes cluster can be a complex journey. I recently demonstrated how to optimize your local K8s development workflow with Rancher Desktop and Skaffold. If you haven’t seen it yet, you can watch it by viewing the video below. You might be wondering, “What happens next?” How do you extend this solution beyond a local setup to a real-world pipeline with a remote cluster?

Technical debt: how to measure and manage it with DevOps

Every technical team in the software industry is familiar with technical debt. That is because every software team incurs technical debt along the way. This article answers some critical questions about technical debt. It reviews what technical debt is and what its causes are, why it is essential to address technical debt, and how this debt accumulates.

Five tools to increase Kubernetes developer productivity

This article was inspired by our recent "5 tools to increase Kubernetes developer productivity" video, hosted by Saiyam Pathak and Kunal Kushwaha. Over the years Kubernetes has become the de facto orchestration platform, as such it's crucial that developers have the right set of tools to increase their productivity for development and operations. In this article, we take a look at five such tools that can help developers inprove productivity while when Kubernetes. Let’s jump in.

Kubernetes for the JavaScript Developer - Part Two - Deploy to Kubernetes

Continuing on from Part One where we went through a brief history of containers and Kubernetes then Dockerized a NodeJS application, now we are ready to deploy to Kubernetes. If this is your first or nth time deploying to Kubernetes, Shipa makes this simple. You don’t have to worry about authoring multiple Kubernetes manifests and templates to deploy your application, all you need is an image.

VirtualMetric Presents: Database Monitoring

VirtualMetric provides a powerful tool to observe your database performance and health. With VirtualMetric's tool, you can monitor #Database Transactions, Database Statistics, #PerformanceMetrics and #Inventory from a single dashboard. Drill down into detailed statistics to troubleshoot your database performance and optimize it. Ensure compliance and guarantee the security of stored data with advanced visibility across datasets.

OpenTelemetry (OTel) is opening new possibilities for developers

OpenTelemetry (OTel) is emerging as the industry standard for system observability and distributed tracing across cloud-native and distributed architectures. But where do developers fit in? With OTel’s main use case focusing on production monitoring and observability, I find that many developers are still not fully familiar with OTel. Others believe it is more of a tool for DevOps/SRE.

37 minutes to deploy a fullstack app on my new AWS account

Today, I was working on our Terraform Provider, and I noticed that I never tried to deploy an application from scratch on a new and clean AWS account. Meaning, an empty AWS account - with 0 resources created. No VPC, no EC2, no Load Balancer, nothing... just an IAM user to get access to my AWS account programmatically. This post explains what I did and how it took 37 minutes and 33 seconds to literally: 😅 Let's explain all of that!

Load test WordPress + nginx on Kubernetes

Why this combination you ask? Load testing is my passion, and I am partial to Kubernetes. I challenged myself to share a use case that many could relate to, focused on a business critical application. Websites came to mind and WordPress is the world’s most popular website management system. Of course, nginx is the most popular web server so let’s throw that into the mix. And Kubernetes? With more than 50% of corporations adopting Kubernetes in 2021, what better system to run in.

Wind River Studio Supports Intel SoCs for Real-Time and AI-Driven Intelligent Systems for Aerospace and Defense Edge Applications

Wind River today announced support for new Intel Xeon D processors. Part of a multiyear effort to optimize Wind River Studio for Intel IoT system-on-chip (SoC) offerings, the combined technologies address the challenges of enabling greater compute in the space- and power-constrained rugged environments of verticals such as aerospace and defense, in order to meet the demands of edge applications.

The Best AWS Elastic Beanstalk Alternatives for 2022

AWS Elastic Beanstalk is an AWS-managed service. It is used by startups, small & mid-sized businesses for web application development. As it comes pre-configured with EC2 server and is efficient at using automatic provisioning of services and resources, handling application code and environment configurations seems easier with this PaaS solution. AWS Elastic Beanstalk is a go-to option for various startups, small & mid-sized businesses.

Advanced pipeline orchestration with the circleback pattern

With multiple teams working on many projects, having a single pipeline for your software is just not enough. These projects need to be built and integrated before they can be tested and released. So how do dev teams handle this situation? Many teams approach the problem by breaking down software into smaller parts that do less, and are easier to maintain and build. This approach has resulted in the microservices architectures that are increasingly common in our industry.

What Is AWS MAP? (And How To Get The Most Out Of Your Migration)

In 2021, the most popular vendor in the cloud infrastructure services market, Amazon Web Services (AWS), controlled 32% of the entire market. To encourage more businesses to choose AWS, Amazon started a program to help accelerate migration — the AWS Migration Acceleration Program (AWS MAP). Below we’ll cover the details of the program, the potential financial benefits for companies looking to take part, and how to make sure you get all the financial benefits available.

Kubernetes for the JavaScript Developer - Part One - Create a Docker Image

Since its introduction in 2014 to the world, Kubernetes has been helping usher in the next generation of distributed workloads. As workloads started to be containerized, so did the need to manage the containers, thus the inception of container orchestrators. There have been a few container orchestrators out there before Kubernetes such as Docker Swarm and Apache Mesos. Though as a feature developer, Kubernetes can certainly feel like an 800-pound gorilla in the room.

How to Get Started Securing Your Internal Software Supply Chain

Defining, building, and delivering a secure software supply chain is challenging for many organizations. Software builds utilize many open source components, and the vast landscape of cloud native developer and platform tools grows more extensive and more diverse every day. Developers, operators, and security teams must work together to ensure software is delivered swiftly and securely to meet business and customer desires.

Create and navigate a documentation library with Notebooks

Datadog Notebooks enable your teams to create and manage key reports and documentation as they build out, monitor, and maintain their infrastructure. Notebooks can include both text and graphs of any telemetry data you have collected in Datadog, and they support collaborative editing so that multiple team members can edit and leave comments simultaneously.

Traditional vs Modern Incident Response

An incident is an event (network outage, system failure, data breach, etc.) that can lead to loss of, or disruption to, an organization's operations, services or functions. Incident Response is an organization’s effort to detect, analyze and correct the hazards caused due to an incident. In the most common cases, when an incident response is mentioned, it usually relates to security incidents. Sometimes incident response and incident management are more or less used interchangeably.

Outsmart your Business on moving artifacts from BizTalk to Logic Apps

With the emergence of the hybrid solutions, solutions such as BizTalk Server that were purely on-premises solutions are communicating with SaaS solutions and are being deployed in a hybrid model. There are many advantages to cloud integration. Organizations can take advantage of powerful features of the Azure Platform and other cloud-based services from Microsoft, such as Service Bus, Logic Apps, Power BI, and Event Hub.

Connecting and securing your microservices in one step using EnRoute

In this meetup, we welcome Chintan, founder of Saaras Inc, and Kunal Kushwaha, developer advocate at Civo, to discuss how to connect and secure your microservices in one step. Chintan’s talk, “Connecting and Securing your Microservices in One Step using EnRoute Kubernetes Ingress API Gateway on Civo”, walks you through the architecture of EnRoute OneStep API Gateway and OneStep Configuration without YAML.

Finding a pricing model that's just right

Getting your pricing right is critical to the success of any SaaS company, but finding a model that works can be tough. Price too high, you won’t close enough deals - your business will fail. Price too low, your business model will be unsustainable - your business will fail. To add to the complication, when you’re a new startup your goals are evolving.

Cloud-Native Infrastructure Automation - The Key to 5G Success

5G has proven to be a game-changer for several businesses. Given the advancements in O-RAN contributed by cloud-native design & 5G Core, telecommunication vendors, Communication Service Providers (CSPs), and enterprises are trying to deliver an extraordinary customer experience by leveraging 5G. This also presents a massive opportunity for service providers to simplify and enhance customer experience, fortify existing revenue streams, and tap into new markets.

Deploying a React application to Netlify

React, a front-end framework for building user interfaces, uses component-based architecture and non-opinionated design principles, making it a developer favorite. React has been widely adopted and has a large community of developers behind it. Netlify is a popular framework for hosting React applications, but it does not provide your team with the highest level of control over the deployment process. As a result, you are not able to perform important tasks like running automated tests.

Customizing the JFrog Xray Horizontal Pod Autoscaler

In cloud native computing (Kubernetes in our case), there is a requirement to automatically scale the compute resources used for performing a task. The autoscaling cloud computer strategy allows to dynamically adjust the active number of application servers and allocated resources instead of responding manually in real-time to traffic surges that necessitate more resources and instances.

GitLens 12 - Visual Studio Code for the Web Support & GitLens+

GitLens users rejoice! This release introduces exciting updates, including preview support for a browser-based editing experience in VS Code online, legendary new GitLens+ features like Worktrees and Visual File History, and more! Keep reading to see what’s new in GitLens 12.

Have a Worry-Free Upgrade

The waiting can be intensely stressful. You are mid-way through a critical production upgrade during the weekend. The schedule is tight. Suddenly there is an unexpected problem you aren’t able to resolve. You need help. So, you call in a support ticket. And that’s when the waiting starts. While you’re waiting for the support team to review and get back to you, questions race through your mind: How quickly will they respond to the ticket?

Incident severity and priority 101

Severity and priority can be challenging for a company to nail. When an incident is declared, it's essential to have a system to define the impact and how urgently it should be handled. Incident severity and priority are the two knobs teams can leverage to define scope and urgency, and eventually, the appropriate process to take action. But how should we define them, and what are the differences?

Shift Left Reliability Meetup February - Retooling your toolkit

Security and reliability have a lot in common. So much in fact, that the tools used for one are often well suited for the other. The only thing you need is the right mindset. In this talk Mika Boström will go over the principles, ideas and share real world examples. You may realise you've been doing both already.

Shift Left Reliability Meetup February - Implementing reliability for a post-pandemic future

Steve Wade will talk about his experiences to date empowering developers and importantly the wider business to care about the reliability of the applications they provide to customers. He will discuss the pillars that make up reliability, along with his hypothesis and results on implementing each of them. Steve will tap into his experiences working in a range of sectors including financial services on how he made companies make changes pre-pandemic, as well as the additional challenges organisations face in the future post pandemic. Steve aims to make sure attendees leave with a toolkit of ideas as well as lessons learnt so you don't make the same assumptions he did.

Why OpenTelemetry (OTel) is a game changer for troubleshooting your applications

Microservices are powerful architectures. Yet, they are complicated ones as well. Microservices enable engineering departments to scale faster than ever, but this speed comes at the price of developer confidence. When developing microservices, it is hard for developers to understand how different services interact with each other and why a certain event occurred when and where it did.

Sponsored Post

What Is a DevOps Toolchain and How Does It Work?

Picture yourself trying to resolve a code error when you notice an additional issue outside your realm of expertise that's making matters worse. Your instinct is to get in touch with the right contact as quickly as possible to resolve the issue so that there's no further impact on the system's uptime. But what if you can't get in touch with them immediately, or don't know who to contact? Instead of trying to solve the problem without support, a DevOps toolchain could have mitigated this chain reaction from the start.

Network Management In The Age Of AI

Change is critical to growth. Especially if you’re running a business in today’s volatile market. The silver lining is that we are at the peak of innovation, moving forward from a decade filled with disruptions, catalysing transformations. Over the years, enterprise IT has evolved to play a more significant role in business. Innovation, macro-economic factors, unexpected disruptions, and other internal and external factors have caused the change.

Should Your Startup Hire a DevOps?

Software development no more emphasizes “final delivery” or deployment of a project. It is more about “continuous delivery and integration” today. The market also demands rapid delivery and updates without missing out on elements like ‘quality’ and ‘innovation.’ So, instead of building a super-robust well-tested product at once, developers focus on faster and bug-free releases to create a reliable product over time.

Getting Started With GitOps and Argo CD

Today we are going to explore getting started using Argo CD. This post is going to assume you know a bit about containers, and that you already have an empty cluster in place (or know how to create one). If any of this is unfamiliar, head over to Understanding the Basics to get a bit of practice. Before we get started, let’s talk about GitOps.

Orchestrate Spark pipelines with Airflow on Ocean for Apache Spark

Running Apache Spark applications on Kubernetes has a lot of benefits, but operating and managing Kubernetes at scale has significant challenges for data teams. With the recent addition of Ocean for Apache Spark to Spot’s suite of Kubernetes solutions, data teams have the power and flexibility of Kubernetes without the complexities. A cloud-native managed service, Ocean Spark automates cloud infrastructure and application management for Spark-on-Kubernetes.

How to set up a Private, Remote and Virtual Go Registry

The simplest way to manage and organize your Go dependencies is with a Go Repository. You need reliable, secure, consistent and efficient access to your dependencies that are shared across your team, in a central location. Including a place to set up multiple registries, that work transparently with the Go client. With the JFrog free cloud subscription, including JFrog Artifactory, Xray and Pipelines, you can set up a free local, remote and virtual Go Registry in minutes.

FAQ - Netreo Azure and AWS Monitoring Capabilities

Netreo SaaS delivers a single solution for simplifying how IT organizations optimize today’s hybrid blend of on-premises, public and private clouds that are common in complex, global enterprise infrastructures. With the upcoming release of cloud monitoring enhancements coming in March, Netreo SaaS will provide even greater, multi-cloud monitoring capabilities and extended functionality for Microsoft Azure and AWS cloud customers.

SNMP Traps: The 90's Want Their Monitoring Technology Back

How do you monitor your network? There are a myriad of technologies and tools out there, each providing different benefits and challenges. Today we are going to focus on one specific area, Simple Network Management Protocol (SNMP) Traps. That’s right, we are going narrow here, not just focusing on SNMP but on one specific portion of the protocol: namely the ability of devices that support SNMP to send alert information to collectors.

Does A Multi-Cloud Strategy Mean Compromising on Application Performance?

Based on the Teneo customers I’ve spoken with in the last 12 months, adopting a ‘‘Cloud-First’, Multi-Cloud strategy is often a top priority for Infrastructure and Operations (I&O) teams. However, many organizations have multiple clouds and cloud services mixed with physical data centers and Co-Lo (co-locations). And as a result of running multiple applications and services stitched together, user experience suffers.

How We Defined The Pricing Model of Qovery

Pricing is a complex topic matter, and there’s no one-size-fits-all approach to pricing. Few things impact growth and revenue as much as your pricing. Finding the right balance between value and revenue will make or break your business. While most founders have a clear product vision and have thought through things like their go-to-market strategy or hiring plans, surprisingly, few have an idea about what their pricing should look like.

Golang Testing Frameworks for Every Type of Test

While Go provides a testing package and a go test command, the former only offers basic testing capabilities. The package also has some drawbacks, such as missing assertions and increasing repetition with large-scale tests. As a result, several Go testing frameworks have been created to augment it. Go testing frameworks consist of tools and resources for creating and designing tests. Some of these frameworks incorporate the testing package and go test command, while others take a different approach.

Code Manager improvements reduce deployment time and effort

Over the past few months, we set out to drastically reduce the amount of time Code Manager takes to deploy code and sand down some rough edges to make it more stable and robust. In order to understand what we were able to achieve, we need a quick primer on how code is deployed to a Puppet Server in the first place. There are three parts to a Code Manager code deployment: We’ve improved each of these three parts of the Code Manager code deployment.

Bare-metal or Cloud? The cost of performance and traffic

Bare metal servers are a valuable option for all sizes of businesses, including small, medium, and startups companies. When designing infrastructure it is important to manage cost. However, the decision on whether to run your application on bare metal or cloud provider should not be driven by the size of your company but by your infrastructure needs. Each approach has its own tradeoffs and complexities, especially since it is difficult to find two clouds with the same parameters.

Scaling Argo CD Securely in 2022

Last updated 2/22/2022 Argo CD is used by some of the largest and most secure companies on earth with sensitive and very important workloads. In 2022, it’s all the more critical to make sure Argo CD is running securely within your organization. As Argo continues the process of CNCF graduation, additional security audits and improvements to project security are underway.

Why Reliability Engineering Matters: an Analysis of Amazon's Dec 2021 US-East-1 Region Outage

In the field of Chaos Theory, there’s a concept called the Synchronization of Chaos—disparate systems filled with randomness will influence the disorder in other systems when coupled together. From a theoretical perspective, these influences can be surprising. It’s difficult to understand exactly how a butterfly flapping its wings could lead to a devastating tornado. But we often see the influences of seemingly unconnected systems play out in real life.

Podcast: Break Things on Purpose | Carissa Morrow: Learning to be Resilient

Being new in tech an be intimidating! Thankfully, folks like Carissa Morrow are shining examples of how to come into tech from the ground up. Carissa began with a career shift and just started coding, went through the Boise Codeworks bootcamp, and made the jump to tech. Carissa talks about the resilience it took in her early days, and how those experiences reinforced her attitude on continually learning.

Malware Civil War - Malicious npm Packages Targeting Malware Authors

The JFrog Security research team continuously monitors popular open source software (OSS) repositories with our automated tooling to avert potential software supply chain security threats, and reports any vulnerabilities or malicious packages discovered to repository maintainers and the wider community. Most recently we disclosed 25 malicious packages in the npm repository that were picked up by our automated scanning tools.

Why Kubernetes Is Worth Learning

Learning Kubernetes (K8s) can be intimidating. There are so many great tools to increase your use of K8s, it’s confusing to know where to begin. You learned how to walk by first learning to crawl. In the same way, to effectively integrate K8s into your software infrastructure, you need to build a foundation一a foundation of knowledge where you understand the capability of K8s and how it can improve your organization’s operations.

Logic App Schema Validation: DateTime Restrictions

I recently published a post regarding BizTalk Schema Validation: DateTime Restrictions on my personal blog. Still, I was curious to see if we have the same capability in Logic Apps Standard and Consumption. And by doing that, see how compatible these two technologies are regarding XML Schemas and validations.

Solving Shared Cost Allocation With Telemetry-Driven Cost Organization

If you’ve ever been in charge of your company’s cloud cost, you likely understand how painful it can be to accurately allocate shared cloud spend. In fact, in the 2021 State of FinOps report, “dealing with shared costs” was the second most common challenge faced by FinOps. Today, I’m excited to share that we’ve taken a huge step toward alleviating that pain.

Episode 3: Mooving to... Stability: The Role of Catastrophic Failure in Software Design

In this episode of Mooving to… Stability: The Role of Catastrophic Failure in Software Design, we had the opportunity to chat with Jeff Atwood, yes that Jeff Atwood of, Coding Horror, Stack Overflow, and Discourse (Chief Happiness Officer). Jeff started writing 911 software in Boulder, Colorado for a small company, which was a crash-course in writing code for software that has real consequences. With this unique and deep perspective, B.J.

Your guide to the key steps of capacity planning and management

If you’re going to move assets to Azure – or any public cloud, you’re going to need some help. As a cloud consulting firm with a top-notch infrastructure performance monitoring application, we help enterprises navigate obstacles on the path to the cloud all the time. We’ve also felt the pain of sizing and pricing in our cloud journey, too. That’s why we created Galileo Cloud Compass (or GCC as we sometimes call it).

6 Simple Steps to an Easy Cloud Migration

If you’re going to move assets to Azure – or any public cloud, you’re going to need some help. As a cloud consulting firm with a top-notch infrastructure performance monitoring application, we help enterprises navigate obstacles on the path to the cloud all the time. We’ve also felt the pain of sizing and pricing in our cloud journey, too. That’s why we created Galileo Cloud Compass (or GCC as we sometimes call it).

Service Level Objectives: Where do we start?

Most of us have heard about SLOs and what they mean but always found it hard to start adopting them across our teams. This video is a way to demystify the journey of adoption of SLOs, with examples of how several large companies like Disney adopted them. Whether you are new to the DevOps/SRE world or an experienced developer, you will learn a fresh approach to making software more reliable!

A guide to Google Cloud Platform regions

Google Cloud is underpinned by a global network of Google Cloud Platform (GCP) regions that help bring its services closer to users and improve reliability and speed. In this blog, we look at the rollout of GCP regions and locations worldwide, and explain the benefits of using a direct connection to access them.

Starting projects at incident.io

We’re a small startup (10 people at time of writing) with big ambitions, particularly when it comes to our product. With so many things we want to do, it’s important for us to be structured the way we approach our work, without being so process-driven that we lose all the benefits of being small and nimble. As we’re still new, and the team is growing all the time, very little is set in stone.

Build and test your code with a CI pipeline

This article is a part of our DevOps blog series inspired by our DevOps bootcamp live streams available to watch on our YouTube channel. As a developer constantly working with code, it’s only natural to feel the need to test your code frequently. Testing helps detect bugs and protect against any of the same in the future.

Everything you need to know about Squadcast and Microsoft Teams Integration

Microsoft Teams is one of the most versatile tools in terms of providing collaboration and chat solutions to numerous enterprises. We at Squadcast understand how important Microsoft Teams can be for your organization. Hence, we bring you this blog on Squadcast-Microsoft Teams integration that will tell you how this integration can help in improved incident management, effective collaboration and a lot more.

5 tools to increase Kubernetes developer productivity

Over the years Kubernetes has become the de facto orchestration platform and it becomes important that developers have the right set of tools to increase their productivity for development and operations. In this talk, Saiyam and Kunal from Civo will discuss the following tools: These tools not only accelerate the development workflow but also help to debug issues faster. You can improve your productivity by 10x using these tools and speakers will be showcasing demos for each one of them. In the end, they will also talk about their recommendations for working and developing with Kubernetes.

Announcing HAProxy Data Plane API 2.5

The focus of the 2.5 version was on expanding support for HAProxy configuration keywords, and that’s where most of the effort during this release cycle was spent. We will continue doing that during the next couple of versions to gain complete feature parity with both the HAProxy configuration and Runtime API so that you can use the Data Plane API as a full-featured way to configure HAProxy.

Building and running FIPS containers on Ubuntu 18.04

Whether running on the public cloud or a private cloud, the use of containers is ingrained in today’s devops oriented workflows. Having workloads set up to run under the mandated compliance requirements is thus necessary to fully exploit the potential of containers. This article focuses on how to build and run containers that comply with the US and Canada government FIPS140-2 data protection standard.

Monitor Ubuntu Advantage FIPS configurations

In regulated environments, some machines must adhere to strict cryptography requirements designed to protect systems from being cracked, altered, or tampered with. Using cryptographic modules that are FIPS certified or compliant ensure a systems’ encryption solutions adequately protect its digital assets. FIPS validated operating systems are a prerequisite for government agencies, their partners, and those wanting to conduct business with the federal government.

Crossing K8s Monitoring and Observability Gaps With Change Intelligence

Recently we had the privilege of being named a Gartner Cool Vendor in the Monitoring and Observability category. The funny thing is, while this is definitely the closest Gartner category for our solution, we aren’t really used to thinking about Komodor as a monitoring and observability tool.

Top 13 Site Reliability Engineer (SRE) Tools

The role and responsibilities of a site reliability engineer (SRE) may vary depending on the size of the organization, and as such, so do site reliability engineer tools. For the most part, a site reliability engineer is focused on multiple tasks and projects at one time, so for most SREs, the various tools they use reflect their eve-evolving responsibilities.

How to secure your CI pipeline

Many enterprises still struggle to get security right. To protect their business, it is critical they focus on security during the entire infrastructure and application lifecycle, including continuous integration (CI). Developers are becoming more autonomous as they transition to a DevOps way of working, with more people requiring access to production systems.

Azure Communication Services - an overview

There are such cloud services that can drastically accelerate your innovation and reduce your time to market by providing you with functionalities that are simple, ready-to-use and that allow you to add a real added-value to your users. Azure Communication Services is one of those. In this article, we’ll have an overview of what Azure Communication Services (“ACS” for short) is, what functionalities it provides and what are some of the use cases in which it can be leveraged.

LogicMonitor APM is now generally available to Enterprise customers

Over the past two years, we have been on a journey to provide the tools you need in order to achieve unified, end-to-end observability in real-time across your entire business. We believe that true observability gives you the confidence to embark on your cloud and digital transformation initiatives. LM APM empowers ITOps and DevOps teams with the context they need to continue delivering quality user experiences while seamlessly correlating all of this data in one easy-to-use platform.

Efforts to Secure OSS fired up after Log4Shell

Who would have thought software could rattle the White House? But a vulnerability in Log4J, a popular open source software project, exposed critical digital infrastructure to remote code execution attacks. This prompted the US Government to engage big tech, infosec professionals, and open source organizations to come together to help secure open source software.

Robin.io Partners with Lekha Wireless and Blue Arcus to accelerate highly scalable custom carrier-grade network solutions

Highly scalable custom carrier-grade network solutions will be accelerated, thanks to a new partnership. Robin.io, along with Lekha Wireless and Blue Arcus will offer Automation and Orchestration capabilities for the disaggregated 5G market.

Designing your incident severity levels

We wrote this article in response to a question asked in our Slack Community. Click here to join hundreds of technology leaders discussing best practices for incident response! ✨ We know a thing or two about incident response. As such, we're often asked to advise when companies are designing their incident response processes. A common question is "How do you design your incident severity levels?". It's a great question given how central they are to incident response!

Network Monitoring & Management Best Practices for Beginners

Looking to build your basic IT knowledge in 2022? In this post we'll cover some general network monitoring and network management best practices that MSPs and IT departments use to support their organizations. Any discussion about IT is eventually going to lead towards network monitoring and management. All told, one of the most well-understood roles of an IT department or IT provider is that of keeping a business’ essential systems up and running.

Using gRPC with Python

Microservice is now the architecture of choice for many developers when crafting cloud-native applications. A microservices application is a collection of loosely coupled services that communicate with each other, enhancing collaboration, maintainability, scalability, and deployment. There are several options for enabling this communication between microservices. REST is the most popular among developers, sometimes used synonymously with APIs. However, gRPC can be a better alternative to REST.

Monitor your GitHub Repos with Graphite and Grafana

In this article, we will explore the main metrics of a GitHub repository and why it is important to monitor them. We will learn how to get GitHub data in a convenient format, process, then visualize it for further analysis. Finally, we will analyze the main advantages of using data monitoring tools such as Hosted Graphite and Grafana by MetricFire.

Some Useful Linux Networking Commands

One of the most well-known and commonly used open-source operating systems is Linux. It's a clone of UNIX, however unlike UNIX, it's free to download and use. Despite being an open-source operating system, it has a safe and resilient architecture, and many individuals and companies rely on it. You can, interestingly, make your own Linux version. You can accomplish so by simply downloading Linux and making the necessary adjustments. In essence, we can distinguish them as different Linux versions.

Why and How SREs Can Benefit from Feature Flags

When you think of who uses feature flags, your mind most likely goes to developers. In general, feature flags are closely associated with software engineering. But Site Reliability Engineers, too, can benefit from feature flags. SREs may not be the ones to create feature flags, but they should work closely with developers to ensure that the applications their teams support include feature flags.

An easier way to create runbooks

Runbooks have been a game changer for many incident response teams, and we just made it easier for you to get up and running with them. Runbooks reduce toil for responders and ensure consistency in your incident management processes.In the thick of trying to resolve an issue, remembering things like emailing customers is likely the last thing on responders minds but yet forgetting to do so can be detrimental.

An overview of OpenStack storage

OpenStack storage is probably one of the most complex topics in OpenStack architecture right after networking. There are many different storage options, at least a few storage services, and tons of supported storage backends. It is very easy to get lost. But do not worry, there is hope. Since OpenStack was initially created as an open-source implementation of the Amazon Web Service Elastic Compute Cloud (AWS EC2), its storage architecture is quite similar to leading public clouds.

8 Ways to Ensure a Greener Data Center

After becoming increasingly popular in the mid-2000s, cloud computing has revolutionized the data center industry. Aside from providing cost-savings, security, mobility, and scalability benefits, cloud computing was also expected to be more environmentally friendly than other data storing and processing methods. By being highly efficient, cloud operators can reduce the use of electricity and other materials that typically increase a data center’s carbon footprint.

Announcing the General Availability of VMware Tanzu Kubernetes Grid 1.5

In a world where organizations are often defined by the digital services they can deliver, it’s crucial for underlying IT infrastructure to move as quickly as the business demands. To support our customers with getting the most out of a Kubernetes powered environment, we continue to make enhancements to VMware Tanzu Kubernetes Grid. In this post we’ll discuss some of the new capabilities our customers will benefit from using in Tanzu Kubernetes Grid 1.5.

Better Way To Write Async Function in Node/Express/Next - Handle catch(err) Only Once.

Avoid Writing a Lot of Try Catch by Catching The ‘catch()’ Just Once. How annoying it is to write a lot of try-catch for each async function in an express app? What if you never need to write a try catch block for all async functions and still be able to handle the errors?

Qoddi vs Heroku and AWS: what to choose as a Startup

You have many options when it comes to choosing a platform to deploy your app. In this article, we will compare AWS, Heroku, and Qoddi. Heroku is hosted on AWS, making Heroku, like Qovery for instance, nothing more than a management platform for AWS services. You can do everything you do on Heroku (or Qovery) directly with AWS for a lesser price but Heroku removes all the infrastructure management layer (or DevOps) you still need to have with AWS or any other cloud providers.

Dedicated hosts for macOS are now available

Dedicated hosts for macOS are now available on CircleCI. This new layer of support is built exclusively for macOS and offers Apple developers unprecedented storage, security, and scalability on CircleCI. By reserving a dedicated host, teams can unlock access to a bare metal instance that provides exclusive access to an entire host machine for 24 hours.

Why you don't just buy tools to make DevOps happen

Matt Gordon is a Microsoft Data Platform MVP and the Director of Data and Infrastructure at Rev.io. The sophisticated billing-as-a-service (BaaS) platform provides online billing software to communications companies, IoT businesses, and technology service providers. Matt’s role, first and foremost, involves keeping the lights on while at the same time overseeing many data-related projects, from architecture to performance tuning and DevOps implementation.

How To Detect and Prevent Zero-Day Vulnerabilities With Smart Infrastructure Monitoring Tool

“End of life, end of support, pandemic-induced shipping delays and remote work, scanning failures: It’s a recipe for a patching nightmare.”, federal cybersecurity CTO Matt Keller says. Ensuring a high level of security for your IT infrastructure and being sure you have not missed something is hard to arrange during these days. A zero-day exploit happens when hackers identify a software weakness or a security gap and take advantage of it to perform a cyberattack.

Should you use Kubernetes for your Startup?

Developers love containers for their portability and flexibility. Suitable for today’s cloud-native environment and agile development requirements, containers make it super-fast to develop, test, and run applications. Besides this, they are lightweight and can optimize the platform (and the host OS) they are deployed on. Now, for powerful applications with hundreds of containers, the platform should also be portable, flexible, extensible, and efficient.

Get Started with ChatOps with "7 Steps to ChatOps for Enterprise Teams"

The right tools enable your team to ship amazing code quickly. But between building, deploying, testing, monitoring, and maintaining software, all those great tools can create a lot of stuff to keep track of. Luckily, there’s a solution to this problem: ChatOps. ChatOps is a collective approach to running DevOps workflows and building a collaborative team culture.

Scale Your Infrastructure with Cloud Native Technology

When business is growing rapidly, the necessity to scale the processes is obvious. If your initial infrastructure hasn’t been thought through with scalability in mind, growing your infrastructure may be quite painful. The common tactic, in this case, is to transition to cloud native architecture. In this post, we will talk about what you need to know when you’re scaling up with the cloud so that you can weigh the pros and cons and make an informed decision.

Stupid Simple Service Mesh in Kubernetes

We covered the what, when and why of Service Mesh in a previous post. Now I’d like to talk about why they are critical in Kubernetes. To understand the importance of using service meshes when working with microservices-based applications, let’s start with a story. Suppose that you are working on a big microservices-based banking application, where any mistake can have serious impacts. One day the development team receives a feature request to add a rating functionality to the application.

Xray: New Year, New Security Features

As part of our ongoing efforts to offer you the most comprehensive and advanced SDLC protection capabilities, JFrog continues to boost the capabilities of our Xray security and compliance product. In this blog, we offer an overview of recent Xray improvements, all aimed at helping you fortify your software, reduce risk, scale security, streamline compliance and accelerate releases with confidence.

Getting Started with Skaffold for Kubernetes Deployments

Kubernetes has experienced rapid growth over the years, with a recent post from the Cloud Native Computing Foundation reporting a userbase increase of about 67% in just the past year. Kubernetes is a container orchestration platform that automates how containers are deployed, how they communicate, and how traffic is routed between them; it also scales configurations for both the containerized workloads and the underlying infrastructure that comprises the cluster.

Cloud Complexity - Bringing Resources together in Multi-cloud Environments

The world is still getting used to operating within the cloud. Moving to the cloud is challenging for many organizations. So why do we see a rise in the adoption of multicloud strategies? In this blog, we will explore why this trend is worth considering for your organization, as well as look at the challenges that it brings.

How Load Balancing Improves the Performance of Your Applications

Load balancing is an indispensable technique for improving a website’s performance. I’ll explain why. With Firefox’s Web Developer Tools open, I visited a popular retailer’s website to see how many HTTP requests my browser made when loading the site. In this case, I counted 119 requests needed to render the landing page.

Get powerful insights across your infrastructure with new data filters

Would your organization benefit from having powerful, yet easy-to-use filters to inspect your nodes? With our latest Continuous Delivery for Puppet Enterprise release, we’ve updated the filters in the user interface to support more advanced queries. SysAdmins, developers, DataOps, and IT managers will all benefit from having access to these powerful filters.

Continuous Build and Deployment of Go Applications with Google Cloud Build

We've gone through many iterations of ways to build, deploy and distribute applications written in Go at Cloud 66. Unlike Rails, Go applications can be web applications, daemons or CLIs and therefore have different requirements. I'll share some of what we've learned with you in this post.

Tanzu Talk: "The year Kubernetes crossed the chasm" - the 2021 CNCF survey - cooking black beans

Coté looks through the most recent CNCF kubernetes and cloud native surveys, finding multi-cloud usage, kubernetes in production, and what people find difficult about kubernetes. Also, join him on BeanCam as he monitors actual, real-life black beans being cooked.

5 Best RMM Software Tools to Use in 2022

As the world has shifted towards remote work in the wake of Covid-19, the need for remote management and monitoring tools has increased. Managed service providers (MSPs) use RMMs to provide off-site, automated and streamlined support to their clients for their IT needs. Having an efficient RMM can reduce workload and give MSPs a competitive edge over adversaries. So let’s take a more detailed look at what an RMM software can do and then a look at the best RMM software tools for MSPs in 2022.

mooving To...Stability

Join seasoned veteran, Jeff Atwood (yes, that Jeff Atwood of Stack Overflow and Discourse) as he discusses the role of catastrophic failure in software design. Users of modern apps require as close to 100% uptime as possible, which also means they require quick results. When these expectations aren't met, we need to learn from them to create better design. But what if your fault tolerance design ends up being the cause of your issues? Sean Molloy, and BJ Maldonado talk with Jeff about how you can learn from failure to improve your software.

Best Practices To Build & Manage a Strong DevOps Team

Looking to build or improve your DevOps Team? We will explain the roles and responsibilities of a DevOps Team within your organization, and how to start building one. What does a DevOps Team do? A DevOps team is made of professionals from development and operations that work closely together. These cross-functional teams are responsible for orchestrating the entire software development process.

New Year, New Features in Artifactory

Let’s start 2022 off the right with new features and updates that will extend JFrog Artifactory’s power and reach in addressing challenges with managing your binaries from development to production. Join JFrog’s Irena Guy Product Manager, Evgeny Karasik Senior Product Manager, Ben Ifrach Product Manager, and Eyal Ben Moshe Development Manager, Ecosystem. In this session, you'll learn about the new updates.

What to Know About Azure SQL Database Serverless Compute Tier

Over the past several years, I've helped numerous customers migrate SQL Server workloads to Azure SQL, including Azure SQL Database, Azure SQL Managed Instance, and Azure SQL Virtual Machines. In this article, I'll explain some of the challenges of optimizing the compute cost for an Azure SQL Database deployment and review how the serverless compute tier can greatly simplify it.

CVE-2021-44521 - Exploiting Apache Cassandra User-Defined Functions for Remote Code Execution

JFrog’s Security Research team recently disclosed an RCE (remote code execution) issue in Apache Cassandra, which has been assigned to CVE-2021-44521 (CVSS 8.4). This Apache security vulnerability is easy to exploit and has the potential to wreak havoc on systems, but luckily only manifests in non-default configurations of Cassandra.

8 Issues With AWS Tags And How To Overcome Them For Good

AWS resource tagging is fundamental for effective cloud cost management. By creating and allocating cost-related tags in AWS, you can organize and manage your resources according to keys and values that make sense to you. This helps you better understand your cloud costs and manage your spending. But proper tagging isn't easy. While AWS provides several useful resources, you may still run into some issues that require more involved solutions.

How to Simplify Monitoring for Complex Network Software

Digital transformation is causing the IT ecosphere to evolve, and the evolution is being accelerated by competitive necessity. Enterprises are using digital technology to increase revenue and lower costs. Failure to compete effectively will have devastating consequences. Digital transformation requires that IT evolve from a cost center to a value creator. FinOps and DevOps are processes that include the entire enterprise in value creation.

How We Define SRE Work

At the time of writing this post, I have officially been at Honeycomb for one year as a site reliability engineer (SRE). I had shared my initial experiences and impressions in this post and thought it would make sense to check back in now that I’ve had the opportunity to spend time learning about the team, the culture, and the code base more in depth.

AIOps in 2022 and Beyond: A Conversation with Gartner

Modern digital businesses adopt AIOps tools to enable continuous insights across an IT stack. These insights tell the full story of what’s happening behind systems, allowing IT teams to achieve the operational efficiencies and high availability that lead to customer satisfaction. Old siloed monitoring disciplines provide data specific to performance of the digital experience, IT infrastructure, application or network.

Improved routing for Jira Cloud and Jira Server tickets with multi-project support

If you love Jira then you probably love customization, and we’ve made your integration with Jira Cloud and Jira Server even better with multi-project support! You can now route your incident tickets and follow-up work to remediation teams' Jira projects directly from FireHydrant, saving you valuable time and clean-up work. Let’s take a look at what has changed and some additional use cases unlocked with this integration.

DevOps State of Mind Episode 8: What do DevSecOps and Formula 1 have in common?

Josh Minthorne is the co-founder and global technology director of Axcelinno, an IT technology consultancy and professional services company that helps organizations define and implement their DevSecOps adoption and cloud migration. Today, we're talking about why the security landscape has made companies hesitant to move to the cloud and what they can do to migrate with confidence.

Auto-Scaling is now available for everyone!

Worried that your app will not have enough resources as you grow? All apps on Qoddi can scale at any time and as part of Qoddi's new interface launch early this week, we made auto-scaling available for everyone after more than 6 months in Beta. Auto-Scaling, like Auto-Heal (another feature included with all Qoddi apps), is the guardian angel of your apps and will make sure your app continues to run whatever happens.

Why Edge Computing Will Overtake the Cloud

Compared to the previous generation, today’s generation of startups are increasingly cloud-centric. The previous generation of dotcoms had to suffer the economics and complexities of deploying, managing, and scaling their own servers, networks, and data centers. In contrast, today’s generation grew up in the just-in-time, pay-for-what-you-need, and scale-up-on-demand world that is cloud native.

Adding value to applications using the software testing life cycle

Software testing is important enough to have its own phase in the software development life cycle (SDLC). The software testing life cycle (STLC) is a step-by-step process that improves the quality of software by applying rigorous planning and analysis to the testing process. Testing is a development tool that adds value to your team’s applications. Embracing testing as a vital component of software development can save you and your team a lot of time debugging and fixing errors in the future.

GitKraken Client v8.3: Now 2x Faster for Apple Silicon Users

What do Olympic speed skaters and developers have in common? They have a need…a need for speed. 😏 Nobody likes moving slow, no matter if you’re competing for the gold, or just trying to deploy an awesome new feature. The GitKraken team has been hard at work making sure all GitKraken Client users, especially macOS users, have the speediest experience possible when leveraging our legendary Git client to collaborate with teams.

Puppet Enterprise installation and self-signed Intermediate CA

This article is about how to install Puppet Enterprise using your own self-signed Intermediate CA (Certificate Authority). In some environments, regulations require you to intercept and inspect all SSL traffic to detect malicious activities that could otherwise masquerade as legitimate encrypted traffic. This requires the ability to decrypt and re-encrypt the stream in real time, which can only be done with the proper certificates installed.

Surviving the Server Chip Shortage

The global chip shortage, which began in 2020, continues as demand for semiconductor chips continues to far outpace production. Intel CEO Pat Gelsinger recently forecast shortages to be sustained through at least the remainder of 2022. As a result, IT operations teams at almost every company we’ve talked with have felt the crunch in the form of skyrocketing prices and delays of up to a year for procurement of physical servers.

Virtualized Databases: Friend or Foe to Federal IT Pros?

Federal IT pros have historically had limited time and resources; this makes it crucial to find the source of application slowdowns or outages as quickly as possible. Database virtualization can help by allowing IT pros to separate the storage and application layers within the application stack more quickly, providing easier access to root-cause issues. An estimated 70% of databases are virtualized. Yet, the benefits of virtualized databases come with additional challenges.

JFrog Discloses 3 Remote Access Trojans in PyPI

The JFrog Security research team continuously monitors popular open source software (OSS) repositories with our automated tooling to detect and avert potential software supply chain security threats. After validating the findings, the team reports any security vulnerabilities or malicious packages discovered to repository maintainers and the wider community.

Low latency Linux kernel for industrial embedded systems - Part III

Welcome to the concluding chapter of this three-part blog series on the low latency Ubuntu kernel for industrial embedded systems. Each blog is standalone and can be read independently from the others, although you may want to start at the beginning for some continuity. If you need a quick refresher on userland and kernel space, we recommend you check Part I out first.

12-Factor Containerized Microservices: Leveraging VMware Tanzu and the Best of Kubernetes

At VMware, as we talk to enterprise customers about their application deployment patterns, challenges, and future requirements, we observe a common theme. Most of them are embarking on a modern application design and deployment path by using containers and Kubernetes as foundational technologies and by implementing their applications as microservices.

GitKraken Client v8.3 Release - Now 2x Faster for Apple Silicon Users

The GitKraken Client now runs natively on Apple Silicon with faster performance and lower power consumption. Additionally, a Mac Big Sur update means that those on MacOS Big Sur or later no longer need to run custom terminal commands to improve performance. If you are on a Mac with Apple Silicon, you will need to download the new client from our website and replace your existing client, but don’t worry. All of your settings, including profiles and integrations, will remain intact.

Why we started Helios: Making production-readiness developer-friendly

They say that co-founding a startup is like marriage. In our case, it’s probably a long overdue one, since we’ve already been collaborating together for 18 years. Ever since we first shared a bunk bed in the army, our joint journey has included a number of milestones, like military service, living as flatmates, university, and working together. Just over a year ago we knew it was the right time.

The Impact of Cloud Computing on Management

The rapid growth of the marketplace and increasing competition require companies to make significant changes in the way they offer services and products. Companies that wish to remain competitive must keep an eye on technology development and adapt to any new technology if it becomes available. Each new cloud service provider in cloud computing contributes fundamentally to promoting growth and competition at the same time.

The 5 best AWS Deployment Options to Consider in 2022

When we talk about various deployment and infrastructure provisioning choices on AWS, each option serves a particular set of users and needs. Some of Amazon's most common deployment services include Elastic Beanstalk, CloudFormation, and CodeDeploy. In containerization, there are options like ECS, EKS, Fargate, etc.

HPC workloads on Robin Cloud Native Platform (CNP) using Nvidia GPU (MIG A100)

In today’s world, graphics processing units or GPUs have attracted a lot of attention as the optimal vehicle to run artificial intelligence (AI), machine learning (ML) and deep learning (DL) workloads. These workloads require massive amounts of data, both ultra-high speed and parallel processing, along with flexibility and high availability. It is clear that high-performance computing (HPC) with graphics processing unit (GPU) systems are required to support cutting-edge workloads.

The State of Robotics - January 2022

What a way of starting the year! Setting milestones, helping those in need, and daring to dream. January 2022 starts with one of the biggest technological conferences — CES. So, in this piece, you will find a breakdown of three robots in our usual style. But there’s more… we also bring a story to inspire you all. It’s a great experience writing this blog, where every month news are abundant. Thank you all for contacting us and sharing your stories.

Datadog Serverless Monitoring for Amazon API Gateway, SQS, Kinesis, and more

Many organizations leverage AWS to build fully managed, event-driven applications, which break down complex workloads into APIs, event streams, and other decentralized services in order to improve performance and scalability. This type of architecture relies primarily on AWS Lambda functions to process synchronous and asynchronous requests as they move between a workload’s resources, such as Amazon API Gateway and Amazon Kinesis.

How to take action from Datadog Apps

Engineers who support production environments are tasked with resolving new issues as quickly and efficiently as possible. But as they look to carry out these responsibilities, their remediation workflows tend to take on the following pattern: For example, someone on your team might discover in a log analysis tool that a user is flooding a key service by making an abnormal number of requests.

The three pillars of great incident response

There’s no one-size-fits-all incident response process. Depending on your organisation’s shape and size, you’ll have different requirements and priorities. But the same three pillars form the core of any good process, whether it’s for the largest e-commerce giant or a scrappy SaaS startup.

List app registrations with credentials about to expire

App registrations is a mechanism in Azure AD allowing to work with an application and its permissions. It’s an object in Azure AD that represents the application, its redirect URI (where to redirect users after they have signed in), its logout URL (where to redirect users after they’ve signed out), API access and custom application roles for managing permissions to users and apps.

Server Uptime Monitoring: What, Why, and How?

In an earlier blog post, we had discussed how server performance monitoring is not just about monitoring CPU, memory, and disk resources anymore. There is more to server performance monitoring than just three resources or metrics. That blog post covered several key performance indicators (KPIs) that IT teams must track to ensure that their servers are performing well. In this blog post, we focus on another KPI – server uptime.

AWS Cost Management: The Complete Guide To Manage AWS Costs

Most organizations migrate to the Amazon Web Services (AWS) public cloud from on-premises data centers. But others switch from another Infrastructure-as-a-Service (IaaS) platform to AWS. The most common reason we hear for choosing AWS is cost savings. As the largest public cloud provider, AWS leverages economies of scale, making its cloud services more affordable than those a company would have to build itself. Yet not every organization can claim AWS has saved it money.

ICYMI: Achieving Visibility in Your CI/CD Pipeline With Honeycomb + CircleCI

Before continuous integration came to be, setting up builds was no fun because the complexity and overhead involved in a release cycle was compounded by inflexible, manual processes. The release cycle was slow and often resulted in breaking changes. Continuous integration and continuous delivery (CI/CD) has changed much of that through pipelines that automate how we build and test software—today, we can deploy, have builds fail, and resolve any errors faster than ever.

It's not ready for production until it has an Operational Readiness Checklist

Maintaining the reliability of complex services just got easier with Operational Readiness Checklists. Service owners and engineering leaders can now evaluate and maintain the production readiness of the services their users rely on every day: spot risks in your service dependencies before they cause incidents, and respond quickly if they do. Before you put a new service into production, readiness checklists help you dot-your-is and cross-your-ts.

Low latency Linux kernel for industrial embedded systems - Part II

Welcome to Part II of this three-part blog series on adopting the low latency Ubuntu kernel for your embedded systems. In case you missed it, check out Part I for a brief intro on preemptable processes in multiuser systems and memory split into kernel and user space. The low-latency Ubuntu kernel ships with a 1000 Hz tick timer granularity (CONFIG_HZ_1000) and the maximum preemption (CONFIG_PREEMPT) available in the mainline Linux kernel.

Canonical: a world leader in remote first working

Over the last two years much of the Global workforce has experienced remote working first-hand. Sound familiar? For many, this was a ‘career first’, changing their views on the effectiveness of remote working. The desire to be office based has reduced dramatically with people wanting to avoid time-consuming commutes. In a recent survey, a staggering 91% of US workers wanted home working to persist post pandemic.

VMware Expands Cloud Foundry Investments for Tanzu Application Service

VMware continues to heavily invest in Cloud Foundry and Tanzu Application Service, VMware’s distribution of Cloud Foundry, to ensure it remains the best place to run business-critical applications. Let’s dive a little deeper to see these exciting investments in action.

This Is Not a Predictions Article! What's on the Minds of Your Peers and Tech Leaders for 2022

You have to make lots of technical, architectural, and organizational choices. Knowing what your peers, analysts, and tech leaders are thinking about can help you make decisions about where and how to invest your time, money, and energy. That’s why we’ve compiled this roundup of ideas from tech decision makers, leaders, and analysts to help you focus.

Datadog acquires CoScreen

At Datadog, we’re dedicated to building a platform that helps teams detect, troubleshoot, and resolve issues in their applications and infrastructure. We know that our customers need to be able to debug issues, explore ideas, and manage incidents efficiently, and that means having access to tools that can help them seamlessly share information and leverage the expertise of their distributed teams.

Traceroute software-the troubleshooting tool your network needs

The need for in-depth network monitoring is growing exponentially as organizations expand in size and more companies are established. Increased monitoring needs demand a feature-rich tool to simplify networks and get a clear view of their underlying infrastructure. Diagnosing network faults and ensuring a well-balanced operation of all network devices is the primary task of a network admin any day.

Serverless vs Fully Managed Services: What Are They? What is the Difference Between Them?

One of the first questions you must ask yourself when deciding to construct an application in the cloud is whether your application will be built utilizing serverless or fully managed services. To begin, let me state that these are extremely loosely defined concepts and that there may be cloud services that fall somewhere in the middle, as well as others that are both serverless and fully managed services at the same time.

[Webinar] 5 Things We Learned Not to Ignore While Scaling Kubernetes

Using Kubernetes for orchestration? Great—we hope things are running smoothly. The thing about Kubernetes, though, is that it tends to surprise you—throwing curveballs just when you think you've finally mastered the art of container management. And those curveballs usually come at you when you try to scale up. So, how can you scale K8s without striking out due to speed and reliability (not to mention sanity) issues?

10 Problematic Challenges to Expect When You Scale Your Monitoring

You started the year with one Kubernetes cluster and now you have 100. How do you deal with that? This is a reality for many SREs, and as organizations scale their monitoring to address the growing complexity of their IT environments, SREs will inevitably encounter challenges. The key is to know what challenges to expect, so you can be prepared rather than surprised.

Introduction to Ceph and Rook | Kublr Webinar

Learn how the Ceph storage system can be deployed and managed quickly and reliably using Rook, Kubernetes (K8s) and Kublr in Azure & AWS. Learn how to use Ceph in heterogeneous hybrid & multi-cloud environments to enable data replication, mirroring and disaster recovery. Learn how Ceph and Rook provide cloud-native K8s applications with block and file storage & advanced capabilities like snapshots and volume cloning.

10 Challenges to Expect When You Scale Your Monitoring

You started the year with one Kubernetes cluster and now you have 100. How do you deal with that? This is a reality for many SREs, and as organizations scale their monitoring to address the growing complexity of their IT environments, SREs will inevitably encounter challenges. The key is to know what challenges to expect, so you can be prepared rather than surprised.

Low latency Linux for industrial embedded systems - Part I

Welcome to this mini blog series on the low latency Linux kernel for industrial embedded systems! The real-time patch, which is not fully upstream yet, has had many developers wonder about stable alternatives for their projects adopting an embedded Linux operating system (OS) with latency requirements in the milliseconds’ range. The low-latency Ubuntu Linux kernel from Canonical is less costly to maintain than real-time alternatives.

4 Reasons to Get Your Whole Team Involved in User Research

As a product designer at VMware Tanzu Labs, I’m often having conversations on the value of design in product development. I was discussing design with a client stakeholder one day and made the comment that “Nobody can tell who the designer is on my team.” At first, they were a bit confused by this statement. “Aren’t the designers the ones who create the designs of the product?” they said.

Interoperability a long way off as enterprises target multicloud

Clouds remain segmented, leaving businesses little recourse for how to best navigate complexity. The multicloud movement has flavors to it. Intentional or accidental, an enterprise can blend a kaleidoscope of infrastructure services with existing software tools. IaaS requirements can tag along with SaaS adoption, easily creating a multicloud environment before technology teams can consider the sprawl.

Lighting up the Rails: A Checklist for Modernizing Railway IP Optical Networks

Ah, the romance of the railroad. Conductors shouting “all aboard”, watching the countryside whizz by, dining cars and sleeper cars, boxcars, the Orient Express, bullet trains, chasm-spanning bridges, the clickety clack of the rails, clang clang at the crossings, and the engineer waving from the caboose. As we near the two hundred year mark of the first passenger train in 1825, we can observe that railways, our oldest modern means of mass transportation, are as strong as ever.

Squadcast Earns a Spot on G2's Top 50 Best Software Awards for IT Management Products 2022

We are thrilled to announce that G2 has recognized Squadcast as a High Performer in the Incident Management space and rated us as one of the Best Software for IT Management Products. Over the last three years, G2 has acknowledged our impact in the IT Incident Management space, which led to us being recognized as a Momentum Leader in the Incident Management and IT Alerting categories. Thanks to our learnings from customer feedback, we have been able to shape our product vision and grow further.

Tagging in a monitoring tool: what is it and how can it benefit your team?

As you start to have responsibility for more than a handful of SQL Server instances, you’ll need to get more organised. Everyone around you benefits if you’ve recorded basic things like what the server does and who is responsible for it, and we think that a great place to do this is in your monitoring tool (and, better still, if that’s SQL Monitor!).

Just Launched ValidKube. Here Are 7 Other K8s Open Source Projects We Love!

I am excited to share that we’ve just launched our first open source project called ValidKube. The idea behind Validkube is to fuse together the capabilities of three other popular open-source projects (kubeval, kubectl-neat and trivy by Aqua) and present them in a single view, providing users with a way to ensure YAML code hygiene and security, all at the same time and with just a few clicks of the button.

Financial Services Customer Maintains 99.99% Uptime With LogicMonitor

In this case study video, LogicMonitor is joined by Abrigo, a software company for financial institutions, to discuss the evolution of the financial technology space through the digital transformation era. From supporting PPP loans throughout the pandemic to consolidating a plethora of monitoring tools into one platform for greater visibility and ease of use – LogicMonitor provides Abrigo with the enterprise-grade SaaS monitoring solution it needs to support its customers 24/7, around the globe.

GCISD Accelerates Digital Transformation With LogicMonitor

In this case study video, LogicMonitor is joined by Grapevine-Colleyville Independent School District to discuss the evolution of the education space through the digital transformation era. From keeping tabs on thousands of devices, consolidating a plethora of monitoring tools into one platform for greater visibility and ease of use, and leveraging AI powered alerting and forecasting, LogicMonitor provides GCISD with the enterprise-grade SaaS monitoring solution it needs to support its students 24/7, wherever they are learning.

This is Why Everyone Else is Embracing Kubernetes

Now that conferences are finally coming back, what better way to emerge from uncertainty with a strategy marked for success? If you’re wondering which technology conferences and events to attend, how about starting with containers and Kubernetes? As the leading platform technology underlying containers, Kubernetes can help you build, deploy, and manage applications faster and at scale.

Shipa Now in the Civo Marketplace

Shipa is now for the first time in the Civo Marketplace. If you are unfamiliar with Civo, Civo is a Kubernetes-based cloud provider allowing for the rapid creation of Kubernetes clusters. The engineering efficiency and developer experience that Shipa brings can supercharge your Kubernetes experience on Civo. Now you can spin up a Shipa Control Plane e.g Shipa Self-Managed with a click of a button on Civo Cloud.

Canonical and CoreSpace Announce Partnership To Offer Organizations 'One-Stop Shopping' for Private Clouds

February, 8th 2022 — Canonical, the publisher of Ubuntu, and CoreSpace, a leading Infrastructure-as-a-Service provider, announced a partnership today that makes it easier and more economical for organizations to set up, customize, and manage private clouds. Hybrid or multi-cloud environments that allow organizations to run workloads where it makes the most sense have become common in the modern enterprise.

Slash MTTR, avoid costly downtime with improved cross-team Collaboration

Every second counts when IT teams are called upon to resolve business impacting issues. In modern enterprises, poor communication, fragmented toolchains and spiralling IT complexity can conspire to slow down incident response, putting service availability and ultimately customer satisfaction in peril.

Network automation tools

“Automation applied to an inefficient operation will magnify the inefficiency.” – Bill Gates A network, as we all know, is the connection of multiple devices to share information between them. While it’s a major task to manually manage every device connected to a network, a software-based feature called network automation can be utilized to help overcome this challenge.

Podcast: Break Things on Purpose | Gunnar Grosch: From user to hero to advocate

Reliability and serverless are at the forefront of today’s conversation. For this episode Gunnar Grosch, Senior Developer Advocate at AWS, is here to talk about Chaos Engineering, AWS Serverless, and the work that AWS is doing when it comes to reliability.

Use your words: the importance of clear writing in product development

The role of an engineer at a startup is a tangled web: as well as writing code, you have to be your own product manager, QA tester, customer support and designer. But there’s another hat that you have to wear which you might not have thought about: copywriter. All products have copy, from welcome messages to text on a submit button. At incident.io, we have to put on our copywriting hats every time we add a new feature.

Auto-generate Postman Collections from traffic

Postman is a great tool for API testing during development. It’s GUI is simple to learn and ubiquitous. However, manually writing test cases for local development gets tedious fast if you have a lot of endpoints. Meticulously entering every detail for every use case takes forever. Also, if you get one HTTP Header or parameter wrong, it can take hours to diagnose. And even when it’s done, the API tests are almost immediately out of date because the API contract changes.

New Year, New Features in Xray

Let’s start 2022 off the right with new features and updates that will extend JFrog Xray’s power and reach in addressing challenges with securing your binaries from development to production. Join Sarit Tager, VP Product Security as she discusses how Xray provides intelligent supply chain security and compliance at DevOps speed. JFrog Xray is a software composition analysis (SCA) solution that scans your open source software (OSS) dependencies for security vulnerabilities and license compliance issues.

Chimera: Painless OAuth for Plugin Frameworks

Plugins can help teams unlock the full potential of Mattermost, but they aren’t always ready to go out of the box. Learn how Chimera streamlines plugin configuration via an OAuth2 Proxy. One of the best aspects of any software offered in the Cloud is the ability to start using it in just a matter of minutes. The same is true for the Mattermost Cloud offering.

How One Company Accidently Autoscaled to 200 Nodes and Crashed The App

This article is based on a true story. The names of the company and people involved were changed to protect the innocent 🙂 . A few weeks ago, we were contacted by a pretty big e-commerce company. We can’t really share their name but, for the purpose of this story, let’s call them “KubeCorp Inc”. They reached out to us following an edge-case incident they had, which resulted in severe downtime.

Predefine values of custom pipeline variables

Recently, we introduced support for default values in custom pipeline variables. Today, we're happy to announce the ability to make pipeline variables configuration more flexible with predefined values. We added a property to predefine values that can be assigned to a variable. It helps avoid errors, and improves the user experience. Instead of typing a variable value, you can choose it from a dropdown.

Continuous EC2 Spot Market Prediction for Continuous Optimization

Using spot instances for mission-critical workloads always carried the risk of interruptions, making their use, while financially attractive, less than ideal from a reliability perspective. Spot by NetApp has made it possible for cloud consumers to use spot instances for dramatic cost savings while ensuring high availability for all kinds of workloads. Spot Availability Scores are core to our cloud infrastructure offerings, which are leveraged to provide maximum availability while mitigating risks.

5 Ways Scrum Teams Can Be More Efficient

With progressive delivery, DevOps, scrum, and agile methodologies, the software delivery process has become faster and more collaborative than ever before. Scrum has emerged as a ubiquitous framework for agile collaboration, instilling some basic meetings and roles into a team and enabling them to begin iterating on product increments quickly. However, as scrum teams grow and systems become more complex, it can be difficult to maintain productivity levels in your organization.

Manage automated test data with the PractiTest orb

The software testing data provided by CI/CD tools is valuable, but it is not always comprehensive enough to give managers the insights they need to make improvements. To make effective business decisions, managers need visibility into the entire testing process, in a way that will help them understand what needs to be done and how.

Azure Container Apps - an overview

Over the years, containerization has grown in popularity among organizations and will continue to grow. For example, containerization is slowly becoming the de facto approach when it comes to building applications following the microservices architecture. It is undeniable that containerization has a lot of advantages, but these advantages come at a cost.

CVE-2021-44142: Critical Samba Vulnerability Allows Remote Code Execution

Recently, a critical out-of-bounds vulnerability, assigned to CVE-2021-44142, was disclosed in Samba versions prior to 4.13.17. The Samba vulnerability carries a critical CVSS of 9.9 and allows attackers to remotely execute code on machines running a Samba server with a vulnerable configuration. The vulnerability was disclosed as part of the Pwn2Own Austin competition where researchers are challenged to exploit widely-used software and devices with unknown vulnerabilities.

Our Solution for Scalable Multi-Region SaaS Deployment

Just like many other production DevOps engineering teams, our JFrog team deploys new version releases several times a day to AWS, Azure and GCP, across more than 20 cloud regions. This process used to take us many hours and could have even failed if it was done alongside maintenance by other teams.

10 Best Practices to Get the Most Out of Your IT Infrastructure Monitoring

IT infrastructures are in a constant state of change. From centralized mainframe systems to distributed serverless multi-cloud environments, these changes have happened relatively quickly. And nothing is stopping it. Gartner predicts that by 2023, over 90% of IT organizations will have most of their staff working remotely. This is largely due to companies shifting to using more cloud services. IT operations teams have had to find ways to keep up by implementing effective IT infrastructure monitoring.

[Infographic] AWS RDS from a Serverless perspective

In this article, we’ll deep dive into all the basics to help you decide if AWS RDS is the right decision for your architecture and help you hit the ground running if you do end up AWS RDS. For many decades now, relational databases (RDS) have been the place to store your data. They are pretty flexible often use some kind of SQL dialect, which is one of the main languages taught in computer science classes, and widely understood by the average developer.

Your First Pulumi and Shipa Integration

Typically, Infrastructure-as-Code or IaCs have had their own languages to learn. For example, if leveraging Terraform most likely you came across Terraform’s native syntax, HCL. Though as software engineers we might be more familiar with other languages of choice. Using a general-purpose computer language vs a provider level syntax does unlock the power of the language; anything you can do in the computer language potentially can be additional methods, calls, etc.

Create and Manage Registry Secrets with VMware Tanzu Mission Control

Operators using VMware Tanzu Mission Control can now create and manage image registry secrets. This new feature of Tanzu Mission Control enables people to create image registry secrets in a single namespace and make them available for use by all namespaces in a cluster, providing a single place to manage all registry secrets for that cluster.

Progressive Updates to Cloud Native Apps Using Tanzu Service Mesh Traffic Management

Releasing new features seamlessly with no downtime in a rapidly evolving microservices-based application can be challenging. VMware Tanzu Service Mesh makes this process easier, removing much of the complexity involved with rolling out progressive updates to cloud native apps. Here we explain how it works.

3 reasons top-notch infrastructure performance is critical for service providers

Brocade, a Broadcom Company, named 2/8/2022 as End-of-Support (EOS) for Brocade Network Advisor (BNA), the collection mechanism for Brocade fabrics. Broadcom recommends Brocade SANnav as a replacement for BNA. To continue providing industry-leading infrastructure intelligence, Galileo’s new v2 agent for Brocade will use the REST functionality to collect all the configuration and performance metrics required. Read on for all the details you need to know.

7 Best CDN Providers 2022

Web page loading time, or website speed in a more technical phrase, is an important SEO component. It's also the most important aspect of the user experience. Modern internet content consumers have a short attention span and lack patience. You risk losing valuable traffic if your website does not load quickly enough. A CDN (Content Delivery Network) can help a website load faster.

The Role Of Hyperautomation In Network-as-a-Service (NaaS)

One of the key benefits of the Network-as-a-Service (NaaS) approach is the level of control network service providers can pass on to their enterprise customers and partners. In this blog, we look at the role that hyperautomation is playing in the provisioning and management of networks...

CLI Stands For.... A CLI Intro Series - Part 1

Intro to the CLI – Part 1: Why Learn to Use The Terminal and Some History Long-time fans of GitKraken Client have come to love the graphical user interface, or GUI, that allows you to click on a Git branch or commit to perform an action or even drag and drop a branch to start a pull request. Version 8.0 of GitKraken Client introduced the GitKraken CLI, allowing you to interact with your repositories, and the rest of your computer, from Terminal Tabs.

Ceph for Enterprise

In this webinar, we review the storage challenges faced by Enterprises, and how Ceph can solve many of them. We discuss how a single Ceph cluster can address block, file and object storage needs, and compliment proprietary and public cloud solutions. We will also demonstrate some of the key features of Ceph that provide solutions for disaster recovery and compliance requirements.

The 3 Most Common Questions We Get About Cloud Cost Intelligence

As cloud use has increased, so have the costs associated with it. In 2020, surveyed organizations reported being over budget for their cloud spend by an average of 23%. Most are spending more than they expected, and to top it off — they don’t feel they’re getting value from that spend.

The Top Public Sector Consideration for 2022: Kubernetes Adoption

Kubernetes is one of the most popular platforms for managing and deploying applications built on microservices and containers. For the public sector, deploying pure upstream Kubernetes in offline, air-gapped environments can be a big challenge. Especially when you’re dealing with strict security controls and limited bandwidth, processes, and resources in place to ramp up quickly.

Finserv hybrid cloud strategy - it starts with Linux

The future of financial services technology infrastructure is hybrid multi-cloud. Hybrid multi-cloud architecture provides financial institutions flexibility, portability, interoperability, and the control needed to consistently deploy and manage enterprise applications and workloads. By adopting hybrid cloud, finservs realise the benefits of effective cloud cost management, security, compliance, efficiency and agility. The hybrid cloud strategy starts with choosing the right enterprise Linux.

Top 6 Innovations in Data Center Cooling Technology

Cooling systems are one of the most important components of a data center. They often consume about half of all the data center’s total energy and are necessary to maintain a safe operating environment. The American Society of Heating, Refrigerating, and Air Conditioning Engineers (ASHRAE) thermal guidelines recommend that the ideal temperature for server inlets is between 64.4° F and 80.6° F with a relative humidity between 40% and 60%.

Secure Your Software Supply Chain with New VMware Tanzu Application Platform Capabilities

VMware Tanzu Application Platform is a modular, application-aware platform that gives developers a prepaved path to production for building and deploying software on any compliant public cloud or on-premises Kubernetes cluster. Designed to deliver a superior and secure developer experience, it makes the software supply chain even more secure with a suite of features, including vulnerability scanning, a software bill of materials, and image signing, and more.

How a team of 15 developers deploys 4200 times per Month using the Preview Environments

When the CTO of this growing company (freshly acquired by a billion-dollar company) contacted me, he was concerned by the ability of his team to deliver what they committed to for the current year. His main issue was 15 engineers working in the same development environment. Can you imagine developing on the same workstation? Things will get worst as they plan to quadruple their engineering team size in the next 18 months.

For Every Github Action...

On Nov 13, 2019 Github made it’s CI/CD solution GitHub Actions generally available to the world. Since then tens of thousands of shared workflows have been published. It is now the default for most Github projects given how easy it is to integrate with an existing repo. Projects of all sizes have adopted it from our homegrown Terraform module to the Docker Cli. This is why at Speedscale we’ve published a template for how to use Speedscale in conjunction with GitHub Actions.

6 ways your organization can benefit from a network management solution

In today’s world, businesses depend on the internet and networks for nearly all their operations. Most large-scale corporations from banks to IT services have their critical operations built around a network. With network types ranging from wired and wireless to virtual environments, network management has only become increasingly complex, and network administrators need all the help they can get.

Sponsored Post

Top 5 Kubernetes Load-Testing Tools and How They Compare

It's not for nothing that Kubernetes is a popular choice for running a cloud workload. It can be a powerful tool for orchestrating your applications. However, one thing that can often be a last thought in a production workflow, or maybe forgotten altogether, is load testing. It might be tempting to think that Kubernetes can handle it all. In many cases it can, but it's always smart to know how much your application can take. After reading this article, you'll be equipped to determine which tools would best serve you for load testing your application.

Use Datadog's Sourcegraph extension to navigate code and visualize service dependencies

Sourcegraph is a universal code search tool that enables you to easily navigate and understand all of your code, regardless of the number of repositories you have and where they’re hosted. Its built-in code intelligence feature lets you jump to the definition and references of functions and variables, helping you learn new codebases faster.

Load Balance an Infinite Number of Servers And Never Reload HAProxy

Every load balancer you’ll find on the market must deliver performance, reliability, scalability, and security, and do it better than its competitors. Each must solve complex programming challenges that address those needs—choices that will affect the direction of the project for years to come. HAProxy is no different. When evaluating whether you should choose HAProxy or something else, it helps to know how project contributors answered the big, architectural questions.

Using authentication decorators in Flask

Has your team worked on an API and wanted (somehow) to implement more powerful security features? If you are dissatisfied with the level of security in an API, there are solutions for improving it! In this tutorial, I will lead you through the process of creating API endpoints that are secured with authentication tokens. Using these endpoints, we will be able to make requests to the Flask API only for authenticated users.

Scaling based on the number of messages in an Azure Service Bus queue

One of the most notable advantages of the Cloud is the ability to scale resources to meet demand. We then scale out or up when the demand increases, and we scale in or down when the demand decreases. For the record, scaling out / in refers to increasing or decreasing the number of instances of a given resource, whereas scaling up / down refers to increasing or decreasing the capacity (CPU, Ram, Disk, I/O performance) or a given instance.

Don't Miss Out: Highlights from DevOps Cloud Days 2022

If you didn’t attend our recently concluded DevOps Cloud Days online conference, you missed a learning event that those who did called “fantastic” and “meaningful.” In written feedback, developers, operations staff, and security admins who attended described the presentations as “powerful,” “inspiring” and “excellent.” Fortunately, it wasn’t your last chance to share that fruitful experience with us.

5 Ways To Increase Engineering Velocity Without Skyrocketing Costs

It's something you know. Those who rely on or offer Software-as-a-Service (SaaS) solutions are under constant pressure to innovate. Often, this means quickly building new features and releasing them more frequently. Staying on budget and on time is also critical for staying competitive. Likewise, SaaS providers should also offer customers cost-effective solutions to their technology challenges. But that’s not all. You must also always release quality code that provides seamless user experiences.

From eBPF to CI/CD: 12 emerging trends in observability

As businesses accelerate digital transformations and cloud adoption to better serve customers and employees in the face of the global pandemic, operational complexity has also mounted. To untangle these complexities and enable executive visibility into IT ecosystem , business leaders are increasingly looking to observability solutions as a strategic investment.

The Power of Shipa CNAMEs

As a software engineer, I admit I am not the best at networking. Can’t connect to your app for some reason, one going joke is to “always blame DNS” e.g the Domain Name System. My personal DNS experience is usually editing a few records for my personal blog and connecting a few tools and that is it. Thanks to distributed systems, had to learn all about SRV records and some more DNS concepts.

NASSCOM Features CloudHedge in Emerge 50 for 2021 League of Top 10 Enterprises in the SaaS Award Category

CloudHedge’s OmniDeq™, worlds leading platform for automating App Modernization has been recognized by NASSCOM in the Emerge 50 Awards for 2021 and has also secured a spot in the League of Top 10 Enterprises under the “SaaS” Award Category.

Platform.sh implements new EU Standard Contractual Clauses (SCCs)

On June 4, 2021, the EU Commission released two new contract templates, both labeled Standard Contractual Clauses (SCCs). The first template is for standard contractual clauses between controllers and processors under Article 28 of the GDPR, and its adoption is optional. The second template is for module-based standard contractual clauses for personal data transfers to non-adequate countries, and its adoption is required. With GDPR compliance as our top priority, Platform.sh has adopted both.

Sponsored Post

What is MTTR? Resolve incidents faster through ops, alerting and documentation

When downtime strikes any distributed software deployment or platform, it's all hands on deck until the lights are green and service is restored. This process, from the recognition of a problem to a deployed solution, has most commonly been defined as MTTR - mean time to resolution. In just the last few years, DevOps and site reliability (SRE) professionals have developed sophisticated new models for how they work and audit their successes. In 2022, MTTR is one of the most widely-used software performance success metrics.

Introducing Datadog Application Security

Securing modern-day production systems is expensive and complex. Teams often need to implement extensive measures, such as secure coding practices, security testing, periodic vulnerability scans and penetration tests, and protections at the network edge. Even when organizations have the resources to deploy these solutions, they still struggle to keep pace with software teams, especially as they accelerate their release cycles and migrate to distributed systems and microservices.

The startup guide to sensible incident management

If you’re working at an early stage startup and looking to get some good incident management foundations in place without investing excessive time and effort, this guide is quite literally for you. There’s an enormous amount of content available for organisations looking to import ‘gold standard’ incident management best practices – things like the PagerDuty Response site, the Atlassian incident management best practices, and the Google SRE book.

CFEngine bootstrap with Ansible

CFEngine and Ansible are two complementary infrastructure management tools. Findings from our analysis show that they can be combined and used side by side with joint forces to handle all areas in the best possible way. Part of infrastructure management is hosts deployment, either when building a brand new infrastructure or when growing one by adding new hosts.

Managed Kubernetes Comparison: EKS vs Scaleway Kubernetes Kapsule

The container orchestration tool Kubernetes helps an increasing number of companies to automate, scale, and manage their containerized application deployments. According to the Cloud Native Computing Foundation, the open-source software foundation that hosts and maintains Kubernetes, adoption of the platform increased from seventy-eight percent in 2019 to eighty-three percent in 2020.

DevOps Roundtable with Transact Campus

Join us for an exclusive round table event where you will have the opportunity to ask Mrinal Virnave, Senior Director of Software Architecture at Transact Campus & JFrog’s own technical expert Bill Manning your most pressing DevOps questions, like: During our sit down, Mrinal Virnavewill elaborate on how his team increased productivity by transforming their developer experience and creating a centralized and secure process.

Integrating Log Analytics in Serverless360

Recently we launched features to provide support for Log Analytics in Serverless360. Log Analytics workspaces are used by a lot of different features within the Azure Monitor stack and by providing the ability to link a Log Analytics Workspace to a Business Application in Serverless360 we see that it provides a way you can allow a support user the ability to view and run queries against your log data without needing to be an Azure Expert.

Design Considerations for Software Distribution to Edge & IoT Applications

Make no mistake: You can’t overlook software distribution in DevOps. At risk are the reliability, security and speed of your software releases — and your business itself. This is especially true in enterprises that are releasing across numerous edge endpoints or IoT devices. As your releases’ cadence and payload grow, software distribution challenges multiply, particularly at the edge.

Now You can Invoke PagerDuty Rundeck Actions Within the PagerDuty Slack Integration

Last year, we released PagerDuty Rundeck Actions, a PagerDuty add-on product that connects responders to automated diagnostics and remediation for common problems directly in the PagerDuty incident response workflow. After working with our customers and listening to the community, we are excited to announce that PagerDuty Rundeck Actions now integrates with PagerDuty’s Slack integration.

The Question Isn't Whether You're Overspending in the Cloud, It's by How Much

Everyone is doing it. No, I am not talking about the latest Tik Tok challenge… The thing that everybody is doing—every company, that is—is that they are spending more money in the cloud than they need to. In fact, 82% of respondents in our own recent survey admitted that their organizations have incurred unnecessary cloud costs.

SRE: How the role is evolving

The growth of site reliability engineering (SRE) has demonstrated the need for SRE implementations is here to stay for the foreseeable future. LinkedIn voted SRE jobs as the second most promising positions in the US in 2019, and now as we head into 2022, you can be sure to see the evolution of SRE continue to grow and expand. Below, we’ll get into what SRE is, what SRE engineers do, and how SRE will continue to evolve into the future.

Announcing General Availability of Application Transformer for VMware Tanzu

Today, I’m delighted to announce that Application Transformer for VMware Tanzu is now generally available. Application Transformer for Tanzu is a tool that aids in the discovery, analysis, and containerization of legacy applications, thus helping customers to simplify and accelerate their app modernization journeys by targeting their re-platforming strategy on the well-known “5 R” modernization framework.

Get the most out of your Hyper-V infrastructure using ManageEngine OpManager

Virtualization is the technique of creating a software-based virtual version of something, whether that be computers, storage, networking, servers, or applications. Virtualization creates a virtual layer over the hardware, enabling the creation of virtual machines (VMs), which are virtual computers that you can run multiple of on a single piece of hardware.

What is Server Management? Tools and Best Practices

In the digital age, organizations depend more on IT than ever before. The foundation of many IT functions -- including data storage, website hosting, emails, and software -- is server management. Without reliable, functioning servers, most IT functionality would collapse. Many businesses have migrated internal IT to cloud services using servers located in remote data centers, but a significant number still have in-house servers or use a hybrid environment of in-house and cloud services.

Kubernetes Tips: How to find the Port of a Service with a DNS request

Last week I created a guide for our users to set up an NGINX service as an API Gateway with Qovery. The API gateway must redirect the incoming traffic to the appropriate service with the correct port. My problem is that the API Gateway does not know the ports exposed for every service. In this post, I will show you a quick tip on finding the port of a Kubernetes service with a single DNS request. Let's go!

AWS' Newest Official Partner: Speedscale

Speedscale is excited to join the AWS Partner Network (APN), the global community of partners who leverage Amazon Web Services (AWS) to build solutions and services for customers. AWS Partners are uniquely positioned to help businesses take full advantage of all that AWS has to offer and accelerate the journey to the cloud. As part of this achievement, Speedscale has completed the AWS Foundational Technical Review (FTR).

Difference Between Public, Private, and Hybrid Cloud

Cloud computing is vast. It encompasses a huge range of architectural styles, classifications, and types. This complex computing network has transformed the way we work and is a crucial part of our daily lives, both at home and at work. For organizations, there are many ways to “cloud”, but let’s start with the basics of cloud computing; the internet cloud.

Optimize your resource classes with the CircleCI resources dashboard

CircleCI cloud offers over 20 resource classes (varying CPU and RAM) across multiple execution environments. Finding the best resource class size for your job — not too big and not too small — can sometimes be a challenge. But now, you can view CPU and RAM usage for Docker executors within the UI. The new dashboard, found in the new Resources tab on the job details page, displays the CPU and RAM, for all parallel runs in your Docker job.

Let's talk engineering; building software by building community

For the past three years, I have been running and facilitating a community where folks from all levels and departments at CircleCI can come together to discuss diverse topics. We call it “Let’s Talk Engineering.” Some of the topics we’ve covered have been technical in nature, while others have focused more on leadership: how different teams operate, personal growth, and writing to name a few. Let’s Talk Engineering celebrates interdisciplinarity and multidisciplinarity.

How To Calculate Margin Analysis For SaaS (And Increase Profitability)

Profitability in SaaS can be tricky. A company's net earnings are based on its invested capital, assets, and equity. But its profit margin shows how much money it extracts from its total sales or revenue — its ability to turn revenue into profit. Performing a thorough margin analysis or profit margin analysis is a reliable way to assess the company’s financial health.

Security-Rich: How the D2iQ Kubernetes Platform Meets NSA/CISA Kubernetes Security Hardening Guidelines

Cybersecurity continues to be a thorny problem for businesses and government agencies as breaches, disruptions, and data thefts continue to escalate. To help ensure that the growing number of government and private organizations implementing Kubernetes solutions have the highest possible levels of security, the National Security Agency (NSA) and Cybersecurity and Infrastructure Security Agency (CISA) have issued guidelines for hardening the security of Kubernetes implementations.

Open source cloud platform: meet OpenStack

Are you looking for an open source cloud platform and you don’t know where to start? Are you getting lost in all the independent rankings and cloud platform comparison pages? Try OpenStack and get your open source cloud platform up and running today. OpenStack works at any scale: from a single workstation to thousands of nodes and installs in minutes. Sounds impossible? Give it a try or continue reading to explore where is it coming from.

Linux Server Management in 2022

Linux server management is an integration of cybersecurity and business objectives. Linux server management at scale is a vastly different activity from interacting with a terminal on one machine. The best Linux server management tools universally offer a server management GUI within a web browser. Implementation details matter, especially in a pay-for-compute world. Sysadmin tools that don’t have a lightweight footprint increase overall compute costs.

Platform.sh consolidates its management team with appointment of Ori Pekelman and Fabien Potencier as CSO and CPO

A major player in European cloud, created in 2015 with presence in Europe and the United States, Platform.sh is entering a new growth stage. With a team of more than 300 employees and 150 more to be added in 2022, Platform.sh is developing and consolidating its management team. Ori Pekelman, co-founder and previously Chief Product Officer (CPO), passes the torch to Fabien Potencier and becomes Chief Strategy Officer (CSO). In 2021, Platform.sh acquired Fabien Potencier's company Blackfire.io to consolidate its position as a leading platform for the management of web app fleets-making this move a logical next step.

Using GitOps, Multiple Argo Instances, and Environments with Argo CD at Scale

As open-source software evolves and grows, it’s important that organizations, both large and small, can scale to keep up with their end user’s needs. At Codefresh, we are announcing a new release of our platform, Codefresh Software Delivery Platform, powered by Argo (CSDP) which delivers a scalable deployment management platform with Argo. Some of the major new features include the following support: These are some of the major new features in Codefresh’s new platform.

Using Codefresh Workflows for GitOps deployments

One of the major components of the Codefresh Software Delivery Platform is the Workflows capability that allows you to define any kind of software process for creating artifacts, running unit tests, running security scans, and all other actions that are typically used in Continuous Integration (CI). At first glance, Codefresh Workflows might look like the typical pipelines that you would find in any popular CI product but if you look under the hood you will realize looks can be deceiving.

Introducing Codefresh Software Delivery Platform, Powered by Argo

Delivering new software is the single most important function of businesses trying to compete today. Many companies get stuck with flaky scripting, manual interventions, complex processes, and large unreliable tool stacks across diverse infrastructure. Software teams are left scrambling to understand their software supply chain and discover the root cause of failures. It’s time for a new approach.

The Top 7 Open Source Tools for Securing Your Kubernetes Cluster

This article explores how to secure production Kubernetes clusters with the help of open source tools. As a prerequisite, you’ll need to have basic beginner-level knowledge of Docker and Kubernetes. In a nutshell, Kubernetes is a container orchestration tool and Docker is a containerization platform. Some of the most famous Kubernetes clusters managed by cloud providers include AWS EKS, Azure AKS, and Google CKE.

Ready to run! Get Started with Spark on Kubernetes

The Apache Spark and Kubernetes integration was recently officially declared Generally Available and Production Ready, generating a lot of interest from the community. More and more companies choose to run their big data workloads on Kubernetes to benefit from containerization and a standard cloud-native ecosystem.

What Is a System Administrator? A Complete Guide to SysAdmin Roles and Responsibilities

System Administrators (SysAdmins) often represent the core of IT organizations. SysAdmins manage the organization's computing infrastructure, encompassing servers, virtualization, networking, and storage. For many years, the term System Administrator, or SysAdmin, was typically associated with Linux or UNIX systems.

The Impact of CVE-2022-0185 Linux Kernel Vulnerability on Popular Kubernetes Engines

Last week, a critical vulnerability identified as CVE-2022-0185 was disclosed, affecting Linux kernel versions 5.1 to 5.16.1. The security vulnerability is an integer underflow in the Filesystem Context module that allows a local attacker to run arbitrary code in the context of the kernel, thus leading to privilege escalation, container environment escape, or denial of service.

VMware Announces Availability of Terraform Provider for Tanzu Mission Control

As more customers start to see the benefits of Kubernetes in orchestrating their containerized applications, VMware Tanzu Mission Control continues to evolve with new features that meet operational challenges. With the addition of Terraform provider support, Tanzu Mission Control enables increased DevOps velocity by offering an additional route to consistent deployments and management of Kubernetes.