Operations | Monitoring | ITSM | DevOps | Cloud

What is Chronograf?

InfluxDB is an open-source time-series database, i.e. a database optimized for storing data points collected across an interval of time. Developed by InfluxData, InfluxDB is intended for fast, high-availability storage and retrieval of many different system metrics. The entire InfluxDB project, which is housed at influxdata.com, includes: Yet with all of these tools for collecting and processing time-series data, there's still one step missing—visualizing it. That's where Chronograf comes in.

Should you DIY your Opentelemetry Monitoring?

I recently read this thread in the CNCF slack from someone wanting to send metrics and traces directly to Postgres. Reasonable enough right? After all once your data is in postgres you can query it to your heart’s content. And isn’t the general culture of OpenTelemetry that you should be able to do all of Observability without resorting to SaaS tools? The thread, however, is pretty universally opposed to this approach; and I have to say that I agree.

How to choose an OS for software development in automotive

Automotive as an industry is constantly looking for ways to improve its processes and efficiency. The one common tool that is at the intersection of all software development activities is the operating system (OS). From advanced simulation tools to new vehicle designs, the OS has to be reliable and provide the best platforms for a maximum number of purposes.

Large language models (LLMs): what, why, how?

Large language models (LLMs) are machine-learning models specialised in understanding natural language. They became famous once ChatGPT was widely adopted around the world, but they have applications beyond chatbots. LLMs are suitable to generate translations or content summaries. This blog will explain large language models (LLMs), including their benefits, challenges, famous projects and what the future holds.

Canonical Kubernetes 1.28 pre-announcement

Kubernetes 1.28 is currently planned to be released on the 15th of August 2023. Canonical Kubernetes 1.28 is scheduled to release swiftly afterwards, with MicroK8s 1.28 expected to be generally available within 24 hours of upstream, and Charmed Kubernetes expected to follow within days. We consistently follow the upstream release cadence to provide our users and customers with the latest improvements and fixes, together with security maintenance and enterprise support for Kubernetes on Ubuntu.

Micro-Outages Uncovered: Exploring the Real Cost of Downtime for Your Business

Unplanned downtime is an eventuality every business tries to avoid but will face. In today’s digitally interconnected world, outages can be particularly damaging, especially if the business is unprepared. Not only can outages cause employee frustration and anger customers, leading to numerous intangible costs like lower satisfaction hurting a company’s reputation, but the loss of employee productivity caused by unplanned downtime can significantly affect the bottom line.

Q2 Round Up: Roadmap Review & Q3 2023 Look Ahead

Many thanks to everyone who joined us for our recent virtual meetup, during which we discussed some of our Q2 2023 highlights, including features highlights, the 2023 roadmap for VictoriaMetrics and of course: The launch of VictoriaLogs! In this blog post, we’d like to share a summary of these highlights.

Experience This! What is the Importance of Application Experience?

If you have ever worked in a kitchen, you know how tough it is to be short-staffed. Cooks have to work twice as hard and their performance suffers, leaving not-so-happy customers and comped meals for the complainers. It’s similar to how applications operate. Maybe an application is glitching out from poor coding on the backend or bogging from an influx of data coming in. In either case, the end-user application experience suffers.

MTTR vs. MTBF vs. MTTF: Understanding Failure Metrics

In the dynamic landscape of software and web applications, failures can have severe consequences, impacting user experience, business continuity, and overall performance. To proactively address these challenges, organizations rely on robust monitoring practices supported by failure metrics. Failure metrics, specifically tailored to software and web application monitoring, provide crucial insights into system health, reliability, and optimization opportunities.