Operations | Monitoring | ITSM | DevOps | Cloud

Operational Controls at the BBC

The BBC is the world's largest broadcaster, and is home to a wide range of popular services. Ensuring service availability is a key concern of the BBCs product teams, and they’ve invested in operational controls to help them achieve this. Their portfolio is comprised of thousands of services that communicate together to deliver live TV and radio, on-demand content, and a vast high-traffic website.

Serverless from Scratch

Openfit is a new fitness streaming service by Beachbody that streams hundreds of thousands of hours of video to tens of thousands of users each month with a 100% serverless architecture. From development to testing and production workloads, Reza Javidi (Director of DevOps & SRE) shares best practices his teams have developed for scaling and securing serverless workloads—both in terms of traffic and development velocity.

Building a Real Time Metrics Database at Datadog

In the course of its eight years of existence, Datadog has grown its real time metrics systems that collect, process, and visualize data to the point they now handle trillions of points per day. This has been based on an architecture combining open source technologies, such as Apache Cassandra, Kafka, and PostgreSQL, with a lot of in-house software for in-memory data storing and querying.

Monitor your customer data infrastructure with Segment and Datadog

This is a guest post by Noah Zoschke, Engineering Manager at Segment. Segment is the customer data infrastructure that makes it easy for companies to clean, collect, and control their first-party customer data. At Segment, our ultimate goal is to collect data from Sources (e.g., a website or mobile app) and route it to one or more Destinations (e.g., Google Analytics and AWS Redshift) as quickly and reliably as possible.

Monitor Apache Hive with Datadog

Apache Hive is an open source interface that allows users to query and analyze distributed datasets using SQL commands. Hive compiles SQL commands into an execution plan, which it then runs against your Hadoop deployment. You can customize Hive by using a number of pluggable components (e.g., HDFS and HBase for storage, Spark and MapReduce for execution). With our new integration, you can monitor Hive metrics and logs in context with the rest of your big data infrastructure.

Understand, explore, and collaborate with Dashboard Details

Dashboards provide critical visibility into the performance and health of your environment. But if your organization uses hundreds or thousands of dashboards, or if you’ve recently transitioned to a new company or different team, it’s not always easy to understand the full significance of the data shown on every single dashboard.

How to install Datadog on AWS hosts with Ansible dynamic inventories

Ansible is an automation tool for provisioning, managing, and deploying infrastructure and applications. When building large-scale applications, Ansible enables users to manage and configure their infrastructure across platforms like AWS. Whether you rely on temporary or dedicated hosts, you can use Ansible to create a repeatable process for configuring them with the Datadog Agent.