San Francisco, CA, USA
Jul 11, 2019   |  By James Burns
It’s the Thursday before a holiday weekend and you’ve got a cost crisis. Someone in finance has just noticed that this month’s AWS bill is trending 15% higher than last month’s. An all-hands meeting is called, and everyone is asked to shut down as much capacity as they can “safely.” All the work your team has been trying to push out before end of sprint is going to be delayed for days. Chances of an operational outage when someone shuts down something critical?
Jun 27, 2019   |  By Carlos Alberto Cortez
This year at KubeCon EMEA, the OpenTracing and OpenCensus teams announced that the two projects were merging to form OpenTelemetry. One of the main priorities for the merger is straightforward backwards compatibility with both OpenTracing and OpenCensus, via software bridges. In this post, we’ll discuss the OpenTracing to OpenTelemetry bridge.
Jun 17, 2019   |  By Danton Rodriguez
When there are unexpected issues in production, things can escalate quickly. Suddenly, your phone is buzzing and slack notifications are everywhere. You’re actually contemplating picking up a phone call! In situations like these, how do you know where to start your investigation? Sure, sometimes there are clear indicators (AWS, GCP, or Azure are down). But what about when there’s not? When you don’t even know where something is broken? Enter: Correlations.
Jun 13, 2019   |  By James Burns
LightStep Tracing is an easy way to start using distributed tracing without deploying your own distributed tracing system. Istio is a “batteries included” set of best practices for deploying and managing containerized software. Istio proxy provides an automatic service mesh, based on Envoy, so that you can understand and control how different services communicate with each other. Envoy and therefore Istio support distributed tracing out of the box.
Jun 3, 2019   |  By James Burns
Previously we’ve written about having hard conversations with cloud providers. On Sunday June 2nd, Google Cloud Platform had an extended networking-based outage. There was significant disruption of commonly used services like YouTube and Gmail, as well as Google hosted applications like Snapchat. The incident currently associated with the outage, 19009, indicates a start time of 12:53 US/Pacific and a resolution time of 16:56 US/Pacific.
May 2, 2018   |  By LightStep
Report Finds Record Growth in Microservices is Disrupting the Operational Landscape
Jun 25, 2019   |  By LightStep
In distributed systems, when thing break (which they often do), it can be challenging to know where to start your investigation.
May 8, 2019   |  By LightStep
Developer Mode is an easy way to quickly get up and running with a local tracing solution. In as little as 30 seconds, you can launch a personal tracing sandbox, and see real-time traces whenever you execute code.
Apr 16, 2019   |  By LightStep
See how iOS engineer Parker Edwards solves performance issues in an ecommerce app — using LightStep's Service Diagrams. Service Diagrams are interactive, real-time, and dynamic overviews of system performance and architecture.
Apr 10, 2019   |  By LightStep
LightStep OSS engineer Austin Parker shows how fast it can be with LightStep Tracing and Developer Mode.
Apr 10, 2019   |  By LightStep
LightStep OSS engineer Austin Parker walks you through our newest feature: Developer Mode.