In this session, we will explore the challenges of diagnosing issues within large-scale systems made up of hundreds or even thousands of interconnected services. We’ll consider essential questions such as: What details are necessary to understand each service? How can we design dashboards that provide a clear and immediate overview of system health? We’ll also look at how to make the most of observability data—metrics, logs, traces, and profiles—by examining their specific roles and the unique perspectives they offer. Finally, we’ll discuss approaches to automating root cause analysis, weighing the benefits and limitations of both fully automated and partially automated solutions.
For more information about this event, please refer to this link:
https://devopsstage.com/speakers/nikolay-sivko/