Skip to main content

    Recommendation Service Memory Leak

    Failure scenario

    This performance issue is triggered by enabling the recommendationServiceCacheFailure flag in the Flagd config.

    When activated, this flag introduces a memory leak in the recommendation service.

    Root Cause Analysis by Coroot

    Coroot AI RCA analysis of recommendation service cache failure showing memory issues and OOM kills

    How it works

    1. Anomaly detection

    Coroot detected anomalies in the frontend-proxy service, identifying increased latency and error rates affecting user experience.

    2. eBPF-based metrics correlation

    Using eBPF metrics, Coroot traced the connection failures back to the recommendation service, connecting latency spikes across the entire service dependency chain to identify the source of ECONNREFUSED errors.

    3. Understanding the cause

    Coroot identified that memory problems in the recommendation service were causing frequent restarts. The analysis revealed OOM killer terminations due to memory exhaustion, with kubernetes events showing repeated container restarts and service unavailability. The cascading impact propagated through frontendfrontend-proxy, causing 500 errors and connection timeouts.

    Results

    2/3
    Detection of the problematic service:

    Successfully identified the recommendation service as the source of memory exhaustion and OOM kills

    Immediate fixes provided:

    Suggested increasing memory limits, but this doesn't address the root cause - the memory leak needs to be analyzed and fixed in the application code

    Additional details:

    Provided complete failure scenario analysis showing gRPC calls, connection attempts, and cascading latency through the entire service dependency chain

    LLM Usage Details

    Model: Anthropic Claude Sonnet 4 (claude-sonnet-4-20250514)
    Token usage: 17,409 input tokens • 627 output tokens
    Estimated cost: ~$0.06 USD

    Based on $3 per 1M input tokens, $15 per 1M output tokens

    Try Coroot's AI RCA in your environment

    See how Coroot finds root causes for your real production issues with a full enterprise trial.