You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
support to set an SLO threshold level and then calculate the goodput, rather than measuring the total throughput.
Why is this needed:
Normally, the vllm-benchmark only provides average/median/P99 metrics for TTFT/TPOT, making it impossible to calculate statistics such as what percentage of 200 requests fall below the SLO threshold.