Skip to content

Fix disaggregated per-GPU IO throughput comparability via cluster averages#912

Open
KOKOSde wants to merge 1 commit intoSemiAnalysisAI:mainfrom
KOKOSde:feat/disagg-cluster-avg-throughput
Open

Fix disaggregated per-GPU IO throughput comparability via cluster averages#912
KOKOSde wants to merge 1 commit intoSemiAnalysisAI:mainfrom
KOKOSde:feat/disagg-cluster-avg-throughput

Conversation

@KOKOSde
Copy link

@KOKOSde KOKOSde commented Mar 15, 2026

Fixes #299 by making disaggregated input/output throughput per GPU cluster-averaged (comparable with aggregated runs), while preserving role-specific decode/prefill throughput metrics for deeper analysis.

@KOKOSde KOKOSde requested a review from a team March 15, 2026 22:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: No status

Development

Successfully merging this pull request may close these issues.

Presented input/output token throughput per GPU for disaggregated setups not usefully comparable to standard multi-gpu

1 participant