Skip to content

llm-scaler-vllm beta release 1.1-preview

Pre-release
Pre-release

Choose a tag to compare

@liu-shaojun liu-shaojun released this 29 Sep 07:53
· 36 commits to main since this release
1006351

Highlights

Resources

What’s new

  • vLLM:
    • Bug fix for sym_int4 online quantization on Multi-modal models