llm-scaler-vllm beta release 1.1-preview
Pre-release
Pre-release
·
36 commits
to main
since this release
Highlights
Resources
- Docker Image: intel/llm-scaler-vllm:1.1-preview
(functionally equivalent to intel/llm-scaler-vllm:0.10.0-b2)
What’s new
- vLLM:
- Bug fix for sym_int4 online quantization on Multi-modal models