You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* feat: Add backend gallery
This PR add support to manage backends as similar to models. There is
now available a backend gallery which can be used to install and remove
extra backends.
The backend gallery can be configured similarly as a model gallery, and
API calls allows to install and remove new backends in runtime, and as
well during the startup phase of LocalAI.
Signed-off-by: Ettore Di Giacinto <[email protected]>
* Add backends docs
Signed-off-by: Ettore Di Giacinto <[email protected]>
* wip: Backend Dockerfile for python backends
Signed-off-by: Ettore Di Giacinto <[email protected]>
* feat: drop extras images, build python backends separately
Signed-off-by: Ettore Di Giacinto <[email protected]>
* fixup on all backends
Signed-off-by: Ettore Di Giacinto <[email protected]>
* test CI
Signed-off-by: Ettore Di Giacinto <[email protected]>
* Tweaks
Signed-off-by: Ettore Di Giacinto <[email protected]>
* Drop old backends leftovers
Signed-off-by: Ettore Di Giacinto <[email protected]>
* Fixup CI
Signed-off-by: Ettore Di Giacinto <[email protected]>
* Move dockerfile upper
Signed-off-by: Ettore Di Giacinto <[email protected]>
* Fix proto
Signed-off-by: Ettore Di Giacinto <[email protected]>
* Feature dropped for consistency - we prefer model galleries
Signed-off-by: Ettore Di Giacinto <[email protected]>
* Add missing packages in the build image
Signed-off-by: Ettore Di Giacinto <[email protected]>
* exllama is ponly available on cublas
Signed-off-by: Ettore Di Giacinto <[email protected]>
* pin torch on chatterbox
Signed-off-by: Ettore Di Giacinto <[email protected]>
* Fixups to index
Signed-off-by: Ettore Di Giacinto <[email protected]>
* CI
Signed-off-by: Ettore Di Giacinto <[email protected]>
* Debug CI
* Install accellerators deps
Signed-off-by: Ettore Di Giacinto <[email protected]>
* Add target arch
* Add cuda minor version
Signed-off-by: Ettore Di Giacinto <[email protected]>
* Use self-hosted runners
Signed-off-by: Ettore Di Giacinto <[email protected]>
* ci: use quay for test images
Signed-off-by: Ettore Di Giacinto <[email protected]>
* fixups for vllm and chatterbox
Signed-off-by: Ettore Di Giacinto <[email protected]>
* Small fixups on CI
Signed-off-by: Ettore Di Giacinto <[email protected]>
* chatterbox is only available for nvidia
Signed-off-by: Ettore Di Giacinto <[email protected]>
* Simplify CI builds
Signed-off-by: Ettore Di Giacinto <[email protected]>
* Adapt test, use qwen3
Signed-off-by: Ettore Di Giacinto <[email protected]>
* chore(model gallery): add jina-reranker-v1-tiny-en-gguf
Signed-off-by: Ettore Di Giacinto <[email protected]>
* fix(gguf-parser): recover from potential panics that can happen while reading ggufs with gguf-parser
Signed-off-by: Ettore Di Giacinto <[email protected]>
* Use reranker from llama.cpp in AIO images
Signed-off-by: Ettore Di Giacinto <[email protected]>
* Limit concurrent jobs
Signed-off-by: Ettore Di Giacinto <[email protected]>
---------
Signed-off-by: Ettore Di Giacinto <[email protected]>
Signed-off-by: Ettore Di Giacinto <[email protected]>
0 commit comments