-
Notifications
You must be signed in to change notification settings - Fork 440
How to set my value for the embedding size? #1200
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
This sounds like something we don't currently support (I've certainly never heard of it). Could you link some things in the main llama.cpp repo related to this? That'll help inform development for the next version. |
I didn't find it. Qwen3 embedding and rerank: |
You don't really set it, you just crop/ignore parts of the output, Matryoshka tuned models have the embedding weights sorted by importance. |
I do not think so. |
I see, then it's probably possible to prompt the model to change the output, I see no further mention of it anywhere though. |
me too, which is weird :) |
You were right, the vector is being clipped. https://huggingface.co/blog/matryoshka#why-would-you-use-%F0%9F%AA%86-matryoshka-embedding-models |
Uh oh!
There was an error while loading. Please reload this page.
Description
The model supports setting the custom length of the embedded array. How do I set my value?
MRL Support indicates whether the embedding model supports custom dimensions for the final embedding.
The text was updated successfully, but these errors were encountered: