Significant improvement in prompt processing speed in Vulcan after updating to 1.101.1 #1831
IntensivePorpoises
started this conversation in
General
Replies: 1 comment
-
|
Mostly due to upstream improvements in llama.cpp vulkan |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I'm using a Radeon 6700XT on Win11, and previously: 12b models at more than q5KS were just unbearably slow. Now I can get whole responses from q6K quants in well under a minute. I don't know what you guys changed, but it really helped a lot, thanks!
Beta Was this translation helpful? Give feedback.
All reactions