Description
I know this sort of defeats your minimal bindings methodology. But I am finding it crucial to know what llama.cpp set the context length to. in the situation where I set it to -1 and let it choose the context size. The reason for letting it choose is because it is becoming tedious to set these nowadays as they are all over the board. I need to know this context size for trimming the context window. All works of course if I explicitly set this. but only llama.cpp knows this value when reads the gguf's native size. I could potentially just start hacking away at the native bindings and do it myself. but then I break when you update. Possibly this falls under some very, very basic helper function category you could just include as a sort of extension.