Hey,
I’m trying to confirm whether constrained decoding is currently supported by the downstream inference engines used in this project. Specifically, I’m interested in the ability to enforce structured output formats, such as generating valid JSON or following a defined schema, directly during decoding rather than relying on post-processing.
This capability would greatly improve reliability for applications that require well-formed structured responses.
Thanks for your work on the project!
Alan
Hey,
I’m trying to confirm whether constrained decoding is currently supported by the downstream inference engines used in this project. Specifically, I’m interested in the ability to enforce structured output formats, such as generating valid JSON or following a defined schema, directly during decoding rather than relying on post-processing.
This capability would greatly improve reliability for applications that require well-formed structured responses.
Thanks for your work on the project!
Alan