docs(benchmarks): add generation benchmarks#239
docs(benchmarks): add generation benchmarks#239vetertann wants to merge 1 commit intotoon-format:mainfrom
Conversation
|
Hi there! Can you please enhance my benchmarks package with your code and share the tool results? As a hint, the final generation result that gets embedded in |
|
Oh, ok... I did this PR just because in your comment to the issue #207 you wrote: A write‑up or summary table we can link to, and |
|
I see, sorry, missed that. Could you add the generation benchmarks (tho in Python, no problem) to this repo as well? For the sake of reproducibility? Thanks. |
|
Sure, I’ll open a PR adding it under benchmarks/generation |
Linked Issue
Closes #207
Description
This PR adds Generation Benchmarks section to the documentation. It details the performance of TOON compared to JSON and JSON Structured Output (JSO) across 21 different LLMs, focusing on token efficiency, accuracy, and repair capabilities.
Type of Change
Changes Made
## 2. Generation benchmarkssection todocs/guide/benchmarks.md.SPEC Compliance
Testing
Pre-submission Checklist
Breaking Changes
Additional Context
Benchmarks were run via the Nebius API.