Finetuning Granite Speech #307

avihu111 · 2025-06-29T15:51:35Z

What does this PR do?

This PR adds a notebook that shows how to finetune Granite Speech, an open-source model that leads the OpenASR leaderboard.

Who can review?

@merveenoyan @stevhliu can you give that a look? 🙏

review-notebook-app · 2025-06-29T15:51:39Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

stevhliu · 2025-06-30T14:56:13Z

Hi, thanks for your contribution!

The cookbook recipes are more focused on applied use cases so it'd be awesome if you could tailor it more towards solving a specific problem or use case.

jack-tol · 2025-06-30T16:08:30Z

Hi, thanks for your contribution!

The cookbook recipes are more focused on applied use cases so it'd be awesome if you could tailor it more towards solving a specific problem or use case.

Might not really be my place to say, but even though this script perhaps doesn't focus on tackling a specific fine-tuning use-case (i.e. domain specific fine-tuning on medical audio etc.), it is nevertheless very important to provide the open-source community with a script to fine-tune a new open-source model on their custom data. Maybe this is in the works already and I'm just jumping the gun, but this contribution surely should exist somewhere within the cookbook or some other resource until perhaps a better, and more robust implementation is available. Just my thoughts.

stevhliu · 2025-06-30T16:11:58Z

Absolutely, we're happy to have a link for it in the Granite Speech docs in Transformers if nothing else!

avihu111 · 2025-07-01T07:50:03Z

Hi @stevhliu, thanks for the feedback!
I expected (like @jack-tol) that the most common use case would be finetuning Granite Speech on custom data (e.g., new language, unseen conditions, etc).
My goal was to show the best way to run inference/finetune the model, along with useful code snippets and a concrete (yet concise and easy to run) example.

We can also finetune Granite Speech on an unseen task like spoken question answering, but I fear people won't find it as useful (a finetuning script was requested here and here ).

I hope it will be suitable for the cookbook - I like the fact that the huggingface webpage presents the notebook nicely. 🙏
If not, I assume the best approach is to add it to the Granite Speech docs.

stevhliu · 2025-07-01T15:51:11Z

I'm wondering if there is some way we can apply your fine-tuning recipe to a more practical application. For example, you can fine-tune Granite Speech and build a Space that transcribes meeting notes, captions videos, etc. This will help you extend the notebook and demonstrate how you can build an AI application with it.

If you decide to keep it as fine-tuning only, then I think it's best to add it to the Granite Speech docs.

Thanks again and we really appreciate the time and effort you put into creating this notebook! 🤗

avihu111 · 2025-07-03T10:35:54Z

Thanks, @stevhliu.
Can you advise on the best way to add this to the Granite Speech docs?
Most of the examples I've seen are short code snippets. Do you have a docs page with an example notebook that you can share?
Any help would be very appreciated - Thanks!

stevhliu · 2025-07-03T17:43:43Z

Yeah, you can open a PR on the Transformers repo and create a ## Resources section on the Granite Speech docs with a link to your notebook.

finetuning granite speech, initial commit

6a7c815

avihu111 added 4 commits June 29, 2025 15:58

update with used packages

66a2963

minor

d11d148

typos

19d89a0

minor

185e05f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Finetuning Granite Speech #307

Finetuning Granite Speech #307

Uh oh!

avihu111 commented Jun 29, 2025

Uh oh!

review-notebook-app bot commented Jun 29, 2025

Uh oh!

stevhliu commented Jun 30, 2025

Uh oh!

jack-tol commented Jun 30, 2025 •

edited

Loading

Uh oh!

stevhliu commented Jun 30, 2025

Uh oh!

avihu111 commented Jul 1, 2025 •

edited

Loading

Uh oh!

stevhliu commented Jul 1, 2025

Uh oh!

avihu111 commented Jul 3, 2025

Uh oh!

stevhliu commented Jul 3, 2025

Uh oh!

Uh oh!

Finetuning Granite Speech #307

Are you sure you want to change the base?

Finetuning Granite Speech #307

Uh oh!

Conversation

avihu111 commented Jun 29, 2025

What does this PR do?

Who can review?

Uh oh!

review-notebook-app bot commented Jun 29, 2025

Uh oh!

stevhliu commented Jun 30, 2025

Uh oh!

jack-tol commented Jun 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stevhliu commented Jun 30, 2025

Uh oh!

avihu111 commented Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stevhliu commented Jul 1, 2025

Uh oh!

avihu111 commented Jul 3, 2025

Uh oh!

stevhliu commented Jul 3, 2025

Uh oh!

Uh oh!

jack-tol commented Jun 30, 2025 •

edited

Loading

avihu111 commented Jul 1, 2025 •

edited

Loading