Allow to use a plain-text transcription file as a reference #1284

njourdane · 2025-11-06T21:13:47Z

This PR adds the --text_file CLI option, which can be used to set a plain-text transcription file as a reference. It can be used for instance to generate karaoke subtitles, assuming you have the lyrics (usually available on the web).

This step is performed after the alignment. It's based on the new align_text function, which takes an aligned transcription result and a file path, and return an other aligned transcription result. It tries match words between synchronized transcription and plain-text transcription using the Python difflib module, acting similarly to a git diff. The diff is done on a slugified version oh each word (so Hëllo matches with hellô!).

Start and end-time are transferred as-is when possible, otherwise they are based on last/previous times and word lengths. Word scores are also transferred.

If the logger is set to DEBUG, it prints the details on how each word is converted, with colors to distinguish diff operations (equal / replace / insert / delete):

Here with an extract of Les filles, les meufs from french singer Marguerite.

It was quite a journey to work on this, I hope it will be useful for some people. :)

allow to use a plain-text transcription file as reference

f9b31c7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Allow to use a plain-text transcription file as a reference #1284

Allow to use a plain-text transcription file as a reference #1284

njourdane commented Nov 6, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Allow to use a plain-text transcription file as a reference #1284

Are you sure you want to change the base?

Allow to use a plain-text transcription file as a reference #1284

Conversation

njourdane commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

njourdane commented Nov 6, 2025 •

edited

Loading