[Examples] Add real-time transcription script and accent post-processing notebook #2696
+474
−0
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR adds two practical, ready-to-run examples that significantly improve learning and adoption of Whisper, especially for non-English users.
New Files Added
examples/real_time_transcription.py→ Live microphone streaming transcription using
pyaudio→ Works out-of-the-box with
tiny/basemodels for near real-time demo→ Includes overlap buffer to avoid cutting words
examples/accent_postprocessing.ipynb→ Jupyter notebook demonstrating post-processing for Indian/accents
→ Uses
spellchecker-pyto correct common misrecognitions→ Easy to extend with custom dictionaries
README Updated
Added section "New Usage Examples" with links to both files.
These examples directly address the suggestions:
Helps beginners and non-English speakers get started quickly.