Skip to content

Releases: m-bain/whisperX

v3.4.0

24 Jun 14:23

Choose a tag to compare

What's Changed

  • chore: add lockfile check step to CI workflows by @Barabazs in #1130
  • docs: add common issue section for libcudnn dependencies in README by @Barabazs in #1161
  • feat: diarization model env config by @bgdnvk in #1101
  • docs: add missing torch import to Python usage example in README by @hammerill in #1168
  • feat: enhance diarization with optional output of speaker embeddings by @eek in #1085

New Contributors

Full Changelog: v3.3.4...v3.4.0

v3.3.4

03 May 09:39

Choose a tag to compare

What's Changed

  • feat: improve CLI loading speed by lazy loading public API by @Barabazs in #1128

Full Changelog: v3.3.3...v3.3.4

v3.3.3

01 May 09:09

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v3.3.1...v3.3.3

v3.3.2

10 Apr 07:38

Choose a tag to compare

What's Changed

  • chore: update ctranslate2 version requirement to >=4.5.0

This patch release should resolve CUDNN related issues.

Full Changelog: v3.3.1...v3.3.2

v3.3.1

08 Jan 17:01

Choose a tag to compare

What's Changed

  • refactor: add type hints and fix import statement by @Barabazs in #975
  • feat: include speaker information in WriteTXT when diarizing by @Barabazs in #976
  • Bug Fix: Suppress Numerals dataclasses replace method by @jmt0221 in #981

New Contributors

Full Changelog: v3.3.0...v3.3.1

v3.3.0

02 Jan 13:09

Choose a tag to compare

What's Changed

  • Update faster-whisper to 1.0.2 to enable model distil-large-v3 by @moritzbrantner in #814
  • latest faster-whisper support added by @Hasan-Naseer in #875
  • Working version with pyannote:3.3.2 and faster-whisper:1.1.0 by @ibombonato in #936
  • Add ultization to verbose flag by @H4CK3Rabhi in #759
  • Added local_files_only option on whisperx.load_model for offline mode by @RoqueGio in #867
  • adding cache_dir to wav2vec2 by @bnitsan in #681
  • feat: add basic installation test flow & restrict python versions by @Barabazs in #965
  • chore: add build and release workflow by @Barabazs in #966
  • fix: update README image source and enhance setup.py for long description by @Barabazs in #968
  • docs: update installation instructions in README by @Barabazs in #969
  • fix: add UTF-8 encoding when reading README.md by @xigh in #970
  • chore: loosen ctranslate2 version restriction & bump whisperX version by @Barabazs in #971

New Contributors

Full Changelog: v3.2.0...v3.3.0

v3.2.0

18 Dec 08:03

Choose a tag to compare

Device and Language Support

Bug Fixes and Stability Improvements

Documentation Updates

Miscellaneous Changes

Full Changelog: v3.1.1...v3.2.0

3.1.1

13 May 11:19
d8a2b4f

Choose a tag to compare

  • translate functionality added
  • fix word timestamp bug (words no longer have consecutive timestamps)

v3.1.0

07 May 19:34
1caddfb

Choose a tag to compare

  • 70x real time transcription, <8GB gpu memory requirement ⚡️⚡️
  • each transcript segment is a sentence (using nltk.sent_tokenize)
  • diarization now assigned per sentence (and outputted to srt)
  • clean up on alignment logic

v3.0.2

04 May 19:53

Choose a tag to compare

v3.0.2 Pre-release
Pre-release

torch2.0, python3.10