Faster-Whisper-XXL Pro r3.256.1 MB-RoFormer Aray Memory Error

This started happening  with the new version, not a problem with the prior version. processing multiple files using one click.bat. When I resubmit the erroed file  in the oneclick it's fine I have 12gb VRAM on my nvidia card it works fine. I am suspisious that the the vram is not released when the 1st file completed.

 I am using this command

"%dp%faster-whisper-xxl.exe" %file_list% -pp -o source --batch_recursive --check_files --standard -f json srt -m large-v2 --vad_method pyannote_v3 --realign --task translate --ff_vocal_extract mb-roformer --roformer_vram 8 --language ja

This is happening on the next file after the 1st file successfully completed.

MB-RoFormer model running on CUDA: 100% | 851/851 | 05:37<<00:00

Traceback (most recent call last):
  File "__main__.py", line 212, in ffmpeg_audio
  File "faster_whisper\roformer_infer.py", line 248, in RoFormer_separator
  File "faster_whisper\roformer_infer.py", line 98, in demix_track
  File "<__array_function__ internals>", line 180, in nan_to_num
  File "numpy\lib\type_check.py", line 515, in nan_to_num
numpy.core._exceptions._ArrayMemoryError: Unable to allocate 429. MiB for an array with shape (1, 2, 225086400) and data type bool

Standalone Faster-Whisper-XXL Pro r3.256.1 running on: CUDA


Starting to process: O:\Avant Browser\data\Video1


Audio filtering is in progress...

Processing Part 1 out of 3:

MB-RoFormer model running on CUDA: 100% | 851/851 | 05:38<<00:00

Processing Part 2 out of 3:

MB-RoFormer model running on CUDA: 100% | 851/851 | 05:38<<00:00

Processing Part 3 out of 3:

MB-RoFormer model running on CUDA: 100% | 590/590 | 03:53<<00:00

Starting sequential faster-whisper inference.

Matplotlib is building the font cache; this may take a moment.
100% | 13732/13732 | 18:41<<00:00 | 12.25 audio seconds/s

Transcription speed: 12.25 audio seconds/s


Starting realignment.
Realignment finished in:  0:00:20.931

Subtitles are written to 'O:\Avant Browser\data' directory.



Starting to process: O:\Avant Browser\data\Video2


Audio filtering is in progress...

Processing Part 1 out of 3:

MB-RoFormer model running on CUDA: 100% | 851/851 | 05:37<<00:00

Traceback (most recent call last):
  File "__main__.py", line 212, in ffmpeg_audio
  File "faster_whisper\roformer_infer.py", line 248, in RoFormer_separator
  File "faster_whisper\roformer_infer.py", line 98, in demix_track
  File "<__array_function__ internals>", line 180, in nan_to_num
  File "numpy\lib\type_check.py", line 515, in nan_to_num
numpy.core._exceptions._ArrayMemoryError: Unable to allocate 429. MiB for an array with shape (1, 2, 225086400) and data type bool
Press any key to continue . . .


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Faster-Whisper-XXL Pro r3.256.1 MB-RoFormer Aray Memory Error #529

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Faster-Whisper-XXL Pro r3.256.1 MB-RoFormer Aray Memory Error #529

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions