-
Notifications
You must be signed in to change notification settings - Fork 144
Description
This started happening with the new version, not a problem with the prior version. processing multiple files using one click.bat. When I resubmit the erroed file in the oneclick it's fine I have 12gb VRAM on my nvidia card it works fine. I am suspisious that the the vram is not released when the 1st file completed.
I am using this command
"%dp%faster-whisper-xxl.exe" %file_list% -pp -o source --batch_recursive --check_files --standard -f json srt -m large-v2 --vad_method pyannote_v3 --realign --task translate --ff_vocal_extract mb-roformer --roformer_vram 8 --language ja
This is happening on the next file after the 1st file successfully completed.
MB-RoFormer model running on CUDA: 100% | 851/851 | 05:37<<00:00
Traceback (most recent call last):
File "main.py", line 212, in ffmpeg_audio
File "faster_whisper\roformer_infer.py", line 248, in RoFormer_separator
File "faster_whisper\roformer_infer.py", line 98, in demix_track
File "<array_function internals>", line 180, in nan_to_num
File "numpy\lib\type_check.py", line 515, in nan_to_num
numpy.core._exceptions._ArrayMemoryError: Unable to allocate 429. MiB for an array with shape (1, 2, 225086400) and data type bool
Standalone Faster-Whisper-XXL Pro r3.256.1 running on: CUDA
Starting to process: O:\Avant Browser\data\Video1
Audio filtering is in progress...
Processing Part 1 out of 3:
MB-RoFormer model running on CUDA: 100% | 851/851 | 05:38<<00:00
Processing Part 2 out of 3:
MB-RoFormer model running on CUDA: 100% | 851/851 | 05:38<<00:00
Processing Part 3 out of 3:
MB-RoFormer model running on CUDA: 100% | 590/590 | 03:53<<00:00
Starting sequential faster-whisper inference.
Matplotlib is building the font cache; this may take a moment.
100% | 13732/13732 | 18:41<<00:00 | 12.25 audio seconds/s
Transcription speed: 12.25 audio seconds/s
Starting realignment.
Realignment finished in: 0:00:20.931
Subtitles are written to 'O:\Avant Browser\data' directory.
Starting to process: O:\Avant Browser\data\Video2
Audio filtering is in progress...
Processing Part 1 out of 3:
MB-RoFormer model running on CUDA: 100% | 851/851 | 05:37<<00:00
Traceback (most recent call last):
File "main.py", line 212, in ffmpeg_audio
File "faster_whisper\roformer_infer.py", line 248, in RoFormer_separator
File "faster_whisper\roformer_infer.py", line 98, in demix_track
File "<array_function internals>", line 180, in nan_to_num
File "numpy\lib\type_check.py", line 515, in nan_to_num
numpy.core._exceptions._ArrayMemoryError: Unable to allocate 429. MiB for an array with shape (1, 2, 225086400) and data type bool
Press any key to continue . . .