Skip to content

Faster-Whisper-XXL Pro r3.256.1 MB-RoFormer Aray Memory Error #529

@billz2

Description

@billz2

This started happening with the new version, not a problem with the prior version. processing multiple files using one click.bat. When I resubmit the erroed file in the oneclick it's fine I have 12gb VRAM on my nvidia card it works fine. I am suspisious that the the vram is not released when the 1st file completed.

I am using this command

"%dp%faster-whisper-xxl.exe" %file_list% -pp -o source --batch_recursive --check_files --standard -f json srt -m large-v2 --vad_method pyannote_v3 --realign --task translate --ff_vocal_extract mb-roformer --roformer_vram 8 --language ja

This is happening on the next file after the 1st file successfully completed.

MB-RoFormer model running on CUDA: 100% | 851/851 | 05:37<<00:00

Traceback (most recent call last):
File "main.py", line 212, in ffmpeg_audio
File "faster_whisper\roformer_infer.py", line 248, in RoFormer_separator
File "faster_whisper\roformer_infer.py", line 98, in demix_track
File "<array_function internals>", line 180, in nan_to_num
File "numpy\lib\type_check.py", line 515, in nan_to_num
numpy.core._exceptions._ArrayMemoryError: Unable to allocate 429. MiB for an array with shape (1, 2, 225086400) and data type bool

Standalone Faster-Whisper-XXL Pro r3.256.1 running on: CUDA

Starting to process: O:\Avant Browser\data\Video1

Audio filtering is in progress...

Processing Part 1 out of 3:

MB-RoFormer model running on CUDA: 100% | 851/851 | 05:38<<00:00

Processing Part 2 out of 3:

MB-RoFormer model running on CUDA: 100% | 851/851 | 05:38<<00:00

Processing Part 3 out of 3:

MB-RoFormer model running on CUDA: 100% | 590/590 | 03:53<<00:00

Starting sequential faster-whisper inference.

Matplotlib is building the font cache; this may take a moment.
100% | 13732/13732 | 18:41<<00:00 | 12.25 audio seconds/s

Transcription speed: 12.25 audio seconds/s

Starting realignment.
Realignment finished in: 0:00:20.931

Subtitles are written to 'O:\Avant Browser\data' directory.

Starting to process: O:\Avant Browser\data\Video2

Audio filtering is in progress...

Processing Part 1 out of 3:

MB-RoFormer model running on CUDA: 100% | 851/851 | 05:37<<00:00

Traceback (most recent call last):
File "main.py", line 212, in ffmpeg_audio
File "faster_whisper\roformer_infer.py", line 248, in RoFormer_separator
File "faster_whisper\roformer_infer.py", line 98, in demix_track
File "<array_function internals>", line 180, in nan_to_num
File "numpy\lib\type_check.py", line 515, in nan_to_num
numpy.core._exceptions._ArrayMemoryError: Unable to allocate 429. MiB for an array with shape (1, 2, 225086400) and data type bool
Press any key to continue . . .

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions