A transcription service (and GUI client) to use Faster-Whisper-XXL as a service #431
fwaris
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Wrote a small utility service to transcribe and diarize video (.mp4) files via Faster-Whisper-XXL.
The service is meant to run on a GPU-enabled box. A small GPU box is enough (e.g. a T1 instance on Azure).
Transcription 'jobs' can be submitted via a GUI client app. The app uploads local .mp4 files; triggers transcription; and downloads resulting text (.vtt) files containing the transcriptions. SSH connectivity is used for secure data exchange.
The app enables non-technical users to easily and securely use Faster-Whisper-XXL as-a-service.
Note: GPU infrastructure is meant to be shareable by multiple users. The jobs are executed serially so only one job runs at a time to keep the compute requirements low.
https://github.com/fwaris/TranscriptionAndDiarization
Beta Was this translation helpful? Give feedback.
All reactions