You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -238,6 +241,49 @@ You can also choose the theme you like best. It can be dark, light, or the one c
238
241
- Yoruba
239
242
</details>
240
243
244
+
### Supported File Types
245
+
246
+
<details>
247
+
<summary>Audio file formats</summary>
248
+
249
+
-`.mp3`
250
+
-`.mpeg`
251
+
-`.wav`
252
+
-`.wma`
253
+
-`.aac`
254
+
-`.flac`
255
+
-`.ogg`
256
+
-`.oga`
257
+
-`.opus`
258
+
</details>
259
+
260
+
<details>
261
+
<summary>Video file formats</summary>
262
+
263
+
-`.mp4`
264
+
-`.m4a`
265
+
-`.m4v`
266
+
-`.f4v`
267
+
-`.f4a`
268
+
-`.m4b`
269
+
-`.m4r`
270
+
-`.f4b`
271
+
-`.mov`
272
+
-`.avi`
273
+
-`.webm`
274
+
-`.flv`
275
+
-`.mkv`
276
+
-`.3gp`
277
+
-`.3gp2`
278
+
-`.3g2`
279
+
-`.3gpp`
280
+
-`.3gpp2`
281
+
-`.ogv`
282
+
-`.ogx`
283
+
-`.wmv`
284
+
-`.asf`
285
+
</details>
286
+
241
287
<!-- PROJECT STRUCTURE -->
242
288
243
289
### Project Structure
@@ -399,7 +445,6 @@ You can also choose the theme you like best. It can be dark, light, or the one c
399
445
source venv/Scripts/activate
400
446
```
401
447
5. Run `cat requirements.txt | xargs -n 1 pip install` to install the dependencies.
402
-
> [!WARNING]
403
448
>For some reason, `pip install -r requirements.txt` throws the error "Could not find a version that satisfies the requirement [PACKAGE_NAME]==[PACKAGE_VERSION] (from version: none)"
404
449
6. Run `python src/app.py` to start the program.
405
450
@@ -433,54 +478,66 @@ Once you open the **Audiotext** executable file (explained in the [getting start
433
478
434
479
### Transcribe From
435
480
436
-
You can transcribe from three audio sources:
481
+
You can transcribe from four sources:
437
482
438
483
-**File** (see image above): Click on the file explorer icon to select the file you want to transcribe. You can also manually enter the path to the file into the input field. You can transcribe audio from both audio and video files. Note that the file explorer has the `All supported files` option selected by default. To select only audio files or video files, click the combo box in the lower right corner of the file explorer to change the file type, as marked in red in the following image:
439
484
440
485

441
486
442
487

443
488
444
-
<details>
445
-
<summary>Supported audio file formats</summary>
446
-
447
-
-`.mp3`
448
-
-`.mpeg`
449
-
-`.wav`
450
-
-`.wma`
451
-
-`.aac`
452
-
-`.flac`
453
-
-`.ogg`
454
-
-`.oga`
455
-
-`.opus`
456
-
</details>
457
-
458
-
<details>
459
-
<summary>Supported video file formats</summary>
460
-
461
-
-`.mp4`
462
-
-`.m4a`
463
-
-`.m4v`
464
-
-`.f4v`
465
-
-`.f4a`
466
-
-`.m4b`
467
-
-`.m4r`
468
-
-`.f4b`
469
-
-`.mov`
470
-
-`.avi`
471
-
-`.webm`
472
-
-`.flv`
473
-
-`.mkv`
474
-
-`.3gp`
475
-
-`.3gp2`
476
-
-`.3g2`
477
-
-`.3gpp`
478
-
-`.3gpp2`
479
-
-`.ogv`
480
-
-`.ogx`
481
-
-`.wmv`
482
-
-`.asf`
483
-
</details>
489
+
-**Directory**: Click on the file explorer icon to select the directory with the files you want to transcribe. You can also manually enter the path to the directory into the input field. All supported video and audio files from the root of the directory and its subdirectories will be transcribed. Note that the `Autosave` option is checked and cannot be unchecked because each file's transcription will automatically be saved in the same path as the source file.
490
+
491
+
<picture>
492
+
<source
493
+
srcset="docs/light/from-directory.png"
494
+
media="(prefers-color-scheme: light)"
495
+
/>
496
+
<source
497
+
srcset="docs/dark/from-directory.png"
498
+
media="(prefers-color-scheme: dark)"
499
+
/>
500
+
<img
501
+
src="docs/light/from-directory.png"
502
+
alt="Main"
503
+
>
504
+
</picture>
505
+
506
+
For example, let's use this directory as a reference:
507
+
508
+
```
509
+
└───files-to-transcribe
510
+
│ paranoid-android.mp3
511
+
│ the-past-recedes.flac
512
+
│
513
+
└───movies
514
+
seul-contre-tous.mp4
515
+
mulholland-dr.avi
516
+
```
517
+
518
+
After transcribing the `files-to-transcribe` directory with subtitles, the folder structure will look like this:
519
+
520
+
```
521
+
└───files-to-transcribe
522
+
│ paranoid-android.mp3
523
+
│ paranoid-android.srt
524
+
│ paranoid-android.txt
525
+
│ paranoid-android.vtt
526
+
│ the-past-recedes.flac
527
+
│ the-past-recedes.srt
528
+
│ the-past-recedes.txt
529
+
│ the-past-recedes.vtt
530
+
│
531
+
└───movies
532
+
seul-contre-tous-1998.mp4
533
+
seul-contre-tous-1998.srt
534
+
seul-contre-tous-1998.txt
535
+
seul-contre-tous-1998.vtt
536
+
mulholland-dr-2001.avi
537
+
mulholland-dr-2001.srt
538
+
mulholland-dr-2001.txt
539
+
mulholland-dr-2001.vtt
540
+
```
484
541
485
542
-**Microphone**: To start recording, simply click the `Start recording` button to begin the process. The text of the button will change to `Stop recording` and its color will change to red. Click it to stop recording and generate the transcription.
486
543
@@ -491,8 +548,6 @@ You can transcribe from three audio sources:
-**YouTube video**: Enter the video URL in the upper input field. When finished, click on the `Generate transcription` button.
497
552
498
553
<picture>
@@ -512,9 +567,32 @@ You can transcribe from three audio sources:
512
567
513
568
### Save Transcription
514
569
515
-
Once the program has generated the transcription, you'll see a green `Save transcription` button below the text box. If you click on it, you'll be prompted for a file explorer where you can give the file a name and select the path where you want to save it. The file extension is `.txt` by default, but you can change it to any other text file type.
570
+
When you click on the `Save transcription` button, you'll be prompted for a file explorer where you can name the transcription file and select the path where you want to save it. The file extension is `.txt` by default, but you can change it to any other text file type.
571
+
572
+
If you use **WhisperX** to generate a transcription and check the `Generate subtitles` option, two files will also be saved along with the text file: a `.vtt` file and a `.srt` file. Both contain the subtitles for the transcribed file, as explained in the [Generate Subtitles](#generate-subtitles) section.
573
+
574
+
Please note that any text entered or modified in the textbox **WILL NOT** be included in the saved transcription.
575
+
576
+
#### Autosave
577
+
578
+
If checked, the transcription will automatically be saved in the root of the folder where the transcribed file is stored. If you check the `Generate subtitles` option, the subtitle files will also be saved automatically. If there are already existing files with the same name, they won't be overwritten. To do that, you'll need to check the `Overwrite existing files` option (see below).
516
579
517
-
If you used **WhisperX** to generate the transcription and checked the `Generate subtitles` option, you'll notice that two files are also saved along with the text file: a `.vtt` file and a `.srt` file. Both contain the subtitles for the transcribed file, as explained in the [Generate Subtitles](#generate-subtitles) section.
580
+
#### Overwrite Existing Files
581
+
582
+
This option can only be checked if the `Autosave` option is checked. If `Overwrite existing files` is checked, existing transcriptions in the root directory of the file to be transcribed will be overwritten when saving.
583
+
584
+
For example, let's use this directory as a reference:
585
+
586
+
```
587
+
└───audios
588
+
foo.mp3
589
+
foo.srt
590
+
foo.txt
591
+
```
592
+
593
+
If we transcribe the audio file `foo.mp3` with the `Generate subtitles`, `Autosave` and `Overwrite existing files` options checked, the files `foo.srt` and `foo.txt` will be overwritten and the file `foo.vtt` will be created.
594
+
595
+
On the other hand, if we transcribe the audio file `foo.mp3` with the options `Generate subtitles` and `Autosave` checked and the option `Overwrite existing files` unchecked, the file `foo.vtt` will still be created, but the files `foo.srt` and `foo.txt` will remain unchanged.
518
596
519
597
### Transcribe Using
520
598
@@ -548,22 +626,16 @@ The **WhisperX** options appear when the selected transcription method is **Whis
548
626
549
627
To translate the audio into English, simply check the `Translate to English` checkbox before generating the transcription, as shown in the video below.
However, there is another unofficial way to translate audio into any supported language by setting the `Audio language` to the target translation language. For example, if the audio is in English and you want to translate it into Spanish, you would set the `Audio language` to "Spanish".
0 commit comments