Can Noisy Cross-Utterance Contexts Help Speech-Recognition Error Correction? (IWSDS 2024)

This is the code to reproduce the CORAAL dataset that we processed and used in our paper.

Usage

Prerequisites: sox, wget installed

# Download and extract the CORAAL dataset
./download_coraal.sh coraal_download_list.txt .

# Truncate the dataset by utterance
python split_coraal.py ./extracted ./split

This will generate train / val / test jsonl files like below.

{
  "id": "ATL_se0_ag1_f_02_1",
  "utterances": [
    {
      "text": "okay",
      "asr": "",
      "audio": "split/ATL_se0_ag1_f_02_1/0000.wav",
      "start_time": 2210.787,
      "end_time": 2211.325
    },
    {
      "text": "my name is and",
      "asr": "",
      "audio": "split/ATL_se0_ag1_f_02_1/0004.wav",
      "start_time": 2210.787,
      "end_time": 2211.325
    },
    "..."
  ],
  "..."
}

You can use this splited utterances directly, or you can substitute all of your speech recognition results into the asr value of each utterance and use it as a Huggingface dataset like below.

import datasets

ds = datasets.load_dataset("../coraal", data_dir="../coraal", n_fronts=5, n_bodies=1, n_rears=5, oracle=True)

If oracle is true, the front and rear context becomes ground_truth.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
dataset_ids		dataset_ids
.DS_Store		.DS_Store
.gitignore		.gitignore
coraal.py		coraal.py
coraal_download_list.txt		coraal_download_list.txt
download_coraal.sh		download_coraal.sh
readme.md		readme.md
requirements.txt		requirements.txt
split_coraal.py		split_coraal.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Can Noisy Cross-Utterance Contexts Help Speech-Recognition Error Correction? (IWSDS 2024)

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Languages

padomin/coraal-dataset

Folders and files

Latest commit

History

Repository files navigation

Can Noisy Cross-Utterance Contexts Help Speech-Recognition Error Correction? (IWSDS 2024)

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages