Skip to content

Commit f42716d

Browse files
committed
update README
1 parent 80cc33f commit f42716d

File tree

1 file changed

+11
-0
lines changed

1 file changed

+11
-0
lines changed

README.md

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -28,6 +28,7 @@ developers to train custom multimodal large language model (MLLM), focusing on <
2828
6. [Citation](#citation)
2929

3030
# News
31+
- [Update Nov. 17, 2024] Recipes for [LLM-Based Contextual ASR](examples/contextual_asr/README.md) have been supported.
3132
- [Update Nov. 5, 2024] Recipes for [speech emotion captioning (SEC)](examples/sec_emotioncaps/README.md) with [emotion2vec](https://github.com/ddlBoJack/emotion2vec) as the encoder has been supported.
3233
- [Update Oct. 12, 2024] Recipes for [SLAM-AAC](examples/slam_aac/README.md) with [EAT](https://github.com/cwx-worst-one/EAT) as the encoder have been supported.
3334
- [Update Sep. 28, 2024] Recipes for [CoT-ST](examples/st_covost2/README.md) have been supported.
@@ -84,6 +85,7 @@ We provide reference implementations of various LLM-based speech, audio, and mus
8485

8586
- Contextual Automatic Speech Recognition (CASR)
8687
- [ Mala-ASR](examples/mala_asr_slidespeech/README.md)
88+
- [LLM-Based Contextual ASR](examples/contextual_asr/README.md)
8789

8890
- [Visual Speech Recognition (VSR)](examples/vsr_LRS3/README.md)
8991
- Speech-to-Text Translation (S2TT)
@@ -142,6 +144,15 @@ Mala-ASR:
142144
year={2024}
143145
}
144146
```
147+
LLM-Based Contextual ASR:
148+
```
149+
@article{yang2024ctc,
150+
title={CTC-Assisted LLM-Based Contextual ASR},
151+
author={Yang, Guanrou and Ma, Ziyang and Gao, Zhifu and Zhang, Shiliang and Chen, Xie},
152+
journal={Proc. SLT},
153+
year={2024}
154+
}
155+
```
145156
CoT-ST:
146157
```
147158
@article{du2024cot,

0 commit comments

Comments
 (0)