You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+25-11Lines changed: 25 additions & 11 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -28,15 +28,17 @@ developers to train custom multimodal large language model (MLLM), focusing on <
28
28
6.[Citation](#citation)
29
29
30
30
# News
31
-
-[Update Jun. 12, 2024] Recipes for [MaLa-ASR](examples/mala_asr_slidespeech/README.md) has been supported.
31
+
-[Update Sep. 28, 2024] Recipes for [CoT-ST](examples/st_covost2/README.md) have been supported.
32
+
-[Update Sep. 25, 2024] Recipes for [DRCap](examples/drcap_zeroshot_aac/README.md) have been supported.
33
+
-[Update Jun. 12, 2024] Recipes for [MaLa-ASR](examples/mala_asr_slidespeech/README.md) have been supported.
32
34
-**[CALL FOR EXAMPLE]** We sincerely invite developers and researchers to develop new applications, conduct academic research based on SLAM-LLM, and pull request your examples! We also acknowledge engineering PR (such as improving and speeding up multi-node training).
33
35
-[Update May. 22, 2024] Please join [slack](https://join.slack.com/t/slam-llm/shared_invite/zt-2mc0pkhhs-5jjOi8Cwc8R1Xc8IQmykDA) or [WeChat group](./docs/Wechat.jpg). We will sync our updates and Q&A here.
34
-
-[Update May. 21, 2024] Recipes for [Spatial Audio Understanding](examples/seld_spatialsoundqa/README.md)has been supported.
35
-
-[Update May. 20, 2024] Recipes for [music caption (MC)](examples/mc_musiccaps/README.md)has been supported.
36
-
-[Update May. 8, 2024] Recipes for [visual speech recognition (VSR)](examples/vsr_LRS3/README.md)has been supported.
37
-
-[Update May. 4, 2024] Recipes for [zero-shot text-to-speech (TTS)](examples/vallex/README.md)has been supported.
38
-
-[Update Apr. 28, 2024] Recipes for [automated audio captioning (AAC)](examples/aac_audiocaps/README.md)has been supported.
39
-
-[Update Mar. 31, 2024] Recipes for [automatic speech recognition (ASR)](examples/asr_librispeech/README.md)has been supported.
36
+
-[Update May. 21, 2024] Recipes for [Spatial Audio Understanding](examples/seld_spatialsoundqa/README.md)have been supported.
37
+
-[Update May. 20, 2024] Recipes for [music caption (MC)](examples/mc_musiccaps/README.md)have been supported.
38
+
-[Update May. 8, 2024] Recipes for [visual speech recognition (VSR)](examples/vsr_LRS3/README.md)have been supported.
39
+
-[Update May. 4, 2024] Recipes for [zero-shot text-to-speech (TTS)](examples/vallex/README.md)have been supported.
40
+
-[Update Apr. 28, 2024] Recipes for [automated audio captioning (AAC)](examples/aac_audiocaps/README.md)have been supported.
41
+
-[Update Mar. 31, 2024] Recipes for [automatic speech recognition (ASR)](examples/asr_librispeech/README.md)have been supported.
40
42
41
43
# Installation
42
44
```bash
@@ -75,12 +77,24 @@ docker run -it --gpus all --name slam --shm-size=256g slam-llm:latest /bin/bash
75
77
## List of Recipes
76
78
We provide reference implementations of various LLM-based speech, audio, and music tasks:
Copy file name to clipboardExpand all lines: examples/st_covost2/README.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -59,10 +59,10 @@ bash infer.sh
59
59
## Citation
60
60
You can refer to the paper for more results.
61
61
```
62
-
@article{ma2024embarrassingly,
63
-
title={An Embarrassingly Simple Approach for LLM with Strong ASR Capacity},
64
-
author={Ma, Ziyang and Yang, Guanrou and Yang, Yifan and Gao, Zhifu and Wang, Jiaming and Du, Zhihao and Yu, Fan and Chen, Qian and Zheng, Siqi and Zhang, Shiliang and others},
65
-
journal={arXiv preprint arXiv:2402.08846},
62
+
@article{du2024cot,
63
+
title={CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought},
64
+
author={Yexing Du, Ziyang Ma, Yifan Yang, Keqi Deng, Xie Chen, Bo Yang, Yang Xiang, Ming Liu, Bing Qin},
0 commit comments