TEN Audio Testdata is a collection of speech data, containing various types of distortions appearing at different scenarios, especially at telecommunication and conferencing systems. These data are released to test speech algorithms, thereby helping to improve the quality of speech in conversational AI and enhance user experience.
TEN is a comprehensive open-source ecosystem for creating, customizing, and deploying real-time conversational AI agents with multimodal capabilities including voice, vision, and avatar interactions.
TEN includes TEN Framework, TEN Turn Detection, TEN VAD, TEN Agent, TMAN Designer, and TEN Portal.
Important
Star TEN Repositories ⭐️
Get instant notifications for new releases and updates. Your support helps us grow and improve TEN!
This dataset includes 530 test wavs, which are all 48kHz sampled, monophonic and with 16 bit-depth. Some from SIG-Challenge, some are real-recorded by ourselves. Developers can use this dataset to evaluate the speech enhancement algorithms they developed. SIGMOS and DNSMOS are the recommended objective evaluation tools for speech quality assessment. Of cource, these are just the tools estimating MOS score, if you would like to assess the speech quality subjectively, you may have to find some people to do the subjective listening test. Word accuracy rate is another evaluation metric, but we do not provide the transcript text and PRs for that is welcome.
| Project | Preview |
|---|---|
| 🏚️ TEN Framework TEN is an open-source framework for real-time, multimodal conversational AI. |
![]() |
| ️🔂 TEN Turn Detection TEN is for full-duplex dialogue communication. |
![]() |
| 🔉 TEN VAD TEN VAD is a low-latency, lightweight and high-performance streaming voice activity detector (VAD). |
![]() |
| 🎙️ TEN Agent TEN Agent is a showcase of TEN Framewrok. |
![]() |
| 🎨 TMAN Designer TMAN Designer is low/no code option to make a voice agent with easy to use workflow UI. |
![]() |
| 📒 TEN Portal The official site of TEN framework, it has documentation and blog. |
![]() |
This project is licensed under Apache 2.0 with certain conditions. Refer to the "LICENSE" file in the root directory for detailed information. Note that the test data contain audio from SIG-Challenge, which is MIT licensed, refer to the NOTICES file in the root directory for detailed information.






