Skip to content

TEN-framework/ten-audio-testdata

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Introduction

TEN Audio Testdata is a collection of speech data, containing various types of distortions appearing at different scenarios, especially at telecommunication and conferencing systems. These data are released to test speech algorithms, thereby helping to improve the quality of speech in conversational AI and enhance user experience.


What is TEN

TEN is a comprehensive open-source ecosystem for creating, customizing, and deploying real-time conversational AI agents with multimodal capabilities including voice, vision, and avatar interactions.

TEN includes TEN Framework, TEN Turn Detection, TEN VAD, TEN Agent, TMAN Designer, and TEN Portal.


Community Channel Purpose
Follow on X Follow TEN Framework on X for updates and announcements
Follow on LinkedIn Follow TEN Framework on LinkedIn for updates and announcements
Discord TEN Community Join our Discord community to connect with developers
Hugging Face Space Join our Hugging Face community to explore our spaces and models
WeChat Join our WeChat group for Chinese community discussions

Important

Star TEN Repositories ⭐️

Get instant notifications for new releases and updates. Your support helps us grow and improve TEN!


TEN star us gif


Dataset Description

This dataset includes 530 test wavs, which are all 48kHz sampled, monophonic and with 16 bit-depth. Some from SIG-Challenge, some are real-recorded by ourselves. Developers can use this dataset to evaluate the speech enhancement algorithms they developed. SIGMOS and DNSMOS are the recommended objective evaluation tools for speech quality assessment. Of cource, these are just the tools estimating MOS score, if you would like to assess the speech quality subjectively, you may have to find some people to do the subjective listening test. Word accuracy rate is another evaluation metric, but we do not provide the transcript text and PRs for that is welcome.


🌏 TEN Ecosystem

Project Preview
🏚️ TEN Framework
TEN is an open-source framework for real-time, multimodal conversational AI.

️🔂 TEN Turn Detection
TEN is for full-duplex dialogue communication.

🔉 TEN VAD
TEN VAD is a low-latency, lightweight and high-performance streaming voice activity detector (VAD).

🎙️ TEN Agent
TEN Agent is a showcase of TEN Framewrok.

🎨 TMAN Designer
TMAN Designer is low/no code option to make a voice agent with easy to use workflow UI.

📒 TEN Portal
The official site of TEN framework, it has documentation and blog.


License

This project is licensed under Apache 2.0 with certain conditions. Refer to the "LICENSE" file in the root directory for detailed information. Note that the test data contain audio from SIG-Challenge, which is MIT licensed, refer to the NOTICES file in the root directory for detailed information.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published