GitHub - TEN-framework/ten-audio-testdata

Introduction

TEN Audio Testdata is a collection of speech data, containing various types of distortions appearing at different scenarios, especially at telecommunication and conferencing systems. These data are released to test speech algorithms, thereby helping to improve the quality of speech in conversational AI and enhance user experience.

What is TEN

TEN is a comprehensive open-source ecosystem for creating, customizing, and deploying real-time conversational AI agents with multimodal capabilities including voice, vision, and avatar interactions.

TEN includes TEN Framework, TEN Turn Detection, TEN VAD, TEN Agent, TMAN Designer, and TEN Portal.

Community Channel	Purpose
	Follow TEN Framework on X for updates and announcements
	Follow TEN Framework on LinkedIn for updates and announcements
	Join our Discord community to connect with developers
	Join our Hugging Face community to explore our spaces and models
	Join our WeChat group for Chinese community discussions

Important

Star TEN Repositories ⭐️

Get instant notifications for new releases and updates. Your support helps us grow and improve TEN!

Dataset Description

This dataset includes 530 test wavs, which are all 48kHz sampled, monophonic and with 16 bit-depth. Some from SIG-Challenge, some are real-recorded by ourselves. Developers can use this dataset to evaluate the speech enhancement algorithms they developed. SIGMOS and DNSMOS are the recommended objective evaluation tools for speech quality assessment. Of cource, these are just the tools estimating MOS score, if you would like to assess the speech quality subjectively, you may have to find some people to do the subjective listening test. Word accuracy rate is another evaluation metric, but we do not provide the transcript text and PRs for that is welcome.

🌏 TEN Ecosystem

Project	Preview
🏚️ TEN Framework TEN is an open-source framework for real-time, multimodal conversational AI.
️🔂 TEN Turn Detection TEN is for full-duplex dialogue communication.
🔉 TEN VAD TEN VAD is a low-latency, lightweight and high-performance streaming voice activity detector (VAD).
🎙️ TEN Agent TEN Agent is a showcase of TEN Framewrok.
🎨 TMAN Designer TMAN Designer is low/no code option to make a voice agent with easy to use workflow UI.
📒 TEN Portal The official site of TEN framework, it has documentation and blog.

License

This project is licensed under Apache 2.0 with certain conditions. Refer to the "LICENSE" file in the root directory for detailed information. Note that the test data contain audio from SIG-Challenge, which is MIT licensed, refer to the NOTICES file in the root directory for detailed information.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
test_data_for_developers		test_data_for_developers
LICENSE		LICENSE
NOTICES		NOTICES
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Introduction

What is TEN

Dataset Description

🌏 TEN Ecosystem

License

About

Uh oh!

Releases

Packages

Uh oh!

License

TEN-framework/ten-audio-testdata

Folders and files

Latest commit

History

Repository files navigation

Introduction

What is TEN

Dataset Description

🌏 TEN Ecosystem

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Packages