Skip to content

FLM-Audio is a audio-language subversion of RoboEgo/FLM-Ego -- an omnimodal model with native full duplexity.

License

Notifications You must be signed in to change notification settings

you-and-you/flm-audio

 
 

Repository files navigation

FLM-Audio

[Hugging Face]

FLM-Audio is a audio-language subversion of RoboEgo/FLM-Ego -- an omnimodal model with native full duplexity. It simultaneously listens, speaks, and composes internal monologue, delivering low‑latency, duplex conversational responses in both English and Chinese. FLM‑Audio is robust to noise and user interruptions, prioritizing responsiveness and naturalness.

Model Card

  • Language(s): Chinese; English;

Technical Report

Motivation & Survey: Toward Embodied AGI: A Review of Embodied AI and the Road Ahead

FLM-Audio Research Paper: FLM-Audio: Natural Monologues Improves Native Full-Duplex Chatbots via Dual Training

Omnimodal System Card: RoboEgo System Card: An Omnimodal Model with Native Full Duplexity

Bias, Risks, and Limitations

Despite extensive data cleaning, FLM‑Audio may still produce undesired content (e.g., biased or offensive language). Users should not disseminate unsafe outputs. Project authors are not responsible for misuse or harmful consequences.

Quick Start

Run the Server

# install dependencies
pip install -r requirements-server.txt
python -m flmaudio.server --port 8990

Start the Web UI

# install dependencies
pip install -r requirements-clientgui.txt
python -m flmaudio.client_gradio --url http://localhost:8990

Start the CLI

# install dependencies
pip install -r requirements-clientcli.txt
python -m flmaudio.client --url http://localhost:8990

License

FLM-Audio is licensed under the Apache License 2.0, except for python code under third_party/moshi, which is licensed under the MIT License. This project is intended for research use only in compliance with applicable laws. For commercial use, please contact us.

About

FLM-Audio is a audio-language subversion of RoboEgo/FLM-Ego -- an omnimodal model with native full duplexity.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%