FLM-Audio

FLM-Audio is a audio-language subversion of RoboEgo/FLM-Ego -- an omnimodal model with native full duplexity. It simultaneously listens, speaks, and composes internal monologue, delivering low‑latency, duplex conversational responses in both English and Chinese. FLM‑Audio is robust to noise and user interruptions, prioritizing responsiveness and naturalness.

Model Card

Language(s): Chinese; English;

Technical Report

Motivation & Survey: Toward Embodied AGI: A Review of Embodied AI and the Road Ahead

FLM-Audio Research Paper: FLM-Audio: Natural Monologues Improves Native Full-Duplex Chatbots via Dual Training

Omnimodal System Card: RoboEgo System Card: An Omnimodal Model with Native Full Duplexity

Bias, Risks, and Limitations

Despite extensive data cleaning, FLM‑Audio may still produce undesired content (e.g., biased or offensive language). Users should not disseminate unsafe outputs. Project authors are not responsible for misuse or harmful consequences.

Quick Start

Run the Server

# install dependencies
pip install -r requirements-server.txt
python -m flmaudio.server --port 8990

Start the Web UI

# install dependencies
pip install -r requirements-clientgui.txt
python -m flmaudio.client_gradio --url http://localhost:8990

Start the CLI

# install dependencies
pip install -r requirements-clientcli.txt
python -m flmaudio.client --url http://localhost:8990

License

FLM-Audio is licensed under the Apache License 2.0, except for python code under third_party/moshi, which is licensed under the MIT License. This project is intended for research use only in compliance with applicable laws. For commercial use, please contact us.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
flmaudio		flmaudio
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements-clientcli.txt		requirements-clientcli.txt
requirements-clientgui.txt		requirements-clientgui.txt
requirements-server.txt		requirements-server.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FLM-Audio

Model Card

Technical Report

Bias, Risks, and Limitations

Quick Start

Run the Server

Start the Web UI

Start the CLI

License

About

Uh oh!

Releases

Packages

Languages

License

you-and-you/flm-audio

Folders and files

Latest commit

History

Repository files navigation

FLM-Audio

Model Card

Technical Report

Bias, Risks, and Limitations

Quick Start

Run the Server

Start the Web UI

Start the CLI

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages