Break down any song into individual stems for vocals, drums, bass, guitar, piano, and more.
Convert audio and video to text with AI-powered transcription, subtitles, and lyrics generation.
Powered by Demucs and Whisper
from Facebook Research and OpenAI.
Features β’ How It Works β’ AI Models β’ Transcription β’ Screenshots β’ Tech Stack
| π― High Quality | Powered by Demucs and Whisper AI from Facebook Research and OpenAI |
| β‘ Fast Processing | No need to install anything β just upload your files |
| π¨ Multiple Options | Stem separation (2, 4, or 6 stems) and speech transcription |
| π Transcription | Convert speech to text, generate subtitles, or extract lyrics |
| π Privacy First | Files automatically deleted after 24 hours |
| π° Pay As You Go | No subscriptions β start with 3 free credits |
| π API Access | Integrate processing into your workflow (coming soon) |
|
Upload any audio or video file β MP3, WAV, FLAC, MP4, and more. We support all common formats. |
Select the AI model and processing type: stem separation, transcription, subtitles, or lyrics. |
Get your separated stems in MP3 or high-quality WAV format. |
Multiple spectrogram/waveform separation models are available through Demucs for different tasks.
Separate your music into:
- π€ Vocals
- π₯ Drums
- πΈ Bass
- πΉ Other instruments
Models: htdemucs, htdemucs_ft
Get even more control with:
- π€ Vocals
- π₯ Drums
- πΈ Bass
- πΈ Guitar
- πΉ Piano
- π΅ Other
Model: htdemucs_6s
Quick isolation of a single element:
- π€ Vocals + Everything Else
- π₯ Drums + Everything Else
- πΈ Bass + Everything Else
Models: htdemucs, htdemucs_ft, htdemucs_6s
Powered by OpenAI's Whisper model, Mux Minus now supports speech-to-text transcription with multiple output formats.
Convert speech from audio or video files to plain text:
- π Automatic language detection
- π Supports 99+ languages
- π Output: Plain text (.txt)
Get transcription with precise timestamps:
- β±οΈ Segment-level timestamps
- π JSON format with metadata
- π¬ Perfect for video chapters
Generate subtitle files for videos:
- πΊ SRT format (SubRip)
- π WebVTT format
- β Ready for video players
Extract timestamped lyrics from songs:
- π€ Two-step pipeline: vocals isolation + transcription
- π΅ LRC format with timestamps
- πΌ Better accuracy than transcribing full mix
- π Includes isolated vocals audio file
Supported Formats:
- Audio: MP3, WAV, FLAC, OGG, M4A, AAC
- Video: MP4, MKV, AVI, MOV, WebM
- File size limit: 5GB
Mux Minus is built on top of Demucs, an open-source audio source separation model created by Facebook AI Research (FAIR).
Demucs uses a hybrid approach combining waveform and spectrogram processing with deep learning to achieve state-of-the-art results in music source separation.
If you'd prefer to run Demucs locally on your own computer, you can! Here's how:
# Install with pip
pip install demucs
# Or with conda
conda install -c conda-forge demucs# Separate a song into 4 stems (vocals, drums, bass, other)
demucs your-song.mp3
# Use the 6-stem model (adds guitar and piano)
demucs --two-stems=vocals your-song.mp3 # Just vocals + accompaniment
demucs -n htdemucs_6s your-song.mp3 # Full 6-stem separationBy default, Demucs creates a separated folder with subfolders for each model
and track:
separated/
βββ htdemucs/
βββ your-song/
βββ vocals.wav
βββ drums.wav
βββ bass.wav
βββ other.wav
For full documentation, visit the Demucs GitHub repository.
Mux Minus is built with modern, production-ready technologies:
| Technology | Purpose |
|---|---|
| Django | Web framework & templating |
| Vanilla JS | Interactive components |
| WaveSurfer.js | Waveform visualization & audio playback |
| CSS3 | Modern styling with CSS variables |
| Technology | Purpose |
|---|---|
| Django | REST API, user management, administration |
| FastAPI | Internal backend service for job processing |
| Demucs | AI-powered audio separation |
| PostgreSQL | Production database |
| SQLite | Development database |
| Technology | Purpose |
|---|---|
| Docker | Containerization |
| Docker Compose | Multi-container orchestration |
| WhiteNoise | Static file serving |
| Traefik | Reverse proxy (production) |
| Technology | Purpose |
|---|---|
| Square | Payment processing |
This project is open source. See the LICENSE file for details.
Built with β€οΈ using Demucs and Copilot (Claude Opus 4.5)



