your voice is an overpowered and underrated tool for meditation and inner work.
aloud. is a meditation facilitator that listens and responds to your voice. it can be a partner for somatic exploration, parts work, and spaced noting. it uses your mic for voice input, whisper.cpp for speech recognition, an LLM to guide you, and speaks using text-to-speech.
aloud works on macOS, Linux, and Windows. choose your LLM — run fully local and private with ollama, use a claude subscription (may draw from extra-use), or connect any API provider (anthropic, openai, openrouter, venice, groq). all providers are configurable from the settings page. the app will also help you set up text-to-speech if necessary.
aloud has two modes: exploration and noting.
exploration: this is a dyadic meditation format where the meditator speaks about what they are experiencing in the moment and the facilitator asks brief questions to help the meditator explore.
in this mode, you optionally set an intention and then mix and match attention focuses (body, emotions, parts work) with vibes (playful, compassionate, loving, spacious, effortless, feel-good). presets give you quick starting points, or you can build your own style. there's a directiveness slider so you can dial in how much guidance you want. in my personal experience, this sort of exploration has been helpful in experiencing jhana states if approached with enough openheartedness.
thanks to Maija Haavisto and Jhourney for guiding me in similar practices.
noting: you specify what participants you'd like, if any — AIs, fixed phrases, or sound effects. then starting with you, each participant notes a sensation in their "awareness" (ideally 1–2 words) or plays their fixed phrase or sound. yes, AIs noting their experience seems kind of silly, but I've actually found it helpful to observe the mental and somatic processes that happen in the cycle of resting -> hearing my cue -> observing -> speaking. if there are no other participants, it'll just briefly introduce the method and then record what you note.
thanks to Vince Horn and again to Jhourney for inspiration.
grab the latest release for your platform below, or from releases:
| platform | download |
|---|---|
| macOS | aloud-1.0.2-macOS.dmg — open the DMG, drag aloud to Applications |
| Windows | aloud-1.0.2-Windows.exe — run the installer |
| Linux | aloud-1.0.2-Linux.AppImage — chmod +x, double-click or run from terminal |
all settings (LLM provider, voice, whisper model, display) are configurable from the settings page inside the app. whisper models download automatically on first launch. the app checks for updates on startup and will prompt you when a new version is available.
- macOS: TTS can use the
saycommand with access to all system voices. You can download better system voices by going to System Settings > Accessibility > Spoken Content, click the dropdown next to System Voice, select Manage Voices, and download Enhanced or Premium voices. - windows: if using browser mode, for best voice quality use Edge — it has access to Microsoft's natural voices (Ava, Jenny) through speechSynthesis.
- linux: your system probably doesn't have text-to-speech built in. I recommend installing piper-tts - you can do this directly on the settings page unless you're using something like Nix. otherwise TTS falls back to browser speechSynthesis. Note that some browsers don't have built in speech synthesis.
to use the Anthropic (Subscription) provider, install the Claude Code CLI first, then run claude once in a terminal to log in with your subscription — aloud shells out to the local claude command to route through your account, so no API key is needed. in my testing, this did not draw from extra-use credits, but there have been communications from Anthropic to the effect that it would do so. also note that extra-use credits on your Claude subscription are not the same as API credits; if you only have one or the other, be sure to use the appropriate provider setting.
- if speech recognition feels slow, try the
basewhisper model (faster, less accurate). - say something like "hold on a bit" during a session to enter silence mode. say "come back" or similar to resume.
- say "mute" to immediately turn off the microphone. click the mic button to resume.
- click the orb in the nav bar to enter kasina gazing mode during a session. click away from it to exit.
- the ember controls add floating particles. each level doubles the count and increases the size.
- click the voice name in the controls bar to open a voice/speed picker.
- sessions auto-save as JSON and plain text, with a short LLM-generated summary.
- you can continue any past session or access the saved sessions folder from the history page.
- the AI can hold silence when requested - the orb dims when it's doing so. just ask it to come back when you want it to speak.
- the AI gently checks in if you're quiet for a while. adjust timing or disable in settings.
- one running copy of aloud can be made accessible to anyone on your local network by setting network access mode to "LAN Access" in settings
- 🥚 there are a few easter eggs 🥚
git clone https://github.com/akrusz/aloud.git
cd aloud
./scripts/start.sh # bootstraps on first run, then launcheson first run, start.sh installs dependencies, creates a Python environment, and writes a default config. configure your LLM provider and other settings in the web UI. requires python 3.10+ and uv (installed automatically if missing).
there are also double-click launchers in scripts/ (Start-Mac.command, Start-Windows.bat, Start-Linux.desktop).
# macOS/Linux
curl -fsSL https://raw.githubusercontent.com/akrusz/aloud/main/scripts/setup.sh | bash# Windows (PowerShell)
irm https://raw.githubusercontent.com/akrusz/aloud/main/scripts/setup.ps1 | iexif you have nix with flakes enabled:
git clone https://github.com/akrusz/aloud.git
cd aloud
nix develop # browser-only (lighter): nix develop .#browser
./scripts/start.sh # auto-bootstraps config and launchesthe flake provides portaudio, ffmpeg, python, uv, and GTK/WebKit2 (for pywebview) via the nix binary cache. python packages are installed via uv into a local venv on first entry.
release builds are automated via GitHub Actions — creating a release tagged vX.X.X triggers builds for all three platforms and attaches the artifacts. see dev-docs/building.md for manual build instructions.