You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Mimic 3 is a privacy-focused open-source neural Text to Speech (TTS) engine that can run faster than real-time on low-end devices like the Raspberry Pi 4. In human terms that means it sounds great, it can run completely offline or in the cloud, and you can verifiably trust it.
Copy file name to clipboardExpand all lines: docs/mycroft-technologies/mimic-tts/mimic-3.md
+17-17Lines changed: 17 additions & 17 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,10 +6,10 @@ description: A fast, privacy-focused, open-source, neural Text to Speech (TTS) e
6
6
7
7
Mimic 3 is a neural text to speech engine that can run locally, even on low-end hardware like the Raspberry Pi 4. It is the default text to speech engine on the [Mark II](https://mycroft.ai/product/mark-ii/).
*[Listen to voice samples](https://mycroftai.github.io/mimic3-voices/)
11
-
*[See example use cases](coming-soon-mimic-3.md#usage)
12
-
*[Learn how it works](coming-soon-mimic-3.md#how-it-works)
11
+
*[See example use cases](mimic-3.md#usage)
12
+
*[Learn how it works](mimic-3.md#how-it-works)
13
13
14
14
## Installation
15
15
@@ -20,15 +20,15 @@ Mimic 3 was designed to run on the Raspberry Pi 4 (64-bit OS), but will also run
20
20
*`amd64`
21
21
* AMD/Intel-based desktops/laptops
22
22
* Tested:
23
-
* Very fast on Ryzen 9 5950X, [RTF](coming-soon-mimic-3.md#real-time-factor) less than 0.05
23
+
* Very fast on Ryzen 9 5950X, [RTF](mimic-3.md#real-time-factor) less than 0.05
24
24
*`arm64`
25
25
* Raspberry Pi 3/4 and Zero 2 with [64-bit Pi OS](https://www.raspberrypi.com/news/raspberry-pi-os-64-bit/)
26
26
* Tested:
27
-
* Usable on Pi 4, [RTF](coming-soon-mimic-3.md#real-time-factor) around 0.5
27
+
* Usable on Pi 4, [RTF](mimic-3.md#real-time-factor) around 0.5
28
28
*`armv7l`
29
29
* Raspberry Pi 1/2/3/4 and Zero 2 with 32-bit Pi OS
30
30
* Tested:
31
-
* Slow on Pi 3, [RTF](coming-soon-mimic-3.md#real-time-factor) around 1.3
31
+
* Slow on Pi 3, [RTF](mimic-3.md#real-time-factor) around 1.3
32
32
33
33
#### Real-Time Factor
34
34
@@ -188,9 +188,9 @@ Once installed, the following commands will be available in `.venv/bin`:
188
188
189
189
There are many ways to use Mimic 3, including:
190
190
191
-
*[From the command line](coming-soon-mimic-3.md#command-line-interface)
192
-
*[As a web server](coming-soon-mimic-3.md#web-server)
193
-
*[In a screen reader](coming-soon-mimic-3.md#speech-dispatcher)
191
+
*[From the command line](mimic-3.md#command-line-interface)
192
+
*[As a web server](mimic-3.md#web-server)
193
+
*[In a screen reader](mimic-3.md#speech-dispatcher)
194
194
195
195
### Voice Keys
196
196
@@ -200,7 +200,7 @@ Voices in Mimic 3 are keyed by a name with specific parts. These parts include t
200
200
201
201
The default voice is `en_UK/apope_low`
202
202
203
-
Voice models are automatically downloaded from [Github](https://github.com/MycroftAI/mimic3-voices) and stored in `${HOME}/.local/share/mycroft/mimic3` (technically `${XDG_DATA_HOME}/mycroft/mimic3`). You can also [manually download them](coming-soon-mimic-3.md#downloading-voices).
203
+
Voice models are automatically downloaded from [Github](https://github.com/MycroftAI/mimic3-voices) and stored in `${HOME}/.local/share/mycroft/mimic3` (technically `${XDG_DATA_HOME}/mycroft/mimic3`). You can also [manually download them](mimic-3.md#downloading-voices).
204
204
205
205
### Command-Line Interface
206
206
@@ -212,7 +212,7 @@ The `mimic3` command can be used to synthesize audio on the command line:
212
212
mimic3 --voice <voice> "<text>" > output.wav
213
213
```
214
214
215
-
where `<voice>` is a [voice key](coming-soon-mimic-3.md#voice-keys) like `en_UK/apope_low`. `<TEXT>` may contain multiple sentences, which will be combined in the final output WAV file. These can also be [split into separate WAV files](coming-soon-mimic-3.md#multiple-wav-output).
215
+
where `<voice>` is a [voice key](mimic-3.md#voice-keys) like `en_UK/apope_low`. `<TEXT>` may contain multiple sentences, which will be combined in the final output WAV file. These can also be [split into separate WAV files](mimic-3.md#multiple-wav-output).
216
216
217
217
#### SSML
218
218
@@ -274,7 +274,7 @@ The following SSML tags are supported:
274
274
*`lang` - set language for sentence
275
275
*`<w>` / `<token>` - word (disables automatic tokenization)
@@ -431,9 +431,9 @@ See `mimic3-server --help` for more options.
431
431
#### Endpoints
432
432
433
433
*`/api/tts`
434
-
*`POST` text or [SSML](coming-soon-mimic-3.md#ssml) and receive WAV audio back
435
-
* Use `?voice=` to select a different [voice/speaker](coming-soon-mimic-3.md#voice-keys)
436
-
* Set `Content-Type` to `application/ssml+xml` (or use `?ssml=1`) for [SSML](coming-soon-mimic-3.md#ssml) input
434
+
*`POST` text or [SSML](mimic-3.md#ssml) and receive WAV audio back
435
+
* Use `?voice=` to select a different [voice/speaker](mimic-3.md#voice-keys)
436
+
* Set `Content-Type` to `application/ssml+xml` (or use `?ssml=1`) for [SSML](mimic-3.md#ssml) input
437
437
*`/api/voices`
438
438
* Returns a JSON list of available voices
439
439
@@ -461,13 +461,13 @@ See `mimic3 --help` for more options.
461
461
462
462
Use the Mimic 3 web server as a drop-in replacement for [MaryTTS](http://mary.dfki.de/), for example with [Home Assistant](https://www.home-assistant.io/integrations/marytts/).
463
463
464
-
Make sure to use a Mimic 3 [voice key](coming-soon-mimic-3.md#voice-keys) like `en_UK/apope_low` instead of a MaryTTS voice name.
464
+
Make sure to use a Mimic 3 [voice key](mimic-3.md#voice-keys) like `en_UK/apope_low` instead of a MaryTTS voice name.
465
465
466
466
### Speech Dispatcher
467
467
468
468
Mimic 3 can be used with the [Orca screen reader](https://help.gnome.org/users/orca/stable/) for Linux via [speech-dispatcher](https://github.com/brailcom/speechd).
469
469
470
-
After [installing Mimic 3](coming-soon-mimic-3.md#installation), start the [web server](coming-soon-mimic-3.md#server). Next, make sure you have `speech-dispatcher` installed:
470
+
After [installing Mimic 3](mimic-3.md#installation), start the [web server](mimic-3.md#server). Next, make sure you have `speech-dispatcher` installed:
0 commit comments