Contenerized text-to-speech engine based on Coqui's TTS with web API
Jacek Kowalski
2024-08-01 55b8831d18635e59bb667080ebbbd8244d36d80a
Add README
1 files added
37 ■■■■■ changed files
README.md 37 ●●●●● patch | view | raw | blame | history
README.md
New file
@@ -0,0 +1,37 @@
# Text-to-speech API based on Coqui's TTS
Simple Python-based container with everything that is needed
to have a self-hosted web-based text-to-speech API.
## Using the container
Just run:
```
docker run -d -p 8000:8000 ghcr.io/jacekkow/docker-text-to-speech-api-tts:master
```
and then visit http://localhost:8000/docs
There is a simple `/sythesize` endpoint that expects a JSON and returns a wave file:
```
curl -o result.wav -X 'POST' \
  'http://localhost:8000/synthesize' \
  -H 'Content-Type: application/json' \
  -d '{"language": "en", "text": "Sample text."}'
```
## Adding languages
Currently only English and Polish models are included in the image.
To add additional languages you can simply add extra entries
in `src/config.py` file and rebuild the container.
Model identifiers are defined in the TTS repository:
https://github.com/idiap/coqui-ai-TTS/blob/dev/TTS/.models.json
Command line `tts --list_models` lists all the available models.
Note that the API does not support multi-speaker models yet!