New file |
| | |
| | | # Text-to-speech API based on Coqui's TTS |
| | | |
| | | Simple Python-based container with everything that is needed |
| | | to have a self-hosted web-based text-to-speech API. |
| | | |
| | | ## Using the container |
| | | |
| | | Just run: |
| | | |
| | | ``` |
| | | docker run -d -p 8000:8000 ghcr.io/jacekkow/docker-text-to-speech-api-tts:master |
| | | ``` |
| | | |
| | | and then visit http://localhost:8000/docs |
| | | |
| | | There is a simple `/sythesize` endpoint that expects a JSON and returns a wave file: |
| | | |
| | | ``` |
| | | curl -o result.wav -X 'POST' \ |
| | | 'http://localhost:8000/synthesize' \ |
| | | -H 'Content-Type: application/json' \ |
| | | -d '{"language": "en", "text": "Sample text."}' |
| | | ``` |
| | | |
| | | ## Adding languages |
| | | |
| | | Currently only English and Polish models are included in the image. |
| | | |
| | | To add additional languages you can simply add extra entries |
| | | in `src/config.py` file and rebuild the container. |
| | | |
| | | Model identifiers are defined in the TTS repository: |
| | | https://github.com/idiap/coqui-ai-TTS/blob/dev/TTS/.models.json |
| | | |
| | | Command line `tts --list_models` lists all the available models. |
| | | |
| | | Note that the API does not support multi-speaker models yet! |