Kokoro is an open-weight TTS model with 82 million parameters.

This demo only showcases English, but you can directly use the model to access other languages.

Voice

Quality and availability vary by language

Hardware

GPU is usually faster, but has a usage quota

0.5 2

πŸ’‘ Customize pronunciation with Markdown link syntax and /slashes/ like [Kokoro](/kˈOkΙ™ΙΉO/)

πŸ’¬ To adjust intonation, try punctuation ;:,.!?—…"()β€œβ€ or stress ˈ and ˌ

⬇️ Lower stress [1 level](-1) or [2 levels](-2)

⬆️ Raise stress 1 level [or](+2) 2 levels (only works on less stressed, usually short words)