Kokoro
Voice & Multimodal
7.2k
stars
Kokoro is an open-weight text-to-speech model with 82 million parameters, optimized for fast inference on both CPU and GPU, generating speech in roughly 100 milliseconds. It supports 8 languages and delivers high-quality, natural-sounding output.