Skip to main content

๐Ÿ—จ๏ธ Kokoro Web - Effortless TTS for Open WebUI

warning

This tutorial is a community contribution and is not supported by the Open WebUI team. It serves only as a demonstration on how to customize Open WebUI for your specific use case. Want to contribute? Check out the contributing tutorial.

What is Kokoro Web?โ€‹

Kokoro Web provides a lightweight, OpenAI-compatible API for the powerful Kokoro-82M text-to-speech model, seamlessly integrating with Open WebUI to enhance your AI conversations with natural-sounding voices.

๐Ÿš€ Two-Step Integrationโ€‹

1. Deploy Kokoro Web API (One Command)โ€‹

services:
kokoro-web:
image: ghcr.io/eduardolat/kokoro-web:latest
ports:
- "3000:3000"
environment:
# Change this to any secret key to use as your OpenAI compatible API key
- KW_SECRET_API_KEY=your-api-key
volumes:
- ./kokoro-cache:/kokoro/cache
restart: unless-stopped

Run with: docker compose up -d

2. Connect OpenWebUI (30 Seconds)โ€‹

  1. In OpenWebUI, go to Admin Panel โ†’ Settings โ†’ Audio
  2. Configure:
    • Text-to-Speech Engine: OpenAI
    • API Base URL: http://localhost:3000/api/v1
      (If using Docker: http://host.docker.internal:3000/api/v1)
    • API Key: your-api-key (from step 1)
    • TTS Model: model_q8f16 (best balance of size/quality)
    • TTS Voice: af_heart (default warm, natural english voice). You can change this to any other voice or formula from the Kokoro Web Demo

That's it! Your OpenWebUI now has AI voice capabilities.

๐ŸŒ Supported Languagesโ€‹

Kokoro Web supports 8 languages with specific voices optimized for each:

  • English (US) - en-us
  • English (UK) - en-gb
  • Japanese - ja
  • Chinese - cmn
  • Spanish - es-419
  • Hindi - hi
  • Italian - it
  • Portuguese (Brazil) - pt-br

Each language has dedicated voices for optimal pronunciation and natural flow. See the GitHub repository for the complete list of language-specific voices or use the Kokoro Web Demo to preview and create your own custom voices instantly.

๐Ÿ’พ Optimized Models for Any Hardwareโ€‹

Choose the model that fits your hardware needs:

Model IDOptimizationSizeIdeal For
model_q8f16Mixed precision86 MBRecommended - Best balance
model_quantized8-bit92.4 MBGood CPU performance
model_uint8f16Mixed precision114 MBBetter quality on mid-range CPUs
model_q4f164-bit & fp16 weights154 MBHigher quality, still efficient
model_fp16fp16163 MBPremium quality
model_uint88-bit & mixed177 MBBalanced option
model_q44-bit matmul305 MBHigh quality option
modelfp32326 MBMaximum quality (slower)

โœจ Try Before You Installโ€‹

Visit the Kokoro Web Demo to preview all voices instantly. This demo:

  • Runs 100% in your browser - No server required
  • Free forever - No usage limits or registration needed
  • Zero installation - Just visit the website and start creating
  • All features included - Test any voice or language immediately

Need More Help?โ€‹

For additional options, voice customization guides, and advanced settings, visit the GitHub repository.

Enjoy natural AI voices in your OpenWebUI conversations!