Text-to-Speech Demo

Advanced text normalization options

🎙️ Voice Cloning

Checking voice cloning capabilities...

Backend Status

Loading backend information...

API Endpoints

Python Example

from openai import OpenAI # Create client pointing to local server client = OpenAI( base_url="http://localhost:8880/v1", api_key="not-needed" ) # Generate speech response = client.audio.speech.create( model="qwen3-tts", voice="Vivian", input="Hello! This is Qwen3-TTS." ) # Save to file response.stream_to_file("output.mp3")

Acknowledgments

This project is built on top of the incredible Qwen3-TTS model developed by the Qwen Team at Alibaba Cloud. We are deeply grateful for their outstanding work in advancing the state-of-the-art in text-to-speech technology.

The Qwen3-TTS model represents a significant advancement in neural speech synthesis, offering:

  • 🎯 High-quality, natural-sounding voice synthesis
  • 🌍 Support for 10+ languages
  • 🎨 Advanced voice cloning and design capabilities
  • ⚡ Efficient inference for production deployments
📦 Qwen3-TTS GitHub → 🤗 Hugging Face Models → 📑 Research Paper →