Sopro TTS is an open-source text-to-speech (TTS) project that implements a lightweight model capable of producing speech from text with zero-shot voice cloning, meaning it can mimic a speaker’s voice from only a few seconds of reference audio. Built with a 169 million-parameter architecture that uses dilated convolutions and cross-attention layers instead of large Transformer stacks, it achieves relatively fast real-time performance even on CPUs (about a 0.25 real-time factor measured on an M3 base). The model is designed to work with a small set of dependencies and to be accessible for developers who want offline TTS with customizable voice style, including options for streaming or non-streaming generation modes. Users can install it with standard Python tools, run a demo server locally, and experiment with CLI or Python API usage for producing synthetic speech.

Features

  • English text-to-speech generation
  • Zero-shot voice cloning from short reference audio
  • Lightweight model (~169M parameters)
  • Streaming and non-streaming modes
  • CLI and Python API usage
  • CPU-friendly real-time performance

Project Samples

Project Activity

See All Activity >

Categories

Text to Speech

License

Apache License V2.0

Follow Sopro TTS

Sopro TTS Web Site

Other Useful Business Software
Get full visibility and control over your tasks and projects with Wrike. Icon
Get full visibility and control over your tasks and projects with Wrike.

A cloud-based collaboration, work management, and project management software

Wrike offers world-class features that empower cross-functional, distributed, or growing teams take their projects from the initial request stage all the way to tracking work progress and reporting results.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Sopro TTS!

Additional Project Details

Programming Language

Python

Related Categories

Python Text to Speech Software

Registered

2026-01-28