speech synthesis source code free download

Showing 16 open source projects for "speech synthesis source code"

View related business solutions

Multimedia Python Clear Filters & Widen Search

Best Visitor Management System
Instantly Notify Staff Of Deliveries And Guest Arrivals To Increase Your Efficiency

<p class="mb-4">Do stacks of paperwork pile up at the front desk area? Or are your receptionists constantly filing reports, guest log-in information and NDAs – taking them away from other important tasks? Not anymore! Our Visitor Management System automates all these processes, streamlining your workflow. Guests can complete inductions, sign NDAs, fill in their contact details and much more using the easy software. These records are then automatically filed and stored, making life easy for receptionists and the HR team. Claim your FREE 7-day trial and experience how VisitUs can transform your workplace!</p>

Try it Free
Odoo Business Applications
Odoo is a software suite to better manage and run your company.

Odoo is a fully integrated, customizable, open-source software packed with hundreds of expertly designed business applications. Odoo’s intuitive database is able to meet a majority of business needs, such as: CRM, Sales, Project, Manufacturing, Inventory, and Accounting, just to name a few.

Get Started
1

PersonaPlex

PersonaPlex code

PersonaPlex is an open-source real-time conversational speech AI model that goes beyond traditional text chat by providing full-duplex speech-to-speech interaction, meaning it can listen and talk at the same time instead of waiting for you to finish speaking before responding. This architectural approach eliminates awkward pauses and makes conversations feel much more human-like, with natural behaviors such as overlapping speech, interruptions, and fluent turn-taking, traits that traditional...

Downloads: 5 This Week

Last Update: 2026-03-02
See Project
2

Podcastfy.ai

Transforming Multimodal Content into Captivating Multilingual Audio

Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling...

Downloads: 9 This Week

Last Update: 2024-11-16
See Project
3

ML Sharp

Sharp Monocular View Synthesis in Less Than a Second

ML Sharp is a research code release that turns a single 2D photograph into a photorealistic 3D representation that can be rendered from nearby viewpoints. Instead of requiring multi-view input, it predicts the parameters of a 3D Gaussian scene representation directly from one image using a single forward pass through a neural network. The core idea is speed: the 3D representation is produced in under a second on a standard GPU, and then the resulting scene can be rendered in real time to...

Downloads: 4 This Week

Last Update: 2026-01-29
See Project
4

SCAIL

Towards Studio-Grade Character Animation via In-Context Learning of 3D

SCAIL is a project developed by the ZAI Organization, focusing on AI-driven research initiatives. While specific documentation about SCAIL’s exact goals and implementation is limited from the repository context alone, the project appears to be part of a collection of machine learning and AI research tools that facilitate scalable model development, evaluation, or application workflows. Given its listing alongside other ZAI projects like speech recognition and text-to-speech systems, SCAIL...

Downloads: 0 This Week

Last Update: 2026-01-30
See Project
Infor M3 ERP
Enterprise manufacturers and distributors requiring a solution to manage and execute complex processes

Efficiently executing the complex processes of enterprise manufacturers and distributors. Infor M3 is a cloud-based, manufacturing and distribution ERP system that leverages the latest technologies to provide an exceptional user experience and powerful analytics in a multicompany, multicountry, and multisite platform. Infor M3 and related CloudSuite™ industry solutions include industry-leading functionality for the chemical, distribution, equipment, fashion, food and beverage, and industrial manufacturing industries. Staying ahead of the competition means staying agile. Our new capabilities bring improved data-driven insights and streamlined workflows to help you make informed decisions and take quick action.

Learn More
5

SPPAS

SPPAS - the automatic annotation and analyses of speech

SPPAS is a scientific computer software package written and maintained by Brigitte Bigi of the Laboratoire Parole et Langage, in Aix-en-Provence, France. Available for free, with open source code, there is simply no other package for linguists to simple use in the automatic annotations of speech, the analyses of any kind of annotated data and the conversion of annotated files. SPPAS is able to produce automatically speech annotations from a recorded speech sound and its orthographic transcription. SPPAS is helpful for the analysis of any annotated data: estimate statistical distributions, make requests, manage files, visualize annotations. ...

Downloads: 20 This Week

Last Update: 2026-04-06
See Project
6

EnCodec

State-of-the-art deep learning based audio codec

Encodec is a neural audio codec developed by Meta for high-fidelity, low-bitrate audio compression using end-to-end deep learning. Unlike traditional codecs (like MP3 or Opus), Encodec uses a learned quantizer and decoder to reconstruct complex waveforms with remarkable accuracy at bitrates as low as 1.5 kbps. It employs a convolutional encoder–decoder architecture trained with perceptual loss functions that optimize for human auditory quality rather than raw waveform distance. The model can...

Downloads: 0 This Week

Last Update: 2025-10-12
See Project
7

DeepFaceLab

The leading software for creating deepfakes

DeepFaceLab is currently the world's leading software for creating deepfakes, with over 95% of deepfake videos created with DeepFaceLab. DeepFaceLab is an open-source deepfake system that enables users to swap the faces on images and on video. It offers an imperative and easy-to-use pipeline that even those without a comprehensive understanding of the deep learning framework or model implementation can use; and yet also provides a flexible and loose coupling structure for those who want to strengthen their own pipeline with other features without having to write complicated boilerplate code.

1 Review

Downloads: 166 This Week

Last Update: 2023-09-07
See Project
8

Steel TTS

A cross-platform wrapper for common text-to-speech engines in Python

Steel is a cross-platform package for using common text-to-speech (speech synthesis) engines in Python. Steel currently supports the following TTS software: - Microsoft Speech API 5 (SAPI5) - eSpeak - NS Speech Synthesis - FreeTTS Documentation: http://sourceforge.net/p/steeltts/wiki/ Bug Tracker: http://sourceforge.net/p/steeltts/tickets/ If you are interested in contributing to the Steel TTS codebase, or would like to make a feature-request, please contact the lead...

Downloads: 4 This Week

Last Update: 2016-03-15
See Project
9

pyespeak

Python to eSpeak speech synthesis

ctypes Python module for eSpeak http://espeak.sf.net speech synthesis

Downloads: 0 This Week

Last Update: 2017-10-28
See Project
Your go-to FinOps platform
Analyze, optimize, and govern your multi-cloud environment effortlessly with AI Agentic FinOps.

Unlike reporting-only FinOps tools, FinOpsly unifies cloud (AWS, Azure, GCP), data (Snowflake, Databricks, BigQuery), and AI costs into a single system of action — enabling teams to plan spend before it happens, automate optimization safely, and prove value in weeks, not quarters.

Learn More
10

Voice keyboard

Voice keyboard/dictation. Aims to be a total substitute for a keyboard. Spell out words letter by letter (using code: alpha, bravo, ..). Arrow keys, modifiers work. Speak whole words (but whole word accuracy is not good). Attach commands to some word

Downloads: 4 This Week

Last Update: 2015-04-20
See Project
11

Speect

Speect is a multilingual TTS system. It offers a full text-to-speech system with various API's, as well as an environment for research and development of TTS systems and voices. It is written in ANSI C and uses a plug-in mechanism for extensions. Speect also includes an extensive set of Python bindings for quick implementation of new ideas, these bindings are derived from SWIG interface files and can easily be extended for other languages supported by SWIG. Speect is free and open...

Downloads: 0 This Week

Last Update: 2013-05-30
See Project
12

nxAlpha

SuperCollider Code for Livecoding Experimental Sound

SuperCollider Code for Livecoding Experimental Sound

Downloads: 0 This Week

Last Update: 2013-04-24
See Project
13

ASTA - Auto. Subtitle Timing Annotator

A collection of scripts and programs to automatically annotate video/audio for subtitles. Basically relies on a MARSYAS (Music Analysis, Retrieval and Synthesis for Audio Signals) plug-in for detecting human voice in polyphonic recordings.

Downloads: 0 This Week

Last Update: 2014-04-24
See Project
14

cwtext text to morse code converter

Convert text to International Morse Code. Input is ASCII text. Output can be: - . -..- - on the console, raw 8bit PCM suitable for piping to /dev/audio, .wav files or even (mp3|ogg). Good for headlines on your MP3 player or code practice.

7 Reviews

Downloads: 64 This Week

Last Update: 2013-03-22
See Project
15

Open Interface for Speech Synthesis

The Open Interface for Speech Synthesis (OISS) provides an interface to speech synthesis hardware and software for end-user applications under Unix.

Downloads: 0 This Week

Last Update: 2013-02-21
See Project
16

JackFX

JackFX is a python module for midi control and realtime audio effects processing built using the Jack Audio Connection Kit. Effects are stackable, and can be chained in any configuration with only a few lines of python code.

1 Review

Downloads: 0 This Week

Last Update: 2013-04-19
See Project