Ask HN: Are there any good open source Text-to-Speech tools?

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

mimic3

24 990 0.0 Python

A fast local neural text to speech engine for Mycroft

I've been quite happy with Mimic3 lately (https://github.com/MycroftAI/mimic3), the engine that powers Mycroft. It also comes with an easy-to-install Docker image.

tortoise-tts

145 12,145 7.7 Jupyter Notebook

A multi-voice TTS system trained with an emphasis on quality

The best is probably tortoise but you have to run it yourself https://github.com/neonbjb/tortoise-tts

Scout Monitoring

www.scoutapm.com featured

Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
espeak-ng

28 3,798 7.1 C

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

I've had good luck with https://github.com/espeak-ng/espeak-ng (for very specific purposes, and I was willing to wrangle IPA)

larynx

18 788 0.0 Python

Discontinued End to end text to speech system using gruut and onnx

I've had good results with https://github.com/rhasspy/larynx

TTS

233 30,483 9.2 Python

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

I'm not sure about the licensing of all the models/etc, but Coqui AI's 'TTS' python package is fairly good.
https://github.com/coqui-ai/TTS

pico-tts

1 36 2.6 C

Android PicoTTS w/C calling application using submodule
TTS

62 8,918 0.0 Jupyter Notebook

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts) (by mozilla)

I have heard good things about Mozilla's TTS: https://github.com/mozilla/TTS

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
dom-examples

88 3,230 7.7 JavaScript

Code examples that accompany various MDN DOM and Web API documentation pages

It's such an obvious answer perhaps is why nobody has commented it. But depending on the use, you might try web speech API synthesis. For example a Windows user might see a Cortana option whereas a Mac user might see Siri.
Demo Here: https://mdn.github.io/dom-examples/web-speech-api/speak-easy...
Read more here https://github.com/mdn/dom-examples/tree/main/web-speech-api

text-to-speech-ubuntu

1 21 4.3

🙊 Setup "selectable" text to speech / TTS on Ubuntu Linux 24.04 22.04 22.10 23.04 23.10 . Ideal for speed reading, programming, editing and writing.

https://github.com/gnat/text-to-speech-ubuntu

buzz

21 10,248 8.7 Python

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
whisper

346 62,242 6.0 Python

Robust Speech Recognition via Large-Scale Weak Supervision

OpenAI’s whisper[1] should do the job for you.
[1] - https://github.com/openai/whisper

tts

1 5 10.0 Jupyter Notebook

Given a URL, this service return an audio file / stream (in WAV format) that reads out the main content of the webpage. (by tslmy)

Given a URL, this service return an audio file / stream (in WAV format) that reads out the main content of the webpage.
https://github.com/tslmy/tts

wenet

5 3,779 9.6 Python

Production First and Production Ready End-to-End Speech Recognition Toolkit

For STT, take a look at Wenet: https://github.com/wenet-e2e/wenet
They provide support for running in a Raspberry Pi and it runs in real-time. I have tried the desktop version and the quality is good enough when the audio is clean.

opentts

10 841 1.3 Python

Open Text to Speech Server

If your use case allows for a web API, I've had good experience running OpenTTS[0].
It packages several models, including Coqui AI's TTS which I tend to use the most. There's a handy Docker image, too.
[0] https://github.com/synesthesiam/opentts

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

What's the best text-to-speech free non-cloud software?

4 projects | /r/DataHoarder | 31 May 2023
Otter.ai has saved reporters hours transcribing interviews. Caveat emptor

4 projects | news.ycombinator.com | 17 Feb 2022
OpenAI deems its voice cloning tool too risky for general release

1 project | news.ycombinator.com | 31 Mar 2024
Base TTS (Amazon): The largest text-to-speech model to-date

3 projects | news.ycombinator.com | 14 Feb 2024
WhisperSpeech – An Open Source text-to-speech system built by inverting Whisper

9 projects | news.ycombinator.com | 17 Jan 2024

Ask HN: Are there any good open source Text-to-Speech tools?

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Tts text-to-speech Pytorch Speech Deep Learning
Post date: 1 Jan 2023

mimic3

tortoise-tts

Scout Monitoring

espeak-ng

larynx

TTS

pico-tts

TTS

InfluxDB

dom-examples

text-to-speech-ubuntu

buzz

whisper

tts

wenet

opentts

SaaSHub

Related posts

What's the best text-to-speech free non-cloud software?

Otter.ai has saved reporters hours transcribing interviews. Caveat emptor

OpenAI deems its voice cloning tool too risky for general release

Base TTS (Amazon): The largest text-to-speech model to-date

WhisperSpeech – An Open Source text-to-speech system built by inverting Whisper

Ask HN: Are there any good open source Text-to-Speech tools?

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Tts text-to-speech Pytorch Speech Deep Learning Post date: 1 Jan 2023

Related posts

What's the best text-to-speech free non-cloud software?

Otter.ai has saved reporters hours transcribing interviews. Caveat emptor

OpenAI deems its voice cloning tool too risky for general release

Base TTS (Amazon): The largest text-to-speech model to-date

WhisperSpeech – An Open Source text-to-speech system built by inverting Whisper

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Tts text-to-speech Pytorch Speech Deep Learning
Post date: 1 Jan 2023