Ask HN: Are there any good open source Text-to-Speech tools?

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Scout Monitoring - Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
www.scoutapm.com
featured
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
  • mimic3

    A fast local neural text to speech engine for Mycroft

  • I've been quite happy with Mimic3 lately (https://github.com/MycroftAI/mimic3), the engine that powers Mycroft. It also comes with an easy-to-install Docker image.

  • tortoise-tts

    A multi-voice TTS system trained with an emphasis on quality

  • The best is probably tortoise but you have to run it yourself https://github.com/neonbjb/tortoise-tts

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
  • espeak-ng

    eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

  • I've had good luck with https://github.com/espeak-ng/espeak-ng (for very specific purposes, and I was willing to wrangle IPA)

  • larynx

    Discontinued End to end text to speech system using gruut and onnx

  • I've had good results with https://github.com/rhasspy/larynx

  • TTS

    🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

  • I'm not sure about the licensing of all the models/etc, but Coqui AI's 'TTS' python package is fairly good.

    https://github.com/coqui-ai/TTS

  • pico-tts

    Android PicoTTS w/C calling application using submodule

  • TTS

    :robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts) (by mozilla)

  • I have heard good things about Mozilla's TTS: https://github.com/mozilla/TTS

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • dom-examples

    Code examples that accompany various MDN DOM and Web API documentation pages

  • It's such an obvious answer perhaps is why nobody has commented it. But depending on the use, you might try web speech API synthesis. For example a Windows user might see a Cortana option whereas a Mac user might see Siri.

    Demo Here: https://mdn.github.io/dom-examples/web-speech-api/speak-easy...

    Read more here https://github.com/mdn/dom-examples/tree/main/web-speech-api

  • text-to-speech-ubuntu

    🙊 Setup "selectable" text to speech / TTS on Ubuntu Linux 24.04 22.04 22.10 23.04 23.10 . Ideal for speed reading, programming, editing and writing.

  • https://github.com/gnat/text-to-speech-ubuntu

  • buzz

    Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

  • whisper

    Robust Speech Recognition via Large-Scale Weak Supervision

  • OpenAI’s whisper[1] should do the job for you.

    [1] - https://github.com/openai/whisper

  • tts

    Given a URL, this service return an audio file / stream (in WAV format) that reads out the main content of the webpage. (by tslmy)

  • Given a URL, this service return an audio file / stream (in WAV format) that reads out the main content of the webpage.

    https://github.com/tslmy/tts

  • wenet

    Production First and Production Ready End-to-End Speech Recognition Toolkit

  • For STT, take a look at Wenet: https://github.com/wenet-e2e/wenet

    They provide support for running in a Raspberry Pi and it runs in real-time. I have tried the desktop version and the quality is good enough when the audio is clean.

  • opentts

    Open Text to Speech Server

  • If your use case allows for a web API, I've had good experience running OpenTTS[0].

    It packages several models, including Coqui AI's TTS which I tend to use the most. There's a handy Docker image, too.

    [0] https://github.com/synesthesiam/opentts

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • What's the best text-to-speech free non-cloud software?

    4 projects | /r/DataHoarder | 31 May 2023
  • Otter.ai has saved reporters hours transcribing interviews. Caveat emptor

    4 projects | news.ycombinator.com | 17 Feb 2022
  • OpenAI deems its voice cloning tool too risky for general release

    1 project | news.ycombinator.com | 31 Mar 2024
  • Base TTS (Amazon): The largest text-to-speech model to-date

    3 projects | news.ycombinator.com | 14 Feb 2024
  • WhisperSpeech – An Open Source text-to-speech system built by inverting Whisper

    9 projects | news.ycombinator.com | 17 Jan 2024