Base TTS (Amazon): The largest text-to-speech model to-date

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • bark

    An inference server for Bark (by SaladTechnologies)

  • Bark and Tortoise work fairly well. Bark does super fast inference[1] on my M1.

    [1] https://github.com/SaladTechnologies/bark

  • metavoice-src

    Foundational model for human-like, expressive TTS

  • Interesting. Just a couple of hours ago I came across MetaVoice-1B [0] (Demo [1]) and was amazed by the quality of their TTS in English (sadly no other languages available).

    If this year becomes the year when high quality Open Source TTS and ASR models appear that can run in real-time on an Nvidia RTX 40x0 or 30x0, then that would be great. On CPU even better.

    [0] https://github.com/metavoiceio/metavoice-src

    [1] https://ttsdemo.themetavoice.xyz/

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • TTS

    πŸΈπŸ’¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

  • I've used coqui.ai's TTS models[0] and library[1] to great success. I was able to get cloned voice to be rendered in about 80% of the audio clip length, and I believe you can also stream the response. Do note the model license for XTTS, it is one they wrote themselves that has some restrictions.

    [0] https://huggingface.co/coqui/XTTS-v2

    [1] https://github.com/coqui-ai/TTS

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • OpenAI deems its voice cloning tool too risky for general release

    1 project | news.ycombinator.com | 31 Mar 2024
  • Coqui Is Shutting Down

    1 project | news.ycombinator.com | 11 Jan 2024
  • Coqui.ai Is Shutting Down

    4 projects | news.ycombinator.com | 3 Jan 2024
  • Hello guys, any selfhosted alternative to eleven labs?

    3 projects | /r/selfhosted | 11 Dec 2023
  • Demo of Anagnorisis - completely local recommendation system powered by Llama 2. Radio mode. Work in progress.

    2 projects | /r/LocalLLaMA | 11 Dec 2023