Seamless: Meta's New Speech Models

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Scout Monitoring - Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
www.scoutapm.com
featured
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
  • seamless_communication

    Foundational Models for State-of-the-Art Speech and Text Translation

  • The license details are listed on the project GitHub

    https://github.com/facebookresearch/seamless_communication#l...

  • gpt-tutor

    Generate personalized audio lessons for learning languages with GPT and Azure AI speech.

  • I built just this a month ago with the Azure AI speech API, which is already pretty good at multilingual speech.https://github.com/adrianmfi/gpt-tutor I look forward to testing if switching to Seamless can improve it further. Seamless supporting nearly 100 languages is a nice improvement.

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
  • dragonfly

    Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx (by dictation-toolbox)

  • https://github.com/dictation-toolbox/dragonfly

  • I work on seamless and you can find sample code here: https://github.com/fairinternal/seamless_communication or in the HuggingFace space.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Sys Admins and AI

    1 project | news.ycombinator.com | 21 May 2024
  • Calling code with local LLM is a hoax

    1 project | dev.to | 20 May 2024
  • Show HN: ffmpeg-english "capture from /dev/video0 every 1 second to jpg files"

    9 projects | news.ycombinator.com | 19 May 2024
  • PaliGemma: Open-Source Multimodal Model by Google

    5 projects | news.ycombinator.com | 15 May 2024
  • Show HN: Pi-C.A.R.D, a Raspberry Pi Voice Assistant

    3 projects | news.ycombinator.com | 13 May 2024