Python voice-recognition

Open-source Python projects categorized as voice-recognition

Top 22 Python voice-recognition Projects

  • PaddleSpeech

    Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

  • Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

    PaddlePaddle/PaddleSpeech

  • speechbrain

    A PyTorch-based Speech Toolkit

  • Project mention: SpeechBrain 1.0: A free and open-source AI toolkit for all things speech | news.ycombinator.com | 2024-02-28
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • silero-vad

    Silero VAD: pre-trained enterprise-grade Voice Activity Detector

  • Project mention: New models and developer products announced at OpenAI DevDay | news.ycombinator.com | 2023-11-06

    >How do you detect speech starting and stopping?

    https://github.com/snakers4/silero-vad

  • WhisperLive

    A nearly-live implementation of OpenAI's Whisper.

  • Project mention: Show HN: WhisperFusion – Ultra-low latency conversations with an AI chatbot | news.ycombinator.com | 2024-01-29

    Everything runs locally, we use:

    - WhisperLive for the transcription - https://github.com/collabora/WhisperLive

  • Python-ai-assistant

    Python AI assistant 🧠

  • Project mention: Jarvis: A Voice Virtual Assistant in Python (OpenAI, ElevenLabs, Deepgram) | news.ycombinator.com | 2023-12-18

    There is another one (Also Jarvis) that's been around for a while and is more useful, wonder if they can combine forces? https://github.com/ggeop/Python-ai-assistant

    Not sure if anyone has noticed but OpenAI now has a mobile app (I've been using the PWA all this time) and the voice assistant on there is really strong. Sounds good, fast, and seems to even run a pass on my voice before it submits the query.

  • mycroft-precise

    A lightweight, simple-to-use, RNN wake word listener

  • rhino

    On-device Speech-to-Intent engine powered by deep learning (by Picovoice)

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • speech-to-text-benchmark

    speech to text benchmark framework

  • Project mention: Speech-to-Text Benchmark | news.ycombinator.com | 2024-01-16
  • cheetah

    On-device streaming speech-to-text engine powered by deep learning (by Picovoice)

  • picovoice

    On-device voice assistant platform powered by deep learning

  • leopard

    On-device speech-to-text engine powered by deep learning

  • Caster

    Dragonfly-Based Voice Programming and Accessibility Toolkit

  • LiveWhisper

    A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.

  • gpt-voice-conversation-chatbot

    Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.

  • chatgpt-voice-assistant

    A chatbot that uses speech to text for input, sends the text to OpenAI's ChatGPT text generation model and speaks the response using text to speech.

  • Project mention: ChatGPT Voice Assistant | news.ycombinator.com | 2023-06-13
  • M.I.L.E.S

    M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searches global weather, delivers date and time, autonomously chooses and retains long-term memories. Available for macOS and Windows.

  • Project mention: Show HN: I made M.I.L.E.S, the worlds best voice assistant | news.ycombinator.com | 2024-01-06
  • gpt_chatbot

    This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pinecone to store long term information and retrieves it to create context. API keys for OpenAI and Pinecone required. Tested on Windows

  • octopus

    On-device Speech-to-Index engine powered by deep learning (by Picovoice)

  • autosrt

    Offline srt producer gui with whisper.cpp

  • Universal-MacAssistant

    Advanced Personal Assistant created for macOS that utilises AppleScripts, Siri and more.

  • Project mention: Your AI MacOS Voice Assistant | /r/coolgithubprojects | 2023-07-03
  • ameli-ai

    Ameli, a cross platform personal voice assistant for Windows/Linux/MacOS/Android/iOS

  • hollow-knight-voice-commands

    A fun little python tool to play Hollow Knight with only voice commands

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python voice-recognition related posts

  • Speech-to-Text Benchmark

    1 project | news.ycombinator.com | 16 Jan 2024
  • New models and developer products announced at OpenAI DevDay

    8 projects | news.ycombinator.com | 6 Nov 2023
  • [Discussion] Video Translation Task

    2 projects | /r/MachineLearning | 13 Jul 2023
  • Your AI MacOS Voice Assistant

    1 project | /r/coolgithubprojects | 3 Jul 2023
  • Apollo dev posts backend code to Git to disprove Reddit’s claims of scrapping and inefficiency

    4 projects | /r/webdev | 9 Jun 2023
  • I made a simple gui to use whisper.cpp in python.

    2 projects | /r/Python | 13 Apr 2023
  • Automatic Speech Recognition with AWS Lambda and Leopard

    2 projects | dev.to | 1 Feb 2023
  • A note from our sponsor - SaaSHub
    www.saashub.com | 20 May 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source voice-recognition projects in Python? This list will help you:

Project Stars
1 PaddleSpeech 10,233
2 speechbrain 7,948
3 silero-vad 2,935
4 WhisperLive 1,287
5 Python-ai-assistant 860
6 mycroft-precise 802
7 rhino 594
8 speech-to-text-benchmark 586
9 cheetah 558
10 picovoice 511
11 leopard 411
12 Caster 334
13 LiveWhisper 298
14 gpt-voice-conversation-chatbot 290
15 chatgpt-voice-assistant 107
16 M.I.L.E.S 89
17 gpt_chatbot 53
18 octopus 34
19 autosrt 22
20 Universal-MacAssistant 9
21 ameli-ai 6
22 hollow-knight-voice-commands 1

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com