SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python Whisper Projects
-
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
-
buzz
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
-
distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
-
chatgpt-telegram-bot
🤖 A Telegram bot that integrates with OpenAI's official ChatGPT APIs to provide answers, written in Python (by n3d1117)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
-
whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
-
subsai
🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️
-
whisper.api
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
-
whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
-
whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
-
whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
-
agentchain
Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
PaddlePaddle/PaddleSpeech
Project mention: Buzz: Transcribe and translate audio offline on your personal computer | news.ycombinator.com | 2024-03-21
Project mention: Easy video transcription and subtitling with Whisper, FFmpeg, and Python | news.ycombinator.com | 2024-04-06It uses this, which does support diarization: https://github.com/m-bain/whisperX
Project mention: Creando Subtítulos Automáticos para Vídeos con Python, Faster-Whisper, FFmpeg, Streamlit, Pillow | dev.to | 2024-04-29Faster-whisper (https://github.com/SYSTRAN/faster-whisper)
Project mention: FunASR: Fundamental End-to-End Speech Recognition Toolkit | news.ycombinator.com | 2024-01-13
Project mention: GreptimeAI + Xinference - Efficient Deployment and Monitoring of Your LLM Applications | dev.to | 2024-01-24Xorbits Inference (Xinference) is an open-source platform to streamline the operation and integration of a wide array of AI models. With Xinference, you’re empowered to run inference using any open-source LLMs, embedding models, and multimodal models either in the cloud or on your own premises, and create robust AI-driven applications. It provides a RESTful API compatible with OpenAI API, Python SDK, CLI, and WebUI. Furthermore, it integrates third-party developer tools like LangChain, LlamaIndex, and Dify, facilitating model integration and development.
Project mention: Show HN: AI Dub Tool I Made to Watch Foreign Language Videos with My 7-Year-Old | news.ycombinator.com | 2024-02-28Yes. But Whisper's word-level timings are actually quite inaccurate out of the box. There are some Python libraries that mitigate that. I tested several of them. whisper-timestamped seems to be the best one. [0]
[0] https://github.com/linto-ai/whisper-timestamped
Project mention: Show HN: WhisperFusion – Ultra-low latency conversations with an AI chatbot | news.ycombinator.com | 2024-01-29Everything runs locally, we use:
- WhisperLive for the transcription - https://github.com/collabora/WhisperLive
Project mention: Porting CP/M to the Brother SuperPowerNote Z80 laptop thing [video] | news.ycombinator.com | 2023-12-13Adding Whisper subtitles was really easy and they're dramatically better than the automatic Google ones (I did it via https://github.com/abdeladim-s/subsai, which was really easy to use). So there is now a reasonably good transcript available in the video comments.
On the other hand, if you need subtitles for a movie that doesn't have some. There are some automated solutions like Whisper that can do a very decent job in most cases : https://github.com/Purfview/whisper-standalone-win
Project mention: Firefox slow to load YouTube? Just another front in Google's war on ad blockers | news.ycombinator.com | 2023-12-12Much better, actually. Try the large-v3 model, it's great. I use it via whisper-ctranslate2 which is a faster implementation.
https://github.com/Softcatala/whisper-ctranslate2
Project mention: Limitless: Personalized AI powered by what you've seen, said, and heard | news.ycombinator.com | 2024-04-15
Python Whisper related posts
-
Best Speech-to-text API with speaker diarization?
-
Easy video transcription and subtitling with Whisper, FFmpeg, and Python
-
SOTA ASR Tooling: Long-Form Transcription
-
Deploying whisperX on AWS SageMaker as Asynchronous Endpoint
-
Buzz: Transcribe and translate audio offline on your personal computer
-
Voxos.ai – An Open-Source Desktop Voice Assistant
-
Jarvis: A Voice Virtual Assistant in Python (OpenAI, ElevenLabs, Deepgram)
-
A note from our sponsor - SaaSHub
www.saashub.com | 23 May 2024
Index
What are some of the best open-source Whisper projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | PaddleSpeech | 10,271 |
2 | buzz | 10,177 |
3 | whisperX | 9,391 |
4 | faster-whisper | 9,278 |
5 | FunASR | 3,872 |
6 | distil-whisper | 3,236 |
7 | chatgpt-telegram-bot | 2,781 |
8 | inference | 2,871 |
9 | whisper-timestamped | 1,582 |
10 | yt-whisper | 1,320 |
11 | WhisperLive | 1,287 |
12 | auto-subtitle | 1,211 |
13 | subsai | 1,094 |
14 | whisper.api | 843 |
15 | truss | 843 |
16 | whisper-standalone-win | 842 |
17 | whisper-playground | 763 |
18 | whisper-ctranslate2 | 770 |
19 | Whisper-WebUI | 715 |
20 | AI-Waifu-Vtuber | 656 |
21 | whisper_mic | 639 |
22 | agentchain | 563 |
23 | Owl | 444 |
Sponsored