latex2speech
bark
latex2speech | bark | |
---|---|---|
1 | 9 | |
2 | 960 | |
- | - | |
2.9 | 8.7 | |
11 months ago | 7 months ago | |
Jupyter Notebook | Jupyter Notebook | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
latex2speech
bark
-
To Bridge the Gap Until the Official Audiobooks Are Released I Tried Making a Myne TTS [P5V5]
So I looked around and decided to use Bark Infinity. (Originally wanted to use Amazon Polly, but don't have a credit card) I tried around and found out that the female storyteller voice sounds quite decently. So I used that and a reference clip of Myne's voice as prompt (which I think might have helped a little... I don't get all that program's features) to generate a whole chapter. That worked quite well.
- Free/Affordable Text to Speech AI?
- Local and open-source equivalent to HeyGen Text-to-Speech (TTS) AI?
-
Whispers of Frostcliff Lodge
AI-generated voice. I'll have to try Bark Infinity and Speechify.
-
Bark: A transformer based text to audio system
I'll link my Bark fork with long audio generation and other features on the root thread, I suppose: https://github.com/JonathanFly/bark
There's going to be a big update this week with some new stuff I haven't talked about. And a bunch of amazing, clear voices, with a huge variety of styles, that blow the default Suno voices out of the water.
Don't get too attached though. I was just playing around and made a Bark fork and it got more popular than expected. But I wasn't thinking about the hours of unpaid support and maintenance in my future that I definitely can NOT afford, for software I don't even really have a personal use case for. I'm not generating my own audiobooks or anything, I wonβt be using it long term myself, I was just curious what Bark could do. (Turns out a LOT more than you might think at first glance, as you'll see this week.) So I'm trying to work out how I can elegantly wind this down and transition people somewhere else. But I'll keep it updated for at least a little while.
- Converting a Subreddit into a Podcast with GPT-4
- Ask a Text-To-Speech AI (Bark) to say "Why was six afraid of seven?" but ignore the "I'm done" token and force it to just keep talking.
- [R] πΆ Bark - Text2Speech...But with Custom Voice Cloning using your own audio/text samples ποΈπ
What are some alternatives?
encodec - State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
bark-with-voice-clone - π Text-prompted Generative Audio Model - With the ability to clone voices
bark - π Text-Prompted Generative Audio Model
crowdcast - Converts a subreddit into a podcast
audiolm-pytorch - Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
TTS - :robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple