audiocraft
sd-webui-lobe-theme
audiocraft | sd-webui-lobe-theme | |
---|---|---|
37 | 77 | |
19,792 | 2,217 | |
2.5% | 7.3% | |
8.3 | 9.3 | |
21 days ago | 4 days ago | |
Python | TypeScript | |
MIT License | GNU Affero General Public License v3.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
audiocraft
- [N] MusicGen - Meta's response to Google's MusicLM for text-to-music is freely available for non-commercial usage
-
Open Source Libraries
facebookresearch/audiocraft/MUSICGEN: Music Generation
- Audiocraft: a library for audio processing and generation with deep learning.
- Audiocraft is a library for audio processing and generation with deep learning
-
Meta Open Sources AudioCraft: Generative AI for Audio
https://github.com/facebookresearch/audiocraft/blob/main/LIC...
-
This is not an infinite zoom.
I asked Audiocraft to make me a "chill hip hop beat", I used framesync.xyz to make keyframes for A1111 Deforum extension. Unfortunately, I don't have the settings file anymore, but it was pretty much just a 26s clip at 15fps (440 frames) with a single prompt "a surreal painting by Magritte" and the usual negative prompt magic voodoo. Then, for every clip I used the last frame of the previous clip as init frame. I render at 512x512 and then use ESRGAN4x to upscale to 2048x2048
-
[Frostveil Series] A monk channeling its inner Ønd
However, the music was 100% AI-generated by MusicGen.
Music was entirely generated by AI using MusicGen. Video was generated using PhotoVibrance.
- Try Meta's new MusicGen text-to-audio generator here, free, up to 30 seconds in length. | Text Prompt: Van Halen Style Catchy Electric Guitar Melody Hook for intro of song with distortion
-
I connected my Roland Digital Piano to GPT and MusicGen...
If you want to know more about MusicGen, https://github.com/facebookresearch/audiocraft
sd-webui-lobe-theme
-
Upscayl – Free and Open Source AI Image Upscaler
upscayl is very approachable, but lacked many features i needed. i ended up using https://github.com/AUTOMATIC1111/stable-diffusion-webui after upscaling became part of my regular workflow, but for someone who just needs a few images enhanced, it's an ideal tool.
-
The Basics of AI Image Generation: How to create your own AI-generated image using Stable Diffusion on your local machine.
For the Git alternative, simply right-click on the location you want to put the Stable Diffusion and select “Git Bash Here”, then paste this on the CLI: git clone https://github.com/AUTOMATIC1111/stable-diffusion-webui
-
Stable Cascade
ComfyUI is similar to Houdini in complexity, but immensely powerful. It's a joy to use.
There are also a large amount of resources available for it on YouTube, GitHub (https://github.com/comfyanonymous/ComfyUI_examples), reddit (https://old.reddit.com/r/comfyui), CivitAI, Comfy Workflows (https://comfyworkflows.com/), and OpenArt Flow (https://openart.ai/workflows/).
I still use AUTO1111 (https://github.com/AUTOMATIC1111/stable-diffusion-webui) and the recently released and heavily modified fork of AUTO1111 called Forge (https://github.com/lllyasviel/stable-diffusion-webui-forge).
-
Show HN: I made a local wrapper for Automatic 1111
Seems like an interesting project. Regarding the name, is there permission to use something so similar to AUTOMATIC1111 [1]?
> Diffusers will Cuda out of memory/perform very slowly for huge generations, like 2048x2048 images, while Auto 1111 SDK won't.
Do we have some numbers on this? I have seen AUTOMATIC1111 fall-over whilst using only half the available of GPU VRAM - there seems to be some weirdness where it tries to allocate before de-allocating the last batch or something.
> You can use any of the 6 compatible RealEsrgran models/weights with our RealEsrgran pipeline for upscaling images. Here are the model ids:
I've previously had trouble trying to use AUTOMATIC1111 upscalers, it seems like it needs more GPU VRAM than just generating the image already upscaled.
[1] https://github.com/AUTOMATIC1111/stable-diffusion-webui
-
Stable Code 3B: Coding on the Edge
You might be thinking of Fooocus: https://github.com/lllyasviel/Fooocus
The Stable Diffusion web interface that got a lot of people's attention originally was Automatic1111: https://github.com/AUTOMATIC1111/stable-diffusion-webui
Fooocus is definitely more beginner friendly. It does a lot of the prompt engineering for you. Automatic1111 has a ton of plugins, most notably ControlNet which gives you fine grained control over the images, but there is a learning curve.
- Google Imagen 2
-
Free or "practically-free" Ai picture generator?
Stable Diffusion https://github.com/AUTOMATIC1111/stable-diffusion-webui
-
Things to do, to put my old PC to use?
Make it into a stable diffusion server!
-
GTA 6 trailer screencaps, photorealistic style
There's no link version, you have to run it locally. You install it from here
-
Automatic1111 v1.7.0-RC published
Repository: AUTOMATIC1111/stable-diffusion-webui · Tag: v1.7.0-RC · Commit: 48fae7c · Released by: AUTOMATIC1111
What are some alternatives?
llama - Inference code for Llama models
stable-diffusion-webui - Stable Diffusion web UI
jukebox - Code for the paper "Jukebox: A Generative Model for Music"
ComfyUI - The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
audiocraft-infinity-webui
automatic - SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models
gpt-producer
stable-diffusion-webui-amdgpu - Stable Diffusion web UI
tortoise-tts - A multi-voice TTS system trained with an emphasis on quality
stable-diffusion-webui-ux - Stable Diffusion web UI UX
Stable-Diffusion - Stable Diffusion, SDXL, LoRA Training, DreamBooth Training, Automatic1111 Web UI, DeepFake, Deep Fakes, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya LoRA, Kandinsky 2, DeepFloyd IF, Midjourney
stable-diffusion-webui-colab - stable diffusion webui colab