sd-dynamic-thresholding
sd-dynamic-thresholding | ultimate-upscale-for-automatic1111 | |
---|---|---|
26 | 52 | |
1,031 | 1,512 | |
5.9% | - | |
7.2 | 4.2 | |
14 days ago | 3 months ago | |
Python | Python | |
MIT License | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
sd-dynamic-thresholding
-
ZeroDiffusion -- a clean zero terminal SNR training 1.5 base model + experimental inpainting model
For outputs to look right, you will need some form of CFG rescale or dynamic thresholding in order to correct for overexposure (A1111 extensions are linked -- I am told that ComfyUI has nodes available for these functions). A good starting point for CFG rescale is 0.7, as recommended in the paper. I strongly suspect that CFG rescale is not an ideal solution and leaves a substantial training-inference gap, and when using zero terminal SNR models I find that Dynamic Thresholding can give better outputs that are closer to what I expect from the data without the brownout often caused by CFG rescale. A potential starting point for Dynamic Thresholding would be: Restart sampler, 15 CFG scale, Mimic CFG scale 15 7.5, Sawtooth on both scale schedulers, 6 for both minimum values, scheduler value 4, do not separate feature channels, ZERO, STD. You will likely have to experiment a lot with Dynamic Thresholding. (edit: small correction to DT settings)
-
Dynamic Thresholding for comfyui?
Recently switched from A1111 and i love it so far, flexibility to orchestrate complex workflows automatically instead of manual operations is a life changer. Anyhow, one extension i like on A1111 was this one: https://github.com/mcmonkeyprojects/sd-dynamic-thresholding
-
How do I implement Dynamic Thresholding (CFG scale fix) in ComfyUI?
In the Automatic1111 webui, there is a Dynamic Thresholding (CFG scale fix) extension that:
-
How to diffuse better faces?
Ive found using ADetailer (https://github.com/Bing-su/adetailer, using their reccomended advanced settings and face_yolov8n.pt) and Dynamic Thresholding (CFG set to 12 and Mimic to 7) has vastly improved my face renders. (https://github.com/mcmonkeyprojects/sd-dynamic-thresholding) GL!
-
Kohya UI settings as asked (style+character training)
The output LoRA works best with CFG at 4, because at 7 it gets that gasoline colors and contrast of overbaking, but I guess this is a tradeoff of that many steps in total (5200) since the earlier snapshots were not that good in style and with character details. You can use a workaround like the Dynamic Trescholding extention: https://github.com/mcmonkeyprojects/sd-dynamic-thresholding.git - helps a lot in many cases when you want a high CFG but the model/lora overbakes them (it mimics a lower CFG while keeping the high CFG details and prompt alignment).
-
Does anyone know how to create this type of hyper realistic pic?
Use sd-dynamic-thresholding extension (set CFG scale to 12 or more and mimic CFG scale to 7): https://github.com/mcmonkeyprojects/sd-dynamic-thresholding
- ControlNet Reference-Only problems
-
What's your favorite small tweaks to make? I'll go first
Tweak this up or down for small changes. Too far and you’ll get a different image. Extensions like Dynamic Thresholding can let you go much higher without the overexposed look.
-
Blurred/Low quality/Low details images
Turn CFG scale down or maybe use this extension, I've never used Dynamic Thresholding before but I think its what you want
- Dynamic threshold & Offset noise - The answer to oversaturated images?
ultimate-upscale-for-automatic1111
-
Ultimate Upscale for A1111 BUG
So I have this problem while trying to use "Ultimate Upscale for automatic1111" plugin in A1111, but I cannot find any information about it on any github issue, reddit post or other help platforms.
-
Mass generate images?
If you don't already have it, install the Ultimate SD Upscale script. It's in the Automatic1111 available extensions list, or you can install it from the URL. It gives you the option to choose from whatever upscalers you have installed.
- Adventure Girl
- Can't use the SD Upscale script... errors every time. Suggestions?
-
I love the Tile ControlNet, but it's really easy to overdo. Look at this monstrosity of tiny detail I made by accident.
Basically you'd select the "tile" Controlnet (both preprocessor and Controlnet model), and then you'd use either tiled diffusion or ultimate SD upscaler to create a tile upscale.
- Upscale photos and artwork in A1111
- ControlNet Reference-Only problems
-
Animals as Aztec/Mayan warriors
These were made over multiple iterations, but the workflow basically was - Generate seed image using either lyriel, edge of realism, absolute reality, rpgv4 - One I get an actual animal, use in control net reference_only, regenerate with photography prompt - Upscale with ultimate scale https://github.com/Coyote-A/ultimate-upscale-for-automatic1111
- Some 4k wallpaper i made while trying out a few extensions
- Controlnet : Reference only test
What are some alternatives?
stable-diffusion-webui-anti-burn - Extension for AUTOMATIC1111/stable-diffusion-webui for smoothing generated images by skipping a few very last steps and averaging together some images before them.
chaiNNer - A node-based image processing GUI aimed at making chaining image processing tasks easy and customizable. Born as an AI upscaling application, chaiNNer has grown into an extremely flexible and powerful programmatic image processing application.
Stable-Diffusion - Stable Diffusion, SDXL, LoRA Training, DreamBooth Training, Automatic1111 Web UI, DeepFake, Deep Fakes, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya LoRA, Kandinsky 2, DeepFloyd IF, Midjourney
multidiffusion-upscaler-for-automatic1111 - Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
adetailer - Auto detecting, masking and inpainting with detection model.
InvokeAI - InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.
sd_webui_SAG
automatic - SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models
sd-dynamic-prompts - A custom script for AUTOMATIC1111/stable-diffusion-webui to implement a tiny template language for random prompt generation
stable-diffusion-webui - Stable Diffusion web UI