Top 23 Pytorch Open-Source Projects

stable-diffusion-webui

2,808 131,121 9.9 Python

Stable Diffusion web UI

Project mention: Show HN: I made an app to use local AI as daily driver | news.ycombinator.com | 2024-02-27

* LLaVA model: I'll add more documentation. You are right Llava could not generate images. For image generation I don't have immediate plans, but checkout these projects for local image generation.
- https://diffusionbee.com/
- https://github.com/comfyanonymous/ComfyUI
- https://github.com/AUTOMATIC1111/stable-diffusion-webui

transformers

180 126,170 10.0 Python

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Project mention: Reading list to join AI field from Hugging Face cofounder | news.ycombinator.com | 2024-05-18

Not sure what you are implying. Thomas Wolf has the second highest number of commits on HuggingFace/transformers. He is clearly competent & deeply technical
https://github.com/huggingface/transformers/

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Keras

79 61,044 9.9 Python

Deep Learning for humans

Project mention: Side Quest #3: maybe the real Deepfakes were the friends we made along the way | dev.to | 2024-05-20

def batcher_from_directory(batch_size:int, dataset_path:str, shuffle=False,seed=None) -> tf.data.Dataset: """ Return a tensorflow Dataset object that returns images and spectrograms as required. Partly inspired by https://github.com/keras-team/keras/blob/v3.3.3/keras/src/utils/image_dataset_utils.py Args: batch_size: The batch size. dataset_path: The path to the dataset folder which must contain the image folder and audio folder. shuffle: Whether to shuffle the dataset. Default to False. seed: The seed for the shuffle. Default to None. """ image_dataset_path = os.path.join(dataset_path, "image") # create the foundation datasets og_dataset = tf.data.Dataset.from_generator(lambda: original_image_path_gen(image_dataset_path), output_signature=tf.TensorSpec(shape=(), dtype=tf.string)) og_dataset = og_dataset.repeat(None) # repeat indefinitely ref_dataset = tf.data.Dataset.from_generator(lambda: ref_image_path_gen(image_dataset_path), output_signature=(tf.TensorSpec(shape=(), dtype=tf.string), tf.TensorSpec(shape=(), dtype=tf.bool))) ref_dataset = ref_dataset.repeat(None) # repeat indefinitely # create the input datasets og_image_dataset = og_dataset.map(lambda x: tf.py_function(load_image, [x, tf.convert_to_tensor(False, dtype=tf.bool)], tf.float32), num_parallel_calls=tf.data.AUTOTUNE) masked_image_dataset = og_image_dataset.map(lambda x: tf.py_function(load_masked_image, [x], tf.float32), num_parallel_calls=tf.data.AUTOTUNE) ref_image_dataset = ref_dataset.map(lambda x, y: tf.py_function(load_image, [x, y], tf.float32), num_parallel_calls=tf.data.AUTOTUNE) audio_spec_dataset = og_dataset.map(lambda x: tf.py_function(load_audio_data, [x, dataset_path], tf.float64), num_parallel_calls=tf.data.AUTOTUNE) unsync_spec_dataset = ref_dataset.map(lambda x, _: tf.py_function(load_audio_data, [x, dataset_path], tf.float64), num_parallel_calls=tf.data.AUTOTUNE) # ensure shape as tensorflow does not accept unknown shapes og_image_dataset = og_image_dataset.map(lambda x: tf.ensure_shape(x, IMAGE_SHAPE)) masked_image_dataset = masked_image_dataset.map(lambda x: tf.ensure_shape(x, MASKED_IMAGE_SHAPE)) ref_image_dataset = ref_image_dataset.map(lambda x: tf.ensure_shape(x, IMAGE_SHAPE)) audio_spec_dataset = audio_spec_dataset.map(lambda x: tf.ensure_shape(x, AUDIO_SPECTROGRAM_SHAPE)) unsync_spec_dataset = unsync_spec_dataset.map(lambda x: tf.ensure_shape(x, AUDIO_SPECTROGRAM_SHAPE)) # multi input using https://discuss.tensorflow.org/t/train-a-model-on-multiple-input-dataset/17829/4 full_dataset = tf.data.Dataset.zip((masked_image_dataset, ref_image_dataset, audio_spec_dataset, unsync_spec_dataset), og_image_dataset) # if shuffle: # full_dataset = full_dataset.shuffle(buffer_size=batch_size * 8, seed=seed) # not sure why buffer size is such # batch full_dataset = full_dataset.batch(batch_size=batch_size) return full_dataset

Real-Time-Voice-Cloning

96 50,951 0.0 Python

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Project mention: FLaNK Stack Weekly 12 February 2024 | dev.to | 2024-02-12

nn

26 48,933 7.7 Jupyter Notebook

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
yolov5

129 47,375 8.8 Python

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Project mention: จำแนกสายพันธ์ุหมากับแมวง่ายๆด้วยYoLoV5 | dev.to | 2024-04-15

Ref https://www.youtube.com/watch?v=0GwnxFNfZhM https://github.com/ultralytics/yolov5 https://dev.to/gfstealer666/kaaraich-yolo-alkrithuemainkaartrwcchcchabwatthu-object-detection-3lef https://www.kaggle.com/datasets/devdgohil/the-oxfordiiit-pet-dataset/data

Made-With-ML

51 36,004 6.8 Jupyter Notebook

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Project mention: [D] How do you keep up to date on Machine Learning? | /r/learnmachinelearning | 2023-08-13

Made With ML

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
GFPGAN

93 34,737 2.7 Python

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Project mention: Ask HN: What is the state of the art in AI photo enhancement? | news.ycombinator.com | 2024-01-24

ComfyUI

125 35,239 9.9 Python

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

Project mention: ComflowySpace: An open-source version of better ComfyUI | news.ycombinator.com | 2024-03-08

The non standard licensing puts me off in contributing or using this. It is frustrating how the phrase opensource has been diluted in the AI/ML community. ComfyUI has a GPL license [1] while this project uses this [2]. I honestly don't know where I stand since this is a legal document using non-standard phrasing to describe how the rights around the source code.
This is a project that uses a custom license with less rights provided than the ComfyUI project it self-describes as improving. Am not sure the title is reflective of the project.
[1] - https://github.com/comfyanonymous/ComfyUI/blob/master/LICENS...

MockingBird

9 34,002 5.8 Python

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
DeepSpeed

51 33,018 9.8 Python

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Project mention: Can we discuss MLOps, Deployment, Optimizations, and Speed? | /r/LocalLLaMA | 2023-12-06

DeepSpeed can handle parallelism concerns, and even offload data/model to RAM, or even NVMe (!?) . I'm surprised I don't see this project used more.

Ray

43 31,414 10.0 Python

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Project mention: Ray: Unified framework for scaling AI and Python applications | news.ycombinator.com | 2024-05-03

pytorch-image-models

35 30,008 9.4 Python

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Project mention: FLaNK AI Weekly 18 March 2024 | dev.to | 2024-03-18

TTS

233 29,831 9.4 Python

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Project mention: Show HN: Pi-C.A.R.D, a Raspberry Pi Voice Assistant | news.ycombinator.com | 2024-05-13

When I did a similar thing (but with less LLM) I liked https://github.com/coqui-ai/TTS but back then I needed to cut out the conversion step from tensor to a list of numbers to make it work really nicely.

fairseq

89 29,402 6.0 Python

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Project mention: Sequence-to-Sequence Toolkit Written in Python | news.ycombinator.com | 2024-03-30

pytorch-tutorial

3 29,218 0.0 Python

PyTorch Tutorial for Deep Learning Researchers
mmdetection

23 28,036 8.4 Python

OpenMMLab Detection Toolbox and Benchmark
pytorch-lightning

9 27,064 9.9 Python

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Project mention: SB-1047 will stifle open-source AI and decrease safety | news.ycombinator.com | 2024-04-29

It's very easy to get started, right in your Terminal, no fees! No credit card at all.
And there are cloud providers like https://replicate.com/ and https://lightning.ai/ that will let you use your LLM via an API key just like you did with OpenAI if you need that.
You don't need OpenAI - nobody does.

Real-ESRGAN

131 26,384 2.7 Python

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Project mention: AI-Powered Nvidia RTX Video HDR Transforms Standard Video into HDR Video | news.ycombinator.com | 2024-01-24

It's not exactly what you're after, as it's anime specific and you need to process the video yourself (eg disassemble to frames, run the upscaler, then assemble back to a movie file), but Real-ESRGAN is really good:
https://github.com/xinntao/Real-ESRGAN/
It's pretty brilliant for cleaning up very old, low resolution anime.

netron

36 26,355 9.9 JavaScript

Visualizer for neural network, deep learning and machine learning models

Project mention: Google Edge AI Model Explorer | news.ycombinator.com | 2024-05-14

fastai

9 25,691 8.0 Jupyter Notebook

The fastai deep learning library
ultralytics

27 23,574 9.8 Python

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

Project mention: The CEO of Ultralytics (yolov8) using LLMs to engage with commenters on GitHub | news.ycombinator.com | 2024-02-12

Yep, I noticed this a while ago. It posts easily identifiable ChatGPT responses. It also posts garbage wrong answers which makes it worse than useless. Totally disrespectful to the userbase.
https://github.com/ultralytics/ultralytics/issues/5748#issue...

JARVIS

52 23,136 7.2 Python

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Project mention: FLaNK Stack 26 February 2024 | dev.to | 2024-02-26

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Pytorch related posts

Reading list to join AI field from Hugging Face cofounder

1 project | news.ycombinator.com | 18 May 2024
Llama3.np: pure NumPy implementation of Llama3

10 projects | news.ycombinator.com | 16 May 2024
Apple to Power AI Features with M2 Ultra Servers

2 projects | news.ycombinator.com | 10 May 2024
AlphaFold 3 predicts the structure and interactions of all of life's molecules

6 projects | news.ycombinator.com | 8 May 2024
XLSTM: Extended Long Short-Term Memory

2 projects | news.ycombinator.com | 8 May 2024
Intel Arc A770: Arrays larger than 4GB crashes

2 projects | news.ycombinator.com | 7 May 2024
Recapping the AI, Machine Learning and Data Science Meetup — May 2, 2024

2 projects | dev.to | 2 May 2024
A note from our sponsor - SaaSHub
www.saashub.com | 20 May 2024

SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source Pytorch projects? This list will help you:

	Project	Stars
1	stable-diffusion-webui	131,121
2	transformers	126,170
3	Keras	61,044
4	Real-Time-Voice-Cloning	50,951
5	nn	48,933
6	yolov5	47,375
7	Made-With-ML	36,004
8	GFPGAN	34,737
9	ComfyUI	35,239
10	MockingBird	34,002
11	DeepSpeed	33,018
12	Ray	31,414
13	pytorch-image-models	30,008
14	TTS	29,831
15	fairseq	29,402
16	pytorch-tutorial	29,218
17	mmdetection	28,036
18	pytorch-lightning	27,064
19	Real-ESRGAN	26,384
20	netron	26,355
21	fastai	25,691
22	ultralytics	23,574
23	JARVIS	23,136