Top 23 Python Open-Source Projects

system-design-primer

380 257,707 0.0 Python

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Project mention: 10 GitHub repositories that every developer must follow | dev.to | 2024-02-21

✅ donnemartin/system-design-primer: https://github.com/donnemartin/system-design-primer

awesome-python

86 207,283 7.0 Python

An opinionated list of awesome Python frameworks, libraries, software and resources.

Project mention: Top 10 GitHub Repositories for Python and Java Developers | dev.to | 2024-05-03

8. Vinta/Awesome-python - This repository is a curated list of top Python frameworks, libraries, and tools for a variety of purposes. It's a must-visit for any developer looking to expand their Python skills and discover new resources. https://github.com/vinta/awesome-python

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
tensorflow

223 182,857 10.0 C++

An Open Source Machine Learning Framework for Everyone

Project mention: Side Quest Devblog #1: These Fakes are getting Deep | dev.to | 2024-04-29

# L2-normalize the encoding tensors image_encoding = tf.math.l2_normalize(image_encoding, axis=1) audio_encoding = tf.math.l2_normalize(audio_encoding, axis=1) # Find euclidean distance between image_encoding and audio_encoding # Essentially trying to detect if the face is saying the audio # Will return nan without the 1e-12 offset due to https://github.com/tensorflow/tensorflow/issues/12071 d = tf.norm((image_encoding - audio_encoding) + 1e-12, ord='euclidean', axis=1, keepdims=True) discriminator = keras.Model(inputs=[image_input, audio_input], outputs=[d], name="discriminator")

TheAlgorithms

62 180,485 9.7 Python

All Algorithms implemented in Python

Project mention: Top 10 GitHub Repositories for Python and Java Developers | dev.to | 2024-05-03

3. TheAlgorithms/Python - For those interested in algorithms and data structures, this repository offers Python implementations for a wide range of algorithms. It's a great way to deepen understanding of algorithmic learning with Python. https://github.com/TheAlgorithms/Python

project-based-learning

198 171,454 0.0

Curated list of project-based tutorials

Project mention: Top 10 GitHub Repositories for Python and Java Developers | dev.to | 2024-05-03

9. Practical-tutorials/project-based-learning - This repository provides links to project-based tutorials for various programming languages, with a focus on Python. It's a great way to gain practical experience and build your developer portfolio. https://github.com/practical-tutorials/project-based-learning

AutoGPT

181 162,161 9.9 JavaScript

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Project mention: Calling code with local LLM is a hoax | dev.to | 2024-05-20

AutoGPT is a framework that seems nice. It has a cool CLI and a flutter UI to create agents from the browser. Its main purpose is to work with your local stuff (documents, audio, videos, etc)

stable-diffusion-webui

2,808 131,121 9.9 Python

Stable Diffusion web UI

Project mention: Show HN: I made an app to use local AI as daily driver | news.ycombinator.com | 2024-02-27

* LLaVA model: I'll add more documentation. You are right Llava could not generate images. For image generation I don't have immediate plans, but checkout these projects for local image generation.
- https://diffusionbee.com/
- https://github.com/comfyanonymous/ComfyUI
- https://github.com/AUTOMATIC1111/stable-diffusion-webui

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
transformers

180 126,170 10.0 Python

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Project mention: Reading list to join AI field from Hugging Face cofounder | news.ycombinator.com | 2024-05-18

Not sure what you are implying. Thomas Wolf has the second highest number of commits on HuggingFace/transformers. He is clearly competent & deeply technical
https://github.com/huggingface/transformers/

30-seconds-of-code

37 119,530 9.8 JavaScript

Short code snippets for all your development needs

Project mention: 30-seconds-of-code: Short code snippets for all your development needs | news.ycombinator.com | 2023-12-25

HelloGitHub

1 85,663 6.5 Python

:octocat: 分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
thefuck

305 83,068 4.2 Python

Magnificent app which corrects your previous console command.

Project mention: Control Linux based distros using hand gestures using OpenCV, GTK, Mediapipe | news.ycombinator.com | 2024-04-14

Are you by chance interested in a command named after the four-letter word, which automatically fixes and reruns the last command: https://github.com/nvbn/thefuck

Pytorch

341 78,436 10.0 Python

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Project mention: PyTorch 2.3: User-Defined Triton Kernels, Tensor Parallelism in Distributed | news.ycombinator.com | 2024-05-10

Django

486 77,104 9.8 Python

The Web framework for perfectionists with deadlines.

Project mention: Using Google Cloud Firestore with Django's ORM | dev.to | 2024-05-20

Django has long been the most popular Python framework for developing web applications. One of its most powerful features is its built in object-relational mapper (ORM) which is designed to flexibly and safely interact with SQL databases in an abstract way.

fastapi

471 71,659 9.8 Python

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Project mention: Python FastAPI: Integrating OAuth2 Security with the Application's Own Authentication Process | dev.to | 2024-05-13

yt-dlp

2,361 72,048 9.8 Python

A feature-rich command-line audio/video downloader

Project mention: Open Source YouTube to MP3 Downloader | news.ycombinator.com | 2024-05-12

I think the "or other url" is important here. I didn't realise for a long time that I could put a reddit, Twitter or other URLs in there to download videos. You can find a complete list of supported sites here:
https://github.com/yt-dlp/yt-dlp/blob/master/supportedsites....

Home Assistant

1,411 69,033 10.0 Python

:house_with_garden: Open source home automation that puts local control and privacy first.

Project mention: Do not buy a Hisense TV (or at least keep them offline) | news.ycombinator.com | 2024-04-20

Apparently the same issue has been reported with Philips TV [1] and Fritz!Box [2] as well.
[1] https://github.com/home-assistant/core/issues/73643#issuecom...
[2] https://forum.openwrt.org/t/minidlna-creates-new-media-serve...

ML-For-Beginners

28 67,267 7.6 HTML

12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all

Project mention: Good coding groups for black women? | news.ycombinator.com | 2024-01-13

- https://github.com/microsoft/ML-For-Beginners
Also check out this list Pitt puts out every year:

Flask

135 66,538 8.7 Python

The Python micro framework for building web applications.

Project mention: Ask HN: High quality Python scripts or small libraries to learn from | news.ycombinator.com | 2024-04-19

I'd suggest Flask or some of the smaller projects in the Pallets ecosystem:
https://github.com/pallets/flask

devops-exercises

37 63,930 7.0 Python

Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions

Project mention: DevOps Exercises | news.ycombinator.com | 2023-08-07

Ansible

391 61,353 9.8 Python

Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to cloud management, in a language that approaches plain English, using SSH, with no agents to install on remote systems. https://docs.ansible.com.

Project mention: Ansible Basics: Your First HelloWorld Playbook 🚀 | dev.to | 2024-04-27

Ansible is an open-source IT automation tool that simplifies application deployment, cloud provisioning, and configuration management across diverse environments. It uses a declarative language to describe the desired state of the system, and then takes the necessary actions to achieve that state. Ansible has become incredibly popular due to its simplicity, agentless architecture, and extensive community support. Document: ansible.com, ansible basics

Keras

79 61,044 9.9 Python

Deep Learning for humans

Project mention: Side Quest #3: maybe the real Deepfakes were the friends we made along the way | dev.to | 2024-05-20

def batcher_from_directory(batch_size:int, dataset_path:str, shuffle=False,seed=None) -> tf.data.Dataset: """ Return a tensorflow Dataset object that returns images and spectrograms as required. Partly inspired by https://github.com/keras-team/keras/blob/v3.3.3/keras/src/utils/image_dataset_utils.py Args: batch_size: The batch size. dataset_path: The path to the dataset folder which must contain the image folder and audio folder. shuffle: Whether to shuffle the dataset. Default to False. seed: The seed for the shuffle. Default to None. """ image_dataset_path = os.path.join(dataset_path, "image") # create the foundation datasets og_dataset = tf.data.Dataset.from_generator(lambda: original_image_path_gen(image_dataset_path), output_signature=tf.TensorSpec(shape=(), dtype=tf.string)) og_dataset = og_dataset.repeat(None) # repeat indefinitely ref_dataset = tf.data.Dataset.from_generator(lambda: ref_image_path_gen(image_dataset_path), output_signature=(tf.TensorSpec(shape=(), dtype=tf.string), tf.TensorSpec(shape=(), dtype=tf.bool))) ref_dataset = ref_dataset.repeat(None) # repeat indefinitely # create the input datasets og_image_dataset = og_dataset.map(lambda x: tf.py_function(load_image, [x, tf.convert_to_tensor(False, dtype=tf.bool)], tf.float32), num_parallel_calls=tf.data.AUTOTUNE) masked_image_dataset = og_image_dataset.map(lambda x: tf.py_function(load_masked_image, [x], tf.float32), num_parallel_calls=tf.data.AUTOTUNE) ref_image_dataset = ref_dataset.map(lambda x, y: tf.py_function(load_image, [x, y], tf.float32), num_parallel_calls=tf.data.AUTOTUNE) audio_spec_dataset = og_dataset.map(lambda x: tf.py_function(load_audio_data, [x, dataset_path], tf.float64), num_parallel_calls=tf.data.AUTOTUNE) unsync_spec_dataset = ref_dataset.map(lambda x, _: tf.py_function(load_audio_data, [x, dataset_path], tf.float64), num_parallel_calls=tf.data.AUTOTUNE) # ensure shape as tensorflow does not accept unknown shapes og_image_dataset = og_image_dataset.map(lambda x: tf.ensure_shape(x, IMAGE_SHAPE)) masked_image_dataset = masked_image_dataset.map(lambda x: tf.ensure_shape(x, MASKED_IMAGE_SHAPE)) ref_image_dataset = ref_image_dataset.map(lambda x: tf.ensure_shape(x, IMAGE_SHAPE)) audio_spec_dataset = audio_spec_dataset.map(lambda x: tf.ensure_shape(x, AUDIO_SPECTROGRAM_SHAPE)) unsync_spec_dataset = unsync_spec_dataset.map(lambda x: tf.ensure_shape(x, AUDIO_SPECTROGRAM_SHAPE)) # multi input using https://discuss.tensorflow.org/t/train-a-model-on-multiple-input-dataset/17829/4 full_dataset = tf.data.Dataset.zip((masked_image_dataset, ref_image_dataset, audio_spec_dataset, unsync_spec_dataset), og_image_dataset) # if shuffle: # full_dataset = full_dataset.shuffle(buffer_size=batch_size * 8, seed=seed) # not sure why buffer size is such # batch full_dataset = full_dataset.batch(batch_size=batch_size) return full_dataset

superset

138 59,473 9.9 TypeScript

Apache Superset is a Data Visualization and Data Exploration Platform

Project mention: Show HN: Open-source BI and analytics for engineers | news.ycombinator.com | 2024-05-15

We are looking at moving our Power BI stuff to Apache Superset [1]. How does this compare to Superset?
[1] https://superset.apache.org/

scikit-learn

82 58,344 9.9 Python

scikit-learn: machine learning in Python

Project mention: How to Build a Logistic Regression Model: A Spam-filter Tutorial | dev.to | 2024-05-05

Online Courses: Coursera: "Machine Learning" by Andrew Ng edX: "Introduction to Machine Learning" by MIT Tutorials: Scikit-learn documentation: https://scikit-learn.org/ Kaggle Learn: https://www.kaggle.com/learn Books: "Hands-On Machine Learning with Scikit-Learn, Keras & TensorFlow" by Aurélien Géron "The Elements of Statistical Learning" by Trevor Hastie, Robert Tibshirani, and Jerome Friedman By understanding the core concepts of logistic regression, its limitations, and exploring further resources, you'll be well-equipped to navigate the exciting world of machine learning!

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python related posts

Calling code with local LLM is a hoax

1 project | dev.to | 20 May 2024
Pyright, but Based

1 project | news.ycombinator.com | 20 May 2024
Using Google Cloud Firestore with Django's ORM

3 projects | dev.to | 20 May 2024
FLaNK-AIM: 20 May 2024 Weekly

28 projects | dev.to | 20 May 2024
Setting up a standalone SQLAlchemy 2.0 ORM application

4 projects | dev.to | 20 May 2024
Reverse-engineered Shazam audio signature generator

1 project | news.ycombinator.com | 20 May 2024
Show HN: ffmpeg-english "capture from /dev/video0 every 1 second to jpg files"

9 projects | news.ycombinator.com | 19 May 2024
A note from our sponsor - InfluxDB
www.influxdata.com | 20 May 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source Python projects? This list will help you:

	Project	Stars
1	system-design-primer	257,707
2	awesome-python	207,283
3	tensorflow	182,857
4	TheAlgorithms	180,485
5	project-based-learning	171,454
6	AutoGPT	162,161
7	stable-diffusion-webui	131,121
8	transformers	126,170
9	30-seconds-of-code	119,530
10	HelloGitHub	85,663
11	thefuck	83,068
12	Pytorch	78,436
13	Django	77,104
14	fastapi	71,659
15	yt-dlp	72,048
16	Home Assistant	69,033
17	ML-For-Beginners	67,267
18	Flask	66,538
19	devops-exercises	63,930
20	Ansible	61,353
21	Keras	61,044
22	superset	59,473
23	scikit-learn	58,344