Top 23 Python Gpt Projects

gpt4free

44 57,799 9.9 Python

The official gpt4free repository | various collection of powerful language models

Project mention: gpt4-openai-api VS gpt4free - a user suggested alternative | libhunt.com/r/gpt4-openai-api | 2024-01-04

I cant install

MetaGPT

32 39,707 10.0 Python

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Project mention: Can AI replace a co-founder? | news.ycombinator.com | 2024-01-07

https://github.com/geekan/MetaGPT :
> MetaGPT takes a one line requirement as input and outputs user stories / competitive analysis / requirements / data structures / APIs / documents, etc.
https://news.ycombinator.com/item?id=29141796 ; "Co-Founder Equity Calculator"
"Ask HN: What are your go to SaaS products for startups/MVPs?" (2020) https://news.ycombinator.com/item?id=23535828 ; FounderKit, StackShare
> USA Small Business Administration: "10 steps to start your business." https://www.sba.gov/starting-business/how-start-business/10-...
>> "Startup Incorporation Checklist: How to bootstrap a Delaware C-corp (or S-corp) with employee(s) in California" https://github.com/leonar15/startup-checklist

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
MindsDB

78 21,424 10.0 Python

The platform for customizing AI from enterprise data

Project mention: What’s the Difference Between Fine-tuning, Retraining, and RAG? | dev.to | 2024-04-08

Check us out on GitHub.

LLaMA-Factory

3 21,791 9.9 Python

Unify Efficient Fine-Tuning of 100+ LLMs

Project mention: FLaNK-AIM Weekly 06 May 2024 | dev.to | 2024-05-06

vllm

31 19,344 9.9 Python

A high-throughput and memory-efficient inference and serving engine for LLMs

Project mention: AI leaderboards are no longer useful. It's time to switch to Pareto curves | news.ycombinator.com | 2024-04-30

I guess the root cause of my claim is that OpenAI won't tell us whether or not GPT-3.5 is an MoE model, and I assumed it wasn't. Since GPT-3.5 is clearly nondeterministic at temp=0, I believed the nondeterminism was due to FPU stuff, and this effect was amplified with GPT-4's MoE. But if GPT-3.5 is also MoE then that's just wrong.
What makes this especially tricky is that small models are truly 100% deterministic at temp=0 because the relative likelihoods are too coarse for FPU issues to be a factor. I had thought 3.5 was big enough that some of its token probabilities were too fine-grained for the FPU. But that's probably wrong.
On the other hand, it's not just GPT, there are currently floating-point difficulties in vllm which significantly affect the determinism of any model run on it: https://github.com/vllm-project/vllm/issues/966 Note that a suggested fix is upcasting to float32. So it's possible that GPT-3.5 is using an especially low-precision float and introducing nondeterminism by saving money on compute costs.
Sadly I do not have the money[1] to actually run a test to falsify any of this. It seems like this would be a good little research project.
[1] Or the time, or the motivation :) But this stuff is expensive.

best-of-ml-python

16 15,633 7.8 Python

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
DocsGPT

35 14,208 9.8 Python

GPT-powered chat for documentation, chat with your documents

Project mention: You can earn free shirt by contributing to DocsGPT | /r/hacktoberfest | 2023-10-03

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
RWKV-LM

84 11,747 8.8 Python

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Project mention: Do LLMs need a context window? | news.ycombinator.com | 2023-12-25

https://github.com/BlinkDL/RWKV-LM#rwkv-discord-httpsdiscord... lists a number of implementations of various versions of RWKV.
https://github.com/BlinkDL/RWKV-LM#rwkv-parallelizable-rnn-w... :
> RWKV: Parallelizable RNN with Transformer-level LLM Performance (pronounced as "RwaKuv", from 4 major params: R W K V)
> RWKV is an RNN with Transformer-level LLM performance, which can also be directly trained like a GPT transformer (parallelizable). And it's 100% attention-free. You only need the hidden state at position t to compute the state at position t+1. You can use the "GPT" mode to quickly compute the hidden state for the "RNN" mode.
> So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding (using the final hidden state).
> "Our latest version is RWKV-6,*

dolly

41 10,787 7.2 Python

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Project mention: "[D]" Using data from Alpaca for a commercial version of a Open LLM | /r/MachineLearning | 2023-07-02

h2ogpt

28 10,686 10.0 Python

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/

Project mention: Ask HN: How do I train a custom LLM/ChatGPT on my own documents in Dec 2023? | news.ycombinator.com | 2023-12-24

As others have said you want RAG.
The most feature complete implementation I've seen is h2ogpt[0] (not affiliated).
The code is kind of a mess (most of the logic is in an ~8000 line python file) but it supports ingestion of everything from YouTube videos to docx, pdf, etc - either offline or from the web interface. It uses langchain and a ton of additional open source libraries under the hood. It can run directly on Linux, via docker, or with one-click installers for Mac and Windows.
It has various model hosting implementations built in - transformers, exllama, llama.cpp as well as support for model serving frameworks like vLLM, HF TGI, etc or just OpenAI.
You can also define your preferred embedding model along with various other parameters but I've found the out of box defaults to be pretty sane and usable.
[0] - https://github.com/h2oai/h2ogpt

awesome-chatgpt-zh

2 10,014 7.8 Python

ChatGPT 中文指南🔥，ChatGPT 中文调教指南，指令指南，应用开发指南，精选资源清单，更好的使用 chatGPT 让你的生产力 up up up! 🚀
AudioGPT

4 9,796 3.7 Python

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
petals

98 8,730 8.3 Python

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Project mention: Mistral Large | news.ycombinator.com | 2024-02-26

So how long until we can do an open source Mistral Large?
We could make a start on Petals or some other open source distributed training network cluster possibly?
[0] https://petals.dev/

promptflow

5 8,249 9.9 Python

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

Project mention: A suite of tools designed to streamline the development cycle of LLM-based apps | news.ycombinator.com | 2024-04-12

text-generation-inference

29 7,995 9.6 Python

Large Language Model Text Generation Inference

Project mention: FLaNK AI-April 22, 2024 | dev.to | 2024-04-22

VALL-E-X

2 7,249 8.8 Python

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Project mention: FLaNK Stack Weekly 12 February 2024 | dev.to | 2024-02-12

GPTCache

43 6,481 7.7 Python

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Project mention: Ask HN: What are the drawbacks of caching LLM responses? | news.ycombinator.com | 2024-03-15

Just found this: https://github.com/zilliztech/GPTCache which seems to address this idea/issue.

awesome-open-gpt

3 5,129 4.5 Python

Collection of Open Source Projects Related to GPT，GPT相关开源项目合集🚀、精选🔥🔥

Project mention: The best free ChatGPT alternatives | /r/ArtificialInteligence | 2023-06-20

Extract from awesome-open-gpt

marvin

17 4,825 9.9 Python

✨ Build AI interfaces that spark joy

Project mention: I'm puzzled how anyone trusts ChatGPT for code | news.ycombinator.com | 2024-05-08

I've never tried it myself, but Prefect does have something like this with their Marvin AI library for Python.
https://github.com/PrefectHQ/marvin?tab=readme-ov-file#-buil...

TaskingAI

1 4,837 9.4 Python

The open source platform for AI-native application development.

Project mention: TaskingAI: AI-native app development platform | news.ycombinator.com | 2024-01-30

awesome-pretrained-chinese-nlp-models

1 4,279 8.9 Python

Awesome Pretrained Chinese NLP Models，高质量中文预训练模型&大模型&多模态模型&大语言模型集合
marqo

114 4,177 9.3 Python

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

Project mention: Are we at peak vector database? | news.ycombinator.com | 2024-01-25

We (Marqo) are doing a lot on 1 and 2. There is a huge amount to be done on the ML side of vector search and we are investing heavily in it. I think it has not quite sunk in that vector search systems are ML systems and everything that comes with that. I would love to chat about 1 and 2 so feel free to email me (email is in my profile). What we have done so far is here -> https://github.com/marqo-ai/marqo

Baichuan2

1 3,960 7.3 Python

A series of large language models developed by Baichuan Intelligent Technology

Project mention: Baichuan 2 | news.ycombinator.com | 2023-10-12

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Gpt related posts

I'm puzzled how anyone trusts ChatGPT for code

4 projects | news.ycombinator.com | 8 May 2024
Agents of Change: Navigating the Rise of AI Agents in 2024

8 projects | dev.to | 2 May 2024
Open-source SDK for adding custom code interpreters to AI apps

2 projects | news.ycombinator.com | 2 May 2024
AI leaderboards are no longer useful. It's time to switch to Pareto curves

1 project | news.ycombinator.com | 30 Apr 2024
FLaNK AI Weekly for 29 April 2024

44 projects | dev.to | 29 Apr 2024
Show HN: Open-source SDK for creating custom code interpreters with any LLM

5 projects | news.ycombinator.com | 19 Apr 2024
A suite of tools designed to streamline the development cycle of LLM-based apps

1 project | news.ycombinator.com | 12 Apr 2024
A note from our sponsor - SaaSHub
www.saashub.com | 17 May 2024

SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source Gpt projects in Python? This list will help you:

	Project	Stars
1	gpt4free	57,799
2	MetaGPT	39,707
3	MindsDB	21,424
4	LLaMA-Factory	21,791
5	vllm	19,344
6	best-of-ml-python	15,633
7	DocsGPT	14,208
8	RWKV-LM	11,747
9	dolly	10,787
10	h2ogpt	10,686
11	awesome-chatgpt-zh	10,014
12	AudioGPT	9,796
13	petals	8,730
14	promptflow	8,249
15	text-generation-inference	7,995
16	VALL-E-X	7,249
17	GPTCache	6,481
18	awesome-open-gpt	5,129
19	marvin	4,825
20	TaskingAI	4,837
21	awesome-pretrained-chinese-nlp-models	4,279
22	marqo	4,177
23	Baichuan2	3,960

Python Gpt

Top 23 Python Gpt Projects

Python Gpt related posts

I'm puzzled how anyone trusts ChatGPT for code

Agents of Change: Navigating the Rise of AI Agents in 2024

Open-source SDK for adding custom code interpreters to AI apps

AI leaderboards are no longer useful. It's time to switch to Pareto curves

FLaNK AI Weekly for 29 April 2024

Show HN: Open-source SDK for creating custom code interpreters with any LLM

A suite of tools designed to streamline the development cycle of LLM-based apps

Index