SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python large-language-model Projects
-
gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
-
Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
-
deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
-
txtai
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
-
camel
🐫 CAMEL: Communicative Agents for “Mind” Exploration of Large Language Model Society (NeruIPS'2023) https://www.camel-ai.org (by camel-ai)
-
awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
I recently managed to manually install the AnkiBrain addon, utilizing my personal ChatGPT API key. I'd like to extend my appreciation for creating such a useful tool. However, I've noticed a significant difference in speed compared to a local GUI, similar to what's offered by GPT Academic.
I'd like to share with you today the Chinese-Alpaca-Plus-13B-GPTQ model, which is the GPTQ format quantised 4bit models of Yiming Cui's Chinese-LLaMA-Alpaca 13B for GPU reference.
Project mention: Haystack DB – 10x faster than FAISS with binary embeddings by default | news.ycombinator.com | 2024-04-28I was confused for a bit but there is no relation to https://haystack.deepset.ai/
Qwen: https://github.com/QwenLM/Qwen
Project mention: Run 70B LLM Inference on a Single 4GB GPU with This New Technique | news.ycombinator.com | 2023-12-03
Here’s another one - it’s older but has some interesting charts and graphs.
https://arxiv.org/abs/2303.18223
So how long until we can do an open source Mistral Large?
We could make a start on Petals or some other open source distributed training network cluster possibly?
[0] https://petals.dev/
The model license:
https://github.com/01-ai/Yi/blob/main/MODEL_LICENSE_AGREEMEN...
1) Your use of the Yi Series Models must comply with the Laws and Regulations as
Project mention: Show HN: FileKitty – Combine and label text files for LLM prompt contexts | news.ycombinator.com | 2024-05-01
Project mention: Baichuan 7B reaches top of LLM leaderboard for it's size (New foundation model 4K tokens) | /r/LocalLLaMA | 2023-06-17GitHub: baichuan-inc/baichuan-7B: A large-scale 7B pretraining language model developed by BaiChuan-Inc. (github.com)
Depending on your use case, https://openchat.team/ might be woth looking into
We (Marqo) are doing a lot on 1 and 2. There is a huge amount to be done on the ML side of vector search and we are investing heavily in it. I think it has not quite sunk in that vector search systems are ML systems and everything that comes with that. I would love to chat about 1 and 2 so feel free to email me (email is in my profile). What we have done so far is here -> https://github.com/marqo-ai/marqo
Python large-language-models related posts
-
Show HN: Generate a Quiz from Any Url
-
TimesFM (Time Series Foundation Model) for time-series forecasting
-
Financial Market Applications of LLMs
-
Implementation for Mini-Gemini
-
News DataStax just bought our startup Langflow
-
Show HN: I made a library for LLM prompt injection/exploit/jailbreak detection
-
Mini-Gemini: Mining the Potential of Multi-Modality Vision Language Models
-
A note from our sponsor - SaaSHub
www.saashub.com | 20 May 2024
Index
What are some of the best open-source large-language-model projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | gpt_academic | 58,363 |
2 | LLaMA-Factory | 22,453 |
3 | Chinese-LLaMA-Alpaca | 17,539 |
4 | ChatGLM2-6B | 15,546 |
5 | haystack | 13,883 |
6 | MOSS | 11,823 |
7 | Qwen | 11,430 |
8 | ml-engineering | 9,928 |
9 | FlexGen | 9,022 |
10 | LLMSurvey | 9,037 |
11 | petals | 8,730 |
12 | nebuly | 8,363 |
13 | deeplake | 7,751 |
14 | Yi | 7,250 |
15 | txtai | 7,111 |
16 | PentestGPT | 6,475 |
17 | Baichuan-7B | 5,646 |
18 | openchat | 4,996 |
19 | camel | 4,504 |
20 | awesome-pretrained-chinese-nlp-models | 4,279 |
21 | marqo | 4,189 |
22 | Baichuan2 | 3,960 |
23 | AutoGPTQ | 3,875 |
Sponsored