Top 23 Jupyter Notebook large-language-model Projects

llm-course

6 29,486 8.1 Jupyter Notebook

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Project mention: Ask HN: People who switched from GPT to their own models. How was it? | news.ycombinator.com | 2024-02-26

This is a very nice resource: https://github.com/mlabonne/llm-course

LLMs-from-scratch

9 16,129 9.6 Jupyter Notebook

Implementing a ChatGPT-like LLM from scratch, step by step

Project mention: Finetuning an LLM-Based Spam Classifier with LoRA from Scratch | news.ycombinator.com | 2024-05-11

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
DeepLearningExamples

7 12,660 6.1 Jupyter Notebook

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
FinGPT

11 11,586 9.5 Jupyter Notebook

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

Project mention: GPT-4, without specialized training, beat a GPT-3.5 class model that cost $10B | news.ycombinator.com | 2024-03-24

There is also the open source FinGPT, that is claimed to beat GPT4 in some benchmarks at a fine tuning cost of $17.25.
https://github.com/AI4Finance-Foundation/FinGPT

Promptify

29 3,046 8.5 Jupyter Notebook

Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

Project mention: Promptify 2.0: More Structured, More Powerful LLMs with Prompt-Optimization, Prompt-Engineering, and Structured Json Parsing with GPT-n Models! 🚀 | /r/ArtificialInteligence | 2023-07-31

First up, a huge Thank You for making Promptify a hit with over 2.3k+ stars on Github ! 🌟

ReAct

1 1,597 4.8 Jupyter Notebook

[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models (by ysymyth)
EasyEdit

6 1,412 9.8 Jupyter Notebook

An Easy-to-use Knowledge Editing Framework for LLMs.

Project mention: ChatGPT provides false information about people, and OpenAI can't correct it | news.ycombinator.com | 2024-04-29

> The article talks about OpenAI being unwilling to correct errors. But they just can’t.
There are actually several algorithms intended to allow fact editing in LLMs: https://github.com/zjunlp/EasyEdit?tab=readme-ov-file#curren...
They don't work perfectly (e.g. "Tim Cook is CEO of Apple" and "The CEO of Apple is Tim Cook" for some reason have to be edited separately) but there are certainly techniques available.

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
alpaca_eval

4 1,134 9.6 Jupyter Notebook

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Project mention: UltraLM-13B reaches top of AlpacaEval leaderboard | /r/LocalLLaMA | 2023-06-28

Alpaca Eval is open source and was developed by the same team who trained the alpaca model afaik. It is not like what you said in the other comment

Get-Things-Done-with-Prompt-Engineering-and-LangChain

18 966 8.2 Jupyter Notebook

LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading and indexing data, creating prompt templates, CSV agents, and using retrieval QA chains to query the custom data. Projects for using a private LLM (Llama 2) for chat with PDF files, tweets sentiment analysis.

Project mention: Get-Things-Done-with-Prompt-Engineering-and-LangChain: NEW Data - star count:617.0 | /r/algoprojects | 2023-12-10

ontogpt

2 512 9.8 Jupyter Notebook

LLM-based ontological extraction tools, including SPIRES

Project mention: GPT-based ontological extraction tools, including SPIRES | news.ycombinator.com | 2023-07-24

xmtf

2 496 5.9 Jupyter Notebook

Crosslingual Generalization through Multitask Finetuning
fromage

2 458 6.3 Jupyter Notebook

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
PIXIU

6 406 8.9 Jupyter Notebook

This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).

Project mention: PIXIU: NEW Data - star count:172.0 | /r/algoprojects | 2023-08-15

llm-search

2 381 8.5 Jupyter Notebook

Querying local documents, powered by LLM

Project mention: Querying local documents, powered by LLM | news.ycombinator.com | 2023-11-07

hyde

2 362 10.0 Jupyter Notebook

HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels (by texttron)

Project mention: Show HN: Hacker Search – A semantic search engine for Hacker News | news.ycombinator.com | 2024-05-02

HyDE apparently means “Hypothetical Document Embeddings”, which seems to be a kind of generative query expansion/pre-processing
https://arxiv.org/abs/2212.10496
https://github.com/texttron/hyde
From the abstract:
Given a query, HyDE first zero-shot instructs an instruction-following language model (e.g. InstructGPT) to generate a hypothetical document. The document captures relevance patterns but is unreal and may contain false details. Then, an unsupervised contrastively learned encoder~(e.g. Contriever) encodes the document into an embedding vector. This vector identifies a neighborhood in the corpus embedding space, where similar real documents are retrieved based on vector similarity. This second step ground the generated document to the actual corpus, with the encoder's dense bottleneck filtering out the incorrect details.

KG_RAG

5 340 9.7 Jupyter Notebook

Empower Large Language Models (LLM) using Knowledge Graph based Retrieval-Augmented Generation (KG-RAG) for knowledge intensive tasks

Project mention: A list of system prompts used for biomedical RAG (KG-RAG) using LLM | news.ycombinator.com | 2024-01-10

datablations

6 295 6.9 Jupyter Notebook

Scaling Data-Constrained Language Models

Project mention: Gemini is only 1x Chinchilla, so it undertrained for production | /r/singularity | 2023-12-07

1x chinchilla means it's not really undertrained but that more could be squeezed without excessive difficulty https://arxiv.org/abs/2305.16264

ToolQA

1 210 6.1 Jupyter Notebook

ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels (easy/hard) across eight real-life scenarios.

Project mention: 🔍📊 Exciting development in the AI world: Introducing ToolQA, a new dataset that evaluates how well Large Language Models (LLMs) can use external tools for question answering. | /r/machinelearningnews | 2023-07-01

langforge

1 163 6.6 Jupyter Notebook

A Toolkit for Creating and Deploying LangChain Apps
localLLM_guidance

3 147 4.2 Jupyter Notebook

Local LLM ReAct Agent with Guidance

Project mention: LlamaCPP and LangChain Agent Quality | /r/LocalLLaMA | 2023-05-31

If you refer to React agent, you can check this link

FastLoRAChat

2 119 7.2 Jupyter Notebook

Instruct-tune LLaMA on consumer hardware with shareGPT data
seemore

2 58 8.0 Jupyter Notebook

From scratch implementation of a vision language model in pure PyTorch

Project mention: A Simple Version of Grok 1.5/ GPT-4 Vision from scratch, in one PyTorch file | news.ycombinator.com | 2024-05-05

augmented-interpretable-models

1 37 7.4 Jupyter Notebook

Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Jupyter Notebook large-language-models related posts

Finetuning an LLM-Based Spam Classifier with LoRA from Scratch

1 project | news.ycombinator.com | 11 May 2024
A Simple Version of Grok 1.5/ GPT-4 Vision from scratch, in one PyTorch file

1 project | news.ycombinator.com | 5 May 2024
Finetune a GPT Model for Spam Detection on Your Laptop in Just 5 Minutes

1 project | news.ycombinator.com | 3 May 2024
ChatGPT provides false information about people, and OpenAI can't correct it

1 project | news.ycombinator.com | 29 Apr 2024
Insights from Finetuning LLMs for Classification Tasks

1 project | news.ycombinator.com | 28 Apr 2024
Comparing 5 ways to implement Multihead Attention in PyTorch

1 project | news.ycombinator.com | 8 Mar 2024
Large Language Model Course

1 project | news.ycombinator.com | 23 Jan 2024
A note from our sponsor - InfluxDB
www.influxdata.com | 13 May 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source large-language-model projects in Jupyter Notebook? This list will help you:

	Project	Stars
1	llm-course	29,486
2	LLMs-from-scratch	16,129
3	DeepLearningExamples	12,660
4	FinGPT	11,586
5	Promptify	3,046
6	ReAct	1,597
7	EasyEdit	1,412
8	alpaca_eval	1,134
9	Get-Things-Done-with-Prompt-Engineering-and-LangChain	966
10	ontogpt	512
11	xmtf	496
12	fromage	458
13	PIXIU	406
14	llm-search	381
15	hyde	362
16	KG_RAG	340
17	datablations	295
18	ToolQA	210
19	langforge	163
20	localLLM_guidance	147
21	FastLoRAChat	119
22	seemore	58
23	augmented-interpretable-models	37

Jupyter Notebook large-language-models

Top 23 Jupyter Notebook large-language-model Projects

Jupyter Notebook large-language-models related posts

Finetuning an LLM-Based Spam Classifier with LoRA from Scratch

A Simple Version of Grok 1.5/ GPT-4 Vision from scratch, in one PyTorch file

Finetune a GPT Model for Spam Detection on Your Laptop in Just 5 Minutes

ChatGPT provides false information about people, and OpenAI can't correct it

Insights from Finetuning LLMs for Classification Tasks

Comparing 5 ways to implement Multihead Attention in PyTorch

Large Language Model Course

Index