Implementing a ChatGPT-like LLM from scratch, step by step

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

LLMs-from-scratch

9 16,129 9.6 Jupyter Notebook

Implementing a ChatGPT-like LLM from scratch, step by step

The attention mechanism we implement in this book* is specific to LLMs in terms of the text inputs, but it's fundamentally the same attention mechanism that is used in vision transformers. The only difference is that in LLMs, you turn text into tokens, and convert these tokens into vector embeddings that go into an LLM. In vision transformers, instead of regarding images as tokens, you use an image patch as a token and turn those into vector embeddings (a bit hard to explain without visuals here). In both text or vision context, it's the same attention mechanism, and it both cases it receives vector embeddings.
(*Chapter 3, already submitted last week and should be online in the MEAP soon, in the meantime the code along with the notes is also available here: https://github.com/rasbt/LLMs-from-scratch/blob/main/ch03/01...)

machine-learning-book

2 2,898 6.8 Jupyter Notebook

Code Repository for Machine Learning with PyTorch and Scikit-Learn

Sorry, in that case I would rather recommend a dedicated RL book. The RL part in LLMs will be very specific to LLMs, and I will only cover what's absolutely relevant in terms of background info. I do have a longish intro chapter on RL in my other general ML/DL book (https://github.com/rasbt/machine-learning-book/tree/main/ch1...) but like others said, I would recommend a dedicated RL book in your case.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Show HN: Scikit-LLM seamlessly integrate LLMs into scikit-learn

1 project | news.ycombinator.com | 25 Aug 2023
Treebomination: Convert a scikit-learn decision tree into a Keras model

3 projects | news.ycombinator.com | 11 Jun 2023
Scikit-LLM: Sklearn Meets Large Language Models

1 project | news.ycombinator.com | 23 May 2023
Scikit-LLM: Sklearn Meets Large Language Models

1 project | news.ycombinator.com | 22 May 2023
Facebook LLAMA is being openly distributed via torrents

15 projects | news.ycombinator.com | 3 Mar 2023

Implementing a ChatGPT-like LLM from scratch, step by step

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Pytorch Machine Learning chatgpt scikit-learn Gpt
Post date: 27 Jan 2024

LLMs-from-scratch

machine-learning-book

InfluxDB

Related posts

Show HN: Scikit-LLM seamlessly integrate LLMs into scikit-learn

Treebomination: Convert a scikit-learn decision tree into a Keras model

Scikit-LLM: Sklearn Meets Large Language Models

Scikit-LLM: Sklearn Meets Large Language Models

Facebook LLAMA is being openly distributed via torrents

Implementing a ChatGPT-like LLM from scratch, step by step

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Pytorch Machine Learning chatgpt scikit-learn Gpt Post date: 27 Jan 2024

LLMs-from-scratch

machine-learning-book

InfluxDB

Related posts

Show HN: Scikit-LLM seamlessly integrate LLMs into scikit-learn

Treebomination: Convert a scikit-learn decision tree into a Keras model

Scikit-LLM: Sklearn Meets Large Language Models

Scikit-LLM: Sklearn Meets Large Language Models

Facebook LLAMA is being openly distributed via torrents

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Pytorch Machine Learning chatgpt scikit-learn Gpt
Post date: 27 Jan 2024