LLMs-from-scratch
java-snapshot-testing
LLMs-from-scratch | java-snapshot-testing | |
---|---|---|
9 | 2 | |
16,129 | 104 | |
- | -1.9% | |
9.6 | 3.4 | |
1 day ago | 20 days ago | |
Jupyter Notebook | Java | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
LLMs-from-scratch
- Finetune a GPT Model for Spam Detection on Your Laptop in Just 5 Minutes
- Insights from Finetuning LLMs for Classification Tasks
-
Ask HN: Textbook Regarding LLMs
https://www.manning.com/books/build-a-large-language-model-f...
- Comparing 5 ways to implement Multihead Attention in PyTorch
- FLaNK Stack 29 Jan 2024
-
Implementing a ChatGPT-like LLM from scratch, step by step
The attention mechanism we implement in this book* is specific to LLMs in terms of the text inputs, but it's fundamentally the same attention mechanism that is used in vision transformers. The only difference is that in LLMs, you turn text into tokens, and convert these tokens into vector embeddings that go into an LLM. In vision transformers, instead of regarding images as tokens, you use an image patch as a token and turn those into vector embeddings (a bit hard to explain without visuals here). In both text or vision context, it's the same attention mechanism, and it both cases it receives vector embeddings.
(*Chapter 3, already submitted last week and should be online in the MEAP soon, in the meantime the code along with the notes is also available here: https://github.com/rasbt/LLMs-from-scratch/blob/main/ch03/01...)
java-snapshot-testing
- FLaNK Stack 29 Jan 2024
-
📸 Snapshot Testing with Kotlin
In this PoC I will use origin-energy/java-snapshot-testing and as stated in "the testing framework loved by lazy productive devs" I use it whenever I find myself manually saving test expectations as text files 😅
What are some alternatives?
s4 - Structured state space sequence models
pong-wars
reor - Private & local AI personal knowledge management app.
finagg - A Python package for aggregating and normalizing historical data from popular and free financial APIs.
datatrove - Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
JavaGuide - 「Java学习+面试指南」一份涵盖大部分 Java 程序员所需要掌握的核心知识。准备 Java 面试,首选 JavaGuide!
Deep_Object_Pose - Deep Object Pose Estimation (DOPE) – ROS inference (CoRL 2018)
langroid - Harness LLMs with Multi-Agent Programming
llm-classifier - Classify data instantly using an LLM
kafkaflow - Apache Kafka .NET Framework to create applications simple to use and extend.
ml-engineering - Machine Learning Engineering Open Book
OutRun - OutRun is an open-source, privacy oriented, outdoor fitness tracker.