Top 23 Jupyter Notebook AI Projects

generative-ai-for-beginners

8 43,780 9.8 Jupyter Notebook

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Project mention: Build a serverless ChatGPT with RAG using LangChain.js | dev.to | 2024-04-10

Generative AI For Beginners: a collection of resources to learn about Generative AI, including tutorials, code samples, and more.

google-research

98 32,991 9.6 Jupyter Notebook

Google Research

Project mention: Show HN: Next-token prediction in JavaScript – build fast LLMs from scratch | news.ycombinator.com | 2024-04-10

People on here will be happy to say that I do a similar thing, however my sequence length is dynamic because I also use a 2nd data structure - I'll use pretentious academic speak: I use a simple bigram LM (2-gram) for single next-word likeliness and separately a trie that models all words and phrases (so, n-gram). Not sure how many total nodes because sentence lengths vary in training data, but there are about 200,000 entry points (keys) so probably about 2-10 million total nodes in the default setup.
"Constructing 7-gram LM": They likely started with bigrams (what I use) which only tells you the next word based on 1 word given, and thought to increase accuracy by modeling out more words in a sequence, and eventually let the user (developer) pass in any amount they want to model (https://github.com/google-research/google-research/blob/5c87...). I thought of this too at first, but I actually got more accuracy (and speed) out of just keeping them as bigrams and making a totally separate structure that models out an n-gram of all phrases (e.g. could be a 24-token long sequence or 100+ tokens etc. I model it all) and if that phrase is found, then I just get the bigram assumption of the last token of the phrase. This works better when the training data is more diverse (for a very generic model), but theirs would probably outperform mine on accuracy when the training data has a lot of nearly identical sentences that only change wildly toward the end - I don't find this pattern in typical data though, maybe for certain coding and other tasks there are those patterns though. But because it's not dynamic and they make you provide that number, even a low number (any phrase longer than 2 words) - theirs will always have to do more lookup work than with simple bigrams and they're also limited by that fixed number as far as accuracy. I wonder how scalable that is - if I need to train on occasional ~100-word long sentences but also (and mostly) just ~3-word long sentences, I guess I set this to 100 and have a mostly "undefined" trie.
I also thought of the name "LMJS", theirs is "jslm" :) but I went with simply "next-token-prediction" because that's what it ultimately does as a library. I don't know what theirs is really designed for other than proving a concept. Most of their code files are actually comments and hypothetical scenarios.
I recently added a browser example showing simple autocomplete using my library: https://github.com/bennyschmidt/next-token-prediction/tree/m... (video)
And next I'm implementing 8-dimensional embeddings that are converted to normalized vectors between 0-1 to see if doing math on them does anything useful beyond similarity, right now they look like this:
  [nextFrequency, prevalence, specificity, length, firstLetter, lastLetter, firstVowel, lastVowel]

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
AI-For-Beginners

8 31,684 6.7 Jupyter Notebook

12 Weeks, 24 Lessons, AI for All!

Project mention: FREE AI Course By Microsoft: ZERO to HERO! 🔥 | dev.to | 2024-03-18

🔗 https://github.com/microsoft/AI-For-Beginners 🔗 https://microsoft.github.io/AI-For-Beginners/

learnopencv

6 20,471 8.6 Jupyter Notebook

Learn OpenCV : C++ and Python Examples

Project mention: YOLO-NAS Pose | /r/pytorch | 2023-11-16

Deci's YOLO-NAS Pose: Redefining Pose Estimation! Elevating healthcare, sports, tech, and robotics with precision and speed. Github link and blog link down below! Repo: https://github.com/spmallick/learnopencv/tree/master/YOLO-NAS-Pose

h4cker

4 16,717 9.2 Jupyter Notebook

This repository is primarily maintained by Omar Santos (@santosomar) and includes thousands of resources related to ethical hacking, bug bounties, digital forensics and incident response (DFIR), artificial intelligence security, vulnerability research, exploit development, reverse engineering, and more.
StableLM

43 15,853 5.0 Jupyter Notebook

StableLM: Stability AI Language Models

Project mention: The Era of 1-bit LLMs: ternary parameters for cost-effective computing | news.ycombinator.com | 2024-02-28

https://github.com/Stability-AI/StableLM?tab=readme-ov-file#...

stable-diffusion-webui-colab

71 15,290 9.0 Jupyter Notebook

stable diffusion webui colab

Project mention: Stable-Diffusion-Webui-Colab | news.ycombinator.com | 2023-07-24

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
dopamine

3 10,378 5.7 Jupyter Notebook

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
ML-Papers-of-the-Week

2 8,943 8.5 Jupyter Notebook

🔥Highlighting the top ML papers every week.

Project mention: [D] Where can I find a list of the foundational academic papers in RL/ML/DL and what are your go-to places to find new academic papers in RL/ML/DL? | /r/MachineLearning | 2023-07-07

Labml.ai stopped working in May. I like https://github.com/dair-ai/ML-Papers-of-the-Week

generative-ai

1 5,640 9.7 Jupyter Notebook

Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI (by GoogleCloudPlatform)

Project mention: Google Imagen 2 | news.ycombinator.com | 2023-12-13

I've used the code based on similar examples from GitHub [1]. According to docs [2], imagegeneration@005 was released on the 11th, so I guessed it's Imagen 2, though there are no confirmations.
[1] https://github.com/GoogleCloudPlatform/generative-ai/blob/ma...
[2] https://console.cloud.google.com/vertex-ai/publishers/google...

nlpaug

10 4,252 0.0 Jupyter Notebook

Data augmentation for NLP
ArtLine

12 3,531 1.4 Jupyter Notebook

A Deep Learning based project for creating line art portraits.
Dreambooth-Stable-Diffusion

100 3,170 6.8 Jupyter Notebook

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles. (by JoePenna)

Project mention: Will there be comprehensive tutorials for fine-tuning SD XL when it comes out? | /r/StableDiffusion | 2023-07-01

Tons of stuff here, no? https://github.com/JoePenna/Dreambooth-Stable-Diffusion/

examples

6 2,465 9.3 Jupyter Notebook

Jupyter Notebooks to help you get hands-on with Pinecone vector databases (by pinecone-io)

Project mention: Alternative Chunking Methods | news.ycombinator.com | 2024-04-30

clip-retrieval

11 2,163 7.7 Jupyter Notebook

Easily compute clip embeddings and build a clip retrieval system with them

Project mention: FLaNK AI for 11 March 2024 | dev.to | 2024-03-11

machine-learning-experiments

8 1,607 2.6 Jupyter Notebook

🤖 Interactive Machine Learning experiments: 🏋️models training + 🎨models demo
vertex-ai-samples

24 1,384 9.8 Jupyter Notebook

Sample code and notebooks for Vertex AI, the end-to-end machine learning platform on Google Cloud

Project mention: Gemini 1.5 outshines GPT-4-Turbo-128K on long code prompts, HVM author | news.ycombinator.com | 2024-02-18

imodels

7 1,293 8.5 Jupyter Notebook

Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).
tensor-house

4 1,179 7.5 Jupyter Notebook

A collection of reference Jupyter notebooks and demo AI/ML applications for enterprise use cases: marketing, pricing, supply chain, smart manufacturing, and more.
Deep-Learning-In-Production

2 1,073 0.0 Jupyter Notebook

Build, train, deploy, scale and maintain deep learning models. Understand ML infrastructure and MLOps using hands-on examples.
chameleon-llm

3 1,020 6.2 Jupyter Notebook

Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
bark

9 960 8.7 Jupyter Notebook

🚀 BARK INFINITY GUI CMD 🎶 Powered Up Bark Text-prompted Generative Audio Model (by JonathanFly)

Project mention: To Bridge the Gap Until the Official Audiobooks Are Released I Tried Making a Myne TTS [P5V5] | /r/HonzukiNoGekokujou | 2023-10-19

So I looked around and decided to use Bark Infinity. (Originally wanted to use Amazon Polly, but don't have a credit card) I tried around and found out that the female storyteller voice sounds quite decently. So I used that and a reference clip of Myne's voice as prompt (which I think might have helped a little... I don't get all that program's features) to generate a whole chapter. That worked quite well.

PConv-Keras

2 893 0.0 Jupyter Notebook

Unofficial implementation of "Image Inpainting for Irregular Holes Using Partial Convolutions". Try at: www.fixmyphoto.ai
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Jupyter Notebook AI related posts

Ask HN: Why all these GitHub fake accounts starring my project

1 project | news.ycombinator.com | 9 May 2024
Alternative Chunking Methods

1 project | news.ycombinator.com | 30 Apr 2024
Machine Learning and AI Beyond the Basics Book

1 project | news.ycombinator.com | 16 Apr 2024
Google Research website is down

1 project | news.ycombinator.com | 5 Apr 2024
GPT-4, without specialized training, beat a GPT-3.5 class model that cost $10B

3 projects | news.ycombinator.com | 24 Mar 2024
FREE AI Course By Microsoft: ZERO to HERO! 🔥

1 project | dev.to | 18 Mar 2024
Building an Open Source Decentralized E-Book Search Engine

5 projects | news.ycombinator.com | 11 Mar 2024
A note from our sponsor - SaaSHub
www.saashub.com | 17 May 2024

SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source AI projects in Jupyter Notebook? This list will help you:

	Project	Stars
1	generative-ai-for-beginners	43,780
2	google-research	32,991
3	AI-For-Beginners	31,684
4	learnopencv	20,471
5	h4cker	16,717
6	StableLM	15,853
7	stable-diffusion-webui-colab	15,290
8	dopamine	10,378
9	ML-Papers-of-the-Week	8,943
10	generative-ai	5,640
11	nlpaug	4,252
12	ArtLine	3,531
13	Dreambooth-Stable-Diffusion	3,170
14	examples	2,465
15	clip-retrieval	2,163
16	machine-learning-experiments	1,607
17	vertex-ai-samples	1,384
18	imodels	1,293
19	tensor-house	1,179
20	Deep-Learning-In-Production	1,073
21	chameleon-llm	1,020
22	bark	960
23	PConv-Keras	893

Jupyter Notebook AI

Top 23 Jupyter Notebook AI Projects

Jupyter Notebook AI related posts

Ask HN: Why all these GitHub fake accounts starring my project

Alternative Chunking Methods

Machine Learning and AI Beyond the Basics Book

Google Research website is down

GPT-4, without specialized training, beat a GPT-3.5 class model that cost $10B

FREE AI Course By Microsoft: ZERO to HERO! 🔥

Building an Open Source Decentralized E-Book Search Engine

Index