GPT-3.5 Turbo fine-tuning and API updates

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

chroma

32 12,771 9.8 Rust

the AI-native open-source embedding database

https://www.trychroma.com
It works as a library, you can require the PIP package and get started immediately.

runpodctl

8 220 9.4 Go

🧰 | RunPod CLI for pod management
Scout Monitoring

www.scoutapm.com featured

Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
text-generation-webui

877 37,401 9.9 Python

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

It depends on your needs. For instance, do you want to host an API or do you want to have a front end like chatGPT? Chances are, text-generation-webui [1] should get you pretty close to hosting it yourself. You simply clone the repo, download the model from huggingface using the included helper (download-model.py) and fire up the server with server.py. You can connect to it by SSH port tunneling on port 7860 (there's other way like Ngrok but SSH tunneling is the easiest and secure).
As for hosting, I found that runpod [2] has been the cheapest (not affiliated, just a user). All the other services tend to add up more than them when you include bandwidth and storage. There's some tutorials online [3] but a lot of them use the quantized version. You should be able to fit the original 70B with "load_in_8bit" on one A100 80GB.
[1] https://github.com/oobabooga/text-generation-webui

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Chroma – the open-source embedding database

1 project | news.ycombinator.com | 11 Jan 2024
Show HN: Embeddings Solution for Personal Journal

2 projects | news.ycombinator.com | 1 Nov 2023
Chroma DB Random Seg Faults

1 project | news.ycombinator.com | 5 Sep 2023
🔥 Create GPT-3 bot out of your own PDFs using Python

3 projects | dev.to | 8 Aug 2023
Why your brain is 3 million times more efficient than GPT-4 - introduction to Vector Databases and their comparison

1 project | dev.to | 23 Jul 2023

GPT-3.5 Turbo fine-tuning and API updates

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Embeddings document-retrieval llms
Post date: 22 Aug 2023

chroma

runpodctl

Scout Monitoring

text-generation-webui

Related posts

Chroma – the open-source embedding database

Show HN: Embeddings Solution for Personal Journal

Chroma DB Random Seg Faults

🔥 Create GPT-3 bot out of your own PDFs using Python

Why your brain is 3 million times more efficient than GPT-4 - introduction to Vector Databases and their comparison

GPT-3.5 Turbo fine-tuning and API updates

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Embeddings document-retrieval llms Post date: 22 Aug 2023

chroma

runpodctl

Scout Monitoring

text-generation-webui

Related posts

Chroma – the open-source embedding database

Show HN: Embeddings Solution for Personal Journal

Chroma DB Random Seg Faults

🔥 Create GPT-3 bot out of your own PDFs using Python

Why your brain is 3 million times more efficient than GPT-4 - introduction to Vector Databases and their comparison

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Embeddings document-retrieval llms
Post date: 22 Aug 2023