GPT-3.5 Turbo fine-tuning and API updates

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Scout Monitoring - Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
www.scoutapm.com
featured
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
  • chroma

    the AI-native open-source embedding database

  • https://www.trychroma.com

    It works as a library, you can require the PIP package and get started immediately.

  • runpodctl

    🧰 | RunPod CLI for pod management

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
  • text-generation-webui

    A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

  • It depends on your needs. For instance, do you want to host an API or do you want to have a front end like chatGPT? Chances are, text-generation-webui [1] should get you pretty close to hosting it yourself. You simply clone the repo, download the model from huggingface using the included helper (download-model.py) and fire up the server with server.py. You can connect to it by SSH port tunneling on port 7860 (there's other way like Ngrok but SSH tunneling is the easiest and secure).

    As for hosting, I found that runpod [2] has been the cheapest (not affiliated, just a user). All the other services tend to add up more than them when you include bandwidth and storage. There's some tutorials online [3] but a lot of them use the quantized version. You should be able to fit the original 70B with "load_in_8bit" on one A100 80GB.

    [1] https://github.com/oobabooga/text-generation-webui

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Chroma – the open-source embedding database

    1 project | news.ycombinator.com | 11 Jan 2024
  • Show HN: Embeddings Solution for Personal Journal

    2 projects | news.ycombinator.com | 1 Nov 2023
  • Chroma DB Random Seg Faults

    1 project | news.ycombinator.com | 5 Sep 2023
  • 🔥 Create GPT-3 bot out of your own PDFs using Python

    3 projects | dev.to | 8 Aug 2023
  • Why your brain is 3 million times more efficient than GPT-4 - introduction to Vector Databases and their comparison

    1 project | dev.to | 23 Jul 2023