Why is ChatGPT 3.5 API 10x cheaper than GPT3?

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

GPTQ-for-LLaMa

75 2,933 8.6 Python

4 bits quantization of LLaMA using GPTQ

You've probably heard, but LLaMA just released, and its 13B parameter model outperforms GPT-3 on most metrics (because they trained it on a lot more data). Someone's already quantized it to 4 and 3 bits and it performs virtually the same. It also apparently performs well on CPUs (several words per second on a 7900X). Running something equivalent to GPT3.5 on a phone is not out that far out.

llama-cpu

9 775 3.1 Python

Fork of Facebooks LLaMa model to run on CPU

You've probably heard, but LLaMA just released, and its 13B parameter model outperforms GPT-3 on most metrics (because they trained it on a lot more data). Someone's already quantized it to 4 and 3 bits and it performs virtually the same. It also apparently performs well on CPUs (several words per second on a 7900X). Running something equivalent to GPT3.5 on a phone is not out that far out.

Scout Monitoring

www.scoutapm.com featured

Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

La Criptografia en l'Era de la Computació Quàntica i de la IA

2 projects | dev.to | 1 Jun 2024
Show HN: 5x faster depth map generation using tensorrt inside comfyui

1 project | news.ycombinator.com | 1 Jun 2024
Meltdown

1 project | news.ycombinator.com | 1 Jun 2024
Napster Sparked a File-Sharing Revolution 25 Years Ago

1 project | news.ycombinator.com | 1 Jun 2024
HuggingFace hacked – Space secrets leak disclosure

1 project | news.ycombinator.com | 1 Jun 2024