Multi-GPU questions

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

exllama

64 2,632 9.0 Python

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Exllama for example uses buffers on each card that reduce the amount of VRAM available for model and context, see here. https://github.com/turboderp/exllama/issues/121

Scout Monitoring

www.scoutapm.com featured

Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

AIM Weekly 03 June 2024

19 projects | dev.to | 3 Jun 2024
Llama3V is suspected to have been stolen from the MiniCPM-Llama3-v2.5 project

2 projects | news.ycombinator.com | 2 Jun 2024
Lama3-V project from a Stanford team plagiarized a lot from MiniCPM-Llama3-v2.5

1 project | news.ycombinator.com | 3 Jun 2024
[2209.02842] ASR2K: Speech Recognition for Around 2000 Languages without Audio

1 project | /r/speechtech | 10 Sep 2022
Text-to-Speech with Speaker Diarization

1 project | news.ycombinator.com | 2 Jun 2024

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA Post date: 9 Jul 2023

exllama

Scout Monitoring

Related posts

AIM Weekly 03 June 2024

Llama3V is suspected to have been stolen from the MiniCPM-Llama3-v2.5 project

Lama3-V project from a Stanford team plagiarized a lot from MiniCPM-Llama3-v2.5

[2209.02842] ASR2K: Speech Recognition for Around 2000 Languages without Audio

Text-to-Speech with Speaker Diarization