Activation-Aware Weight Quantization for LLM Compression Outperforms GPTQ

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

llm-awq

7 1,954 7.9 Python

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Better quantization would have a direct and meaningful impact for everyone running local LLMs. The technique has already been applied to both Vicuna and the multimodal LLaMA variant LLaVA.
https://github.com/mit-han-lab/llm-awq

Scout Monitoring

www.scoutapm.com featured

Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

I Created a Password Manager with AI: Powered by GPT-4

1 project | dev.to | 2 Jun 2024
Scout: Scalable Cognitive Operations Unified Team

1 project | news.ycombinator.com | 1 Jun 2024
Membuat Project Python yang mudah untuk dimaintain

1 project | dev.to | 1 Jun 2024
Make Maintainable Python Project

1 project | dev.to | 1 Jun 2024
Download Paul Graham essays in ePub format

1 project | news.ycombinator.com | 1 Jun 2024

Activation-Aware Weight Quantization for LLM Compression Outperforms GPTQ

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Post date: 2 Jun 2023

llm-awq

Scout Monitoring

Related posts

I Created a Password Manager with AI: Powered by GPT-4

Scout: Scalable Cognitive Operations Unified Team

Membuat Project Python yang mudah untuk dimaintain

Make Maintainable Python Project

Download Paul Graham essays in ePub format