TinyChat: Large Language Model on the Edge

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

llm-awq

7 1,954 7.9 Python

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

TinyChat is an efficient, lightweight, Python-native serving framework for 4-bit LLMs by AWQ. It delivers 2.3x generation speed up on RTX4090.
Code: https://github.com/mit-han-lab/llm-awq/tree/main/tinychat

Scout Monitoring

www.scoutapm.com featured

Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

I Created a Password Manager with AI: Powered by GPT-4

1 project | dev.to | 2 Jun 2024
Scout: Scalable Cognitive Operations Unified Team

1 project | news.ycombinator.com | 1 Jun 2024
Membuat Project Python yang mudah untuk dimaintain

1 project | dev.to | 1 Jun 2024
Make Maintainable Python Project

1 project | dev.to | 1 Jun 2024
Download Paul Graham essays in ePub format

1 project | news.ycombinator.com | 1 Jun 2024

TinyChat: Large Language Model on the Edge

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Post date: 8 Dec 2023

llm-awq

Scout Monitoring

Related posts

I Created a Password Manager with AI: Powered by GPT-4

Scout: Scalable Cognitive Operations Unified Team

Membuat Project Python yang mudah untuk dimaintain

Make Maintainable Python Project

Download Paul Graham essays in ePub format