Rust language-model

Open-source Rust projects categorized as language-model

Top 7 Rust language-model Projects

  • tokenizers

    đź’Ą Fast State-of-the-Art Tokenizers optimized for Research and Production

  • Project mention: HF Transfer: Speed up file transfers | /r/rust | 2023-07-07

    Hugging Face seems to like Rust. They also wrote Tokenizers in Rust.

  • aici

    AICI: Prompts as (Wasm) Programs

  • Project mention: Google Gemini: Context Caching | news.ycombinator.com | 2024-05-16

    To me, context caching is only a subset of what is possible with full control over the model. I consider this a more complete list: https://github.com/microsoft/aici?tab=readme-ov-file#flexibi...

    Context caching only gets you “forking generation into multiple branches” (i.e. sharing work between multiple generations)

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Nuclia DB

    NucliaDB, The AI Search database for RAG

  • Project mention: Tantivy 0.20 is released: Schemaless column store, Schemaless aggregations, Phrase prefix queries, Percentiles, and more... | /r/rust | 2023-06-20

    You have also NucliaDB that is built on top of tantivy and addresses vector search for documents and video search.

  • rusty

    AI-powered CLI tool to help you remember bash commands.

  • llama-dfdx

    LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!

  • smolrsrwkv

    A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit evaluation. It can also directly load PyTorch RWKV models.

  • bytepiece-rs

    The Bytepiece Tokenizer Implemented in Rust.

  • Project mention: A more general tokenizer | /r/rust | 2023-09-25
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Rust language-model related posts

  • HF Transfer: Speed up file transfers

    2 projects | /r/rust | 7 Jul 2023
  • LLM custom dictionary

    1 project | /r/learnmachinelearning | 7 May 2023
  • Introducing repugnant-pickle, a crate for scraping Python Pickle files in a basic way. Notable, it can deal with (some) PyTorch model files.

    2 projects | /r/rust | 6 Apr 2023
  • Is GPT-4 still just a language model trying to predict text?

    1 project | /r/artificial | 5 Apr 2023
  • [D] What's going to be the dominant language for machine learning in 5 years?

    1 project | /r/MachineLearning | 9 Feb 2021
  • substitute for tokenizer in torchtext

    1 project | /r/LanguageTechnology | 31 Jan 2021
  • A note from our sponsor - SaaSHub
    www.saashub.com | 1 Jun 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source language-model projects in Rust? This list will help you:

Project Stars
1 tokenizers 8,538
2 aici 1,797
3 Nuclia DB 585
4 rusty 324
5 llama-dfdx 94
6 smolrsrwkv 91
7 bytepiece-rs 14

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com