Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 23 Rust Search Projects
-
ripgrep
ripgrep recursively searches directories for a regex pattern while respecting your gitignore
-
MeiliSearch
A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
sonic
🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
-
qdrant
Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
postgresml
The GPU-powered AI application database. Get your app to market faster using the simplicity of SQL and the latest NLP, ML + LLM models.
-
lnx
⚡ Insanely fast, 🌟 Feature-rich searching. lnx is the adaptable, typo tollerant deployment of the tantivy search engine.
-
nix-index
Quickly locate nix packages with specific files [maintainers=@bennofs @figsoda @raitobezarius]
-
DataSurgeon
Quickly Extracts IP's, Email Addresses, Hashes, Files, Credit Cards, Social Security Numbers and a lot More From Text
-
trieve
All-in-one infrastructure for building search, recommendations, and RAG. Trieve combines search language models with tools for tuning ranking and relevance.
-
lolcate-rs
Lolcate -- A comically fast way of indexing and querying your filesystem. Replaces locate / mlocate / updatedb. Written in Rust.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
There were quite some occurrences of this component in the entire project, therefore just searching for base-input was not going to cut it for me. Instead, I decided to use regular expressions resp. regex with ripgrep. After installing ripgrep it provides a rg command line tool.
We needed to introduce a new service for search. As we settled on using meilisearch, we needed a way to sync updates on our models with the records in meilisearch. We could've continued to use callbacks but we needed something better.
Project mention: Level Up Your Dev Workflow: Conquer Web Development with a Blazing Fast Neovim Setup (Part 1) | dev.to | 2024-03-16ripgrep: A super-fast file searcher. You can install it using your system's package manager (e.g., brew install ripgrep on macOS). fd: Another blazing-fast file finder. Installation instructions can be found here: https://github.com/sharkdp/fd
Sonic - a project written in Rust, uses custom network communication protocol for fast communication between the client and the server.
Project mention: How to Build a Chat App with Your Postgres Data using Agent Cloud | dev.to | 2024-05-13AgentCloud uses Qdrant as the vector store to efficiently store and manage large sets of vector embeddings. For a given user query the RAG application fetches relevant documents from vector store by analyzing how similar their vector representation is compared to the query vector.
It is a custom pretrained NN with very few nodes, the full source code is here: https://github.com/cantino/mcfly/blob/master/src/network.rs
It's unclear what the superpowers would be? Video doesn't show anything I can't do with an IDE or decent code editor, and there I also have refactoring tools, metadata like indicators for usages that can be used for navigating and so on.
Reminds me of UML-like diagrams over relational databases, except that it's generated one piece at a time. In practice I generate diagrams showing cyclomatic complexity much more often, and for code exploration outside the IDE I'd use ast-grep.
https://ast-grep.github.io/
Project mention: Tantivy 0.20 is released: Schemaless column store, Schemaless aggregations, Phrase prefix queries, Percentiles, and more... | /r/rust | 2023-06-20I don't think you have an active project that addresses all those use cases. There was an attempt in Rust with Toshi that is built on top of tantivy, but the project seems to have stalled.
Project mention: Ask HN: What's the best way to add search to my website? | news.ycombinator.com | 2023-10-08If your content is mostly static, you might want to consider pre-building an index and shipping it as a whole. You could look into something like
* https://stork-search.net/ (Rust/WASM)
* tinysearch: https://github.com/tinysearch/tinysearch (Rust/WASM)
* https://lunrjs.com/ (JS, simple, stable)
* http://elasticlunr.com/ - based on the former, slightly more sophisticated tuning options
Project mention: Ask HN: I have been unable to land a job in two years, does anybody | news.ycombinator.com | 2024-05-03I've issued a few bounties on my repo [0], and it's worked fairly well from my perspective. I'm both the bounty assigner and the maintainer in this case.
Keep in mind that open source is always a bit messy, and a lot of people on all projects (even without bounties) never actually follow through with contributions that they plan to do for a whole host of reasons. The way I've approached it is once someone either has a track record of contributing or makes substantial (visible) process on a PR I assign the bounty to them.
[0] https://github.com/getgrit/gritql/issues?q=is%3Aopen+is%3Ais...
Project mention: YaCy, a distributed Web Search Engine, based on a peer-to-peer network | news.ycombinator.com | 2024-03-05There are already many project about search:
- https://www.marginalia.nu/
- https://searchmysite.net/
- https://lucene.apache.org/
- elastic search
- https://presearch.com/
- https://stract.com/
- https://wiby.me/
I think that all project are fun. I would like to see one succeeding at reaching mainstream level of attention.
I have also been gathering links meta data for some time. Maybe I will use them to feed any eventual self hosted search engine, or language model, if I decide to experiment with that.
- domains for seed https://github.com/rumca-js/Internet-Places-Database
- bookmarks seed https://github.com/rumca-js/RSS-Link-Database
- links for year https://github.com/rumca-js/RSS-Link-Database-2024
lnx - a young but promising project, utilizes Tanitvy as a backend.
nix-index can be used to provide this functionality, and to automate this process you can use nix-index-database (setup instructions are in the README).
Project mention: Tantivy 0.20 is released: Schemaless column store, Schemaless aggregations, Phrase prefix queries, Percentiles, and more... | /r/rust | 2023-06-20You have also NucliaDB that is built on top of tantivy and addresses vector search for documents and video search.
Recently I have also used [ambr](https://github.com/dalance/amber) which can do both search (ambs) and replace (ambr) recursively in your codebase. The only problem as of yet is that it does not support globbing so I cannot filter on certain filetypes only.
Rust Search related posts
-
Finding all HTML tags in a project not being self-closed
-
Ripgrep: Recursively Searches Directories for a Regex
-
Code Search Is Hard
-
SeekStorm VS tantivy - a user suggested alternative
2 projects | 22 Mar 2024 -
Open-source Rust-based RAG
-
Ripgrep
-
Open Source Search Engine as an Alternative to Google Built in Spare Time
-
A note from our sponsor - InfluxDB
www.influxdata.com | 17 May 2024
Index
What are some of the best open-source Search projects in Rust? This list will help you:
Project | Stars | |
---|---|---|
1 | ripgrep | 45,287 |
2 | MeiliSearch | 43,577 |
3 | fd | 31,910 |
4 | sonic | 19,476 |
5 | qdrant | 18,129 |
6 | mcfly | 6,641 |
7 | ast-grep | 5,988 |
8 | postgresml | 5,483 |
9 | Toshi | 4,130 |
10 | stork | 2,711 |
11 | gritql | 2,402 |
12 | stract | 1,690 |
13 | orange | 1,515 |
14 | lnx | 1,155 |
15 | aho-corasick | 955 |
16 | nix-index | 747 |
17 | DataSurgeon | 704 |
18 | trieve | 668 |
19 | Nuclia DB | 576 |
20 | amber | 453 |
21 | lolcate-rs | 283 |
22 | summa | 213 |
23 | parui | 147 |
Sponsored