Search Engines

Open-source projects categorized as Search Engines

Top 16 Search Engine Open-Source Projects

  • the-book-of-secret-knowledge

    A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.

  • Project mention: Cyber Security iPhone Application Idea | /r/iOSDevelopment | 2023-07-03

    8. Security Knowledge Base: - Utilize resources like The-book-of-secret-knowledge (e.g., https://github.com/trimstray/the-book-of-secret-knowledge) and Awesome-Hacking (e.g., https://github.com/Hack-with-Github/Awesome-Hacking) to build a knowledge base. - Extract relevant security information and create a structured knowledge base within SecurIoT. - Implement functionality to query and retrieve security information from the knowledge base. - Thoroughly test the knowledge base integration, ensuring accurate retrieval of security knowledge.

  • MeiliSearch

    A lightning-fast search API that fits effortlessly into your apps, websites, and workflow

  • Project mention: Publish/Subscribe with Sidekiq | dev.to | 2024-02-21

    We needed to introduce a new service for search. As we settled on using meilisearch, we needed a way to sync updates on our models with the records in meilisearch. We could've continued to use callbacks but we needed something better.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Typesense

    Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences

  • Project mention: FlowDiver: The Road to SSR - Part 1 | dev.to | 2024-05-03

    Disregarding props-drilling technique in favor of a more reliable and elegant solution we looked for inspiration elsewhere. Another project of ours .find was using Typesense/Algolia components, which looked a bit like black-box/magic, but at the same time provided a clean approach to build complex and highly customizable solutions.

  • qdrant

    Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

  • Project mention: Hindi-Language AI Chatbot for Enterprises Using Qdrant, MLFlow, and LangChain | dev.to | 2024-05-02

    Great. Now that we have the embeddings, we need to store them in a vector database. We will be using Qdrant for this purpose. Qdrant is an open-source vector database that allows you to store and query high-dimensional vectors. The easiest way to get started with the Qdrant database is using the docker.

  • Yacy

    Distributed Peer-to-Peer Web Search Engine and Intranet Search Appliance

  • Project mention: New ways we're tackling spammy, low-quality content on Search | news.ycombinator.com | 2024-03-07
  • Gigablast

    Nov 20 2017 -- A distributed open source search engine and spider/crawler written in C/C++ for Linux on Intel/AMD. From gigablast dot com, which has binaries for download. See the README.md file at the very bottom of this page for instructions.

  • OnionSearch

    OnionSearch is a script that scrapes urls on different .onion search engines.

  • Project mention: Launching Osint Industries: Discover Your Digital Footprint in Realtime | news.ycombinator.com | 2023-08-09

    Greetings, HN community. We are excited to share OSINT Industries, a platform dedicated to real-time open-source intelligence (OSINT) pertaining to phone numbers and emails.

    About OSINT Industries:

    Realtime Analysis: We provide an up-to-the-moment enrichment tool for emails, and phone numbers.

    Real-Time Intelligence: We refrain from using databases. Every piece of data is fetched in real-time, ensuring its accuracy and timeliness. None of the queries or results are stored.

    Extensive Reach: Our tool can identify associated accounts linked to a particular email or phone number from over 200 websites.

    Detailed Insights: Beyond basic association, our system can pull additional data points, such as images, map locations, and more.

    Pedigree: Our foundation is built upon proven tools our team made in the past like Holehe (https://github.com/megadose/holehe), GHunt (https://github.com/mxrch/GHunt), and onionsearch (https://github.com/megadose/OnionSearch).

    User Base: Within 3 months of our inception, we've got over 350k registered users.

    Trust & Reliability: Our tool has been integrated by various global law enforcement agencies, showcasing its reliability and utility.

    Try the tool for free to discover the digital footprint of your email and phone number. The first 5 searches are free: https://osint.industries

    We offer API access to enterprises, if you're interested in that contact me on [email protected].

    As our tool deals with data that some may view as sensitive, I think it is also important to link our policies here which govern our ethics, and data processing.

    Trust & Safety (our ethics): https://osint.industries/trust

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • sist2

    Lightning-fast file system indexer and search tool

  • Project mention: Better option then filebrowser to share files | /r/OpenMediaVault | 2023-06-11

    Quickly Googling for a docker indexer and search app I turned up Sist2, that on the surface looks like might fit your needs. I don't have an appropriate data store to run it against, so I can't speak to its indexing speed or efficacy. However, the developer does have an accessible demo to try, and the front end at least appears to function well.

  • domains

    World’s single largest Internet domains dataset

  • Project mention: There are only 2 .yahoo Internet domains | news.ycombinator.com | 2023-06-13
  • dark-web-osint-tools

    OSINT Tools for the Dark Web

  • Nuclia DB

    NucliaDB, The AI Search database for RAG

  • Project mention: Tantivy 0.20 is released: Schemaless column store, Schemaless aggregations, Phrase prefix queries, Percentiles, and more... | /r/rust | 2023-06-20

    You have also NucliaDB that is built on top of tantivy and addresses vector search for documents and video search.

  • SmartImage

    Reverse image search tool (SauceNao, IQDB, Ascii2D, trace.moe, and more)

  • tinyvector

    A tiny embedding database in pure Rust.

  • Project mention: Tinyvector - a tiny embedding database in pure Rust | /r/aiengineer | 2023-07-11
  • Seeks

    Seeks is a decentralized p2p websearch and collaborative tool.

  • artadosearch

    Artado Search is open source, private and highly customizable search engine

  • multiSearchHome

    :mag_right: Local standalone html homepage to search in 175 search engine (duckduckgo, youtube, twitter, wikipedia, etc..) // FR___: Page d'accueil html autonome, pour chercher dans 175 moteurs de recherche.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Search Engines related posts

Index

What are some of the best open-source Search Engine projects? This list will help you:

Project Stars
1 the-book-of-secret-knowledge 131,491
2 MeiliSearch 43,472
3 Typesense 18,107
4 qdrant 18,036
5 Yacy 3,265
6 Gigablast 1,518
7 OnionSearch 1,128
8 sist2 769
9 domains 643
10 dark-web-osint-tools 623
11 Nuclia DB 576
12 SmartImage 526
13 tinyvector 340
14 Seeks 262
15 artadosearch 152
16 multiSearchHome 4

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com