SemanticSlicer

A recursive text chunker that attempts to preserve context. (by drittich)

SemanticSlicer Alternatives

Similar projects and alternatives to SemanticSlicer based on common topics and language

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better SemanticSlicer alternative or higher similarity.

SemanticSlicer reviews and mentions

Posts with mentions or reviews of SemanticSlicer. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-06.
  • Pg_vectorize: The simplest way to do vector search and RAG on Postgres
    6 projects | news.ycombinator.com | 6 Mar 2024
    I wrote a C# library to do this, which is similar to other chunking approaches that are common, like the way langchain does it: https://github.com/drittich/SemanticSlicer

    Given a list of separators (regexes), it goes through them in order and keeps splitting the text by them until the chunk fits within the desired size. By putting the higher level separators first (e.g., for HTML split by

    before

    ), it's a pretty good proxy for maintaining context.

Stats

Basic SemanticSlicer repo stats
1
7
7.5
5 months ago

drittich/SemanticSlicer is an open source project licensed under MIT License which is an OSI approved license.

The primary programming language of SemanticSlicer is C#.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com