Python Dedupe

Open-source Python projects categorized as Dedupe

Top 5 Python Dedupe Projects

  • BorgBackup

    Deduplicating archiver with compression and authenticated encryption.

  • Project mention: Ask HN: Open-source Windows 11 backup solutions | news.ycombinator.com | 2024-04-04

    i use - and recommend - "borgbackup": for example with the "vorta" graphical frontend

    * https://www.borgbackup.org/

    * https://vorta.borgbase.com/install/windows/

    just my 0.02€

  • dedupe

    :id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

  • Project mention: Using deep learning for Fuzzy Matching | /r/datascience | 2023-07-06
  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
  • imgdupes

    Identifying and removing near-duplicate images using perceptual hashing.

  • dduper

    Fast block-level out-of-band BTRFS deduplication tool.

  • Deduper

    The goal of this project is to make a deduper program that anybody can run on their computer to save storage space. (by ThatOneShortGuy)

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Dedupe related posts

  • What do you use for VPS backup? Would improved borg setup - pull mode - be enough? Or, do you use something else?

    1 project | /r/selfhosted | 5 Dec 2023
  • Borg CVE fix requires migration

    1 project | news.ycombinator.com | 10 Oct 2023
  • Using deep learning for Fuzzy Matching

    1 project | /r/datascience | 6 Jul 2023
  • disc space is not freeing

    1 project | /r/openSUSE | 25 Jun 2023
  • I installed Arch today!

    1 project | /r/linux4noobs | 12 Jun 2023
  • Ask HN: What is the most cost effective way to do backups?

    1 project | news.ycombinator.com | 8 Jun 2023
  • How to best configure an HDD for cold storage?

    1 project | /r/DataHoarder | 7 Jun 2023
  • A note from our sponsor - SaaSHub
    www.saashub.com | 1 Jun 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source Dedupe projects in Python? This list will help you:

Project Stars
1 BorgBackup 10,643
2 dedupe 3,992
3 imgdupes 336
4 dduper 163
5 Deduper 0

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com