bees
duperemove
bees | duperemove | |
---|---|---|
21 | 16 | |
603 | 669 | |
- | - | |
4.0 | 9.0 | |
about 1 month ago | 19 days ago | |
C++ | C | |
GNU General Public License v3.0 only | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
bees
-
Converted ext4 to btrfs, tried defrag and ran out of space
Btrfs defrag 'will break up the reflinks of COW data' and 'may cause considerable increase of space usage depending on the broken up reflinks'. To try to fix this, I would run bees to try and deduplicate the now duplicate reflinks. It may be worth doing this from e.g. a livedisk though as out of space errors can cause things to break (so don't upgrade packages till you fix this).
-
Introducing Pins: Permanent Nix Binary Storage
Figuring out which paths are needed outside gcroots'ed closures is pretty complicated. If you're using flakes, the main issue is duplicates, so store optimization and bees may help. With channels, once you update a channel you might as well gc everything else.
-
rule
bees
- Should you remove duplicate files?
-
Poke holes in my git-annex + ZFS offline storage system
I felt more confident with the code/developer/docs. The author knows his stuff regarding btrfs. Like, look at this, it's amazing: https://github.com/Zygo/bees/blob/master/docs/btrfs-kernel.md
-
Anyone running Bees? Or deduping data some other way?
I have some time again and wondering if anyone's got Bees, https://github.com/Zygo/bees, running on their Synology.
-
The goal: Use Fedora 37 with Snapper to get a "riceable" Linux desktop that can be rolled back like a time machine (and some comments on why I don't use Silverblue)
Even if NixOS doesn't support sending deduplicating syscalls to the kernel, you could use the Btrfs deduping daemon called bees to slowly save space over time. There might be an equivalent for ZFS, too.
-
Questions Regarding BTRFS, Suspend, and Data Integrity
This isn't much different than ext4. 0 length files can happen after a crash. You can avoid this by mounting with flushoncommit for the future. See here for details.
-
Compression
Maybe BEES can help you to dedup any blocks, not file.
- Is Bees a after-solution to BTRFS defragmentation breaking reflinks ?
duperemove
-
fdupes: Identify or Delete Duplicate Files
Very useful for identifying files that may need to get deduplicate or that can be removed entirely. Unfortunately, I don't think this will also find identical directories.
If deleting files isn't what you want, I'd suggest looking into deduplicating tools.
ZFS has its own de duplicator built in, which is nice. It should just deduplicate files and individual extents of files by itself once you enable it. Probably not a good idea on very write-heavy disks, but it's an option.
Other file systems with extent level deduplication can use https://github.com/markfasheh/duperemove to not only deduplicaye files, but also deduplicate individual extents. This can be very useful for file systems that store a lot of duplicate content, like different WINE prefixes. For filesystems without extent deduplication, duperemove should try hard linking files to make them take up practically no disks space.
-
Bcachefs Merged into the Linux 6.7 Kernel
ZFS now has reflink support, which doesn't require lots of RAM, but isn't done automatically while writing. You need to run something like https://github.com/markfasheh/duperemove
- ZFS 2.2.0 (RC): Block Cloning merged
- Craziest thing I ever used SQLite for: partial file deduplication
-
Deduplication on EXT4
Then duperemove
-
What is a wineprefix and should I make a new one every time I add a new game to lutris?
Filesystems like Btrfs and XFS have support for deduplication, you can use a program like duperemove to save space.
-
File deduplication report?
Maybe you could use a file deduplication instead of a block based? https://github.com/markfasheh/duperemove
-
Anyone running Bees? Or deduping data some other way?
If not bees, do you run other programs for deduping? I see jdupes has support for BTRFS, https://github.com/jbruchon/jdupes, and also duperemove, https://github.com/markfasheh/duperemove.
-
Is switching to BTRFS useful for my use-cases?
It's a good filesystem, I use it with a special setup that needs a filesystem with snapshots. It's been stable for me, I run a duperemovehttps://github.com/markfasheh/duperemove) occasionally and that's about all the maintenance it needs.
-
With Proton being as good as it is now, do we still need separate prefixes for every game?
With Btrfs or XFS you can easily deduplicate the data with tools like duperemove, potentially saving a lot of space if you've installed many small games.
What are some alternatives?
dduper - Fast block-level out-of-band BTRFS deduplication tool.
jdupes - A powerful duplicate file finder and an enhanced fork of 'fdupes'.
btrbk - Tool for creating snapshots and remote backups of btrfs subvolumes
yarn-deduplicate - Deduplication tool for yarn.lock files
btrfs - WinBtrfs - an open-source btrfs driver for Windows
rmlint - Extremely fast tool to remove duplicates and other lint from your filesystem
snap-sync - Use snapper snapshots to backup to external drive
compsize - btrfs: find compression type/ratio on a file or set of files
dedupe - :id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
bees-docker - docker container for zygo/bees