I recently benchmarked current versions of
bupstash for various scenarios to evaluate these tools.
I will likely update the report a couple times as the tools improve over time.
bupstash, I’ve implemented (but not benchmarked yet) a WIP for per-directory parallel
stat()ing, which is its current bottleneck for my use case.
kopia, my main request is to allow to runtime-configure the number of threads it uses (instead of having it hardcoded to 16), as my networked file system would benefit a lot from that. There are also a couple issues I found (and linked).
I would also appreciate that if you find some answers to the open questions in there (e.g. why my
kopia run didn’t deduplcate the data within the first run on the “4 GB, small files” dataset), please answer them here or file an issue in my report’s repo.
Thanks, and happy backuping!