Page 1 of 1

any offhand ideas about data deduplication?

Posted: 2018/11/09 06:32:30
by iwishitwouldwork
I can tell I need to decide something about my backups RSN.

I'm planning to get an M-disc drive and use that occasionally. So, as
a side consideration, are there any gotchas using M-disc drives on
Centos?

ANYWAY -- before I digress any more -- I know I have duplicated
files on my discs. I'd like to end/reduce that. What are your
thoughts on data dedup s/w? I'm not willing to spend any money
on it -- I'd like to clean it up, I don't need to clean it up.
Hence, open software. (I suspect I don't have too much duplication.)

Thx.

(Hmm, is this question appropriate here?)

j.

Re: any offhand ideas about data deduplication?

Posted: 2018/11/09 09:22:34
by TrevorH
There's a package in EPEL called fdupes that can be used on a one-off basis to find and optionally clean up duplicate files. I presume you don't create so many of these duplicates all the time that you need to have something running to do it for you.

Re: any offhand ideas about data deduplication?

Posted: 2018/11/09 15:44:44
by iwishitwouldwork
Aha! i will look into that. I suspect it will do all that I need, since
I don't genuinely need very much.

Thanks.

j.