any offhand ideas about data deduplication?

Issues related to applications and software problems
Post Reply
iwishitwouldwork
Posts: 47
Joined: 2014/02/08 14:56:39

any offhand ideas about data deduplication?

Post by iwishitwouldwork » 2018/11/09 06:32:30

I can tell I need to decide something about my backups RSN.

I'm planning to get an M-disc drive and use that occasionally. So, as
a side consideration, are there any gotchas using M-disc drives on
Centos?

ANYWAY -- before I digress any more -- I know I have duplicated
files on my discs. I'd like to end/reduce that. What are your
thoughts on data dedup s/w? I'm not willing to spend any money
on it -- I'd like to clean it up, I don't need to clean it up.
Hence, open software. (I suspect I don't have too much duplication.)

Thx.

(Hmm, is this question appropriate here?)

j.

User avatar
TrevorH
Forum Moderator
Posts: 23841
Joined: 2009/09/24 10:40:56
Location: Brighton, UK

Re: any offhand ideas about data deduplication?

Post by TrevorH » 2018/11/09 09:22:34

There's a package in EPEL called fdupes that can be used on a one-off basis to find and optionally clean up duplicate files. I presume you don't create so many of these duplicates all the time that you need to have something running to do it for you.
CentOS 5 died in March 2017 - migrate NOW!
Full time Geek, part time moderator. Use the FAQ Luke

iwishitwouldwork
Posts: 47
Joined: 2014/02/08 14:56:39

Re: any offhand ideas about data deduplication?

Post by iwishitwouldwork » 2018/11/09 15:44:44

Aha! i will look into that. I suspect it will do all that I need, since
I don't genuinely need very much.

Thanks.

j.

Post Reply