any offhand ideas about data deduplication?

Issues related to applications and software problems
Post Reply
iwishitwouldwork
Posts: 88
Joined: 2014/02/08 14:56:39

any offhand ideas about data deduplication?

Post by iwishitwouldwork » 2018/11/09 06:32:30

I can tell I need to decide something about my backups RSN.

I'm planning to get an M-disc drive and use that occasionally. So, as
a side consideration, are there any gotchas using M-disc drives on
Centos?

ANYWAY -- before I digress any more -- I know I have duplicated
files on my discs. I'd like to end/reduce that. What are your
thoughts on data dedup s/w? I'm not willing to spend any money
on it -- I'd like to clean it up, I don't need to clean it up.
Hence, open software. (I suspect I don't have too much duplication.)

Thx.

(Hmm, is this question appropriate here?)

j.

User avatar
TrevorH
Site Admin
Posts: 33202
Joined: 2009/09/24 10:40:56
Location: Brighton, UK

Re: any offhand ideas about data deduplication?

Post by TrevorH » 2018/11/09 09:22:34

There's a package in EPEL called fdupes that can be used on a one-off basis to find and optionally clean up duplicate files. I presume you don't create so many of these duplicates all the time that you need to have something running to do it for you.
The future appears to be RHEL or Debian. I think I'm going Debian.
Info for USB installs on http://wiki.centos.org/HowTos/InstallFromUSBkey
CentOS 5 and 6 are deadest, do not use them.
Use the FAQ Luke

iwishitwouldwork
Posts: 88
Joined: 2014/02/08 14:56:39

Re: any offhand ideas about data deduplication?

Post by iwishitwouldwork » 2018/11/09 15:44:44

Aha! i will look into that. I suspect it will do all that I need, since
I don't genuinely need very much.

Thanks.

j.

Post Reply