NAME

File::Find::Duplicates - Find duplicate files

SYNOPSIS

use File::Find::Duplicates;

my @dupes = find_duplicate_files('/basedir1', '/basedir2');

foreach my $dupeset (@dupes) { 
  printf "Files %s (of size %d) hash to %s\n",
    join(", ", @{$dupeset->files}), $dupeset->size, $dupeset->md5;
}

DESCRIPTION

This module provides a way of finding duplicate files on your system.

FUNCTIONS

find_duplicate_files

my %dupes = find_duplicate_files('/basedir1', '/basedir2');

When passed a base directory (or list of such directories) it returns a list of objects with the following methods:

files

A listref of the names of the duplicate files.

size

The size of the duplicate files.

md5

The md5 sum of the duplicate files.

TODO

Check the contents of tars, zipfiles etc to ensure none of these also exist elsewhere (if so requested).

SEE ALSO

File::Find.

AUTHOR

Tony Bowden

BUGS and QUERIES

Please direct all correspondence regarding this module to: bug-File-Find-Duplicates@rt.cpan.org

COPYRIGHT AND LICENSE

Copyright (C) 2001-2005 Tony Bowden.

This program is free software; you can redistribute it and/or modify it under
the terms of the GNU General Public License; either version 2 of the License,
or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT
ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
FOR A PARTICULAR PURPOSE.