NAME
md5_finddup.pl - find duplicate files in XML/md5 type files.
COPYRIGHT
Copyright (C) 2001, 2002 Mark Veltzer; All rights reserved.
LICENSE
This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111, USA.
DETAILS
MANIFEST: md5_finddup.pl
PROJECT: meta
VERSION: 0.13
SYNOPSIS
md5_finddup.pl [options]
DESCRIPTION
This script processes an XML/md5 type file and looks for duplicate MD5 sums (which, in high probability, indicate that the same files are involved...), and prints out the files which have the same MD5 sums. The user gets to select what to do with the file: remove the file, keep the file etc... This is an interactive program.
OPTIONS
- help (type: bool, default: 0)
-
display help message
- pod (type: bool, default: 0)
-
display pod options snipplet
- man (type: bool, default: 0)
-
display manual page
- quit (type: bool, default: 0)
-
quit without doing anything
- gtk (type: bool, default: 0)
-
run a gtk ui to get the parameters
- license (type: bool, default: 0)
-
show license and exit
- copyright (type: bool, default: 0)
-
show copyright and exit
- description (type: bool, default: 0)
-
show description and exit
- history (type: bool, default: 0)
-
show history and exit
- input (type: file, default: /tmp/file.xml)
-
what input file to use ?
no free arguments are allowed
BUGS
None.
AUTHOR
Name: Mark Veltzer
Email: mailto:veltzer@cpan.org
WWW: http://www.veltzer.org
CPAN id: VELTZER
HISTORY
0.00 MV books XML into database
0.01 MV md5 project
0.02 MV database
0.03 MV perl module versions in files
0.04 MV graph visualization
0.05 MV thumbnail user interface
0.06 MV more thumbnail issues
0.07 MV website construction
0.08 MV improve the movie db xml
0.09 MV web site automation
0.10 MV SEE ALSO section fix
0.11 MV move tests to modules
0.12 MV finish papers
0.13 MV more pdmt stuff
SEE ALSO
MIME::Base64(3), Meta::Utils::File::Remove(3), Meta::Utils::Opts::Opts(3), Meta::Utils::Output(3), Meta::Utils::System(3), Term::ReadKey(3), XML::Parser(3), strict(3)
TODO
-fix problem with the parser that I have to do the hack for (the parser doenst seem to give me the whole character data in the handle_char callback...).