NAME
File::Xcopy - copy files after comparing them.
SYNOPSIS
use File::Xcopy;
my $fx = new File::Xcopy;
$fx->from_dir("/from/dir");
$fx->to_dir("/to/dir");
$fx->fn_pat('(\.pl|\.txt)$'); # files with pl & txt extensions
$fx->param('s',1); # search recursively to sub dirs
$fx->param('verbose',1); # search recursively to sub dirs
$fx->param('log_file','/my/log/file.log');
my ($sr, $rr) = $fx->get_stat;
$fx->xcopy; # or
$fx->execute('copy');
# the same with short name
$fx->xcp("from_dir", "to_dir", "file_name_pattern");
DESCRIPTION
The File::Xcopy module provides two basic functions, xcopy
and xmove
, which are useful for coping and/or moving a file or files in a directory from one place to another. It mimics some of behaviours of xcopy
in DOS but with more functions and options.
The differences between xcopy
and copy
are
xcopy
searches files based on file name pattern if the pattern is specified.xcopy
compares the timestamp and size of a file before it copies.xcopy
takes different actions if you tell it to.
The Constructor new(%arg)
Without any input, i.e., new(), the constructor generates an empty object with default values for its parameters.
If any argument is provided, the constructor expects them in the name and value pairs, i.e., in a hash array.
xcopy($from, $to, $pat, $par)
Input variables:
$from - a source file or directory
$to - a target directory or file name
$pat - file name match pattern, default to {.+}
$par - parameter array
log_file - log file name with full path
Variables used or routines called:
get_stat - get file stats
output - output the stats
execute - execute a action
How to use:
use File::Xcopy;
my $obj = File::Xcopy->new;
# copy all the files with .txt extension if they exists in /tgt/dir
$obj->xcopy('/src/files', '/tgt/dir', '\.txt$');
use File:Xcopy qw(xcopy);
xcopy('/src/files', '/tgt/dir', '\.txt$');
Return: ($n, $m).
$n - number of files copied or moved.
$m - total number of files matched
syscopy($from, $to)
Input variables:
$from - a source file or directory
$to - a target directory or file name
Variables used or routines called:
How to use:
use File::Xcopy;
syscopy('/src/file_a', '/tgt/dir/file_b'); # copy to a file
syscopy('/src/file_a', '/tgt/dir'); # copy to a dir
syscopy('/src/dir_a', '/tgt/dir_b'); # copy a dir to a dir
Return: none
xmove($from, $to, $pat, $par)
Input variables:
$from - a source file or directory
$to - a target directory or file name
$pat - file name match pattern, default to {.+}
$par - parameter array
log_file - log file name with full path
Variables used or routines called:
get_stat - get file stats
output - output the stats
execute - execute a action
How to use:
use File::Xcopy;
my $obj = File::Xcopy->new;
# move the files with .txt extension if they exists in /tgt/dir
$obj->xmove('/src/files', '/tgt/dir', '\.txt$');
Return: ($n, $m).
$n - number of files copied or moved.
$m - total number of files matched
execute ($act)
Input variables:
$act - action:
report|test - test run
copy|CP - copy files from source to target only if
1) the files do not exist or
2) newer than the existing ones
This is default.
overwrite|OW - copy files from source to target only if
1) the files exist and
2) no matter is older or newer
move|MV - same as in copy except it removes from source the
following files:
1) files are exactly the same (size and time stamp)
2) files are copied successfully
update|UD - copy files only if
1) the file exists in the target and
2) newer in time stamp
Variables used or routines called: None
How to use:
use File::Xcopy;
my $obj = File::Xcopy->new;
# update all the files with .txt extension if they exists in /tgt/dir
$obj->get_stat('/src/files', '/tgt/dir', '\.txt$');
my ($n, $m) = $obj->execute('overwrite');
Return: ($n, $m).
$n - number of files copied or moved.
$m - total number of files matched
get_stat($from, $to, $pat, $par)
Input variables:
$from - a source file or directory
$to - a target directory or file name
$pat - file name match pattern, default to {.+}
$par - parameter array
log_file - log file name with full path
I currently only implemented /S paramter. Here is an example on how to use the module:
package main;
my $self = bless {}, "main";
use File::Xcopy;
use Debug::EchoMessage;
my $xcp = File::Xcopy->new;
my $fm = '/opt/from/dir';
my $to = '/opt/to/dir';
my %p = (s=>1); # or $xcp->param('s',1);
my ($a, $b) = $xcp->get_stat($fm, $to, '\.sql$', \%p);
# $self->disp_param($a);
# $self->disp_param($b);
$xcp->output($a,$b);
$xcp->param('verbose',1);
my ($n, $m) = $xcp->execute('cp');
# $self->disp_param($xcp->param());
print "Total number of files matched: $m\n";
print "Number of files copied: $n\n";
I will implement the following parameters gradually:
source Specifies the file(s) to copy.
destination Specifies the location and/or name of new files.
/A Copies only files with the archive attribute set,
doesn't change the attribute.
/M Copies only files with the archive attribute set,
turns off the archive attribute.
/D:m-d-y Copies files changed on or after the specified date.
If no date is given, copies only those files whose
source time is newer than the destination time.
/EXCLUDE:file1[+file2][+file3]...
Specifies a list of files containing strings.
When any of the strings match any part of the absolute
path of the file to be copied, that file will be
excluded from being copied. For example, specifying a
string like \obj\ or .obj will exclude all files
underneath the directory obj or all files with the
.obj extension respectively.
/P Prompts you before creating each destination file.
/S Copies directories and subdirectories except empty ones.
/E Copies directories and subdirectories, including empty
ones. Same as /S /E. May be used to modify /T.
/V Verifies each new file.
/W Prompts you to press a key before copying.
/C Continues copying even if errors occur.
/I If destination does not exist and copying more than one
file,
assumes that destination must be a directory.
/Q Does not display file names while copying.
/F Displays full source and destination file names while
copying.
/L Displays files that would be copied.
/H Copies hidden and system files also.
/R Overwrites read-only files.
/T Creates directory structure, but does not copy files.
Does not include empty directories or subdirectories.
/T /E includes empty directories and subdirectories.
/U Copies only files that already exist in destination.
/K Copies attributes. Normal Xcopy will reset read-only
attributes.
/N Copies using the generated short names.
/O Copies file ownership and ACL information.
/X Copies file audit settings (implies /O).
/Y Suppresses prompting to confirm you want to overwrite an
existing destination file.
/-Y Causes prompting to confirm you want to overwrite an
existing destination file.
/Z Copies networked files in restartable mode.
Variables used or routines called:
from_dir - get from_dir
to_dir - get to_dir
fn_pat - get file name pattern
param - get parameters
find_files - get a list of files from a dir and its sub dirs
list_files - get a list of files from a dir
file_stat - get file stats
fmtTime - format time
How to use:
use File::Xcopy;
my $obj = File::Xcopy->new;
# get stat for all the files with .txt extension
# if they exists in /tgt/dir
$obj->get_stat('/src/files', '/tgt/dir', '\.txt$');
use File:Xcopy qw(xcopy);
xcopy('/src/files', '/tgt/dir', 'OW', '\.txt$');
Return: ($sr, $rr).
$sr - statistic hash array ref with the following keys:
OK - the files are the same in size and time stamp
txt - "The Same size and time"
cnt - count of files
szt - total bytes of all files in the category
NO - the files are different either in size or time
txt - "Different size or time"
cnt - count of files
szt - total bytes of all files in the category
OLD{txt|cnt|szt} - "File does not exist in FROM folder"
NEW{txt|cnt|szt} - "File does not exist in TO folder"
EX0{txt|cnt|szt} - "File is older or the same"
EX1{txt|cnt|szt} - "File is newer and its size bigger"
EX2{txt|cnt|szt} - "File is newer and its size smaller"
STAT
max_size - largest file in all the selected files
min_size - smallest file in all the selected files.
max_time - time stamp of the most recent file
min_time - time stamp of the oldest file
The sum of {OK} and {NO} is equal to the sum of {EX0}, {EX1} and {EX2}.
$rr - result hash array ref with the following keys {$f}{$itm}:
{$f} - file name relative to from_dir or to_dir
file - file name without dir parts
pdir - parent directory
prop - file stat array
rdir - relative file name to the $dir
path - full path of the file
type - file status: NEW, OLD, EX1, or EX2
f_pdir - parent dir for from_dir
f_size - file size in bytes from from_dir
f_time - file time stamp from from_dir
t_pdir - parent dir for to_dir
t_size - file size in bytes from to_dir
t_time - file time stamp from to_dir
tmdiff - time difference in seconds between the file
in from_dir and to_dir
szdiff - size difference in bytes between the file
in from_dir and to_dir
action - suggested action: CP, OW, SK
The method also sets the two parameters: stat_ar, file_ar and you can get it using this method:
my $sr = $self->param('stat_ar');
my $rr = $self->param('file_ar');
output($sr,$rr, $out, $par)
Input variables:
$sr - statistic hash array ref from xcopy
$rr - result hash array ref containing all the files and their
properties.
$out - output file name. If specified, the log_file will not be used.
$par - array ref containing parameters such as
log_file - log file name
Variables used or routines called:
from_dir - get from_dir
to_dir - get to_dir
fn_pat - get file name pattern
param - get parameters
action - get action name
format_number - format time or size numbers
How to use:
use File::Xcopy;
my $fc = File::Xcopy->new;
my ($s, $r) = $fc->get_stat($fdir, $tdir, 'pdf$')
$fc->output($s, $r);
Return: None.
If $out or log_file parameter is provided, then the result will be outputed to it.
format_number($n,$t)
Input variables:
$n - a numeric number
$t - number type:
size - in bytes or
time - in seconds
Variables used or routines called: None.
How to use:
use File::Xcopy;
my $fc = File::Xcopy->new;
# convert bytes to KB, MB or GB
my $n1 = $self->format_number(10000000); # $n1 = 9.537MB
# convert seconds to DDD:HH:MM:SS
my $n2 = $self->format_number(1000000,'time'); # $n2 = 11D13:46:40
Return: formated time difference in DDDHH:MM:SS or size in GB, MB or KB.
find_files($dir,$re)
Input variables:
$dir - directory name in which files and sub-dirs will be searched
$re - file name pattern to be matched.
Variables used or routines called: None.
How to use:
use File::Xcopy;
my $fc = File::Xcopu->new;
# find all the pdf files and stored in the array ref $ar
my $ar = $fc->find_files('/my/src/dir', '\.pdf$');
Return: $ar - array ref and can be accessed as ${$ar}[$i]{$itm}, where $i is sequence number, and $itm are
file - file name without dir
pdir - parent dir for the file
path - full path for the file
This method resursively finds all the matched files in the directory and its sub-directories. It uses finddepth
method from File::Find(1) module.
list_files($dir,$re)
Input variables:
$dir - directory name in which files will be searched
$re - file name pattern to be matched.
Variables used or routines called: None.
How to use:
use File::Xcopy;
my $fc = File::Xcopu->new;
# find all the pdf files and stored in the array ref $ar
my $ar = $fc->list_files('/my/src/dir', '\.pdf$');
Return: $ar - array ref and can be accessed as ${$ar}[$i]{$itm}, where $i is sequence number, and $itm are
file - file name without dir
pdir - parent dir for the file
path - full path for the file
This method only finds the matched files in the directory and will not search sub directories. It uses readdir
to get file names.
file_stat($dir,$ar)
Input variables:
$dir - directory name in which files will be searched
$ar - array ref returned from C<find_files> or C<list_files>
method.
Variables used or routines called: None.
How to use:
use File::Xcopy;
my $fc = File::Xcopu->new;
# find all the pdf files and stored in the array ref $ar
my $ar = $fc->find_files('/my/src/dir', '\.pdf$');
my $br = $fc->file_stat('/my/src/dir', $ar);
Return: $br - hash array ref and can be accessed as ${$ar}{$k}{$itm}, where $k is rdir
and the $itm are
size - file size in bytes
time - modification time in Perl time
file - file name
pdir - parent directory
This method also adds the following elements additional to 'file', 'pdir', and 'path' in the $ar array:
prop - file stat array
rdir - relative file name to the $dir
The following lists the elements in the stat array:
file stat array - ${$far}[$i]{prop}:
0 dev device number of filesystem
1 ino inode number
2 mode file mode (type and permissions)
3 nlink number of (hard) links to the file
4 uid numeric user ID of file's owner
5 gid numeric group ID of file's owner
6 rdev the device identifier (special files only)
7 size total size of file, in bytes
8 atime last access time in seconds since the epoch
9 mtime last modify time in seconds since the epoch
10 ctime inode change time (NOT creation time!) in seconds
sinc e the epoch
11 blksize preferred block size for file system I/O
12 blocks actual number of blocks allocated
This method converts the array into a hash array and add additional elements to the input array as well.
fmtTime($ptm, $otp)
Input variables:
$ptm - Perl time
$otp - output type: default - YYYYMMDD.hhmmss
1 - YYYY/MM/DD hh:mm:ss
5 - MM/DD/YYYY hh:mm:ss
11 - Wed Mar 31 08:59:27 1999
Variables used or routines called: None
How to use:
# return current time in YYYYMMDD.hhmmss
my $t1 = $self->fmtTime;
# return current time in YYYY/MM/DD hh:mm:ss
my $t2 = $self->fmtTime(time,1);
Return: date and time in the format specified.
CODING HISTORY
Version 0.01
04/15/2004 (htu) - Initial coding
Version 0.02
04/16/2004 (htu) - laid out the coding frame
Version 0.06
06/19/2004 (htu) - added the inline document
Version 0.10
06/25/2004 (htu) - finished the core coding and passed first testing.
Version 0.11
06/28/2004 (htu) - fixed the mistakes in documentation and populated internal variables.
Version 0.12
12/15/2004 (htu) - fixed a bug in the execute method.
12/26/2004 (htu) - added syscopy method to replace methods in File::Copy module. The copy method in File::Copy does not reserve the attributes of a file.
12/29/2004 (htu) - tested on Solaris and Win32 operating systems
FUTURE IMPLEMENTATION
add directory structure checking
Check whether the from_dir and to_dir have the same directory tree.
add advanced parameters
Ssearch file by a certain date, etc.
add syncronize action
Make sure the files in from_dir and to_dir the same by copying new files from from_dir to to_dir, update exisitng files in to_dir, and move files that do not exist in from_dir out of to_dir to a temp directory.
AUTHOR
Copyright (c) 2004 Hanming Tu. All rights reserved.
This package is free software and is provided "as is" without express or implied warranty. It may be used, redistributed and/or modified under the terms of the Perl Artistic License (see http://www.perl.com/perl/misc/Artistic.html)