NAME
Archive::Probe - A generic library to search file within archive
SYNOPSIS
use Archive::Probe;
my $tmpdir = '<temp_dir>';
my $base_dir = '<directory_of_archive_files>';
my $probe = Archive::Probe->new();
$probe->working_dir($tmpdir);
$probe->add_pattern(
'<your_pattern_here>',
sub {
my ($pattern, $file_ref) = @_;
# do something with result files
});
$probe->search($base_dir, 1);
DESCRIPTION
Archive::Probe is a generic library to search file within archive.
It facilitates searching of particular file by name or content inside deeply nested archive with mixed types. It supports common archive types such as .tar, .tgz, .bz2, .rar, .zip .7z and Java archive such as .jar, .war, .ear. If the target archive file contains another archive file of same or other type, this module extracts the embedded archive to fulfill the inquiry. The level of embedding is unlimited. This module depends on unzip, unrar, 7za and tar which are assumed to be present in PATH. The 7za is part of 7zip utility. It is preferred tool to deal with .zip archive it runs faster and handles meta character better than unzip. The 7zip is open source software and you download and install it from www.7-zip.org or install the binary package p7zip with your favorite package management software. The unrar is freeware which can be downloaded from http://www.rarlab.com/rar_add.htm.
METHODS
constructor new()
Creates a new Archive::Probe
object.
add_pattern(regex, coderef)
Register a file pattern to search with in the archive file(s) and the callback code to handle the matched files. The callback will be passed two arguments:
- $pattern
-
This is the pattern of the matched files.
- $file_ref
-
This is the array reference to the files matched the pattern. The files are extracted, hence exist, only if the second argument of the
search()
method evaluates to true.
search(base_dir, extract_matched)
Search registered files under 'base_dir' and invoke the callback. It requires two arguments:
- $base
-
This is the directory containing the archive file(s) or the archive file itself.
- $extract_matched
-
Extract or copy the matched files to the working directory if this parameter evaluates to true. This is useful when you need search files based on their content not just by name.
extract(base, to_dir, recursive)
Extract archive to given destination directory. It requires three arguments:
- $base
-
This is the path to the archive file or the base archive directory.
- $to_dir
-
The destination directory.
- $recursive
-
Recursively extract all embedded archive files in the master archive if this parameter evaluates to true. It defaults to true.
- $flat
-
If this parameter evaluates to true,
Archive::Probe
extracts embedded archives under the same folder as their containing folder in recursive mode. Otherwise, it extracts the content of embedded archives into their own directories to avoid files with same name from different embedded archive being overwritten. Default is false.
reset_matches()
Reset the matched files list.
ACCESSORS
working_dir([directory])
Set or get the working directory where the temporary files will be created.
show_extracting_output([BOOL])
Enable or disable the output of command line archive tool.
HOW IT WORKS
Archive::Probe
provides plumbing boiler code to search files in nested archive files. It does the heavy lifting to extract mininal files necessary to fulfill the inquiry.
SOURCE AVAILABILITY
This code is hosted on Github
https://github.com/schnell18/archive-probe
BUG REPORTS
Please report bugs or other issues to <schnell18@rt.cpan.org>.
AUTHOR
This module is developed by Justin Zhang <fgz@cpan.org>.
COPYRIGHT
Copyright (C) 2013 by Justin Zhang
This library is free software; you may redistribute and/or modify it under the same terms as Perl itself.