NAME
Datahub::Factory - A conveyor belt which transports data from a data source to a data sink.
SYNOPSIS
dhconveyor [ARGUMENTS] [OPTIONS]
DESCRIPTION
Datahub::Factory is a command line conveyor belt which automates three tasks:
- Data is fetched automatically from a local or remote data source.
- Data is converted to an exchange format.
- The output is pushed to a data sink.
Datahub::Factory fetches data from several sources as specified by the Importer settings, executes a Fix and sends it to a data sink, set by Exporter. Several importer and exporter modules are supported.
Datahub::Factory contains Log4perl support to monitor conveyor belt operations.
Note: This toolset is not a generic tool. It has been tailored towards the functional requirements of the Flemish Art Collection use case.
CONFIGURATION
Command line options
All commands share the following switches:
--log_level
-
Set the log_level. Takes a numeric parameter. Supported levels are: 1 (WARN), 2 (INFO), 3 (DEBUG). WARN (1) is the default.
--log_output
-
Selects an output for the log messages. By default, it will send them to STDERR (pass
STDERR
as parameter), but STDOUT (STDOUT
) and a log file (logs/import_-date-.log
) (STATISTICS
) are also supported.
COMMANDS
help COMMAND
Documentation about command line options.
It is possible to provide either all importer and/or exporter options on the command line, or to create a pipeline configuration file that sets those options.
transport [OPTIONS]
Fetch data from a local or remote source, convert it to an exchange format and export the data.
Datahub::Factory::Command::transport
AUTHORS
- Pieter De Praetere <pieter@packed.be>
- Matthias Vandermaesen <matthias.vandermaesen@vlaamsekunstcollectie.be>
COPYRIGHT
Copyright 2016 - PACKED vzw, Vlaamse Kunstcollectie vzw
LICENSE
This library is free software; you can redistribute it and/or modify it under the terms of the GPLv3.