NAME

dw-load - generate a simple load script

SYNOPSIS

dw-load [options]

OPTIONS

--dsn
--db_user
--db_password
--fact

The name of the fact table.

--dimension

The name of one or more dimensions.

EXAMPLE

dw-load \
    --dsn='dbi:SQLite:dbname=test.db' \
    --fact='sales' \
    --dimension='user' \
    --dimension='product'

DESCRIPTION

dw-etl will create the base of a simple load script.

The load script, by default, will read from a tab-separated file containing all the necessary columns for fact and dimensions.

ETL

The load script is the last step of the ETL (extract/transform/load) process.

By now, we expect that the data was extracted from source systems, cleaned and exported to a tab-separated file. (Or any other format you think is convenient.)

The load script will read the input file and load the Data Warehouse database.

SLOWLY CHANGING DIMENSIONS

One of the most important tasks of the load script is to take care of "slowly changing dimensions".

Any dimensional attributes marked with "KEEP_HISTORY" flag will be checked against live data. Any changes to these fields will lead to the creation of a new record.

If you believe that one attribute will change often, consider if it shouldn't have its own dimension?

AUTHOR

Nelson Ferraz, <nferraz at gmail.com>

ACKNOWLEDGEMENTS

LICENSE AND COPYRIGHT

Copyright 2010 Nelson Ferraz.

This program is free software; you can redistribute it and/or modify it under the terms of either: the GNU General Public License as published by the Free Software Foundation; or the Artistic License.

See http://dev.perl.org/licenses/ for more information.