NAME
Data::Hopen::G::DAG - A hopen build graph
SYNOPSIS
This class encapsulates the DAG for a particular set of one or more goals. It is itself a Data::Hopen::G::Op so that it can be composed into other DAGs.
ATTRIBUTES
goals
Arrayref of the goals for this DAG.
default_goal
The default goal for this DAG.
_graph
The actual Graph. If you find that you have to use it, please open an issue so we can see about providing a documented API for your use case!
_final
The node to which all goals are connected.
_init_graph
A separate Graph of operations that will run before all the operations in "_graph". This is because I don't want to add an edge to every single node just to force the topological sort to work out.
_init_first
The first node to be run in _init_graph.
FUNCTIONS
_run
Traverses the graph. The DAG is similar to a subroutine in this respect. The outputs from all the goals of the DAG are aggregated and provided as the outputs of the DAG. The output is a hash keyed by the name of each goal, with each goal's outputs as the values under that name. Usage:
my $hrOutputs = $dag->run([-context=>$scope][, other options])
$scope
must be a Data::Hopen::Scope or subclass if provided. Other options are as "run" in Data::Hopen::Runnable.
ADDING DATA
goal
Creates a goal of the DAG. Goals are names for sequences of operations, akin to top-level Makefile targets. Usage:
my $goalOp = $dag->goal('name')
Returns a passthrough operation representing the goal. Any inputs passed into that operation are provided as outputs of the DAG under the corresponding name.
The first call to goal()
also sets "default_goal".
connect
- C<DAG:connect(<op1>, <out-edge>, <in-edge>, <op2>)>:
connects output C<< out-edge >> of operation C<< op1 >> as input C<< in-edge >> of
operation C<< op2 >>. No processing is done between output and input.
- C<< out-edge >> and C<< in-edge >> can be anything usable as a table index,
provided that table index appears in the corresponding operation's
descriptor.
- C<DAG:connect(<op1>, <op2>)>: creates a dependency edge from C<< op1 >> to
C<< op2 >>, indicating that C<< op1 >> must be run before C<< op2 >>.
Does not transfer any data from C<< op1 >> to C<< op2 >>.
- C<DAG:connect(<op1>, <Link>, <op2>)>: Connects C<< op1 >> to
C<< op2 >> via L<Data::Hopen::G::Link> C<< Link >>.
TODO return the name of the edge? The edge instance itself? Maybe a fluent interface to the DAG for chaining connect
calls?
add
Add a regular node to the graph. An attempt to add the same node twice will be ignored. Usage:
my $node = Data::Hopen::G::Op->new(name=>"whatever");
$dag->add($node);
Returns the node, for the sake of chaining.
init
Add an initialization operation to the graph. Initialization operations run before all other operations. An attempt to add the same initialization operation twice will be ignored. Usage:
my $op = Data::Hopen::G::Op->new(name=>"whatever");
$dag->init($op[, $first]);
If $first
is truthy, the op will be run before anything already in the graph. However, later calls to init()
with $first
set will push operations even before $op
.
Returns the node, for the sake of chaining.
ACCESSORS
empty
Returns truthy if the only nodes in the graph are internal nodes. Intended for use by hopen files.
OTHER
BUILD
Initialize the instance.
IMPLEMENTATION
Each DAG has a hidden "root" node. All outputs have edges from the root node. The traversal order is reverse topological from the root node, but is not constrained beyond that. Generators can ask for the nodes in root-first or root-last order.
The DAG is built backwards from the outputs toward the inputs, although calls to "output" and "connect" can appear in any order in the hopen
file as long as everything is hooked in by the end of the file.
The following is in flux:
- C<DAG>: A class representing a DAG. An instance called C<main> represents
what will be generated.
- C<DAG:set_default(<goal>)>: make C<< goal >> the default goal of this DAG
(default target).
- C<DAG:inject(<op1>,<op2>[, after/before'])>: Returns an operation that
lives on the edge between C<op1> and C<op2>. If the third parameter is
false, C<'before'>, or omitted, the new operation will be the first
operation on that edge. If the third parameter is true or C<'after'>,
the new operation will be the last operation on that edge. Any number
of operations can be injected on any edge.