NAME

Lucy::Index::SegWriter - Write one segment of an index.

DESCRIPTION

SegWriter is a conduit through which information fed to Indexer passes. It manages Segment and Inverter, invokes the Analyzer chain, and feeds low level DataWriters such as PostingListWriter and DocWriter.

The sub-components of a SegWriter are determined by Architecture. DataWriter components which are added to the stack of writers via add_writer() have Add_Inverted_Doc() invoked for each document supplied to SegWriter’s add_doc().

METHODS

register

$seg_writer->register(
    api       => $api        # required
    component => $component  # required
);

Register a DataWriter component with the SegWriter. (Note that registration simply makes the writer available via fetch(), so you may also want to call add_writer()).

  • api - The name of the DataWriter api which writer implements.

  • component - A DataWriter.

fetch

my $obj = $seg_writer->fetch($api);

Retrieve a registered component.

  • api - The name of the DataWriter api which the component implements.

add_writer

$seg_writer->add_writer($writer);

Add a DataWriter to the SegWriter’s stack of writers.

add_doc

$seg_writer->add_doc(
    doc   => $doc    # required
    boost => $boost  # default: 1.0
);

Add a document to the segment. Inverts doc, increments the Segment’s internal document id, then calls Add_Inverted_Doc(), feeding all sub-writers.

add_segment

$seg_writer->add_segment(
    reader  => $reader   # required
    doc_map => $doc_map  # default: undef
);

Add content from an existing segment into the one currently being written.

  • reader - The SegReader containing content to add.

  • doc_map - An array of integers mapping old document ids to new. Deleted documents are mapped to 0, indicating that they should be skipped.

finish

$seg_writer->finish();

Complete the segment: close all streams, store metadata, etc.

INHERITANCE

Lucy::Index::SegWriter isa Lucy::Index::DataWriter isa Clownfish::Obj.