NAME

Algorithm::Bayesian - Bayesian Spam Filtering Algorithm

SYNOPSIS

use Algorithm::Bayesian;
use Tie::Foo;

my %storage;
tie %storage, 'Tie:Foo', ...;
my $b = Algorithm::Bayesian->new(\%storage);

$b->spam('spamword1', 'spamword2', ...);
$b->ham('hamword1', 'hamword2', ...);

my $pr = $b->test('word1', 'word2', ...);

DESCRIPTION

Algorithm::Bayesian provide an easy way to handle Bayesian spam filtering algorithm.

SUBROUTINES/METHODS

new

my $b = Algorithm::Bayesian->new(\%hash);

Constructor. Simple hash would be fine. You can use Tie::DBI to store data to RDBM, or other key-value storage.

getHam

my $num = $b->getHam($word);

Get $word count in Ham.

getSpam

my $num = $b->getSpam($word);

Get $word count in Spam.

ham

$b->ham(@words);

Train @words as Ham.

spam

$b->spam(@words);

Train @words as Spam.

test

my $pr = $b->test(@words);

Calculate the spam probability of @words. The range of $pr will be in 0 to 1.

testWord

my $pr = $b->testWord($word);

Calculate the spam probability of $word.

The range of $pr will be in 0 to 1. For non-existence word, it will be 0.5.

AUTHOR

Gea-Suan Lin, <gslin at gslin.org>

LICENSE AND COPYRIGHT

Copyright 2010 Gea-Suan Lin.

This program is free software; you can redistribute it and/or modify it under the terms of either: the GNU General Public License as published by the Free Software Foundation; or the Artistic License.

See http://dev.perl.org/licenses/ for more information.