Changes for version 1.02

  • Removed defined on hash.
  • Minor improvements to the documentation.
  • Fixed encoding and parsing errors.
    • Use HTML::Encoding to find encoding.

Documentation

Script to update CNN news article corpus.

Modules

Make a corpus of CNN documents for research.
Parse CNN article for research.