Changes for version 0.008 - 2013-10-06

  • BREAKING: Switch to libreoffice instead of abiword. abiword ceased to wrap footnotes in handy classes that we could identify. This means that any post-processing scripts you may have written will now need to look for different classnames of footnotes, i.e., sdfootnoteanc for footnote refs in the text, and sdfootnotesym for footnote contents at the bottom.
  • Use file extension to hint which extractor to use.
  • Switch to Dist::Zilla.
  • Remove pernicious "_GoBack" ids in Word docx-xml derived HTML.
  • Delete invalid XML characters before passing the text to tidy.

Changes for version 0.008 - 2013-10-06

Modules

Take old or new format Word files and spit out extremely clean HTML.

Provides

in lib/MSWord/ToHTML/Doc.pm
in lib/MSWord/ToHTML/DocX.pm
in lib/MSWord/ToHTML/HTML.pm
in lib/MSWord/ToHTML/Roles/HasHTML.pm
in lib/MSWord/ToHTML/Types/Library.pm