NAME

diceware_modhex_wordlist.pl

DESCRIPTION

Generates a list of words for Diceware using only ModHex characters.

USAGE

diceware_modhex_wordlist.pl [--language|-l nl_NL] [--machine-friendly] [--no-rolls] [--min 5] [--max 10] [--shorter] [--update|u] [--version] [--verbose|-v] [--help]

REQUIRED ARGUMENTS

None.

OPTIONS

  • --language -l

    The language of the words in the wordlist as an ICU locale like da_DK. The chosen language should have a word frequency list on https://github.com/hermitdave/FrequencyWords and have support in Hunspell installed. Default is nl_NL because en_US doesn't yield enough words for a complete list.

  • --machine-friendly

    Generate a list of 8192 words instead of 7776 words. Diceware uses 7776 words to map the result of a roll with five 6-sided dice - 6^5 = 7776 - to a word, but if no dice are used a computer could use a list of 2^13 = 8192 words. This is also how a word could be selected using 13 coinflips per word so the roll is then shown as the result of 13 coinflips. Default it tries to generate 7776 words in a list for use with five dice.

  • --no-rolls

    Do not prepend the words with the set of dice rolls or coin flips. When the list is not used to manually map the rolls to words they might be omitted if software using the list only handles the words. Default the rolls are included.

  • --min

    The minimum length of the words selected for the list. Short words in a list could weaken a passphrase when a combination of short words lead to a short passphrase that can be cracked more easily than the used entropy of the Diceware method to generate the passphrase suggests. Default 5.

  • --max

    The maximum length of the words added to the list. Longer words are more cumbersome to use in a passphrase but to get enough words in the list we have to accept them up to some limit. Default 10.

  • --shorter

    Prefer shorter words over more frequently used words.

  • --update

    Update the wordlist used in the module instead of printing to standard output.

  • --version

    Print the version of this script.

  • --verbose

    Be more verbose.

  • --help

    Show help.

DIAGNOSTICS

EXIT STATUS

CONFIGURATION

To check if a word on a list is valid for the selected language it is checked by the Hunspell spellchecker that must be installed on the system and have support for the selected language installed.

DEPENDENCIES

Convert::ModHex, English, File::Basename, Getopt::Long, HTTP::Status, HTTP::Tiny, Lingua::EN::Inflexion, List::Uniq, Log::Log4perl, Math::Base::Convert, Pod::Usage::CommandLine, Pod::Usage, Readonly, String::Pad, Text::Hunspell

INCOMPATIBILITIES

Currently only the languages Danish (da_DK), Hungarian (hu_HU), Dutch (nl_NL), Turkish (tr_TR) and their regional variants provide enough words containing only ModHex characters to generate a list with enough words. Danish and Dutch require a maximum length of 10 to provide enough words, Hungarian and Turkish require only a maximum length of 9.

BUGS AND LIMITATIONS

Because the ModHex requirement already limits the number of allowed words significantly it usually isn't possible to limit the list further to words that aren't composites of each other and still get a list of 7776 words as a result. Therefore the words in these wordlists require the use of spaces between the words in a passphrase to avoid weakening the passphrase.

AUTHOR

Roland van Ipenburg <roland@rolandvanipenburg.com>

LICENSE AND COPYRIGHT

Copyright (c) 2025 Roland van Ipenburg.

This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself. The full text of this license can be found in the LICENSE file included with this module.

The wordlists generated by this program are based on FrequencyWords content Copyright (c) 2016 Hermit Dave https://github.com/hermitdave/FrequencyWords CC-by-sa-4.0 https://creativecommons.org/licenses/by-sa/4.0/deed.en