NAME
diceware_modhex_wordlist.pl
DESCRIPTION
Generates a list of words for Diceware using only ModHex characters.
USAGE
diceware_modhex_wordlist.pl [--language|-l nl_NL] [--machine-friendly] [--no-rolls] [--min 5] [--max 10] [--shorter] [--update|u] [--version] [--verbose|-v] [--help]
REQUIRED ARGUMENTS
None.
OPTIONS
--language -l
The language of the words in the wordlist as an ICU locale like
da_DK
. The chosen language should have a word frequency list on https://github.com/hermitdave/FrequencyWords and have support in Hunspell installed. Default isnl_NL
becauseen_US
doesn't yield enough words for a complete list.--machine-friendly
Generate a list of 8192 words instead of 7776 words. Diceware uses 7776 words to map the result of a roll with five 6-sided dice - 6^5 = 7776 - to a word, but if no dice are used a computer could use a list of 2^13 = 8192 words. This is also how a word could be selected using 13 coinflips per word so the roll is then shown as the result of 13 coinflips. Default it tries to generate 7776 words in a list for use with five dice.
--no-rolls
Do not prepend the words with the set of dice rolls or coin flips. When the list is not used to manually map the rolls to words they might be omitted if software using the list only handles the words. Default the rolls are included.
--min
The minimum length of the words selected for the list. Short words in a list could weaken a passphrase when a combination of short words lead to a short passphrase that can be cracked more easily than the used entropy of the Diceware method to generate the passphrase suggests. Default 5.
--max
The maximum length of the words added to the list. Longer words are more cumbersome to use in a passphrase but to get enough words in the list we have to accept them up to some limit. Default 10.
--shorter
Prefer shorter words over more frequently used words.
--update
Update the wordlist used in the module instead of printing to standard output.
--version
Print the version of this script.
--verbose
Be more verbose.
--help
Show help.
DIAGNOSTICS
EXIT STATUS
CONFIGURATION
To check if a word on a list is valid for the selected language it is checked by the Hunspell spellchecker that must be installed on the system and have support for the selected language installed.
DEPENDENCIES
Convert::ModHex, English, File::Basename, Getopt::Long, HTTP::Status, HTTP::Tiny, Lingua::EN::Inflexion, List::Uniq, Log::Log4perl, Math::Base::Convert, Pod::Usage::CommandLine, Pod::Usage, Readonly, String::Pad, Text::Hunspell
INCOMPATIBILITIES
Currently only the languages Danish (da_DK), Hungarian (hu_HU), Dutch (nl_NL), Turkish (tr_TR) and their regional variants provide enough words containing only ModHex characters to generate a list with enough words. Danish and Dutch require a maximum length of 10 to provide enough words, Hungarian and Turkish require only a maximum length of 9.
BUGS AND LIMITATIONS
Because the ModHex requirement already limits the number of allowed words significantly it usually isn't possible to limit the list further to words that aren't composites of each other and still get a list of 7776 words as a result. Therefore the words in these wordlists require the use of spaces between the words in a passphrase to avoid weakening the passphrase.
AUTHOR
Roland van Ipenburg <roland@rolandvanipenburg.com>
LICENSE AND COPYRIGHT
Copyright (c) 2025 Roland van Ipenburg.
This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself. The full text of this license can be found in the LICENSE file included with this module.
The wordlists generated by this program are based on FrequencyWords content Copyright (c) 2016 Hermit Dave https://github.com/hermitdave/FrequencyWords CC-by-sa-4.0 https://creativecommons.org/licenses/by-sa/4.0/deed.en