Generating word forms

Have you ever wanted to create a list of possible words in a language you are working on? Have you started creating a dictionary but now need to find words that are not yet recorded? This could be the app for you. Word Generator is a free web service that lets you upload a list of words that you know, together with a list of consonants and vowels, like this:

Consonants: b, rd, d, k, g, j, rl, l, lh, ly, m, n, nh, ng, ny, rn, yh, r, rr, n, ng, y, th, w
Vowels: a, aa, i, ii, u, uu

alardi
arinji
arlibala
[ … ]

Word Generator will generate a list of possible words based on this information. It has a number of settings you can alter to adjust the degree of probability, the number and the length of words you want to produce. You can then ask speakers to look through the list to help them think of words that are not already in the dictionary, and it could provoke useful discussion about other forms and meanings.

Please try Word Generator and post any feedback here or by email to me.

Word Generator is being written by Andreas Scherbakov as part of a project funded by ARC Future Fellowship FT140100214

1 thought on “Generating word forms”

  1. A new version of Word Generator was uploaded yesterday (14/12/15). It adds a vowel confidence level calculation and also some explanatory material for the algorithms it uses.

Here at Endangered Languages and Cultures, we fully welcome your opinion, questions and comments on any post, and all posts will have an active comments form. However if you have never commented before, your comment may take some time before it is approved. Subsequent comments from you should appear immediately.

We will not edit any comments unless asked to, or unless there have been html coding errors, broken links, or formatting errors. We still reserve the right to censor any comment that the administrators deem to be unnecessarily derogatory or offensive, libellous or unhelpful, and we have an active spam filter that may reject your comment if it contains too many links or otherwise fits the description of spam. If this happens erroneously, email the author of the post and let them know. And note that given the huge amount of spam that all WordPress blogs receive on a daily basis (hundreds) it is not possible to sift through them all and find the ham.

In addition to the above, we ask that you please observe the Gricean maxims:

*Be relevant: That is, stay reasonably on topic.

*Be truthful: This goes without saying; don’t give us any nonsense.

*Be concise: Say as much as you need to without being unnecessarily long-winded.

*Be perspicuous: This last one needs no explanation.

We permit comments and trackbacks on our articles. Anyone may comment. Comments are subject to moderation, filtering, spell checking, editing, and removal without cause or justification.

All comments are reviewed by comment spamming software and by the site administrators and may be removed without cause at any time. All information provided is volunteered by you. Any website address provided in the URL will be linked to from your name, if you wish to include such information. We do not collect and save information provided when commenting such as email address and will not use this information except where indicated. This site and its representatives will not be held responsible for errors in any comment submissions.

Again, we repeat: We reserve all rights of refusal and deletion of any and all comments and trackbacks.

Leave a Comment