The word list itself contains 69,903 words, and takes up 665,681 bytes (that's about two-thirds of a megabyte). There are also 69,903 lines in the file, since each word is on a line by itself. I.e, the file (which is called wordlist ) is big and long, and so are most of the words in it. This in turn means that many if not most of the words are rare, obsolete, or simply weird. Furthermore, the entire file is in lower case, including all the proper names. However, you can look up the strange words at 1911 version of Roget's Thesaurus,
I made this list from these sources as an exercise in applying Unix tools,
and as something
to give my students something to play with. That's all it is.
It may be useful to you, which is why I put it here. Or it may not.
No warrantees, implied or expressed.
You can do whatever you like with it; it's your language, after all. For instance, if you were using Unix (or Linux), and you were curious about the English words that ended in -ically, or the ones that started with antid-, then the egrep (or grep) commands
To save this list on your disk, go to the "File" menu of your Web browser after it's done downloading and you've looked at it all you want, and choose Save As. Then you can move the file wordlist to wherever you'd like.