The following lists some helpful data files to feed to some of the example programs. See also the projects and all the code listings.
Exploration 14: Use explore14.txt as sample input to the word-counting programs.
Exploration 51: Words, words, words. Use sorted-words.txt and shuffled-words.txt as input to Listing 51-2. Compare the speed of the program using these two files, which differ only in the order. These files are from wordlist on SourceForge.
Exploration 52: palindrome input files. UTF-8 is a Unicode encoding, so it can represent all the character sets, so palindrome-utf8.txt lists all the palindromes. The ISO character sets can express only one particular extended character set, e.g., Greek, Cryllic. Thus, russian-8859-5.txt lists only the Russian palindrome because it cannot represent the Greek or Western European characters. The koi8 character set is more widely-used than the ISO standard 8859-5. The koi8-r is the Russian variant of koi8.