...document source of corpus and post-processing

http://www.speech.cs.cmu.edu/cgi-bin/cmudict

File: http://svn.code.sf.net/p/cmusphinx/code/trunk/cmudict/cmudict-0.7b
