Random texts exhibit Zipfapos;s-law-like word frequency distribution
Li, W.
Information Theory, IEEE Transactions on
Volume 38, Issue 6, Nov 1992 Page(s):1842 - 1845
Digital Object Identifier 10.1109/18.165464
Summary:It is shown that the distribution of word frequencies for randomly
generated texts is very similar to Zipf's law observed in natural
languages such as English. The facts that the frequency of occurrence of
a word is almost an inverse power law function of its rank and the
exponent of this inverse power law is very close to 1 are largely due to
the transformation from the word's length to its rank, which stretches
an exponential function to a power law function
View citation and abstract |