Deducing linguistic structure from the statistics of large corpora | IEEE Conference Publication | IEEE Xplore