Improving n-gram probability estimates by compound-head clustering | IEEE Conference Publication | IEEE Xplore