Romanized urdu Corpus development (RUCD) model: Edit-distance based most frequent unique unigram extraction approach using real-time interactive dataset | IEEE Conference Publication | IEEE Xplore