Representing Character Sequences as Sets : A simple and intuitive string encoding algorithm for NLP data cleaning | IEEE Conference Publication | IEEE Xplore