Abstract:
We present a novel language adaptable spell checking system that detects spelling errors and suggests context-sensitive corrections in real-time. We show that our system ...Show MoreMetadata
Abstract:
We present a novel language adaptable spell checking system that detects spelling errors and suggests context-sensitive corrections in real-time. We show that our system can be extended to new languages with minimal language-specific processing. Available literature majorly discusses spell checkers for English but there are no publicly available systems that can be extended to work for other languages out of the box. Most of the systems do not work in real-time. We explain the process of generating a language's word dictionary and n-gram probability dictionaries using Wikipedia-articles data and manually curated video subtitles. We present the results of generating a list of suggestions for a misspelled word. We also propose three approaches to create noisy channel datasets of real-world typographic errors. Finally, we show the effectiveness of language adaptability of our proposed system by extending it to 24 languages.
Date of Conference: 03-05 February 2020
Date Added to IEEE Xplore: 12 March 2020
ISBN Information:
Print on Demand(PoD) ISSN: 2325-6516