By Topic

Turkish spelling error detection and correction by using word n-grams

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
DalkiliƧ, G. ; Comput. Eng. Dept., Dokuz Eylul Univ., Izmir, Turkey ; Cebi, Y.

N-grams can be used for spelling check and correction processes. The first step to use n-grams is to find the language specific n-grams by using a corpus. But a corpus cannot be big enough to contain all the possible word n-grams. Back-off smoothing technique is one of the techniques to estimate the frequency of the unknown n-grams in a corpus. By using Back-off technique and the Minimum Edit Distance (MED) algorithm, a program was developed to check spelling errors and suggest corrections in a sentence typed in Turkish. The results were compared with the results of Microsoft Word 2003 proofing tools, and found to be much better.

Published in:

Soft Computing, Computing with Words and Perceptions in System Analysis, Decision and Control, 2009. ICSCCW 2009. Fifth International Conference on

Date of Conference:

2-4 Sept. 2009