By Topic

OCRMPD: OCR system for Myanmar printed document image with a novel segmentation method and hierarchical classification scheme

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Win, H.P.P. ; Univ. of Comput. Studies, Yangon, Myanmar ; Khine, P.T.T. ; Tun, K.N.N.

As large quantity of document images is getting archived by the digital libraries, an efficient strategy that can convert Myanmar document image into machine understandable text format is needed. And Myanmar language contains many words, and most of them are similar, especially for small fonts, the accuracy of the Optical Character Recognition, OCR system for Myanmar may be low. Therefore, this paper designs an OCR system for Myanmar Printed Document (OCRMPD) with several proposed methods that can automatically convert Myanmar printed text to machine understandable text. In order to get more accurate system, enhance the input image by removing noise and making some correction on variants. A method for isolation of the character image is proposed by using connected component analysis for wrongly segmented characters produced by projection only. Finally, hierarchical mechanism is used for SVM classifier for recognition of the character image. The proposed algorithms have been tested on a variety of Myanmar printed documents and the results of the experiments indicate that the methods can increase the segmentation accuracy as well as recognition rates.

Published in:

Intelligent Computer Communication and Processing (ICCP), 2011 IEEE International Conference on

Date of Conference:

25-27 Aug. 2011