By Topic

Part of speech Tagging for Myanmar using Hidden Markov Model

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Khine Khine Zin ; University of Computer Studies, Yangon, Myanmar ; Ni Lar Thein

Part-Of-Speech (POS) Tagging is the process of assigning the words with their categories that best suits the definition of the word as well as the context of the sentence in which it is used. In this paper, we describe a machine learning algorithm for Myanmar Tagging using a corpus-based approach. In order to tag Myanmar language, we need to take part word segmentation, part of speech tagging using HMM and several Tag-sets. Thus, this paper deals with a combination of supervised and un-supervised learning which use pre-tagged and untagged corpus respectively. To assign to each word with the correct tag, we describe Supervised POS Tagging by using the class labels in terms of predictor features on manually tagged corpus and also describe Unsupervised POS Tagging for automatically training without using a manually tagged corpus. By experiments, the best configuration is investigated on different amount of training data and the accuracy is 97.56%.

Published in:

2009 International Conference on the Current Trends in Information Technology (CTIT)

Date of Conference:

15-16 Dec. 2009