Loading [MathJax]/extensions/MathMenu.js
Keyword extraction based on statistical information for cyrillic Mongolian script | IEEE Conference Publication | IEEE Xplore
Scheduled Maintenance: On Monday, 30 June, IEEE Xplore will undergo scheduled maintenance from 1:00-2:00 PM ET (1800-1900 UTC).
On Tuesday, 1 July, IEEE Xplore will undergo scheduled maintenance from 1:00-5:00 PM ET (1800-2200 UTC).
During these times, there may be intermittent impact on performance. We apologize for any inconvenience.

Keyword extraction based on statistical information for cyrillic Mongolian script


Abstract:

We present a keyword extraction system for Mongolian documents using word co-occurrence statistical information which used in for English, Chinese and other languages. Th...Show More

Abstract:

We present a keyword extraction system for Mongolian documents using word co-occurrence statistical information which used in for English, Chinese and other languages. This method based on extracting top frequent words and building the co-occurrence matrix showing the occurrence of each frequent word. The biasness degree of the words and the set of frequent words are measured using CHI-Square Method (χ2). Also, the weight of the words and the set of frequent words are measured using word frequency - inverted word frequency (WF-IWF). Therefore words with high χ2 values and high WF-IWF values are likely to be keywords. The adopted χ2 method in this study is compared with another one method based on WF-IWF which tested for Mongolian. Two different documents were used to evaluate the system performance. We evaluate the effectiveness of χ2 method and WF-IWF method. Results show that the χ2 method is better than WF-IWF.
Date of Conference: 28-30 May 2017
Date Added to IEEE Xplore: 17 July 2017
ISBN Information:
Electronic ISSN: 1948-9447
Conference Location: Chongqing, China

Contact IEEE to Subscribe

References

References is not available for this document.