Loading [MathJax]/extensions/MathMenu.js
Text2arff: Automatic feature extraction software for Turkish texts | IEEE Conference Publication | IEEE Xplore

Text2arff: Automatic feature extraction software for Turkish texts


Abstract:

Which features are the most important for the text classification tasks? In the automatic text categorization area, several studies seek answers to this question. In this...Show More

Abstract:

Which features are the most important for the text classification tasks? In the automatic text categorization area, several studies seek answers to this question. In this paper, a feature extraction tool for Turkish texts (Text2arff) is presented. The toolbox automatically extracts several features such as the frequencies of the words and ngrams, word clustering, Latent semantic indexing etc. The features of the texts are saved in arff (WEKA) file format. The arff files can be used easily with WEKA machine learning library.
Date of Conference: 22-24 April 2010
Date Added to IEEE Xplore: 03 December 2010
ISBN Information:
Print ISSN: 2165-0608
Conference Location: Diyarbakir, Turkey

Contact IEEE to Subscribe

References

References is not available for this document.