Skip to Main Content
This research proposes NTSO (neural text self organizer) as the approach to text clustering and sets inverted index as the basis for execution of the NTSO. For using one of traditional approaches, documents should be encoded into numerical vectors and encoding so causes the two main problems: the huge dimensionality and the sparse distribution. This research proposes that documents should be encoded into string vectors as the alternative structured forms to numerical vectors and NTSO should be used as the approach to text clustering. By solving the two main problems, the proposed approach is expected to improve the performance of text clustering. By comparing the proposed approach with other approaches, we will validate the text clustering performance of the proposed approach as the results of solving the problems.