Loading [MathJax]/extensions/MathMenu.js
A New Hybrid Farsi Text Summarization Technique Based on Term Co-Occurrence and Conceptual Property of the Text | IEEE Conference Publication | IEEE Xplore

A New Hybrid Farsi Text Summarization Technique Based on Term Co-Occurrence and Conceptual Property of the Text


Abstract:

The importance of text summarization grows rapidly as the amount of information increases exponentially. This paper presents a new hybrid summarization technique that com...Show More

Abstract:

The importance of text summarization grows rapidly as the amount of information increases exponentially. This paper presents a new hybrid summarization technique that combines statistical properties of documents with Farsi linguistic features. The originality of the technique lies on the use of term co-occurrence property of the text. It could detect the number of subjects. The proposed technique summarizes the document in proportion to the subject treated in a document. It considers the conceptual property of the text algorithm and based on word synonymy prevents similar sentences to be included in the summary. It also preserves the cohesion of the summarized text. Our results show better performance in comparison with FarsiSum, well known Farsi Summarizer, which is based only on the heuristic property of the text and do not consider the Farsi challenges.
Date of Conference: 06-08 August 2008
Date Added to IEEE Xplore: 03 September 2008
Print ISBN:978-0-7695-3263-9
Conference Location: Phuket, Thailand

Contact IEEE to Subscribe

References

References is not available for this document.