Loading [MathJax]/extensions/MathMenu.js
Identifying Author in Bengali Literature by Bi-LSTM with Attention Mechanism | IEEE Conference Publication | IEEE Xplore

Identifying Author in Bengali Literature by Bi-LSTM with Attention Mechanism


Abstract:

Authorship Attribution is the task of determining the author of an unknown text using one’s writing patterns. It is a well-established task for high-resource languages li...Show More

Abstract:

Authorship Attribution is the task of determining the author of an unknown text using one’s writing patterns. It is a well-established task for high-resource languages like English, but it is challenging for low-resource languages like Bengali. In this paper, we propose a Bi-directional Long Short Term Memory(Bi-LSTM) model with self-attention mechanism to address this problem. GloVe embedding vectors encode the semantic and syntactic knowledge of words, which are then fed into the Bi-LSTM models. Moreover, attention mechanism enhances the model’s ability to learn the complex linguistics patterns through learnable parameters, which gives lower weights to common words and higher weights to keywords that capture an author’s stylistic components. It improves performance extract contextual features. We evaluate our model on multiple datasets and experiment with various architectures. Our proposed model outperforms the state-of-the-art model by 12.14%-20.24% in the BAAD6 author dataset, 1.05% - 7.34% in the BAAD16 author dataset, with best performance accuracy of 97.99%. The experimental results demonstrate that the Bi-LSTM model’s attention mechanism notably boosts performance. (The source code are shared as free tools at https://github.com/IbrahimAlAzhar/AuthorshipAttribution)
Date of Conference: 18-20 December 2021
Date Added to IEEE Xplore: 28 January 2022
ISBN Information:
Conference Location: Dhaka, Bangladesh

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.