Loading [MathJax]/extensions/MathMenu.js
Zero-Shot Hate to Non-Hate Text Conversion Using Lexical Constraints | IEEE Journals & Magazine | IEEE Xplore

Zero-Shot Hate to Non-Hate Text Conversion Using Lexical Constraints


Abstract:

Systems meant for tackling hate speech have been increasing in demand with the rapid growth of social media platforms. One way of controlling hate speech in texts is to t...Show More

Abstract:

Systems meant for tackling hate speech have been increasing in demand with the rapid growth of social media platforms. One way of controlling hate speech in texts is to transform the text into its non-hate version while preserving the rest of the contents. Without the use of parallel data, unsupervised back-translation-based text style transfer is a common method of tackling such problems. In this article, we propose a zero-shot style-transfer technique that does effective unsupervised hate to non-hate conversion without using any hate domain text for training. While decoding the outputs produced by the system, we define an additional step of introducing lexical constraints, for better preservation of contents. Detailed empirical evaluation shows that the zero-shot method outperforms classical unsupervised style-transfer methods while at the same time reducing the data required while training.
Published in: IEEE Transactions on Computational Social Systems ( Volume: 10, Issue: 5, October 2023)
Page(s): 2479 - 2488
Date of Publication: 24 May 2022

ISSN Information:

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.