Abstract:
Considering the colossal amount of user-generated content on social media, it has become increasingly difficult to monitor hateful content being published on public onlin...Show MoreMetadata
Abstract:
Considering the colossal amount of user-generated content on social media, it has become increasingly difficult to monitor hateful content being published on public online spaces, especially during the electioneering periods, particularly in Kenya. In this regard, it is crucial to automate the identification of hate speech in order to manage the volume, variety, veracity and velocity of this content. In this research, we postulate a supervised machine learning approach whereby annotation of the training data set is critical in determining the performance of the trained classifier. Therefore, we develop an annotation framework based on Sternberg's (2003) hate theory and test its performance in classifying about 5k tweets using 3 human annotators per tweet. Preliminary results indicate an intercoder reliability score of 0.5027 based on Krippendorff's alpha.
Published in: 2019 IST-Africa Week Conference (IST-Africa)
Date of Conference: 08-10 May 2019
Date Added to IEEE Xplore: 18 July 2019
ISBN Information:
ISSN Information:
References is not available for this document.
Select All
1.
Silva, L. A. ; Mondal, M. ; Correa, D. ; Benevenuto, F. ; and Weber, I. 2016. Analyzing the targets of hate in online social media. In ICWSM, 687–690.
2.
Burnap, P., and Williams, M. L. 2015. Cyber hate speech on twitter: An application of machine classification and statistical modelling for policy and decision making. Policy ( 2 ): 223–242.
3.
Burnap, P., and Williams, M. L. 2016. Us and them: identifying cyber hate on twitter across multiple protected characteristics. EPJ Data Science, 5 ( 1 ): 11.
4.
Sternberg, R. J. 2003. A duplex theory of hate: Development and application to terrorism, massacres, and genocide. Review of General Psychology. 7 ( 3 ), 299–328.
5.
T. Davidson, D. Warmly, M. Macy and I. Weber, “Automated Hate Speech Detection and the Problem of Offensive Language ”, in International AAAI Conference on Web and Social Media, 2017.
6.
S. Liu and T. Forss. 2015. “New classification models for detecting Hate and Violence web content,” 2015 7th IC3K, Lisbon, 2015, pp. 487–495.
7.
C. Nobata, J. Tetreault, A. Thomas, Y. Mehdad and Y. Chang. 2016. “Abusive Language Detection in Online User Content ”, in Proc. of 25th IC3W.
8.
Z. Waseem and D. Hovy, “Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter ”, Proceedings of the NAACL Student Research Workshop, 2016.
9.
Krippendorff, K. ( 2011 ). Computing Krippendorff’s Alpha-Reliability. Retrieved from https://bit.ly/2CdWcoV on 1 / 8 / 2019.
10.
William Warner and Julia Hirschberg. 2012. Detecting hate speech on the world wide web. In Proc. of the 2 nd Workshop LSM ACL pg. 19–26.
11.
Z. Waseem. 2016. Are You a Racist or Am I Seeing Things? Annotator Influence on Hate Speech Detection on Twitter. EMNLP Workshop on NLP and CSS, pg 138–142.
12.
Schmidt, Anna Wiegand, Michael. 2017. A Survey on Hate Speech Detection using Natural Language Processing. 1–10. 10.18653/v1/W17-1101.
13.
Coupland, Nikolas ( 2010 ), “Other ” representation, Society and Language Use 7, pp. 241–260, John Benjamins Publishing.
14.
UMATI ( 2013 ). Final Report. https://bit.ly/2rc6t0D Retrieved November 3rd, 2018.
15.
Warner, W., and Hirschberg, J. 2012. Detecting hate speech on the world wide web. In Proc. of LSM.
16.
Gitari, N. D. ; Zuping, Z. ; Damien, H. ; and Long, J. 2015. A lexicon-based approach for hate speech detection. International Journal of Multimedia and Ubiquitous Engineering 10 : 215–230.
17.
Dinakar K, Jones B, Havasi C, Lieberman H, Picard R ( 2012 ) Common sense reasoning for detection, prevention, and mitigation of cyberbullying. ACM Trans Interact Intell Syst 2 ( 3 ): 18.
18.
Chen Y, Zhou Y, Zhu S, Xu H ( 2012 ) Detecting offensive language in social media to protect adolescent online safety. In: Proceedings of the fourth ASE/IEEE international conference on social computing (SocialCom 2012), September 3-6, Amsterdam.
19.
Ellen Spertus. 1997. Smokey: Automatic recognition of hostile Messages. IAAI-97 Proceedings.
20.
Klaus Krippendorff ( 2004 ). Content Analysis, an Introduction to Its Methodology, 2nd Edition. Thousand Oaks, CA : Sage Publications–pages 211–256.
21.
Haslam, N. ( 2006 ) ‘ Dehumanization: An integrative review ’, Personality and Social Psychology Review, 10, 252–64.
22.
Kwok, I., and Wang, Y. 2013. Locate the hate: Detecting tweets against blacks. In AAAI.