Artificial IntelligenceNo Access

Hate and Aggression Analysis in NLP with Explainable AI

Shatakshi Raman

Department of Computer Science & Engineering, Bharati Vidyapeeth’s College of Engineering, New Delhi 110063, India

E-mail Address: raman.shat1612@gmail.com

Search for more papers by this author

Vedika Gupta

Jindal Global Business School, O. P. Jindal Global University, Sonipat, Haryana 131001, India

E-mail Address: vgupta2@jgu.edu.in

Search for more papers by this author

Preeti Nagrath

Department of Computer Science & Engineering, Bharati Vidyapeeth’s College of Engineering, New Delhi 110063, India

E-mail Address: preeti.nagrath@bharatividyapeeth.edu

Search for more papers by this author

, and

KC Santosh

https://orcid.org/0000-0003-4176-0236

Applied AI Research Lab, Computer Science Department, University of South Dakota, Vermillion, SD 57069, USA

E-mail Address: santosh.kc@ieee.org

Corresponding author.

Search for more papers by this author

https://doi.org/10.1142/S0218001422590364Cited by:8 (Source: Crossref)

Abstract

Social platforms such as Twitter and Facebook have now become only media to express their thoughts, and due to lack of censorship, it often embellishes themselves as an abode for hate towards minorities. People of color, Asian people, Muslims, women, transgenders, and LGBTQ+ communities are often the target of such online hate and aggression. Though several companies have incorporated considerable algorithms on their platforms, nevertheless due to being rather hard to often detect such comments still make it to the platforms, creating a negative space towards targeted people. This research involves the study and comparison of different hate and aggression detection algorithms with intent on two languages, i.e. English and German including machine learning models (linear SVC, logistic regression, multinomial naive Bayes and random forests) with their variations with feature engineering and bag of words and deep learning (CNN-GRU static, TCN static, Seq2Seq) with their variations vis-à-vis Word2Vec embedding. CNN+GRU static + Word2Vec embedding has outperformed all the other techniques with an accuracy of 68.29%.

The given study involves racial slurs, aggravated and use of harmful words targeted especially towards women and people of color. However, given the nature of the study they cannot be overlooked. The paper is solely for research purposes and does not support hate and aggressive speech in any manner towards anyone.

Keywords:

References

1. S. Abro, S. Shaikh, Z. H. Khand, A. Zafar, S. Khan and G. Mujtaba , Automatic hate speech detection using machine learning: A comparative study, Int. J. Adv. Comput. Sci. Appl. 11(8) (2020) 484–491. Google Scholar
2. S. Agarwal and A. Sureka , Using KNN and SVM based one-class classifier for detecting online radicalization on Twitter, in Distributed Computing and Internet Technology, ICDCIT 2015, Lecture Notes in Computer Science, Vol 8956 (Springer, Cham, 2015). Crossref, Google Scholar
3. S. Agarwal and A. Sureka, Characterizing linguistic attributes for automatic classification of intent based racist/radicalized posts on tumblr micro-blogging website (2017). Google Scholar
4. D. Chatzakou, N. Kourtellis, J. Blackburn, E. De Cristofaro, G. Stringhini and A. Vakali , Measuring# GamerGate: A tale of hate, sexism, and bullying, Proc. 26th Int. Conf. World Wide Web Companion (International World Wide Web Conferences Steering Committee, 2017), pp. 1285–1290. Crossref, Google Scholar
5. M. Dadvar, D. Trieschnigg, R. Ordelman and F. D. Jong , Improving cyberbullying detection with user context, European Conf. Information Retrieval (Springer, Berlin, 2013), pp. 693–696. Crossref, Google Scholar
6. C. Dalal, S. Tandon and A. Mukerjee , Insult detection in Hindi, Tech. Rep. Artif. Intell. 18(1) (2014) 1–8. Google Scholar
7. B. Gambäck and U. K. Sikdar , Using convolutional neural networks to classify hate-speech, Proc. First Workshop on Abusive Languages (Association for Computational Linguistics, Vancouver, 2017), pp. 85–90. Crossref, Google Scholar
8. R. Gomez et al., Exploring hate speech detection in multimodal publications, 2020 IEEE Winter Conf. Applications of Computer Vision (WACV), Snowmass, CO, 1–5 March 2020, pp. 1459–1467. Crossref, Google Scholar
9. E. Greevy, Automatic text categorisation of racist webpages, PhD thesis, Dublin City University (2004). Google Scholar
10. E. Greevy and A. F. Smeaton , Classifying racist texts using a support vector machine, SIGIR ’04: Proc. 27th Annual Int. ACM SIGIR Conf. Research and Development in Information Retrieval (2004), pp. 468–469. Crossref, Google Scholar
11. V. Gupta, P. Dass, V. Bansal and R. Arora , A truncated deep neural network for identifying age groups in real time images, J. Interdiscipl. Math. 25(3) (2022) 851–861. Crossref, Web of Science, Google Scholar
12. V. Gupta, N. Jain, S. Shubham, A. Madan, A. Chaudhary and Q. Xin , Toward integrated CNN-based sentiment analysis of tweets for scarce-resource language—Hindi, Trans. Asian Low-Resour. Lang. Inf. Process. 20(5) (2021) 1–23. Crossref, Web of Science, Google Scholar
13. V. Gupta, S. Juyal and Y. C. Hu , Understanding human emotions through speech spectrograms using deep neural network, J. Supercomput. 78(5) (2022) 6944–6973. Crossref, Web of Science, Google Scholar
14. V. Gupta, N. Jain, P. Katariya, A. Kumar, S. Mohan, A. Ahmadian and M. Ferrara , An emotion care model using multimodal textual analysis on COVID-19, Chaos, Solitons Fractals 144 (2021) 110708. Crossref, Web of Science, Google Scholar
15. G. B. Herwanto, A. M. Ningtyas, K. E. Nugraha and I. N. P. Trisna , Hate speech and abusive language classification using fastText, 2019 Int. Seminar on Research of Information Technology and Intelligent Systems (ISRITI) (IEEE, 2019), pp. 69–72. Crossref, Google Scholar
16. W. Khan, A. Daud, J. A. Nasir and T. Amjad , A survey on the state-of-the-art machine learning models in the context of NLP, Kuwait J. Sci. 43 (2016) 95–113. Web of Science, Google Scholar
17. S. Kumar, F. Spezzano and V. Subrahmanian , Accurately detecting trolls in slashdot zoo via decluttering, Proc. 2014 IEEE/ACM Int. Conf. Advances in Social Networks Analysis and Mining (ASONAM 2014), Beijing, China, 17–20 August 2014, pp. 188–195. Crossref, Google Scholar
18. R. Kumar, A. K. Ojha, S. Malmasi and M. Zampieri , Benchmarking aggression identification in social media, Proc. First Workshop on Trolling, Aggression and Cyberbullying (Association for Computational Linguistics, Santa Fe, New Mexico, USA, 2018), pp. 1–11. Google Scholar
19. I. Kwok and Y. Wang , Locate the hate: Detecting tweets against blacks, AAAI 27(1) (2013) 1621–1622, https://doi.org/10.1609/aaai.v27i1.8539. Crossref, Google Scholar
20. S. Malmasi and M. Zampieri , Detecting hate speech in social media, Proc. Int. Conf. Recent Advances in Natural Language Processing, RANLP (INCOMA, Varna, 2017), pp. 467–472. Crossref, Google Scholar
21. M. Märtens, S. Shen, A. Iosup and F. Kuipers , Toxicity detection in multiplayer online games, Proc. 2015 Int. Workshop on Network and Systems Support For Games (IEEE Press, 2015), Article 5, 1–6. Crossref, Google Scholar
22. A. Marantz , Antisocial: Online Extremists, Techno-Utopians, and the Hijacking of the American Conversation (Penguin Books, 2020). Google Scholar
23. F. Menczer, R. Fulper, G. L. Ciampaglia, E. Ferrara, Y. Ahn, A. Flammini, B. Lewis and K. Rowe , Misogynistic language on Twitter and sexual violence, Proc. ACM Web Science Workshop on Computational Approaches to Social Modeling (ChASM), https://doi.org/10.6084/m9.figshare.1291081.v1 (2015). Google Scholar
24. T. Mihaylov, G. D. Georgiev, A. Ontotext and P. Nakov , Finding opinion manipulation trolls in news community forums, Proc. Nineteenth Conf. Computational Natural Language Learning (Association for Computational Linguistics, Beijing, 2015), pp. 310–314. Crossref, Google Scholar
25. L. G. M. de la Vega and V. Ng , Modeling trolling in social media conversations, in Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (European Language Resources Association, Miyazaki, 2018). Google Scholar
26. S. Nikiforos et al., Bullying in virtual learning communities, Adv. Exp. Med. Biol. 989 (2017) 211–216. Crossref, Web of Science, Google Scholar
27. A. Onan , Biomedical text categorization based on ensemble pruning and optimized topic modelling, Comput. Math. Methods Med. 2018 (2018) 2497471. Crossref, Web of Science, Google Scholar
28. A. Onan , Mining opinions from instructor evaluation reviews: A deep learning approach, Comput. Appl. Eng. Edu. 28 (2020) 117–138. Crossref, Web of Science, Google Scholar
29. A. Onan , Consensus clustering-based undersampling approach to imbalanced learning, Sci. Program. 2019 (2019) 5901087:1–5901087:14. Web of Science, Google Scholar
30. A. Onan, S. Korukoglu and H. Bulut , Ensemble of keyword extraction methods and classifiers in text classification, Expert Syst. Appl. 57 (2016) 232–247. Crossref, Web of Science, Google Scholar
31. A. Onan and S. Korukoglu , A feature selection model based on genetic rank aggregation for text sentiment classification, J. Inf. Sci. 43 (2017) 25–38. Crossref, Web of Science, Google Scholar
32. A. Onan, S. Korukoglu and H. Bulut , A hybrid ensemble pruning approach based on consensus clustering and multi-objective evolutionary algorithm for sentiment classification, Inf. Process. Manag. 53 (2017) 814–833. Crossref, Web of Science, Google Scholar
33. A. Onan , Sentiment analysis on product reviews based on weighted word embeddings and deep neural networks, Concurrency Comput. Pract. Exp. 33 (2020) e5909. Crossref, Web of Science, Google Scholar
34. A. Onan , Sentiment analysis on massive open online course evaluations: A text mining and deep learning approach, Comput. Appl. Eng. Edu. 29 (2021) 572–589. Crossref, Web of Science, Google Scholar
35. A. Onan , An ensemble scheme based on language function analysis and feature engineering for text genre classification, J. Inf. Sci. 44 (2018) 28–47. Crossref, Web of Science, Google Scholar
36. A. Onan , Topic-enriched word embeddings for sarcasm identification, in Advances in Intelligent Systems and Computing, Vol. 984 (Springer, Cham, 2019), pp. 293–304. Google Scholar
37. M. T. Ribeiro, S. Singh, and C. Guestrin , “Why should I trust you?”: Explaining the predictions of any classifier, Proc. 22nd ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining (Association for Computing Machinery, New York, NY, USA, 2016), pp. 1135–1144. Google Scholar
38. V. Rozova, K. Witt, J. Robinson, Y. Li and K. Verspoor , Detection of self-harm and suicidal ideation in emergency department triage notes, J. Am. Med. Inf. Assoc. 29(3) (2022) 472–480. Crossref, Web of Science, Google Scholar
39. S. Sax, Flame wars: Automatic insult detection, Technical report, Stanford University (2016). Google Scholar
40. J. Salminen, H. Almerekhi, M. Milenković, S. G. Jung, J. An, H. Kwak and B. J. Jansen , Anatomy of online hate: Developing a taxonomy and machine learning models for identifying and classifying hate in online news media, Proc. Twelfth Int. AAAI Conf. Web and Social Media, Vol. 12 (2018), pp. 330–339. Crossref, Google Scholar
41. J. Salminen, M. Hopf, S. A. Chowdhury, S. G. Jung, H. Almerekhi and B. J. Jansen , Developing an online hate classifier for multiple social media platforms, Hum-Centric Comput. Inf. Sci. 10(1) (2020) 1–34. Crossref, Web of Science, Google Scholar
42. N. S. Samghabadi, S. Maharjan, A. Sprague, R. Diaz-Sprague and T. Solorio , Detecting nastiness in social media, Proc. First Workshop on Abusive Language Online (Association for Computational Linguistics, Vancouver, BC, Canada, 2017), pp. 63–72. Crossref, Google Scholar
43. A. Schmidt and M. Wiegand , A survey on hate speech detection using natural language processing, Proc. Fifth Int. Workshop on Natural Language Processing for Social Media (Association for Computational Linguistics, Valencia, Spain, 2017). Crossref, Google Scholar
44. J. Schuurmans et al., Intent classification for dialogue utterances, IEEE Intell. Syst. 35 (2020) 82–88. Crossref, Web of Science, Google Scholar
45. S. O. Sood et al., Automatic identification of personal insults on social news sites, J. Am. Soc. Inform. Sci. Technol. 63 (2012) 270–285. Crossref, Web of Science, Google Scholar
46. Q. Sun and C. Shen , Who would respond to A troll? A social network analysis of reactions to trolls in online communities, Comput. Hum. Behav. 121 (2021) 106786. Crossref, Web of Science, Google Scholar
47. J. van Doorn , Anger, feelings of revenge, and hate, Emotion Rev. 10(4) (2018) 321–322. Crossref, Web of Science, Google Scholar
48. S. Wachs, M. F. Wright and A. T. Vazsonyi , Understanding the overlap between cyberbullying and cyberhate perpetration: Moderating effects of toxic online disinhibition, Criminal Behav. Mental Health 29(3) (2019) 179–188. Crossref, Web of Science, Google Scholar
49. Z. Waseem , Are you a racist or am I seeing things? Annotator influence on hate speech detection on twitter, Proc. First Workshop on NLP and Computational Social Science (Association for Computational Linguistics, Austin, Texas, 2016), pp. 138–142. Crossref, Google Scholar
50. Z. Waseem and D. Hovy , Hateful symbols or hateful people? Predictive features for hate speech detection on Twitter, Proc. NAACL-HLT 2016 (Association for Computational Linguistics, San Diego, California, 2016), pp. 88–93. Crossref, Google Scholar
51. M. Wiegand, M. Siegel and I. Ruppenhofer , Overview of the GermEval 2018 shared task on the identification of offensive language, Proc. GermEval, Vienna, Austria, 21 September 2018, pp. 1–10. Google Scholar
52. J. M. Xu, K. S. Jun, X. Zhu and A. Bellmore , Learning from bullying traces in social media, Proc. 2012 Conf. North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Association for Computational Linguistics, Montréal, Canada, 2012), pp. 656–666. Google Scholar
53. Z. Yi, S. Li, J. Ma, J. Yu, Y. Tan and Q. Wu , Towards an efficient and robust adversarial attack against neural text classifier, Int. J. Pattern Recognit. Artif. Intell. 36 (2022) 2253007. Link, Web of Science, Google Scholar
54. H. Ye, W. Zhang and M. Nie , An improved semi-supervised variational autoencoder with gate mechanism for text classification, Int. J. Pattern Recognit. Artif. Intell. 26 (2022) 2253006. Link, Google Scholar