Research PaperNo Access

SSTSA: A Self-Supervised Topic Sentiment Analysis Using Semantic Similarity Measures and Transformers

Azam Seilsepour

Department of Computer Engineering, Central Tehran Branch, Islamic Azad University, Tehran, Iran

E-mail Address: aza.seilsepour.eng@iauctb.ac.ir

Search for more papers by this author

Reza Ravanmehr

https://orcid.org/0000-0001-9605-5839

Department of Computer Engineering, Central Tehran Branch, Islamic Azad University, Tehran, Iran

E-mail Address: r.ravanmehr@iauctb.ac.ir

Corresponding author.

Search for more papers by this author

, and

Ramin Nassiri

Department of Computer Engineering, Central Tehran Branch, Islamic Azad University, Tehran, Iran

E-mail Address: r_nasiri@iauctb.ac.ir

Search for more papers by this author

https://doi.org/10.1142/S0219622023500736Cited by:4 (Source: Crossref)

Abstract

The exponentially increasing amount of data generated by the public on social media platforms is a precious source of information. It can be used to find the topics and analyze the comments. Some researchers have extended the Latent Dirichlet Allocation (LDA) method by adding a sentiment layer to simultaneously find the topics and their related sentiments. However, most of these approaches do not achieve admirable accuracy in Topic Sentiment Analysis (TSA), particularly when there is insufficient training data or the texts are complex, ambiguous, and short. In this paper, a self-supervised novel approach called SSTSA is proposed for TSA that extracts the hidden topics and analyzes the total sentiment related to each topic. The SSTSA proposes a new method called Pseudo-label Generator. For this purpose, first, it employs semantic similarity and Word Mover’s Distance (WMD) measures. Then, the document embedding technique is employed to semantically estimate the sentiment orientation of samples and generate the pseudo-labels (positive or negative). Afterward, a hybrid classifier composed of a pre-trained Robustly Optimized BERT (RoBERTa) and a Long Short-Term Memory (LSTM) model is trained to predict the sentiment of unseen data. The evaluation results on different datasets of various domains demonstrate that the SSTSA outperforms similar unsupervised/self-supervised methods.

Keywords:

References

1. P. Zola, P. Cortez, C. Ragno and E. Brentari , Social media cross-source and cross-domain sentiment classification, International Journal of Information Technology & Decision Making 18(5) (2019) 1469–1499, https://doi.org/10.1142/S0219622019500305. Link, Web of Science, Google Scholar
2. T.-H. You, L.-L. Tao and E. Cambria , A hotel ranking model through online reviews with aspect-based sentiment analysis, International Journal of Information Technology & Decision Making 22(1) (2023) 1–25, https://doi.org/10.1142/S0219622022500626. Link, Google Scholar
3. Y. Li, G. Kou, G. Li and Y. Peng , Consensus reaching process in large-scale group decision making based on bounded confidence and social network, European Journal of Operational Research 303(2) (2022) 790–802, https://doi.org/10.1016/j.ejor.2022.03.040. Crossref, Web of Science, Google Scholar
4. A. Seilsepour, R. Ravanmehr and H. R. Sima , 2016 olympic games on twitter: Sentiment analysis of sports fans tweets using big data framework, Journal of Advances in Computer Engineering and Technology 5(3) (2019) 143–160. Google Scholar
5. P. D. Mahendhiran and S. Kannimuthu , Deep learning techniques for polarity classification in multimodal sentiment analysis, International Journal of Information Technology & Decision Making 17(3) (2018) 883–910, https://doi.org/10.1142/S0219622018500128. Link, Web of Science, Google Scholar
6. W. Xie, X. Fu, X. Zhang, Y. Lu, Y. Wei and J. Yang , Topic sentiment analysis using words embeddings dependency in edge social system, Transactions on Emerging Telecommunications Technologies (2019) e3817, https://doi.org/10.1002/ett.3817. Web of Science, Google Scholar
7. A. Seilsepour, M. Alizadeh, R. Ravanmehr, M. T. H. Beheshti and R. Nassiri , Self-supervised sentiment classification based on semantic similarity measures and contextual embedding using metaheuristic optimizer, in 2022 8th Iranian Conf. Signal Processing and Intelligent Systems (ICSPIS) (2022), pp. 1–7, https://doi.org/10.1109/ICSPIS56952.2022.10043914. Crossref, Google Scholar
8. S. K. Akpatsa, P. C. Addo, H. Lei, X. Li, M. Dorgbefu Jr., D. D. Fiawoo, J. Nartey and J. C. Dagadu, Sentiment analysis and topic modeling of twitter data: A text mining approach to the US–Afghan war crisis (2022), doi:10.2139/ssrn.4064560. Google Scholar
9. S. H. W. Ilyas, Z. T. Soomro, A. Anwar, H. Shahzad and U. Yaqub , Analyzing brexit’s impact using sentiment analysis and topic modeling on twitter discussion, in The 21st Annual Int. Conf. Digital Government Research, (Association for Computing Machinery, 2020), pp. 1–6, https://doi.org/10.1145/3396956.3396973. Crossref, Google Scholar
10. H. Jelodar, Y. Wang, R. Orji and S. Huang , Deep sentiment classification and topic discovery on novel coronavirus or COVID-19 online discussions: NLP Using LSTM recurrent neural network approach, IEEE Journal of Biomedical and Health Informatics 24(10) (2020) 2733–2742, https://doi.org/10.1109/JBHI.2020.3001216. Crossref, Web of Science, Google Scholar
11. H.-J. Kwon, H.-J. Ban, J.-K. Jun and H.-S. Kim , Topic modeling and sentiment analysis of online review for airlines, Information 12(2) (2021) 78, https://doi.org/10.3390/info12020078. Crossref, Web of Science, Google Scholar
12. A. R. Pathak, M. Pandey and S. Rautaray , Topic-level sentiment analysis of social media data using deep learning, Applied Soft Computing 108 (2021) 107440, https://doi.org/10.1016/j.asoc.2021.107440. Crossref, Web of Science, Google Scholar
13. F. Qiao and J. Williams , Topic modelling and sentiment analysis of global warming tweets: Evidence from big data analysis, Journal of Organizational and End User Computing 34(3) (2022) 1–18, https://doi.org/10.4018/JOEUC.294901. Crossref, Web of Science, Google Scholar
14. T. Yang, C. Gao, J. Zang, D. Lo and M. Lyu , Tour: Dynamic topic and sentiment analysis of user reviews for assisting app release, in Companion Proc. Web Conf. 2021, (Association for Computing Machinery, 2021), pp. 708–712, https://doi.org/10.1145/3442442.3458612. Crossref, Google Scholar
15. S. Zhang, L. Ly, N. Mach and C. Amaya , Topic modeling and sentiment analysis of yelp restaurant reviews, International Journal of Information Systems in the Service Sector 14(1) (2022) 1–16, https://doi.org/10.4018/IJISSS.295872. Crossref, Web of Science, Google Scholar
16. X. Fu, X. Sun, H. Wu, L. Cui and J. Z. Huang , Weakly supervised topic sentiment joint model with word embeddings, Knowledge-based Systems 147 (2018) 43–54, https://doi.org/10.1016/j.knosys.2018.02.012. Crossref, Web of Science, Google Scholar
17. F. Huang, C. Yuan, Y. Bi, J. Lu, L. Lu and X. Wang , Multi-granular document-level sentiment topic analysis for online reviews, Applied Intelligence 52(7) (2022) 7723–7733, https://doi.org/10.1007/s10489-021-02817-1. Crossref, Web of Science, Google Scholar
18. Q. Liang, S. Ranganathan, K. Wang and X. Deng , JST-RR model: Joint modeling of ratings and reviews in sentiment-topic prediction, Technometrics 65(1) (2022) 1–13, https://doi.org/10.1080/00401706.2022.2063187. Web of Science, Google Scholar
19. P. Liu, J. A. Gulla and L. Zhang , A joint model for analyzing topic and sentiment dynamics from large-scale online news, World Wide Web 21(4) (2018) 1117–1139, https://doi.org/10.5555/3220754.3220859. Crossref, Web of Science, Google Scholar
20. A. Osmani and J. Bagherzadeh , Weighted joint sentiment-topic model for sentiment analysis compared to alga: Adaptive lexicon learning using genetic algorithm, Computational Intelligence & Neuroscience (2022). https://doi.org/10.1155/2022/7612276. Crossref, Web of Science, Google Scholar
21. A. Osmani, J. B. Mohasefi and F. S. Gharehchopogh , Enriched latent dirichlet allocation for sentiment analysis, Expert Systems 37(4) (2020) e12527, https://doi.org/10.1111/exsy.12527. Crossref, Web of Science, Google Scholar
22. B. Ozyurt and M. Ali Akcayol , A new topic modeling based approach for aspect extraction in aspect based sentiment analysis: SS-LDA, Expert Systems with Applications 168 (2021) 114231, https://doi.org/10.1016/j.eswa.2020.114231. Crossref, Web of Science, Google Scholar
23. G. Pergola, L. Gui and Y. He , TDAM: A topic-dependent attention model for sentiment analysis, Information Processing & Management 56(6) (2019) 102084, https://doi.org/10.1016/j.ipm.2019.102084. Crossref, Web of Science, Google Scholar
24. L. Y. Dong, S. J. Ji, C. J. Zhang, Q. Zhang, D. K. W. Chiu, L. q. Qiu and D. Li , An unsupervised topic-sentiment joint probabilistic model for detecting deceptive reviews, Expert Systems with Applications 114 (2018) 210–223, https://doi.org/10.1016/j.eswa.2018.07.005. Crossref, Web of Science, Google Scholar
25. T. Zhou, K. Law and D. Creighton , A weakly-supervised graph-based joint sentiment topic model for multi-topic sentiment analysis, Information Sciences 609 (2022) 1030–1051, https://doi.org/10.1016/j.ins.2022.07.126. Crossref, Web of Science, Google Scholar
26. D. M. Blei, A. Y. Ng and M. I. Jordan , Latent Dirichlet allocation, Journal of Machine Learning Research 3 (2003) 993–1022, https://doi.org/10.1162/jmlr.2003.3.4-5.993. Web of Science, Google Scholar
27. D. Meskele and F. Frasincar , ALDONAr: A hybrid solution for sentence-level aspect-based sentiment analysis using a lexicalized domain ontology and a regularized neural attention model, Information Processing & Management 57(3) (2020) 102211, https://doi.org/10.1016/j.ipm.2020.102211. Crossref, Web of Science, Google Scholar
28. N. M. Sham and A. Mohamed , Climate change sentiment analysis using lexicon, machine learning and hybrid approaches, Sustainability 14(8) (2022) 2071–1050, https://doi.org/10.3390/su14084723. Web of Science, Google Scholar
29. P. Rendón-Cardona, J. Gil-Gonzalez, J. Páez-Valdez and M. Rivera-Henao , Self-supervised sentiment analysis in spanish to understand the university narrative of the colombian conflict, Applied Sciences 12(11) (2022) 5472, https://doi.org/10.3390/app12115472. Crossref, Google Scholar
30. S. Sazzed and S. Jayarathna , SSentiA: A self-supervised sentiment analyzer for classification from unlabeled data, Machine Learning with Applications 4 (2021) 100026, https://doi.org/10.1016/j.mlwa.2021.100026. Crossref, Web of Science, Google Scholar
31. M. Alizadeh, S. E. Mousavi, M. T. H. Beheshti and A. Ostadi , Combination of feature selection and hybrid classifier as to network intrusion detection system adopting FA, GWO, and bat optimizers, 2021 7th Int. Conf. Signal Processing and Intelligent Systems (ICSPIS), 29–30 December 2021, Tehran, Islamic Republic of Iran, pp. 1–7, https://doi.org/10.1109/ICSPIS54653.2021.9729365. Google Scholar
32. C. D. Manning, P. Raghavan and H. Schütze , Introduction to Information Retrieval, Vol. 1 (Cambridge University Press, 2008). Crossref, Google Scholar
33. Q. Le and T. Mikolov , Distributed representations of sentences and documents. in ICML’14: Proc. of the 31st Int. Conf. on International Conference on Machine Learning, (JMLR.org, 2014), pp. 1188–1196, https://doi.org/10.5555/3044805.3045025. Google Scholar
34. M. Alizadeh, M. T. H. Beheshti, A. Ramezani and H. Saadatinezhad , Network traffic forecasting based on fixed telecommunication data using deep learning, in 2020 6th Iranian Conf. Signal Processing and Intelligent Systems (ICSPIS), Mashhad, Iran, 2020, pp. 1–7, https://doi.org/10.1109/ICSPIS51611.2020.9349573. Crossref, Google Scholar
35. G. Kou, Y. Xu, Y. Peng, F. Shen, Y. Chen, K. Chang and S. Kou , Bankruptcy prediction for SMEs using transactional data and two-stage multiobjective feature selection, Decision Support Systems 140 (2021) 113429, https://doi.org/10.1016/j.dss.2020.113429. Crossref, Web of Science, Google Scholar
36. S. M. Ghazali, M. Alizadeh, J. Mazloum and Y. Baleghi , Modified binary salp swarm algorithm in EEG signal classification for epilepsy seizure detection, Biomedical Signal Processing and Control 78 (2022) 103858, https://doi.org/10.1016/j.bspc.2022.103858. Crossref, Web of Science, Google Scholar
37. G. Li, G. Kou and Y. Peng , Heterogeneous large-scale group decision making using fuzzy cluster analysis and its application to emergency response plan selection, IEEE Transactions on Systems, Man, and Cybernetics: Systems 52(6) (2022) 3391–3403, https://doi.org/10.1109/TSMC.2021.3068759. Crossref, Web of Science, Google Scholar
38. Y. Liu, M. Ott, N. Goyal, J. Du, M. Joshi, D. Chen, O. Levy, M. Lewis, L. Zettlemoyer and V. Stoyanov, RoBERTa: A robustly optimized bert pretraining approach, arXiv:1907.11692, doi:10.48550/arXiv.1907.11692. Google Scholar
39. Y. Shi, L. Zhu, W. Li, K. Guo and Y. Zheng , Survey on classic and latest textual sentiment analysis articles and techniques, International Journal of Information Technology & Decision Making 18(4) (2019) 1243–1287, https://doi.org/10.1142/S0219622019300015. Link, Web of Science, Google Scholar
40. X. Liu, F. Zhang, Z. Hou, L. Mian, Z. Wang, J. Zhang and J. Tang , Self-supervised learning: Generative or contrastive, IEEE Transactions on Knowledge and Data Engineering 35(1) (2021). https://doi.org/10.1109/TKDE.2021.3090866. Google Scholar
41. T. Li, G. Kou, Y. Peng and P. S. Yu , An integrated cluster detection, optimization, and interpretation approach for financial data, IEEE Transactions on Cybernetics 52(12) (2022) 13848–13861, https://doi.org/10.1109/TCYB.2021.3109066. Crossref, Web of Science, Google Scholar
42. A. Osmani and J. B. Mohasefi , Opinion mining using enriched joint sentiment-topic model, International Journal of Information Technology & Decision Making 22(1) (2022) 1–63, https://doi.org/10.1142/S0219622022500584. Google Scholar
43. G. Kou, Y. Peng, and G. Wang , Evaluation of clustering algorithms for financial risk analysis using MCDM methods, Information Sciences 275 (2014) 1–12, https://doi.org/10.1016/j.ins.2014.02.137. Crossref, Web of Science, Google Scholar
44. X. Fu, H. Wu and L. Cui , Topic sentiment joint model with word embeddings, in Proc. Workshop Interactions between Data Mining and Natural Language Processing co-located with the European Conf. Machine Learning and Principles and Practice of Knowledge Discovery in Databases DMNLP@ PKDD/ECML (Italy, 2016), pp. 41–48. Google Scholar
45. F. Li, M. Huang and X. Zhu , Sentiment analysis with global topics and local dependency, in Proc. of the AAAI Conf. on Artificial Intelligence, (Atlanta, Georgia, 2010), pp. 1371–1376. Crossref, Google Scholar
46. C. Lin and Y. He , Joint sentiment/topic model for sentiment analysis, in Int. Conf. Information and Knowledge Management, Proc., (Association for Computing Machinery, 2009), pp. 375–384, https://doi.org/10.1145/1645953.1646003. Crossref, Google Scholar
47. K. Nimala and R. Jebakumar , Retracted article: Sentiment topic emotion model on students feedback for educational benefits and practices, Behaviour & Information Technology 40 (2021) 311–319. Crossref, Google Scholar
48. W. Zhang, K. Zhao, L. Qiu and C. Hu , SESS: A self-supervised and syntax-based method for sentiment classification, in Proc. 23rd Pacific Asia Conf. Language, Information and Computation, Vol. 2 (Hong Kong, China, 2009), pp. 596–605. Google Scholar
49. L. Qiu, W. Zhang, C. Hu and K. Zhao , SELC: A self-supervised model for sentiment classification, in Proc. 18th ACM Conf. Information and Knowledge Management, (Hong Kong, China, 2009), pp. 929–936, https://doi.org/10.1145/1645953.1646072. Crossref, Google Scholar
50. L. Zhang, R. Ghosh, M. Dekhil, M. Hsu and B. Liu, Combining lexicon-based and learning-based methods for twitter sentiment analysis, HP Laboratories Technical Report HPL-2011-89 (2011), pp. 1–8. Google Scholar
51. Y. He and D. Zhou , Self-training from labeled features for sentiment analysis. Information Processing & Management 47(4) (2011) 606–616, https://doi.org/10.1016/j.ipm.2010.11.003. Crossref, Web of Science, Google Scholar
52. P. Zhang and Z. He , A weakly supervised approach to chinese sentiment classification using partitioned self-training, Journal of Information Science 39(6) (2013) 815–831, https://doi.org/10.1177/0165551513480330. Crossref, Web of Science, Google Scholar
53. J. Wang and Y. Dong , Measurement of text similarity: A survey, Information 11(9) (2020) 421, https://doi.org/10.3390/info11090421. Crossref, Web of Science, Google Scholar
54. G. Sidorov, A. Gelbukh, H. Gómez-Adorno and D. Pinto , Soft similarity and soft cosine measure: Similarity of features in vector space model, Computación y Sistemas 18(3) (2014) 491–504, https://doi.org/10.13053/CyS-18-3-2043. Google Scholar
55. M. Kusner, Y. Sun, N. Kolkin and K. Weinberger , From word embeddings to document distances, in ICML’15: Proc. of the 32nd Int. Conf. Machine Learning, (JMLR.org, 2015), pp. 957–966, https://doi.org/10.5555/3045118.3045221. Google Scholar
56. G. Petz, M. Karpowicz, H. Frschu, A. Auinger, V. Strtesk and A. Holzinger , Computational approaches for mining user’s opinions on the web 2.0, Information Processing and Management 50(6) (2014) 899–908, https://doi.org/10.1016/j.ipm.2014.07.005. Crossref, Web of Science, Google Scholar
57. A. Angerschmid, J. Zhou, K. Theuermann, F. Chen and A. Holzinger , Fairness and explanation in AI-informed decision making, Machine Learning and Knowledge Extraction 4(2) (2022) 556–579, https://doi.org/10.3390/make4020026. Crossref, Web of Science, Google Scholar
58. V. Balakrishnan and E. Lloyd-Yemoh , Stemming and lemmatization: A comparison of retrieval performances, Lecture Notes on Software Engineering 2(3) (2014) 262–267. Crossref, Google Scholar
59. A. Htait, S. Fournier, P. Bellot, L. Azzopardi and G. Pasi , Using sentiment analysis for pseudo-relevance feedback in social book search, in Proc. 2020 ACM SIGIR on Int. Conf. Theory of Information Retrieval, (Association for Computing Machinery, 2020), pp. 29–32, https://doi.org/10.1145/3409256.3409847. Crossref, Google Scholar
60. M. Hu and B. Liu , Mining and summarizing customer reviews, in Proc. Tenth ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining, (Association for Computing Machinery, 2004), pp. 168–177, https://doi.org/10.1145/1014052.1014073. Crossref, Google Scholar
61. B. Li and L. Han , Distance weighted cosine similarity measure for text classification, in Intelligent Data Engineering and Automated Learning – IDEAL 2013, eds. H. Yin, K. Tang, Y. Gao, F. Klawonn, M. Lee, T. Weise, B. Li and X. Yao (Springer, Berlin, Heidelberg, 2013), pp. 611–618. Crossref, Google Scholar
62. Bo Pang and L. Lee , A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts, in Proc. 42nd Annual Meeting on Association for Computational Linguistics, (Association for Computational Linguistics, 2004), p. 271, https://doi.org/10.3115/1218955.1218990. Crossref, Google Scholar
63. J. Blitzer, M. Dredze and F. Pereira , Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification, in Proc. 45th Annual Meeting of the Association of Computational Linguistics, (Prague, Czech Republic, 2007), pp. 440–447. Google Scholar
64. T. L. Griffiths and M. Steyvers , Finding scientific topics, Proceedings of the National Academy of Sciences 101(suppl_1) (2004) 5228–5235, https://doi.org/10.1073/pnas.0307752101. Crossref, Google Scholar
65. S. Loria et al., textblob documentation, Release 0.15, 2(8) (2018). Google Scholar
66. M. Thelwall, K. Buckley, G. Paltoglou, D. Cai and A. Kappas , Sentiment strength detection in short informal text, Journal of the American Society for Information Science and Technology 61(12) (2010) 2544–2558, https://doi.org/10.1002/asi.21416. Crossref, Web of Science, Google Scholar
67. F. Å. Nielsen , A new anew: Evaluation of a word list for sentiment analysis in microblogs, in MSM, CEUR Workshop Proc. CEUR-WS.org, Vol. 718 (2011), pp. 93–98, https://doi.org/10.48550/arXiv.1103.2903. Google Scholar
68. C. Hutto and E. Gilbert , VADER: A parsimonious rule-based model for sentiment analysis of social media text, in Proc. Int. AAAI Conf. Web and Social Media, (Michigan, USA, 2014), pp. 216–225. Crossref, Google Scholar
69. A. Akbik, D. Blythe and R. Vollgraf , Contextual string embeddings for sequence labeling, in Proc. 27th Int. Conf. Computational Linguistics, (New Mexico, USA, 2018), pp. 1638–1649. Google Scholar
70. G. Zhou, J. Zhao and D. Zeng , Sentiment classification with graph co-regularization, in Proc. COLING 2014, the 25th Int. Conf. Computational Linguistics: Technical Papers, (Dublin, Ireland, 2014), pp. 1331–1340. Google Scholar
71. M. Fernández, T. Álvarez-López, J. Juncal-Martínez, E. Costa-Montenegro and F. J. González-Castaño , Unsupervised method for sentiment analysis in online texts, Expert Systems with Applications 58 (2016) 57–75, https://doi.org/10.1016/j.eswa.2016.03.031. Crossref, Web of Science, Google Scholar
72. D. Vilares, C. Gómez-Rodríguez and M. A. Alonso , Universal, unsupervised (rule-based), uncovered sentiment analysis, Knowledge-Based Systems 118 (2017) 45–55, https://doi.org/10.1016/j.knosys.2016.11.014. Crossref, Web of Science, Google Scholar
73. N. Reimers and I. Gurevych , Sentence-BERT: Sentence embeddings using Siamese BERT-network, in Proc. 2019 Conf. Empirical Methods in Natural Language Processing and the 9th Int. Joint Conf. Natural Language Processing (EMNLP-IJCNLP), Association for Computational Linguistics, Hong Kong, China, 2019, pp. 3982–3992, https://doi.org/10.18653/v1/D19-1410. Crossref, Google Scholar
74. Z. Lan, M. Chen, S. Goodman, K. Gimpel, P. Sharma and R. Soricut , ALBERT: A lite BERT for self-supervised learning of language representations, in 8th Int. Conf. Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26–30 2020, OpenReview.net. Google Scholar
75. A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser and I. Polosukhin , Attention is all you need, in Advances in Neural Information Processing Systems (2017), pp. 5998–6008. Google Scholar