No Access

Combining Sentiment Analysis with Socialization Bias in Social Networks for Stock Market Trend Prediction

Faculty of Information Technology, King Mongku’t University of Technology North Bangkok, 1518 Pibulsongkram Road, Bangsue, Bangkok, Thailand

E-mail Address: kmlj171@163.com

Search for more papers by this author

and

Phayung Meesad

Faculty of Information Technology, King Mongku’t University of Technology North Bangkok, 1518 Pibulsongkram Road, Bangsue, Bangkok, Thailand

E-mail Address: pym@kmutnb.ac.th

Search for more papers by this author

https://doi.org/10.1142/S1469026816500036Cited by:14 (Source: Crossref)

Abstract

According to the indirect relationship between information and stock trend, information such as comments and tweets can be used for stock trend prediction. When conducting classification on text data, feature sparse issues occur during conversion between tweets and word vectors. Another problem is that the unreliability of average sentiment scores to indicate one day’s sentiment. This is especially caused by the unbalanced number between positive and negative within one day, thus a large bias between sentiment and stock trend arises. In addion, information has social attributes when created and diffused in social networks, bias containing people’s belief in social networks also have become socialization bias. In order to solve those problems, this work proposes a sentiment analysis based prediction model and an inverse bias algorithm. Instead of applying sentiment analysis to add sentiment related features, this work uses SentiWordNet to give an additional weight to the selected features, and applies two kinds of sentiment analysis to inverse the socialization bias. Aiming at labeling the tweets to sentiment related groups to help find socialization bias, this work also proposes an extended wordlist based on a semi-supervised Naïve Bayes classification algorithm. After finishing the inverse socialization bias, stock trends were used to label example sets. Different classification algorithms were compared in this work. The proposed model with SVM linear algorithm proves to yield accuracy of 90.33% at its best performance.

Keywords:

Remember to check out the Most Cited Articles!
Check out these titles in artificial intelligence!

References

1. J. Bollen, H. Mao and X. Zeng, Twitter mood predicts the stock market, J. Comput. Sci. 2 (1) (2011) 1–8. Crossref, Google Scholar
2. J. Zhang, Y. Kawai, S. Nakajima, Y. Matsumoto and K. Tanaka, Sentiment Bias Detection in Support of News Credibility Judgment, 2011 44th Hawaii International Conference on System Sciences (HICSS) (2011) pp. 1–10. Google Scholar
3. C.-H. L. Lee, A. Liu and W.-S. Chen, Pattern discovery of fuzzy time series for financial prediction, IEEE Trans. Knowl. Data Eng. 18 (5) (2006) 613–625. Crossref, Google Scholar
4. T. Kimoto, K. Asakawa, M. Yoda and M. Takeoka, Stock market prediction system with modular neural networks, 1990 IJCNN International Joint Conference on Neural Networks 1 (1990) 1–6. Crossref, Google Scholar
5. K. Kim, Financial time series forecasting using support vector machines, Neurocomputing 55 (1–2) (2003) 307–319. Crossref, Google Scholar
6. M. Hagenau, M. Liebmann, M. Hedwig and D. Neumann, Automated news reading: Stock price prediction based on financial news using context-specific features, 2012 45th Hawaii International Conference on System Science (HICSS) (2012) pp. 1040–1049. Google Scholar
7. M. Makrehchi, S. Shah and W. Liao, Stock Prediction Using Event-Based Sentiment Analysis, 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) (2013) pp. 337–342. Google Scholar
8. K. Kim and J. Lee, Sentiment visualization and classification via semi-supervised nonlinear dimensionality reduction, Pattern Recognit (2013) 758–768. Google Scholar
9. A. Pak and P. Paroubek, Twitter for Sentiment Analysis: When Language Resources are Not Available, 2011 22nd International Workshop on Database and Expert Systems Applications (DEXA) (2011) pp. 111–115. Google Scholar
10. K. Jedrzejewski and M. Morzy, Opinion mining and social networks: A promising match, 2011 International Conference on Advances in Social Networks Analysis and Mining (ASONAM) (2011) pp. 599–604. Google Scholar
11. M. Yassine and H. Hajj, A Framework for emotion mining from text in online social networks, 2010 IEEE International Conference on Data Mining Workshops (ICDMW) (2010) pp. 1136–1142. Google Scholar
12. M. I. Kaya and M. E. Karsligil, Stock price prediction using financial news articles, 2010 2nd IEEE International Conference on Information and Financial Engineering (ICIFE) (2010) pp. 478–482. Google Scholar
13. K. Zhang, L. Li, P. Li and W. Teng, Stock trend forecasting method based on sentiment analysis and system similarity model, 2011 6th International Forum on Strategic Technology (IFOST) (2011) pp. 890–894. Google Scholar
14. A. A. Bhat and S. S. Kamath, Automated stock price prediction and trading framework for Nifty intraday trading, DeepDyve (2013). Google Scholar
15. Y. Chen, C. Wang and X. Liang, How external factors influence stock market: A model based SVM, 2010 IEEE International Conference on Service Operations and Logistics and Informatics (SOLI) (2010) pp. 325–329. Google Scholar
16. S. S. Kamath, A. Bagalkotkar, A. Kandelwal, S. Pandey and K. Poornima, Sentiment Analysis Based Approaches for Understanding User Context in Web Content, 2013 International Conference on Communication Systems and Network Technologies (CSNT) (2013) 607–611. Google Scholar
17. V. N. Vapnik, The Nature of Statistical Learning Theory. (New York, NY, USA: Springer-Verlag New York Inc., 1995). Crossref, Google Scholar
18. G. Li and F. Liu, A clustering-based approach on sentiment analysis, 2010 International Conference on Intelligent Systems and Knowledge Engineering (ISKE) (2010) pp. 331–337. Google Scholar
19. K. Denecke, Using SentiWordNet for multilingual sentiment analysis, IEEE 24th International Conference on Data Engineering Workshop, 2008. ICDEW (2008) pp. 507–512. Google Scholar
20. S. Blair-goldensohn, T. Neylon, K. Hannan, G. A. Reis, R. Mcdonald and J. Reynar, Building a sentiment summarizer for local service reviews, In NLP in the Information Explosion Era (2008). Google Scholar