Journal of Big Data (Oct 2024)
Siamese Graph Convolutional Split-Attention Network with NLP based Social Sentimental Data for enhanced stock price predictions
Abstract
Abstract Predicting stock market behavior using sentiment analysis has become increasingly popular, as customer responses on platforms like Twitter can influence market trends. However, most existing sentiment-based models struggle with two major issues: inaccuracy and high complexity. These problems lead to frequent prediction errors and make the models difficult to implement in real-time trading systems. To address these challenges, this paper proposes a new method called Siagra-ConSA-HSOA (Siamese Graph Convolutional Split-Attention Network with NLP-based Social Sentiment Data). Two data sources feed the model: specifically, NIFTY-50 Stock Market and real-time Twitter sentiment. Through Natural Language Processing (NLP), the raw data is pre-processed and key features are extracted before they are fused into a unified dataset using a cross-domain transformer, namely CDSFT, and then Circle-Inspired Optimization Algorithm (CIOA) selects the most important features from this dataset. This decreases the complexity of the model without losing essential information. Finally, a Graph Convolutional Split-Attention Network (SGCSAN) for promisingly predicting whether the stock prices are going to hit the ground and fly high again or is going to nosedive with Humboldt Squid Optimization Algorithm (HSOA) is introduced to further improve accuracy with lesser error generation. The proposed model Siagra-ConSA-HSOA achieved 99.9% accuracy and 99.8% recall in the testing stage, meaning that such a model performs better than the current approaches both in prediction accuracy and efficiency. Thus, this is a glimmer that the model shall be able to overcome some of the main problems with the current techniques used in predicting the behavior of the stock market. GitHub Repository: https://github.com/jramans2/Siamese-GCN-SplitAttention-Stock-Prediction.git
Keywords