Genetic Algorithm for Feature Selection Applied to Financial Time Series Monotonicity Prediction: Experimental Cases in Cryptocurrencies and Brazilian Assets
Rodrigo Colnago Contreras,
Vitor Trevelin Xavier da Silva,
Igor Trevelin Xavier da Silva,
Monique Simplicio Viana,
Francisco Lledo dos Santos,
Rodrigo Bruno Zanin,
Erico Fernandes Oliveira Martins,
Rodrigo Capobianco Guido
Affiliations
Rodrigo Colnago Contreras
Department of Computer Science and Statistics, Institute of Biosciences, Letters and Exact Sciences, São Paulo State University (UNESP), São José do Rio Preto 15054-000, SP, Brazil
Vitor Trevelin Xavier da Silva
Department of Applied Mathematics and Statistics, Institute of Mathematical and Computer Sciences, University of São Paulo, São Carlos 13566-590, SP, Brazil
Igor Trevelin Xavier da Silva
Department of Applied Mathematics and Statistics, Institute of Mathematical and Computer Sciences, University of São Paulo, São Carlos 13566-590, SP, Brazil
Monique Simplicio Viana
Department of Computing, Federal University of São Carlos, São Carlos 13565-905, SP, Brazil
Francisco Lledo dos Santos
Faculty of Architecture and Engineering, Mato Grosso State University, Cáceres 78217-900, MT, Brazil
Rodrigo Bruno Zanin
Faculty of Architecture and Engineering, Mato Grosso State University, Cáceres 78217-900, MT, Brazil
Erico Fernandes Oliveira Martins
Faculty of Architecture and Engineering, Mato Grosso State University, Cáceres 78217-900, MT, Brazil
Rodrigo Capobianco Guido
Department of Computer Science and Statistics, Institute of Biosciences, Letters and Exact Sciences, São Paulo State University (UNESP), São José do Rio Preto 15054-000, SP, Brazil
Since financial assets on stock exchanges were created, investors have sought to predict their future values. Currently, cryptocurrencies are also seen as assets. Machine learning is increasingly adopted to assist and automate investments. The main objective of this paper is to make daily predictions about the movement direction of financial time series through classification models, financial time series preprocessing methods, and feature selection with genetic algorithms. The target time series are Bitcoin, Ibovespa, and Vale. The methodology of this paper includes the following steps: collecting time series of financial assets; data preprocessing; feature selection with genetic algorithms; and the training and testing of machine learning models. The results were obtained by evaluating the models with the area under the ROC curve metric. For the best prediction models for Bitcoin, Ibovespa, and Vale, values of 0.61, 0.62, and 0.58 were obtained, respectively. In conclusion, the feature selection allowed the improvement of performance in most models, and the input series in the form of percentage variation obtained a good performance, although it was composed of fewer attributes in relation to the other sets tested.