Mathematics (Apr 2025)

Text Mining arXiv: A Look Through Quantitative Finance Papers

  • Michele Leonardo Bianchi

DOI
https://doi.org/10.3390/math13091375
Journal volume & issue
Vol. 13, no. 9
p. 1375

Abstract

Read online

This paper explores articles hosted on the arXiv preprint server with the aim of uncovering valuable insights hidden in this vast collection of research. Employing text mining techniques and through the application of natural language processing methods, I xamine the contents of quantitative finance papers posted in arXiv from 1997 to 2022. I extract and analyze, without relying on ad hoc software or proprietary databases, crucial information from the entire documents, including the references, to understand the topic trends over time and to find out the most cited researchers and journals in this domain. Additionally, I compare numerous algorithms for performing topic modeling, including state-of-the-art approaches.

Keywords