University of Sindh Journal of Information and Communication Technology (Sep 2022)

Towards the Optimal Use of Machine Learning Algorithms in Text Mining: A Quick Review

  • Syed Zafar Ali Shah,
  • Sadaqat Jan,
  • Ibrar Ali Shah

Journal volume & issue
Vol. 6, no. 3
pp. 89 – 94

Abstract

Read online

This paper aims to provide a quick review to jump-start the research in the field of text mining where Machine Learning (ML) algorithms have been used and several accomplishments have been reported by the research community. There are different categories of text mining, and the implementation of ML algorithms and techniques have been supported in the literature to give promising results. However, in this area of study, most of the research activities in terms of time and efforts are consumed during the initial stages where implementations and experiments are carried out to evaluate various combinations. The accomplishments in this field can be further advanced by presenting early investigations concisely and analytically. Thus, the benefits of this paper are threefold: first, it will provide a platform for the new researchers to start quickly with a shorter literature review and knowing more precisely about the combinations of text mining and ML; secondly, clear analysis has been presented about the text mining categories where the performance of ML algorithms have been reported successful; and lastly, the problems have been identified for which the algorithms were used in various studies. This will enable the new researchers to directly target the problem instead of implementing the existing techniques. With the help of well-structured questions, the results are more analytical and present multidimensional views to this research issue. Main findings include that ML has been widely used in document classification and Support Vector Machine (SVM) is the most successful algorithm reported.

Keywords