International Journal Bioautomation (Jun 2018)

A Novel Classification Method for Class-imbalanced Data and Its Application in microRNA Recognition

  • Xia Geng,
  • Yu-Quan Zhu,
  • Zhi Yang

DOI
https://doi.org/10.7546/ijba.2018.22.2.133-146
Journal volume & issue
Vol. 22, no. 2
pp. 133 – 146

Abstract

Read online

For non-coding RNA gene mining, especially microRNA mining, there are many challenges in the classification of imbalanced data. A novel classification method based on the Adaboost algorithm is proposed to handle the imbalance of positive and negative cases. Unstable-Adaboost is improved with respect to the initial weight assignment, the base classifier selection, the weight adjustment mechanism and other aspects. Furthermore, the Stable-Adaboost algorithm is proposed, which adjusts the weight of the sample set to rapidly achieve a more balanced training set. In addition, the Stable-Adaboost algorithm can ensure that the follow-up training set is maintained in a balanced state by optimizing the weight adjustment mechanism of incorrectly classified samples and stabilizing the classification performance. Experimental results show the superiority of Unstable-Adaboost and Stable-Adaboost in imbalance classification.

Keywords