BioData Mining (Nov 2023)

6mA-StackingCV: an improved stacking ensemble model for predicting DNA N6-methyladenine site

  • Guohua Huang,
  • Xiaohong Huang,
  • Wei Luo

DOI
https://doi.org/10.1186/s13040-023-00348-8
Journal volume & issue
Vol. 16, no. 1
pp. 1 – 15

Abstract

Read online

Abstract DNA N6-adenine methylation (N6-methyladenine, 6mA) plays a key regulating role in the cellular processes. Precisely recognizing 6mA sites is of importance to further explore its biological functions. Although there are many developed computational methods for 6mA site prediction over the past decades, there is a large root left to improve. We presented a cross validation-based stacking ensemble model for 6mA site prediction, called 6mA-StackingCV. The 6mA-StackingCV is a type of meta-learning algorithm, which uses output of cross validation as input to the final classifier. The 6mA-StackingCV reached the state of the art performances in the Rosaceae independent test. Extensive tests demonstrated the stability and the flexibility of the 6mA-StackingCV. We implemented the 6mA-StackingCV as a user-friendly web application, which allows one to restrictively choose representations or learning algorithms. This application is freely available at http://www.biolscience.cn/6mA-stackingCV/ . The source code and experimental data is available at https://github.com/Xiaohong-source/6mA-stackingCV .

Keywords