Jisuanji kexue (Sep 2021)
Predicting Drug Molecular Properties Based on Ensembling Neural Networks Models
Abstract
Artificial intelligence (AI) methods have made great success in predicting chemical properties and bioactivity of drug molecules in the Bioinformatics field.Neural network gains wide applications in the process of drug discovery.However,the shallow neural network (SNN) gives lower accuracy while deep neural networks (DNN) are easy to be overfitting.Model ensembling is expected to further improve the predictive performance of weak learners in traditional machine learning methods.Therefore,it is the first time to apply model ensembling strategy to predict the properties of drug molecules.By encoding molecular structures,the combination strategies,averaging,and stacking methods are adopted to increase predicting accuracy of pKa of drug molecules.Compared with DNN,the stacking strategy presents the best predictive accuracy and the Pearson coefficient reaches to 0.86.Ensembling weak learners of the neural networks can reproduce the accuracy of DNN while keeping the satisfied generalization ability.The results show that ensembling method can increase the predictive accuracy and reliability.
Keywords