Acta Pharmaceutica Sinica B (Nov 2019)

Predicting complexation performance between cyclodextrins and guest molecules by integrated machine learning and molecular modeling techniques

  • Qianqian Zhao,
  • Zhuyifan Ye,
  • Yan Su,
  • Defang Ouyang

Journal volume & issue
Vol. 9, no. 6
pp. 1241 – 1252

Abstract

Read online

Most pharmaceutical formulation developments are complex and ideal formulations are generally obtained after extensive experimentation. Machine learning is increasingly advancing many aspects in modern society and has achieved significant success in multiple subjects. Current research demonstrated that machine learning can be adopted to build up high-accurate predictive models in drugs/cyclodextrins (CDs) systems. Molecular descriptors of compounds and experimental conditions were employed as inputs, while complexation free energy as outputs. Results showed that the light gradient boosting machine provided significantly improved predictive performance over random forest and deep learning. The mean absolute error was 1.38 kJ/mol and squared correlation coefficient was 0.86. The evaluation of relative importance of molecular descriptors further demonstrated the key factors affecting molecular interactions in drugs/CD systems. In the specific ketoprofen–CD systems, machine learning model showed better predictive performance than molecular modeling calculation, while molecular simulation could provide structural, dynamic and energetic information. The integration of machine learning and molecular simulation could produce synergistic effect for interpreting and predicting pharmaceutical formulations. In conclusion, the developed predictive models were able to quickly and accurately predict the solubilizing capacity of CD systems. Current research has taken an important step toward the application of machine learning in pharmaceutical formulation design. Key words: Machine learning, Deep learning, LightGBM, Random forest, Cyclodextrin, Binding free energy, Molecular modeling, Ketoprofen