Artificial Intelligence Chemistry (Dec 2024)
Conf-GEM: A geometric information-assisted direct conformation generation model
Abstract
Molecular conformations generation (MCG) aims to efficiently obtain reasonable and stable three-dimensional (3D) atomic coordinates of the atoms in the molecule from scratch, providing a structural foundation for molecular representation learning models and advanced downstream molecular design tasks such as molecular property prediction, molecular generation, and molecular docking. Existing MCG methods mostly rely on indirect distance-based strategies, which which can result in geometrically unrealistic conformations, or direct coordinate-based methods, which have larger search spaces and are prone to overfitting. Therefore, this study introduces Conf-GEM, a novel geometric information-assisted direct conformation generation model based on E-GeoGNN, a geometrically augmented 3D graph neural network with multiple scales. Pre-training and divide-and-conquer strategies, are integrated into the proposed model. Conf-GEM outperforms RDKit and nine deep-learning-based MCG models on the GEOM-QM9 and GEOM-Drugs datasets, achieving conformational coverage of 96.69% and 96.07%, respectively, without force field optimization. It also excels on the X-ray diffraction crystal structure dataset with up to 97.04% conformational coverage. In conclusion, Conf-GEM provides a novel solution for stabilizing 3D conformations generation. We provide an online prediction service (https://confgem.cmdrg.com) with a user-friendly interface for researchers.