Double-head transformer neural network for molecular property prediction

Yuanbing Song; Jinghua Chen; Wenju Wang; Gang Chen; Zhichong Ma

doi:10.1186/s13321-023-00700-4

Journal of Cheminformatics (Feb 2023)

Double-head transformer neural network for molecular property prediction

Yuanbing Song,
Jinghua Chen,
Wenju Wang,
Gang Chen,
Zhichong Ma

Affiliations

Yuanbing Song: College of Communication and Art Design, University of Shanghai for Science and Technology
Jinghua Chen: College of Communication and Art Design, University of Shanghai for Science and Technology
Wenju Wang: College of Communication and Art Design, University of Shanghai for Science and Technology
Gang Chen: College of Communication and Art Design, University of Shanghai for Science and Technology
Zhichong Ma: College of Communication and Art Design, University of Shanghai for Science and Technology

DOI: https://doi.org/10.1186/s13321-023-00700-4
Journal volume & issue: Vol. 15, no. 1
pp. 1 – 16

Abstract

Read online

Abstract Existing molecular property prediction methods based on deep learning ignore the generalization ability of the nonlinear representation of molecular features and the reasonable assignment of weights of molecular features, making it difficult to further improve the accuracy of molecular property prediction. To solve the above problems, an end-to-end double-head transformer neural network (DHTNN) is proposed in this paper for high-precision molecular property prediction. For the data distribution characteristics of the molecular dataset, DHTNN specially designs a new activation function, beaf, which can greatly improve the generalization ability of the nonlinear representation of molecular features. A residual network is introduced in the molecular encoding part to solve the gradient explosion problem and ensure that the model can converge quickly. The transformer based on double-head attention is used to extract molecular intrinsic detail features, and the weights are reasonably assigned for predicting molecular properties with high accuracy. Our model, which was tested on the MoleculeNet [1] benchmark dataset, showed significant performance improvements over other state-of-the-art methods.

Published in Journal of Cheminformatics

ISSN: 1758-2946 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology; Science: Chemistry
Website: https://jcheminf.biomedcentral.com/

About the journal

Abstract

Keywords