Relative molecule self-attention transformer

Łukasz Maziarka; Dawid Majchrowski; Tomasz Danel; Piotr Gaiński; Jacek Tabor; Igor Podolak; Paweł Morkisz; Stanisław Jastrzębski

doi:10.1186/s13321-023-00789-7

Journal of Cheminformatics (Jan 2024)

Relative molecule self-attention transformer

Łukasz Maziarka,
Dawid Majchrowski,
Tomasz Danel,
Piotr Gaiński,
Jacek Tabor,
Igor Podolak,
Paweł Morkisz,
Stanisław Jastrzębski

Affiliations

Łukasz Maziarka: Faculty of Mathematics and Computer Science, Jagiellonian University
Dawid Majchrowski: NVIDIA
Tomasz Danel: Faculty of Mathematics and Computer Science, Jagiellonian University
Piotr Gaiński: Faculty of Mathematics and Computer Science, Jagiellonian University
Jacek Tabor: Faculty of Mathematics and Computer Science, Jagiellonian University
Igor Podolak: Faculty of Mathematics and Computer Science, Jagiellonian University
Paweł Morkisz: NVIDIA
Stanisław Jastrzębski: Molecule.one

DOI: https://doi.org/10.1186/s13321-023-00789-7
Journal volume & issue: Vol. 16, no. 1
pp. 1 – 14

Abstract

Read online

Abstract The prediction of molecular properties is a crucial aspect in drug discovery that can save a lot of money and time during the drug design process. The use of machine learning methods to predict molecular properties has become increasingly popular in recent years. Despite advancements in the field, several challenges remain that need to be addressed, like finding an optimal pre-training procedure to improve performance on small datasets, which are common in drug discovery. In our paper, we tackle these problems by introducing Relative Molecule Self-Attention Transformer for molecular representation learning. It is a novel architecture that uses relative self-attention and 3D molecular representation to capture the interactions between atoms and bonds that enrich the backbone model with domain-specific inductive biases. Furthermore, our two-step pretraining procedure allows us to tune only a few hyperparameter values to achieve good performance comparable with state-of-the-art models on a wide selection of downstream tasks.

Published in Journal of Cheminformatics

ISSN: 1758-2946 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology; Science: Chemistry
Website: https://jcheminf.biomedcentral.com/

About the journal

Abstract

Keywords