Accelerating the inference of string generation-based chemical reaction models for industrial applications

Mikhail Andronov; Natalia Andronova; Michael Wand; Jürgen Schmidhuber; Djork-Arné Clevert

doi:10.1186/s13321-025-00974-w

Journal of Cheminformatics (Mar 2025)

Accelerating the inference of string generation-based chemical reaction models for industrial applications

Mikhail Andronov,
Natalia Andronova,
Michael Wand,
Jürgen Schmidhuber,
Djork-Arné Clevert

Affiliations

Mikhail Andronov: IDSIA, USI, SUPSI
Natalia Andronova
Michael Wand: IDSIA, USI, SUPSI
Jürgen Schmidhuber: IDSIA, USI, SUPSI
Djork-Arné Clevert: Machine Learning Research, Pfizer Research and Development

DOI: https://doi.org/10.1186/s13321-025-00974-w
Journal volume & issue: Vol. 17, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Transformer-based, template-free SMILES-to-SMILES translation models for reaction prediction and single-step retrosynthesis are of interest to computer-aided synthesis planning systems, as they offer state-of-the-art accuracy. However, their slow inference speed limits their practical utility in such applications. To address this challenge, we propose speculative decoding with a simple chemically specific drafting strategy and apply it to the Molecular Transformer, an encoder-decoder transformer for conditional SMILES generation. Our approach achieves over 3X faster inference in reaction product prediction and single-step retrosynthesis with no loss in accuracy, increasing the potential of the transformer as the backbone of synthesis planning systems. To accelerate the simultaneous generation of multiple precursor SMILES for a given query SMILES in single-step retrosynthesis, we introduce Speculative Beam Search, a novel algorithm tackling the challenge of beam search acceleration with speculative decoding. Our methods aim to improve transformer-based models’ scalability and industrial applicability in synthesis planning.

Published in Journal of Cheminformatics

ISSN: 1758-2946 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology; Science: Chemistry
Website: https://jcheminf.biomedcentral.com/

About the journal

Abstract

Keywords