Optimized model architectures for deep learning on genomic data

Hüseyin Anil Gündüz; René Mreches; Julia Moosbauer; Gary Robertson; Xiao-Yin To; Eric A. Franzosa; Curtis Huttenhower; Mina Rezaei; Alice C. McHardy; Bernd Bischl; Philipp C. Münch; Martin Binder

doi:10.1038/s42003-024-06161-1

Communications Biology (Apr 2024)

Optimized model architectures for deep learning on genomic data

Hüseyin Anil Gündüz,
René Mreches,
Julia Moosbauer,
Gary Robertson,
Xiao-Yin To,
Eric A. Franzosa,
Curtis Huttenhower,
Mina Rezaei,
Alice C. McHardy,
Bernd Bischl,
Philipp C. Münch,
Martin Binder

Affiliations

Hüseyin Anil Gündüz: Department of Statistics, LMU Munich
René Mreches: Department for Computational Biology of Infection Research, Helmholtz Center for Infection Research
Julia Moosbauer: Department of Statistics, LMU Munich
Gary Robertson: Department for Computational Biology of Infection Research, Helmholtz Center for Infection Research
Xiao-Yin To: Department of Statistics, LMU Munich
Eric A. Franzosa: Department of Biostatistics, Harvard School of Public Health
Curtis Huttenhower: Department of Biostatistics, Harvard School of Public Health
Mina Rezaei: Department of Statistics, LMU Munich
Alice C. McHardy: Department for Computational Biology of Infection Research, Helmholtz Center for Infection Research
Bernd Bischl: Department of Statistics, LMU Munich
Philipp C. Münch: Department for Computational Biology of Infection Research, Helmholtz Center for Infection Research
Martin Binder: Department of Statistics, LMU Munich

DOI: https://doi.org/10.1038/s42003-024-06161-1
Journal volume & issue: Vol. 7, no. 1
pp. 1 – 10

Abstract

Read online

Abstract The success of deep learning in various applications depends on task-specific architecture design choices, including the types, hyperparameters, and number of layers. In computational biology, there is no consensus on the optimal architecture design, and decisions are often made using insights from more well-established fields such as computer vision. These may not consider the domain-specific characteristics of genome sequences, potentially limiting performance. Here, we present GenomeNet-Architect, a neural architecture design framework that automatically optimizes deep learning models for genome sequence data. It optimizes the overall layout of the architecture, with a search space specifically designed for genomics. Additionally, it optimizes hyperparameters of individual layers and the model training procedure. On a viral classification task, GenomeNet-Architect reduced the read-level misclassification rate by 19%, with 67% faster inference and 83% fewer parameters, and achieved similar contig-level accuracy with ~100 times fewer parameters compared to the best-performing deep learning baselines.

Published in Communications Biology

ISSN: 2399-3642 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Science: Biology (General)
Website: https://www.nature.com/commsbio/

About the journal