Computational and Structural Biotechnology Journal (Jan 2021)

AMR-Diag: Neural network based genotype-to-phenotype prediction of resistance towards β-lactams in Escherichia coli and Klebsiella pneumoniae

  • Ekaterina Avershina,
  • Priyanka Sharma,
  • Arne M. Taxt,
  • Harpreet Singh,
  • Stephan A. Frye,
  • Kolin Paul,
  • Arti Kapil,
  • Umaer Naseer,
  • Punit Kaur,
  • Rafi Ahmad

Journal volume & issue
Vol. 19
pp. 1896 – 1906

Abstract

Read online

Antibiotic resistance poses a major threat to public health. More effective ways of the antibiotic prescription are needed to delay the spread of antibiotic resistance. Employment of sequencing technologies coupled with the use of trained neural network algorithms for genotype-to-phenotype prediction will reduce the time needed for antibiotic susceptibility profile identification from days to hours.In this work, we have sequenced and phenotypically characterized 171 clinical isolates of Escherichia coli and Klebsiella pneumoniae from Norway and India. Based on the data, we have created neural networks to predict susceptibility for ampicillin, 3rd generation cephalosporins and carbapenems. All networks were trained on unassembled data, enabling prediction within minutes after the sequencing information becomes available. Moreover, they can be used both on Illumina and MinION generated data and do not require high genome coverage for phenotype prediction. We cross-checked our networks with previously published algorithms for genotype-to-phenotype prediction and their corresponding datasets. Besides, we also created an ensemble of networks trained on different datasets, which improved the cross-dataset prediction compared to a single network.Additionally, we have used data from direct sequencing of spiked blood cultures and found that AMR-Diag networks, coupled with MinION sequencing, can predict bacterial species, resistome, and phenotype as fast as 1–8 h from the sequencing start. To our knowledge, this is the first study for genotype-to-phenotype prediction: (1) employing a neural network method; (2) using data from more than one sequencing platform; and (3) utilizing sequence data from spiked blood cultures.

Keywords