Systematic Reviews (Sep 2023)

Semi-automating abstract screening with a natural language model pretrained on biomedical literature

  • Sheryl Hui-Xian Ng,
  • Kiok Liang Teow,
  • Gary Yee Ang,
  • Woan Shin Tan,
  • Allyn Hum

DOI
https://doi.org/10.1186/s13643-023-02353-8
Journal volume & issue
Vol. 12, no. 1
pp. 1 – 3

Abstract

Read online

Abstract We demonstrate the performance and workload impact of incorporating a natural language model, pretrained on citations of biomedical literature, on a workflow of abstract screening for studies on prognostic factors in end-stage lung disease. The model was optimized on one-third of the abstracts, and model performance on the remaining abstracts was reported. Performance of the model, in terms of sensitivity, precision, F1 and inter-rater agreement, was moderate in comparison with other published models. However, incorporating it into the screening workflow, with the second reviewer screening only abstracts with conflicting decisions, translated into a 65% reduction in the number of abstracts screened by the second reviewer. Subsequent work will look at incorporating the pre-trained BERT model into screening workflows for other studies prospectively, as well as improving model performance.

Keywords