Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study

John Zimmerman; Robin E. Soler; James Lavinder; Sarah Murphy; Charisma Atkins; LaShonda Hulbert; Richard Lusk; Boon Peng Ng

doi:10.1186/s13643-021-01640-6

Systematic Reviews (Apr 2021)

Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study

John Zimmerman,
Robin E. Soler,
James Lavinder,
Sarah Murphy,
Charisma Atkins,
LaShonda Hulbert,
Richard Lusk,
Boon Peng Ng

Affiliations

John Zimmerman: Deloitte Consulting, LLP
Robin E. Soler: Centers for Disease Control and Prevention, National Center for Chronic Disease Prevention and Health Promotion, Division of Diabetes Translation
James Lavinder: Deloitte Consulting, LLP
Sarah Murphy: Deloitte Consulting, LLP
Charisma Atkins: Centers for Disease Control and Prevention, National Center for Chronic Disease Prevention and Health Promotion, Division of Diabetes Translation
LaShonda Hulbert: Centers for Disease Control and Prevention, National Center for Chronic Disease Prevention and Health Promotion, Division of Diabetes Translation
Richard Lusk: Deloitte Consulting, LLP
Boon Peng Ng: Centers for Disease Control and Prevention, National Center for Chronic Disease Prevention and Health Promotion, Division of Diabetes Translation

DOI: https://doi.org/10.1186/s13643-021-01640-6
Journal volume & issue: Vol. 10, no. 1
pp. 1 – 8

Abstract

Read online

Abstract Background Systematic Reviews (SR), studies of studies, use a formal process to evaluate the quality of scientific literature and determine ensuing effectiveness from qualifying articles to establish consensus findings around a hypothesis. Their value is increasing as the conduct and publication of research and evaluation has expanded and the process of identifying key insights becomes more time consuming. Text analytics and machine learning (ML) techniques may help overcome this problem of scale while still maintaining the level of rigor expected of SRs. Methods In this article, we discuss an approach that uses existing examples of SRs to build and test a method for assisting the SR title and abstract pre-screening by reducing the initial pool of potential articles down to articles that meet inclusion criteria. Our approach differs from previous approaches to using ML as a SR tool in that it incorporates ML configurations guided by previously conducted SRs, and human confirmation on ML predictions of relevant articles during multiple iterative reviews on smaller tranches of citations. We applied the tailored method to a new SR review effort to validate performance. Results The case study test of the approach proved a sensitivity (recall) in finding relevant articles during down selection that may rival many traditional processes and show ability to overcome most type II errors. The study achieved a sensitivity of 99.5% (213 out of 214) of total relevant articles while only conducting a human review of 31% of total articles available for review. Conclusions We believe this iterative method can help overcome bias in initial ML model training by having humans reinforce ML models with new and relevant information, and is an applied step towards transfer learning for ML in SR.

Published in Systematic Reviews

ISSN: 2046-4053 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine
Website: https://systematicreviewsjournal.biomedcentral.com

About the journal

Abstract

Keywords