Usefulness of machine learning softwares to screen titles of systematic reviews: a methodological study

Ana Helena Salles dos Reis; Ana Luiza Miranda de Oliveira; Carolina Fritsch; James Zouch; Paulo Ferreira; Janaine Cunha Polese

doi:10.1186/s13643-023-02231-3

Systematic Reviews (Apr 2023)

Usefulness of machine learning softwares to screen titles of systematic reviews: a methodological study

Ana Helena Salles dos Reis,
Ana Luiza Miranda de Oliveira,
Carolina Fritsch,
James Zouch,
Paulo Ferreira,
Janaine Cunha Polese

Affiliations

Ana Helena Salles dos Reis: Post-Graduate Program of Health Sciences, Faculdade Ciências Médicas de Minas Gerais
Ana Luiza Miranda de Oliveira: Post-Graduate Program of Health Sciences, Faculdade Ciências Médicas de Minas Gerais
Carolina Fritsch: Faculty of Medicine and Health, School of Health Sciences, Sydney Musculoskeletal Health, The Kolling Institute, The University of Sydney
James Zouch: Faculty of Health Sciences, The University of Sydney
Paulo Ferreira: Faculty of Health Sciences, The University of Sydney
Janaine Cunha Polese: Post-Graduate Program of Health Sciences, Faculdade Ciências Médicas de Minas Gerais

DOI: https://doi.org/10.1186/s13643-023-02231-3
Journal volume & issue: Vol. 12, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Objective To investigate the usefulness and performance metrics of three freely-available softwares (Rayyan®, Abstrackr® and Colandr®) for title screening in systematic reviews. Study design and setting In this methodological study, the usefulness of softwares to screen titles in systematic reviews was investigated by the comparison between the number of titles identified by software-assisted screening and those by manual screening using a previously published systematic review. To test the performance metrics, sensitivity, specificity, false negative rate, proportion missed, workload and timing savings were calculated. A purposely built survey was used to evaluate the rater's experiences regarding the softwares’ performances. Results Rayyan® was the most sensitive software and raters correctly identified 78% of the true positives. All three softwares were specific and raters correctly identified 99% of the true negatives. They also had similar values for precision, proportion missed, workload and timing savings. Rayyan®, Abstrackr® and Colandr® had 21%, 39% and 34% of false negatives rates, respectively. Rayyan presented the best performance (35/40) according to the raters. Conclusion Rayyan®, Abstrackr® and Colandr® are useful tools and provided good metric performance results for systematic title screening. Rayyan® appears to be the best ranked on the quantitative and on the raters’ perspective evaluation. The most important finding of this study is that the use of software to screen titles does not remove any title that would meet the inclusion criteria for the final review, being valuable resources to facilitate the screening process.

Published in Systematic Reviews

ISSN: 2046-4053 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine
Website: https://systematicreviewsjournal.biomedcentral.com

About the journal

Abstract

Keywords