What is the suitability of clinical vignettes in benchmarking the performance of online symptom checkers? An audit study

Azeem Majeed; Mansour Taghavi Azar Sharabiani; Saba Mian; Austen El-Osta; Emmanouil Bagkeris; Aos Alaa; Iman Webber

doi:10.1136/bmjopen-2021-053566

BMJ Open (Apr 2022)

What is the suitability of clinical vignettes in benchmarking the performance of online symptom checkers? An audit study

Azeem Majeed,
Mansour Taghavi Azar Sharabiani,
Saba Mian,
Austen El-Osta,
Emmanouil Bagkeris,
Aos Alaa,
Iman Webber

Affiliations

Azeem Majeed: professor of primary care and public health
Mansour Taghavi Azar Sharabiani: Department of Primary Care and Public Health, Imperial College London, London, UK
Saba Mian: Department of Primary Care and Public Health, Imperial College London, London, UK
Austen El-Osta: Department of Primary Care and Public Health, Imperial College London, London, UK
Emmanouil Bagkeris: National Heart and Lung Institute, Imperial College London, London, UK
Aos Alaa: Self-Care Academic Research Unit (SCARU), Department of Primary Care and Public Health, Imperial College London Faculty of Medicine, London, UK
Iman Webber: Self-Care Academic Research Unit (SCARU), Department of Primary Care and Public Health, Imperial College London Faculty of Medicine, London, UK

DOI: https://doi.org/10.1136/bmjopen-2021-053566
Journal volume & issue: Vol. 12, no. 4

Abstract

Read online

Objective Assess the suitability of clinical vignettes in benchmarking the performance of online symptom checkers (OSCs).Design Observational study using a publicly available free OSC.Participants Healthily OSC, which provided consultations in English, was used to record consultation outcomes from two lay and four expert inputters using 139 standardised patient vignettes. Each vignette included three diagnostic solutions and a triage recommendation in one of three categories of triage urgency. A panel of three independent general practitioners interpreted the vignettes to arrive at an alternative set of diagnostic and triage solutions. Both sets of diagnostic and triage solutions were consolidated to arrive at a final consolidated version for benchmarking.Main outcome measures Six inputters simulated 834 standardised patient evaluations using Healthily OSC and recorded outputs (triage solution, signposting, and whether the correct diagnostic solution appeared first or within the first three differentials). We estimated Cohen’s kappa to assess how interpretations by different inputters could lead to divergent OSC output even when using the same vignette or when compared with a separate panel of physicians.Results There was moderate agreement on triage recommendation (kappa=0.48), and substantial agreement on consultation outcomes between all inputters (kappa=0.73). OSC performance improved significantly from baseline when compared against the final consolidated diagnostic and triage solution (p<0.001).Conclusions Clinical vignettes are inherently limited in their utility to benchmark the diagnostic accuracy or triage safety of OSC. Real-world evidence studies involving real patients are recommended to benchmark the performance of OSC against a panel of physicians.

Published in BMJ Open

ISSN: 2044-6055 (Online)
Publisher: BMJ Publishing Group
Country of publisher: United Kingdom
LCC subjects: Medicine
Website: https://bmjopen.bmj.com

About the journal