The Lancet: Digital Health (Nov 2021)
Multi-step validation of a deep learning-based system for the quantification of bowel preparation: a prospective, observational study
Abstract
Summary: Background: Inadequate bowel preparation is associated with a decrease in adenoma detection rate (ADR). A deep learning-based bowel preparation assessment system based on the Boston bowel preparation scale (BBPS) has been previously established to calculate the automatic BBPS (e-BBPS) score (ranging 0–20). The aims of this study were to investigate whether there was a statistically inverse relationship between the e-BBPS score and the ADR, and to determine the threshold of e-BBPS score for adequate bowel preparation in colonoscopy screening. Methods: In this prospective, observational study, we trained and internally validated the e-BBPS system using retrospective colonoscopy images and videos from the Endoscopy Center of Wuhan University, annotated by endoscopists. We externally validated the system using colonoscopy images and videos from the First People's Hospital of Yichang and the Third Hospital of Wuhan. To prospectively validate the system, we recruited consecutive patients at Renmin Hospital of Wuhan University aged between 18 and 75 years undergoing colonoscopy. The exclusion criteria included: contraindication to colonoscopy, family polyposis syndrome, inflammatory bowel disease, history of surgery for colorectal or colorectal cancer, known or suspected bowel obstruction or perforation, patients who were pregnant or lactating, inability to receive caecal intubation, and lumen obstruction. We did colonoscopy procedures and collected withdrawal videos, which were reviewed and the e-BBPS system was applied to all colon segments. The primary outcome of this study was ADR, defined as the proportion of patients with one or more conventional adenomas detected during colonoscopy. We calculated the ADR of each e-BBPS score and did a correlation analysis using Spearman analysis. Findings: From May 11 to Aug 10, 2020, 616 patients underwent screening colonoscopies, which evaluated. There was a significant inverse correlation between the e-BBPS score and ADR (Spearman's rank −0·976, p<0·010). The ADR for the e-BBPS scores 1–8 was 28·57%, 28·68%, 26·79%, 19·19%, 17·57%, 17·07%, 14·81%, and 0%, respectively. According to the 25% ADR standard for screening colonoscopy, an e-BBPS score of 3 was set as a threshold to guarantee an ADR of more than 25%, and so high-quality endoscopy. Patients with scores of more than 3 had a significantly lower ADR than those with a score of 3 or less (ADR 15·93% vs 28·03%, p<0·001, 95% CI 0·28–0·66, odds ratio 0·43). Interpretation: The e-BBPS system has potential to provide a more objective and refined threshold for the quantification of adequate bowel preparation. Funding: Project of Hubei Provincial Clinical Research Center for Digestive Disease Minimally Invasive Incision and Hubei Province Major Science and Technology Innovation Project.