BMJ Open (Sep 2020)

Automatic deep learning-based colorectal adenoma detection system and its similarities with pathologists

  • Yong Huang,
  • Wei Jin,
  • Jing Yuan,
  • Zhigang Song,
  • Chunkai Yu,
  • Shuangmei Zou,
  • Wenmiao Wang,
  • Xiaohui Ding,
  • Jinhong Liu,
  • Liwei Shao,
  • Xiangnan Gou,
  • Zhanbo Wang,
  • Huang Chen,
  • Cancheng Liu,
  • Zhuo Sun,
  • Calvin Ku,
  • Yongqiang Zhang,
  • Xianghui Dong,
  • Shuhao Wang,
  • Ning Lv,
  • Huaiyin Shi

DOI
https://doi.org/10.1136/bmjopen-2019-036423
Journal volume & issue
Vol. 10, no. 9

Abstract

Read online

Objectives The microscopic evaluation of slides has been gradually moving towards all digital in recent years, leading to the possibility for computer-aided diagnosis. It is worthwhile to know the similarities between deep learning models and pathologists before we put them into practical scenarios. The simple criteria of colorectal adenoma diagnosis make it to be a perfect testbed for this study.Design The deep learning model was trained by 177 accurately labelled training slides (156 with adenoma). The detailed labelling was performed on a self-developed annotation system based on iPad. We built the model based on DeepLab v2 with ResNet-34. The model performance was tested on 194 test slides and compared with five pathologists. Furthermore, the generalisation ability of the learning model was tested by extra 168 slides (111 with adenoma) collected from two other hospitals.Results The deep learning model achieved an area under the curve of 0.92 and obtained a slide-level accuracy of over 90% on slides from two other hospitals. The performance was on par with the performance of experienced pathologists, exceeding the average pathologist. By investigating the feature maps and cases misdiagnosed by the model, we found the concordance of thinking process in diagnosis between the deep learning model and pathologists.Conclusions The deep learning model for colorectal adenoma diagnosis is quite similar to pathologists. It is on-par with pathologists’ performance, makes similar mistakes and learns rational reasoning logics. Meanwhile, it obtains high accuracy on slides collected from different hospitals with significant staining configuration variations.