Computer-aided diagnosis of laryngeal cancer via deep learning based on laryngoscopic images
Hao Xiong,
Peiliang Lin,
Jin-Gang Yu,
Jin Ye,
Lichao Xiao,
Yuan Tao,
Zebin Jiang,
Wei Lin,
Mingyue Liu,
Jingjing Xu,
Wenjie Hu,
Yuewen Lu,
Huaifeng Liu,
Yuanqing Li,
Yiqing Zheng,
Haidi Yang
Affiliations
Hao Xiong
Department of Otolaryngology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, China; Institute of Hearing and Speech-Language Science, Sun Yat-sen University, China
Peiliang Lin
Department of Otolaryngology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, China; Institute of Hearing and Speech-Language Science, Sun Yat-sen University, China
Jin-Gang Yu
School of Automation Science and Engineering, South China University of Technology, China
Jin Ye
Department of Otolaryngology, the Third Affiliated Hospital, Sun Yat-sen University, China
Lichao Xiao
School of Automation Science and Engineering, South China University of Technology, China
Yuan Tao
Department of Otolaryngology, Peking University Shenzhen Hospital, China
Zebin Jiang
Department of Otolaryngology, Puning People's Hospital, China
Wei Lin
Department of Otolaryngology, Taizhou First People‘s Hospital, China
Mingyue Liu
Department of Otolaryngology, the Third Affiliated Hospital, Sun Yat-sen University, China
Jingjing Xu
Department of Hearing and Speech-Language Science, Xinhua College, Sun Yat-sen University, China
Wenjie Hu
Department of Hearing and Speech-Language Science, Xinhua College, Sun Yat-sen University, China
Yuewen Lu
Department of Hearing and Speech-Language Science, Xinhua College, Sun Yat-sen University, China
Huaifeng Liu
Department of Hearing and Speech-Language Science, Xinhua College, Sun Yat-sen University, China
Yuanqing Li
School of Automation Science and Engineering, South China University of Technology, China; Correspondence to: Y. Li, School of Automation Science and Engineering, South China University of Technology, 381 Wushan Road, Guangzhou 510641, China.
Yiqing Zheng
Department of Otolaryngology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, China; Institute of Hearing and Speech-Language Science, Sun Yat-sen University, China; Department of Hearing and Speech-Language Science, Xinhua College, Sun Yat-sen University, China; Correspondence to: Y. Zheng and H. Yang, Department of Otolaryngology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, 107 West Yan Jiang Road, Guangzhou 510120, China.
Haidi Yang
Department of Otolaryngology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, China; Institute of Hearing and Speech-Language Science, Sun Yat-sen University, China; Department of Hearing and Speech-Language Science, Xinhua College, Sun Yat-sen University, China; Correspondence to: Y. Zheng and H. Yang, Department of Otolaryngology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, 107 West Yan Jiang Road, Guangzhou 510120, China.
Objective: To develop a deep convolutional neural network (DCNN) that can automatically detect laryngeal cancer (LCA) in laryngoscopic images. Methods: A DCNN-based diagnostic system was constructed and trained using 13,721 laryngoscopic images of LCA, precancerous laryngeal lesions (PRELCA), benign laryngeal tumors (BLT) and normal tissues (NORM) from 2 tertiary hospitals in China, including 2293 from 206 LCA subjects, 1807 from 203 PRELCA subjects, 6448 from 774 BLT subjects and 3191 from 633 NORM subjects. An independent test set of 1176 laryngoscopic images from other 3 tertiary hospitals in China, including 132 from 44 LCA subjects, 129 from 43 PRELCA subjects, 504 from 168 BLT subjects and 411 from 137 NORM subjects, was applied to the constructed DCNN to evaluate its performance against experienced endoscopists. Results: The DCCN achieved a sensitivity of 0.731, a specificity of 0.922, an AUC of 0.922, and the overall accuracy of 0.867 for detecting LCA and PRELCA among all lesions and normal tissues. When compared to human experts in an independent test set, the DCCN’ s performance on detection of LCA and PRELCA achieved a sensitivity of 0.720, a specificity of 0.948, an AUC of 0.953, and the overall accuracy of 0.897, which was comparable to that of an experienced human expert with 10–20 years of work experience. Moreover, the overall accuracy of DCNN for detection of LCA was 0.773, which was also comparable to that of an experienced human expert with 10–20 years of work experience and exceeded the experts with less than 10 years of work experience. Conclusions: The DCNN has high sensitivity and specificity for automated detection of LCA and PRELCA from BLT and NORM in laryngoscopic images. This novel and effective approach facilitates earlier diagnosis of early LCA, resulting in improved clinical outcomes and reducing the burden of endoscopists.