Tongue feature dataset construction and real-time detection.

Wen-Hsien Chang; Chih-Chieh Chen; Han-Kuei Wu; Po-Chi Hsu; Lun-Chien Lo; Hsueh-Ting Chu; Hen-Hong Chang

doi:10.1371/journal.pone.0296070

PLoS ONE (Jan 2024)

Tongue feature dataset construction and real-time detection.

Wen-Hsien Chang,
Chih-Chieh Chen,
Han-Kuei Wu,
Po-Chi Hsu,
Lun-Chien Lo,
Hsueh-Ting Chu,
Hen-Hong Chang

Affiliations

Wen-Hsien Chang
Chih-Chieh Chen
Han-Kuei Wu
Po-Chi Hsu
Lun-Chien Lo
Hsueh-Ting Chu
Hen-Hong Chang

DOI: https://doi.org/10.1371/journal.pone.0296070
Journal volume & issue: Vol. 19, no. 3
p. e0296070

Abstract

Read online

BackgroundTongue diagnosis in traditional Chinese medicine (TCM) provides clinically important, objective evidence from direct observation of specific features that assist with diagnosis. However, the current interpretation of tongue features requires a significant amount of manpower and time. TCM physicians may have different interpretations of features displayed by the same tongue. An automated interpretation system that interprets tongue features would expedite the interpretation process and yield more consistent results.Materials and methodsThis study applied deep learning visualization to tongue diagnosis. After collecting tongue images and corresponding interpretation reports by TCM physicians in a single teaching hospital, various tongue features such as fissures, tooth marks, and different types of coatings were annotated manually with rectangles. These annotated data and images were used to train a deep learning object detection model. Upon completion of training, the position of each tongue feature was dynamically marked.ResultsA large high-quality manually annotated tongue feature dataset was constructed and analyzed. A detection model was trained with average precision (AP) 47.67%, 58.94%, 71.25% and 59.78% for fissures, tooth marks, thick and yellow coatings, respectively. At over 40 frames per second on a NVIDIA GeForce GTX 1060, the model was capable of detecting tongue features from any viewpoint in real time.Conclusions/significanceThis study constructed a tongue feature dataset and trained a deep learning object detection model to locate tongue features in real time. The model provided interpretability and intuitiveness that are often lacking in general neural network models and implies good feasibility for clinical application.

Published in PLoS ONE

ISSN: 1932-6203 (Online)
Publisher: Public Library of Science (PLoS)
Country of publisher: United States
LCC subjects: Medicine; Science
Website: https://journals.plos.org/plosone/

About the journal