Journal of Translational Medicine (Oct 2024)
LMCD-OR: a large-scale, multilevel categorized diagnostic dataset for oral radiography
Abstract
Abstract In recent years, digital dentistry has increasingly utilized advanced image analysis techniques, such as image classification and disease diagnosis, to improve clinical outcomes. Despite these advances, the lack of comprehensive benchmark datasets is a significant barrier. To address this gap, our research team develop LMCD-OR, a substantial collection of oral radiograph images designed to support extensive artificial intelligence (AI)-driven diagnostics. LMCD-OR comprises 3,818 digital imaging and communications in medicine (DICOM) oral X-ray images from local medical institutions that are meticulously annotated to provide broad category information for both primary dental outpatient services and detailed secondary disease diagnoses. This dataset is engineered to train and validate multiclassification models to improve the precision and scope of oral disease diagnostics. To ensure robust dataset validation, we employ four cutting-edge visual neural network classification models as benchmarks. These models are tested against rigorous performance metrics, demonstrating the ability of the dataset to support advanced image classification and disease diagnosis tasks. LMCD-OR is publicly available at http://dentaldataset.zeroacademy.net .
Keywords