Scientific Data (Jan 2025)
A multi-modal dental dataset for semi-supervised deep learning image segmentation
Abstract
Abstract In response to the increasing prevalence of dental diseases, dental health, a vital aspect of human well-being, warrants greater attention. Panoramic X-ray images (PXI) and Cone Beam Computed Tomography (CBCT) are key tools for dentists in diagnosing and treating dental conditions. Additionally, deep learning for tooth segmentation can focus on relevant treatment information and localize lesions. However, the scarcity of publicly available PXI and CBCT datasets hampers their use in tooth segmentation tasks. Therefore, this paper presents a multimodal dataset for Semi-supervised Tooth Segmentation (STS-Tooth) in dental PXI and CBCT, named STS-2D-Tooth and STS-3D-Tooth. STS-2D-Tooth includes 4,000 images and 900 masks, categorized by age into children and adults. Moreover, we have collected CBCTs providing more detailed and three-dimensional information, resulting in the STS-3D-Tooth dataset comprising 148,400 unlabeled scans and 8,800 masks. To our knowledge, this is the first multimodal dataset combining dental PXI and CBCT, and it is the largest tooth segmentation dataset, a significant step forward for the advancement of tooth segmentation.