Scientific Data (Nov 2024)
In-vivo non-contact multispectral oral disease image dataset with segmentation
Abstract
Abstract In imaging spectroscopy, gathering oral tissue spectral data from resected samples may not accurately represent tissue signatures due to time-dependent changes, blood loss, protein degeneration, and preservation chemicals. In-vivo spectral imaging is employed to address these limitations, but it poses challenges like device dimensions, tissue accessibility, and motion artifacts, impacting data quality and reliability. Our study publishes a dataset of spectral images focusing on oral diseases, addressing these challenges. We used a state-of-the-art multispectral camera, capturing images at 270*510 pixels resolution in 16 spectral bands (460 nm to 600 nm). The dataset includes 91 participants (15 healthy and 76 diseased), with multiple images per patient, totalling 243 spectral images. The dataset encompasses three oral health conditions: Oral Submucous Fibrosis (OSMF), Leukoplakia, and Oral Squamous Cell Carcinoma (OSCC). Detailed patient history records accompany each case. This publicly available oral health multispectral dataset has the potential to advance spectroscopy diagnosis. Integrating artificial intelligence with a comprehensive spectral signature repository holds promise for accurate disease analysis.