OTONet: Deep Neural Network for Precise Otoscopy Image Classification

Divya Rao; Rohit Singh; Sudiksha Kottachery Kamath; Sanjeev Kushal Pendekanti; Divya Pai; Sucheta V. Kolekar; M. Raviraja Holla; Sameena Pathan

doi:10.1109/ACCESS.2024.3351668

IEEE Access (Jan 2024)

OTONet: Deep Neural Network for Precise Otoscopy Image Classification

Divya Rao,
Rohit Singh,
Sudiksha Kottachery Kamath,
Sanjeev Kushal Pendekanti,
Divya Pai,
Sucheta V. Kolekar,
M. Raviraja Holla,
Sameena Pathan

Affiliations

Divya Rao: ORCiD; Department of Information and Communication Technology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, India
Rohit Singh: ORCiD; Department of Otorhinolaryngology, Kasturba Medical College, Manipal Academy of Higher Education, Manipal, India
Sudiksha Kottachery Kamath: ORCiD; Department of Information and Communication Technology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, India
Sanjeev Kushal Pendekanti: ORCiD; Department of Information and Communication Technology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, India
Divya Pai: ORCiD; Department of Orthodontics and Dentofacial Orthopedics, Manipal College of Dental Sciences, Manipal Academy of Higher Education, Manipal, India
Sucheta V. Kolekar: ORCiD; Department of Information and Communication Technology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, India
M. Raviraja Holla: ORCiD; Department of Information and Communication Technology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, India
Sameena Pathan: ORCiD; Department of Information and Communication Technology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, India

DOI: https://doi.org/10.1109/ACCESS.2024.3351668
Journal volume & issue: Vol. 12
pp. 7734 – 7746

Abstract

Read online

Otoscopy is a diagnostic procedure to visualize the external ear canal and eardrum, facilitating the detection of various ear pathologies and conditions. Timely otoscopy image classification offers significant advantages, including early detection, reduced patient anxiety, and personalized treatment plans. This paper introduces a novel OTONet framework specifically tailored for otoscopy image classification. It leverages octave 3D convolution and a combination of feature and region-focus modules to create an accurate and robust classification system capable of distinguishing between various otoscopic conditions. This architecture is designed to efficiently capture and process the spatial and feature information present in otoscopy images. Using a public otoscopy dataset, OTONet has reached a classification accuracy of 99.3% and an F1 score of 99.4% across 11 classes of ear conditions. A comparative analysis demonstrates that OTONet surpasses other established machine learning models, including ResNet50, ResNet50v2, VGG16, Dense-Net169, and ConvNeXtTiny, across various evaluation metrics. The research’s contribution to improved diagnostic accuracy reduced human error, expedited diagnostics, and its potential for telemedicine applications.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords