Performance evaluation in cataract surgery with an ensemble of 2D–3D convolutional neural networks

Ummey Tanin; Adrienne Duimering; Christine Law; Jessica Ruzicki; Gabriela Luna; Matthew Holden

doi:10.1049/htl2.12078

Healthcare Technology Letters (Apr 2024)

Performance evaluation in cataract surgery with an ensemble of 2D–3D convolutional neural networks

Ummey Tanin,
Adrienne Duimering,
Christine Law,
Jessica Ruzicki,
Gabriela Luna,
Matthew Holden

Affiliations

Ummey Tanin: School of Computer Science Carleton University Ottawa Ontario Canada
Adrienne Duimering: Department of Ophthalmology, School of Medicine Queen's University Kingston Ontario Canada
Christine Law: Department of Ophthalmology, School of Medicine Queen's University Kingston Ontario Canada
Jessica Ruzicki: Department of Ophthalmology, School of Medicine Queen's University Kingston Ontario Canada
Gabriela Luna: Department of Ophthalmology, School of Medicine Queen's University Kingston Ontario Canada
Matthew Holden: School of Computer Science Carleton University Ottawa Ontario Canada

DOI: https://doi.org/10.1049/htl2.12078
Journal volume & issue: Vol. 11, no. 2-3
pp. 189 – 195

Abstract

Read online

Abstract An important part of surgical training in ophthalmology is understanding how to proficiently perform cataract surgery. Operating skill in cataract surgery is typically assessed by real‐time or video‐based expert review using a rating scale. This is time‐consuming, subjective and labour‐intensive. A typical trainee graduates with over 100 complete surgeries, each of which requires review by the surgical educators. Due to the consistently repetitive nature of this task, it lends itself well to machine learning‐based evaluation. Recent studies utilize deep learning models trained on tool motion trajectories obtained using additional equipment or robotic systems. However, the process of tool recognition by extracting frames from the videos to perform phase recognition followed by skill assessment is exhaustive. This project proposes a deep learning model for skill evaluation using raw surgery videos that is cost‐effective and end‐to‐end trainable. An advanced ensemble of convolutional neural network models is leveraged to model technical skills in cataract surgeries and is evaluated using a large dataset comprising almost 200 surgical trials. The highest accuracy of 0.8494 is observed on the phacoemulsification step data. Our model yielded an average accuracy of 0.8200 and an average AUC score of 0.8800 for all four phase datasets of cataract surgery proving its robustness against different data. The proposed ensemble model with 2D and 3D convolutional neural networks demonstrated a promising result without using tool motion trajectories to evaluate surgery expertise.

Published in Healthcare Technology Letters

ISSN: 2053-3713 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Medical technology
Website: https://ietresearch.onlinelibrary.wiley.com/journal/20533713

About the journal

Abstract

Keywords