Machine learning for technical skill assessment in surgery: a systematic review

Kyle Lam; Junhong Chen; Zeyu Wang; Fahad M. Iqbal; Ara Darzi; Benny Lo; Sanjay Purkayastha; James M. Kinross

doi:10.1038/s41746-022-00566-0

npj Digital Medicine (Mar 2022)

Machine learning for technical skill assessment in surgery: a systematic review

Kyle Lam,
Junhong Chen,
Zeyu Wang,
Fahad M. Iqbal,
Ara Darzi,
Benny Lo,
Sanjay Purkayastha,
James M. Kinross

Affiliations

Kyle Lam: Department of Surgery and Cancer, 10th Floor Queen Elizabeth the Queen Mother Building, St Mary’s Hospital, Imperial College
Junhong Chen: Department of Surgery and Cancer, 10th Floor Queen Elizabeth the Queen Mother Building, St Mary’s Hospital, Imperial College
Zeyu Wang: Department of Surgery and Cancer, 10th Floor Queen Elizabeth the Queen Mother Building, St Mary’s Hospital, Imperial College
Fahad M. Iqbal: Department of Surgery and Cancer, 10th Floor Queen Elizabeth the Queen Mother Building, St Mary’s Hospital, Imperial College
Ara Darzi: Department of Surgery and Cancer, 10th Floor Queen Elizabeth the Queen Mother Building, St Mary’s Hospital, Imperial College
Benny Lo: Department of Surgery and Cancer, 10th Floor Queen Elizabeth the Queen Mother Building, St Mary’s Hospital, Imperial College
Sanjay Purkayastha: Department of Surgery and Cancer, 10th Floor Queen Elizabeth the Queen Mother Building, St Mary’s Hospital, Imperial College
James M. Kinross: Department of Surgery and Cancer, 10th Floor Queen Elizabeth the Queen Mother Building, St Mary’s Hospital, Imperial College

DOI: https://doi.org/10.1038/s41746-022-00566-0
Journal volume & issue: Vol. 5, no. 1
pp. 1 – 16

Abstract

Read online

Abstract Accurate and objective performance assessment is essential for both trainees and certified surgeons. However, existing methods can be time consuming, labor intensive, and subject to bias. Machine learning (ML) has the potential to provide rapid, automated, and reproducible feedback without the need for expert reviewers. We aimed to systematically review the literature and determine the ML techniques used for technical surgical skill assessment and identify challenges and barriers in the field. A systematic literature search, in accordance with the PRISMA statement, was performed to identify studies detailing the use of ML for technical skill assessment in surgery. Of the 1896 studies that were retrieved, 66 studies were included. The most common ML methods used were Hidden Markov Models (HMM, 14/66), Support Vector Machines (SVM, 17/66), and Artificial Neural Networks (ANN, 17/66). 40/66 studies used kinematic data, 19/66 used video or image data, and 7/66 used both. Studies assessed the performance of benchtop tasks (48/66), simulator tasks (10/66), and real-life surgery (8/66). Accuracy rates of over 80% were achieved, although tasks and participants varied between studies. Barriers to progress in the field included a focus on basic tasks, lack of standardization between studies, and lack of datasets. ML has the potential to produce accurate and objective surgical skill assessment through the use of methods including HMM, SVM, and ANN. Future ML-based assessment tools should move beyond the assessment of basic tasks and towards real-life surgery and provide interpretable feedback with clinical value for the surgeon. PROSPERO: CRD42020226071

Published in npj Digital Medicine

ISSN: 2398-6352 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://www.nature.com/npjdigitalmed/

About the journal