Improved Repeatability of Mouse Tibia Volume Segmentation in Murine Myelofibrosis Model Using Deep Learning

Aman Kushwaha; Rami F. Mourad; Kevin Heist; Humera Tariq; Heang-Ping Chan; Brian D. Ross; Thomas L. Chenevert; Dariya Malyarenko; Lubomir M. Hadjiiski

doi:10.3390/tomography9020048

Tomography (Mar 2023)

Improved Repeatability of Mouse Tibia Volume Segmentation in Murine Myelofibrosis Model Using Deep Learning

Aman Kushwaha,
Rami F. Mourad,
Kevin Heist,
Humera Tariq,
Heang-Ping Chan,
Brian D. Ross,
Thomas L. Chenevert,
Dariya Malyarenko,
Lubomir M. Hadjiiski

Affiliations

Aman Kushwaha: Department of Radiology, University of Michigan, Ann Arbor, MI 48109, USA
Rami F. Mourad: Department of Radiology, University of Michigan, Ann Arbor, MI 48109, USA
Kevin Heist: Department of Radiology, University of Michigan, Ann Arbor, MI 48109, USA
Humera Tariq: Department of Radiology, University of Michigan, Ann Arbor, MI 48109, USA
Heang-Ping Chan: Department of Radiology, University of Michigan, Ann Arbor, MI 48109, USA
Brian D. Ross: Department of Radiology, University of Michigan, Ann Arbor, MI 48109, USA
Thomas L. Chenevert: Department of Radiology, University of Michigan, Ann Arbor, MI 48109, USA
Dariya Malyarenko: Department of Radiology, University of Michigan, Ann Arbor, MI 48109, USA
Lubomir M. Hadjiiski: Department of Radiology, University of Michigan, Ann Arbor, MI 48109, USA

DOI: https://doi.org/10.3390/tomography9020048
Journal volume & issue: Vol. 9, no. 2
pp. 589 – 602

Abstract

Read online

A murine model of myelofibrosis in tibia was used in a co-clinical trial to evaluate segmentation methods for application of image-based biomarkers to assess disease status. The dataset (32 mice with 157 3D MRI scans including 49 test–retest pairs scanned on consecutive days) was split into approximately 70% training, 10% validation, and 20% test subsets. Two expert annotators (EA1 and EA2) performed manual segmentations of the mouse tibia (EA1: all data; EA2: test and validation). Attention U-net (A-U-net) model performance was assessed for accuracy with respect to EA1 reference using the average Jaccard index (AJI), volume intersection ratio (AVI), volume error (AVE), and Hausdorff distance (AHD) for four training scenarios: full training, two half-splits, and a single-mouse subsets. The repeatability of computer versus expert segmentations for tibia volume of test–retest pairs was assessed by within-subject coefficient of variance (%wCV). A-U-net models trained on full and half-split training sets achieved similar average accuracy (with respect to EA1 annotations) for test set: AJI = 83–84%, AVI = 89–90%, AVE = 2–3%, and AHD = 0.5 mm–0.7 mm, exceeding EA2 accuracy: AJ = 81%, AVI = 83%, AVE = 14%, and AHD = 0.3 mm. The A-U-net model repeatability wCV [95% CI]: 3 [2, 5]% was notably better than that of expert annotators EA1: 5 [4, 9]% and EA2: 8 [6, 13]%. The developed deep learning model effectively automates murine bone marrow segmentation with accuracy comparable to human annotators and substantially improved repeatability.

Published in Tomography

ISSN: 2379-1381 (Print); 2379-139X (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://www.mdpi.com/journal/tomography

About the journal

Abstract

Keywords