Multi-scale V-net architecture with deep feature CRF layers for brain extraction

Jong Sung Park; Shreyas Fadnavis; Eleftherios Garyfallidis

doi:10.1038/s43856-024-00452-8

Communications Medicine (Feb 2024)

Multi-scale V-net architecture with deep feature CRF layers for brain extraction

Jong Sung Park,
Shreyas Fadnavis,
Eleftherios Garyfallidis

Affiliations

Jong Sung Park: Intelligent Systems Engineering, Indiana University Bloomington
Shreyas Fadnavis: Massachusetts General Hospital, Harvard Medical School
Eleftherios Garyfallidis: Intelligent Systems Engineering, Indiana University Bloomington

DOI: https://doi.org/10.1038/s43856-024-00452-8
Journal volume & issue: Vol. 4, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Background Brain extraction is a computational necessity for researchers using brain imaging data. However, the complex structure of the interfaces between the brain, meninges and human skull have not allowed a highly robust solution to emerge. While previous methods have used machine learning with structural and geometric priors in mind, with the development of Deep Learning (DL), there has been an increase in Neural Network based methods. Most proposed DL models focus on improving the training data despite the clear gap between groups in the amount and quality of accessible training data between. Methods We propose an architecture we call Efficient V-net with Additional Conditional Random Field Layers (EVAC+). EVAC+ has 3 major characteristics: (1) a smart augmentation strategy that improves training efficiency, (2) a unique way of using a Conditional Random Fields Recurrent Layer that improves accuracy and (3) an additional loss function that fine-tunes the segmentation output. We compare our model to state-of-the-art non-DL and DL methods. Results Results show that even with limited training resources, EVAC+ outperforms in most cases, achieving a high and stable Dice Coefficient and Jaccard Index along with a desirable lower Surface (Hausdorff) Distance. More importantly, our approach accurately segmented clinical and pediatric data, despite the fact that the training dataset only contains healthy adults. Conclusions Ultimately, our model provides a reliable way of accurately reducing segmentation errors in complex multi-tissue interfacing areas of the brain. We expect our method, which is publicly available and open-source, to be beneficial to a wide range of researchers.

Published in Communications Medicine

ISSN: 2730-664X (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine
Website: https://www.nature.com/commsmed/

About the journal