Frontiers in Oncology (Feb 2024)

Identification of ipsilateral supraclavicular lymph node metastasis in breast cancer based on LASSO regression with a high penalty factor

  • Haohan Zhang,
  • Jin Yin,
  • Jin Yin,
  • Jin Yin,
  • Chen Zhou,
  • Chen Zhou,
  • Chen Zhou,
  • Jiajun Qiu,
  • Jiajun Qiu,
  • Jiajun Qiu,
  • Junren Wang,
  • Junren Wang,
  • Qing Lv,
  • Qing Lv,
  • Qing Lv,
  • Ting Luo,
  • Ting Luo

DOI
https://doi.org/10.3389/fonc.2024.1349315
Journal volume & issue
Vol. 14

Abstract

Read online

Aiming at the problems of small sample size and large feature dimension in the identification of ipsilateral supraclavicular lymph node metastasis status in breast cancer using ultrasound radiomics, an optimized feature combination search algorithm is proposed to construct linear classification models with high interpretability. The genetic algorithm (GA) is used to search for feature combinations within the feature subspace using least absolute shrinkage and selection operator (LASSO) regression. The search is optimized by applying a high penalty to the L1 norm of LASSO to retain excellent features in the crossover operation of the GA. The experimental results show that the linear model constructed using this method outperforms those using the conventional LASSO regression and standard GA. Therefore, this method can be used to build linear models with higher classification performance and more robustness.

Keywords