Scientific Reports (Jun 2024)

Deep learning model integrating cfDNA methylation and fragment size profiles for lung cancer diagnosis

  • Minjung Kim,
  • Juntae Park,
  • Seonghee Oh,
  • Byeong-Ho Jeong,
  • Yuree Byun,
  • Sun Hye Shin,
  • Yunjoo Im,
  • Jong Ho Cho,
  • Eun-Hae Cho

DOI
https://doi.org/10.1038/s41598-024-63411-2
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Detecting aberrant cell-free DNA (cfDNA) methylation is a promising strategy for lung cancer diagnosis. In this study, our aim is to identify methylation markers to distinguish patients with lung cancer from healthy individuals. Additionally, we sought to develop a deep learning model incorporating cfDNA methylation and fragment size profiles. To achieve this, we utilized methylation data collected from The Cancer Genome Atlas and Gene Expression Omnibus databases. Then we generated methylated DNA immunoprecipitation sequencing and genome-wide Enzymatic Methyl-seq (EM-seq) form lung cancer tissue and plasma. Using these data, we selected 366 methylation markers. A targeted EM-seq panel was designed using the selected markers, and 142 lung cancer and 56 healthy samples were produced with the panel. Additionally, cfDNA samples from healthy individuals and lung cancer patients were diluted to evaluate sensitivity. Its lung cancer detection performance reached an accuracy of 81.5% and an area under the receiver operating characteristic curve of 0.87. In the serial dilution experiment, we achieved tumor fraction detection of 1% at 98% specificity and 0.1% at 80% specificity. In conclusion, we successfully developed and validated a combination of methylation panel and a deep learning model that can distinguish between patients with lung cancer and healthy individuals.

Keywords