Mathematical Biosciences and Engineering (Jan 2023)

Antibody sequences assembly method based on weighted de Bruijn graph

  • Yi Lu,
  • heng Ge,
  • Biao Cai ,
  • Qing Xu,
  • Ren Kong ,
  • Shan Chang

DOI
https://doi.org/10.3934/mbe.2023266
Journal volume & issue
Vol. 20, no. 4
pp. 6174 – 6190

Abstract

Read online

With the development of next-generation protein sequencing technologies, sequence assembly algorithm has become a key technology for de novo sequencing process. At present, the existing methods can address the assembly of an unknown single protein chain. However, for monoclonal antibodies with light and heavy chains, the assembly is still an unsolved question. To address this problem, we propose a new assembly method, DBAS, which integrates the quality scores and sequence alignment scores from de novo sequencing peptides into a weighted de Bruijn graph to assemble the final protein sequences. The established method is used to assembling sequences from two datasets with mixed light and heavy chains from antibodies. The results show that the DBAS can assemble long antibody sequences for both mixed light and heavy chains and single chains. In addition, DBAS is able to distinguish the light and heavy chains by using BLAST sequence alignment. The results show that the algorithm has good performance for both target sequence coverage and contig assembly accuracy.

Keywords