mAbs (Dec 2024)

Seq2scFv: a toolkit for the comprehensive analysis of display libraries from long-read sequencing platforms

  • Marianne Bachmann Salvy,
  • Luca Santuari,
  • Emanuel Schmid-Siegert,
  • Nikolaos Lykoskoufis,
  • Ioannis Xenarios,
  • Bulak Arpat

DOI
https://doi.org/10.1080/19420862.2024.2408344
Journal volume & issue
Vol. 16, no. 1

Abstract

Read online

Antibodies have emerged as the leading class of biotherapeutics, yet traditional screening methods face significant time and resource challenges in identifying lead candidates. Integrating high-throughput sequencing with computational approaches marks a pivotal advancement in antibody discovery, expanding the antibody space to explore. In this context, a major breakthrough has been the full-length sequencing of single-chain variable fragments (scFvs) used in in vitro display libraries. However, few tools address the task of annotating the paired heavy and light chain variable domains (VH and VL), which is the primary advantage of full-scFv sequencing. To address this methodological gap, we introduce Seq2scFv, a novel open-source toolkit designed for analyzing in vitro display libraries from long-read sequencing platforms. Seq2scFv facilitates the identification and thorough characterization of V(D)J recombination in both VH and VL regions. In addition to providing annotated scFvs, translated sequences and numbered chains, Seq2scFv enables linker inference and characterization, sequence encoding with unique identifiers and quantification of identical sequences across selection rounds, thereby simplifying enrichment identification. With its versatile and standalone functionality, we anticipate that the implementation of Seq2scFv tools in antibody discovery pipelines will efficiently expedite the full characterization of display libraries and potentially facilitate the identification of high-affinity antibody candidates.

Keywords