SoftwareX (May 2024)

ANDez: An open-source tool for author name disambiguation using machine learning

  • Jinseok Kim,
  • Jenna Kim

Journal volume & issue
Vol. 26
p. 101719

Abstract

Read online

Author name disambiguation in bibliographic data is challenging due to the same names of different authors and name variations of authors. Various machine learning (ML) methods address this, but a unified framework for comparing them is lacking. This study introduces ANDez, an open-source tool that integrates top-performing ML techniques for author name disambiguation. Developed in Python using popular ML libraries, ANDez provides a transparent system, merging complex procedures from different ML approaches. This promotes the assessment, modification, and benchmarking of ML techniques in author name disambiguation. ANDez's user-friendly design also helps researchers analyze ambiguous bibliographic data without needing advanced ML coding expertise.

Keywords