SoftwareX (May 2024)
ANDez: An open-source tool for author name disambiguation using machine learning
Abstract
Author name disambiguation in bibliographic data is challenging due to the same names of different authors and name variations of authors. Various machine learning (ML) methods address this, but a unified framework for comparing them is lacking. This study introduces ANDez, an open-source tool that integrates top-performing ML techniques for author name disambiguation. Developed in Python using popular ML libraries, ANDez provides a transparent system, merging complex procedures from different ML approaches. This promotes the assessment, modification, and benchmarking of ML techniques in author name disambiguation. ANDez's user-friendly design also helps researchers analyze ambiguous bibliographic data without needing advanced ML coding expertise.