Applied Sciences (Dec 2023)
Name Disambiguation Scheme Based on Heterogeneous Academic Sites
Abstract
Academic researchers publish their work in various formats, such as papers, patents, and research reports, on different academic sites. When searching for a particular researcher’s work, it can be challenging to pinpoint the right individual, especially when there are multiple researchers with the same name. In order to handle this issue, we propose a name disambiguation scheme for researchers with the same name based on heterogeneous academic sites. The proposed scheme collects and integrates research results from these varied academic sites, focusing on attributes crucial for disambiguation. It then employs clustering techniques to identify individuals who share the same name. Additionally, we implement the proposed rule-based algorithm name disambiguation method and the existing deep learning-based identification method. This approach allows for the selection of the most accurate disambiguation scheme, taking into account the metadata available in the academic sites, using a multi-classifier approach. We consider various researchers’ achievements and metadata of articles registered in various academic search sites. The proposed scheme showed an exceptionally high F1-measure value of 0.99. In this paper, we propose a multi-classifier that executes the most appropriate disambiguation scheme depending on the inputted metadata. The proposed multi-classifier shows the high F1-measure value of 0.67.
Keywords