Cybernetics and Information Technologies (Jun 2016)

Duplicate Literature Detection for Cross-Library Search

  • Liu Wei,
  • Zeng Jianxun

DOI
https://doi.org/10.1515/cait-2016-0028
Journal volume & issue
Vol. 16, no. 2
pp. 160 – 178

Abstract

Read online

The proliferation of online digital libraries offers users a great opportunity to search their desired literatures on Web. Cross-library search applications can help users search more literature information from multiple digital libraries. Duplicate literatures detection is always a necessary step when merging the search results from multiple digital libraries due to heterogeneity and autonomy of digital libraries. To this end, this paper proposes a holistic solution which includes achieving automatic training set, holistic attribute mapping, and weight of attribute training. The experiments on real digital libraries show that the proposed solution is highly effective.

Keywords