IEEE Access (Jan 2019)
SwICS: Section-Wise In-Text Citation Score
Abstract
Since past several years, finding relevant documents from plethora of web repositories has become prime attention of the scientific community. To find out relevant research articles, state-of-the-art techniques employ content, metadata, citations, and collaborative filtering based approaches. Among all of them, citation based approaches hold strong potential because most of the time, authors cite relevant papers. Bibliographic coupling is one of the well-known citation based approaches for recommending relevant papers. In this paper, we present an approach SwICS that harnesses number of common references between pair of documents as similarity measure whereas the distribution of in-text citations within the text are not analyzed. The proposed approach explores the in-text citation frequencies within contents of the paper and in-text citation patterns between different logical sections for bibliographically coupled papers. For evaluation, the employed data set contains 1150 research documents are obtained from a well-known autonomous citation index known as: CiteSeer. A comprehensive user study is conducted to build a gold standard for comparing the proposed approach. The approach is compared with the state-of-the-art bibliographic coupling and content similarity based techniques. The comparison results revealed that proposed approach significantly performs better than the contemporary approaches. The comparison result with gold standard yielded an average of 0.73. The average gain achieved by the proposed approach is 60% from state-of-the-art: bibliographic coupling. Whereas, the correlation between gold standard and content based approach remains 20%. The proposed approach can play a significant role for search engines and citation indexers in terms of improving the quality of their results.
Keywords