Iranian Journal of Information Processing & Management (Dec 2020)

Users clustering based on search behavior analysis using the LRFM model (case study: Iran scientific information database (Ganj))

  • Somayeh Fatahi,
  • Mohammad Rabiei

Journal volume & issue
Vol. 36, no. 2
pp. 419 – 442

Abstract

Read online

Iran scientific information database (Ganj) which includes almost one million scientific records provides the search opportunity in dissertations, domestic scientific journals, articles, conferences, research projects, and governmental reports. A large number of researchers meet the needs of their scientific and research resources from the Ganj database daily. Users’ needs and behaviors are variant and understanding it helps system administrators to use different strategies to manage the better databases and provide efficient services to users. One way to understand users’ needs is to cluster them based on their behavior and identify the features of each cluster. This study aims to cluster the users based on the analysis of their search behavior using the LRFM model. In this study, the search log data of Ganj users were collected for three months. In this research, the LRFM attributes were calculated, and then the K-means algorithm was applied to them. The optimal number of clusters was calculated based on different criteria. Based on customer value matrix, the results of customer clustering users in four groups are efficient, suspicious, unreliable, and intermittent and base on customer loyalty Marcus users categorizes in loyal, potential, insecure and newcomers.

Keywords