Towards Supporting Exploratory Search over the Arabic Web Content: The Case of ArabXplore

Al-Agha Iyad; Abed Ahmed

doi:10.22059/jitm.2020.303225.2535

Journal of Information Technology Management (Dec 2020)

Towards Supporting Exploratory Search over the Arabic Web Content: The Case of ArabXplore

Al-Agha Iyad,
Abed Ahmed

Affiliations

Al-Agha Iyad: Associate Prof., Department of Computer Science, Faculty of Information Technology, The Islamic University of Gaza, Palestine.
Abed Ahmed: MSc, Department of Computer Science, Faculty of Information Technology, The Islamic University of Gaza, Gaza Strip, Palestine.

DOI: https://doi.org/10.22059/jitm.2020.303225.2535
Journal volume & issue: Vol. 12, no. 4
pp. 160 – 179

Abstract

Read online

Due to the huge amount of data published on the Web, the Web search process has become more difficult, and it is sometimes hard to get the expected results, especially when the users are less certain about their information needs. Several efforts have been proposed to support exploratory search on the web by using query expansion, faceted search, or supplementary information extracted from external knowledge resources. However, these solutions are not well explored for the general web search in an open-domain setting. In addition, they mostly focus on supporting search in content expressed in English and Latin based languages. In this research, we propose a fully automated approach that aims to support exploratory search over the Arabic web content. It exploits the Arabic version of Wikipedia to extract complementary information that supports visual representation and deeper exploration of the search engine's results. Key Wikipedia entities are extracted from the text snippets produced by the search engine in response to the user's query. Entities are then filtered and ranked by using a novel ranking algorithm that extends the conventional PageRank algorithm. Finally, a graph is built and presented to the user to visually represent highly ranked topics and their relationships. The proposed approach was realized by developing ArabXplore, a system that integrates with the web browser to support the web search process by executing our approach in query time. It was assessed over a dataset of 100 Arabic search queries covering different domains, and results were assessed and rated by human subjects. The underlying ranking algorithm was also compared with the conventional PageRank.

Published in Journal of Information Technology Management

ISSN: 2008-5893 (Print); 2423-5059 (Online)
Publisher: University of Tehran
Country of publisher: Iran, Islamic Republic of
LCC subjects: Bibliography. Library science. Information resources: Information resources (General)
Website: https://jitm.ut.ac.ir/

About the journal

Abstract

Keywords