Practical Detection of Click Spams Using Efficient Classification-Based Algorithms

Mahdieh Fallah; Sajjad Zarifzadeh

International Journal of Information and Communication Technology Research (Jun 2018)

Practical Detection of Click Spams Using Efficient Classification-Based Algorithms

Mahdieh Fallah,
Sajjad Zarifzadeh

Affiliations

Mahdieh Fallah: Department. of Computer Engineering Yazd University Yazd, Iran
Sajjad Zarifzadeh: Department of Computer Engineering Yazd University Yazd, Iran

Journal volume & issue: Vol. 10, no. 2
pp. 63 – 71

Abstract

Read online

Most of today’s Internet services utilize user feedback (e.g. clicks) to improve the quality of their services. For example, search engines use click information as a key factor in document ranking. As a result, some websites cheat to get a higher rank by fraudulently absorbing clicks to their pages. This phenomenon, known as “Click Spam”, is initiated by programs called “Click Bot”. The problem of distinguishing bot-generated traffic from the user traffic is critical for the viability of Internet services, like search engines. In this paper, we propose a novel classification-based system to effectively identify fraudulent clicks in a practical manner. We first model user sessions with three different levels of features, i.e. session-based, user-based and IP-based features. Then, we classify sessions with two different methods: a one-class and a two-class classification that both work based on the well-known K-Nearest Neighbor algorithm. Finally, we analyze our methods with the real log of a Persian search engine. Experimental results show that the proposed algorithms can detect fraudulent clicks with a precision of up to 96% which outperform the previous works by more than 5%.

Published in International Journal of Information and Communication Technology Research

ISSN: 2251-6107 (Print); 2783-4425 (Online)
Publisher: Iran Telecom Research Center
Country of publisher: Iran, Islamic Republic of
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology; Technology: Electrical engineering. Electronics. Nuclear engineering: Telecommunication; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://ijict.itrc.ac.ir/

About the journal

Abstract

Keywords