Development of anti-phishing browser based on random forest and rule of extraction framework

Mohith Gowda HR; Adithya MV; Gunesh Prasad S; Vinay S

doi:10.1186/s42400-020-00059-1

Cybersecurity (Oct 2020)

Development of anti-phishing browser based on random forest and rule of extraction framework

Mohith Gowda HR,
Adithya MV,
Gunesh Prasad S,
Vinay S

Affiliations

Mohith Gowda HR: B.E in Computer Science and Engineering, PES College of Engineering
Adithya MV: B.E in Computer Science and Engineering, PES College of Engineering
Gunesh Prasad S: B.E in Computer Science and Engineering, PES College of Engineering
Vinay S: Information Science and Engineering, PES College of Engineering

DOI: https://doi.org/10.1186/s42400-020-00059-1
Journal volume & issue: Vol. 3, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Phishing is a technique under Social Engineering attacks which is most widely used to get user sensitive information, such as login credentials and credit and debit card information, etc. It is carried out by a person masquerading as an authentic individual. To protect web users from these attacks, various anti-phishing techniques are developed, but they fail to protect the user from these attacks in various ways. In this paper, we propose a novel technique to identify phishing websites effortlessly on the client side by proposing a novel browser architecture. In this system, we use the rule of extraction framework to extract the properties or features of a website using the URL only. This list consists of 30 different properties of a URL, which will later be used by the Random Forest Classification machine learning model to detect the authenticity of the website. A dataset consisting of 11,055 tuples is used to train the model. These processes are carried out on the client-side with the help of a redesigned browser architecture. Today Researches have come up with machine learning frameworks to detect phishing sites, but they are not in a state to be used by individuals having no technical knowledge. To make sure that these tools are accessible to every individual, we have improvised and introduced detection methods into the browser architecture named as ‘Embedded Phishing Detection Browser’ (EPDB), which is a novel method to preserve the existing user experience while improving the security. The newly designed browser architecture introduces a special segment to perform phishing detection operations in real-time. We have prototyped this technique to ensure maximum security, better accuracy of 99.36% in the identification of phishing websites in real-time.

Published in Cybersecurity

ISSN: 2523-3246 (Online)
Publisher: SpringerOpen
Country of publisher: Singapore
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://cybersecurity.springeropen.com/

About the journal

Abstract

Keywords