Phishing Website Detection Using Improved Multilayered Convolutional Neural Networks

Hadia Bibi; Syed Rehan Shah; Mirza Murad Baig; Muhammad Imran Sharif; Mehwish Mehmood; Zahid Akhtar; Kamran Siddique

doi:10.3844/jcssp.2024.1069.1079

Research Article Open Access

Phishing Website Detection Using Improved Multilayered Convolutional Neural Networks

Hadia Bibi¹, Syed Rehan Shah², Mirza Murad Baig³, Muhammad Imran Sharif⁴, Mehwish Mehmood⁵, Zahid Akhtar⁶ and Kamran Siddique⁷

¹ Department of Computer Science, Bahauddin Zakariya University, Multan, Pakistan
² Department of Computer Science, Muhammad Nawaz Shareef University of Agriculture, Multan, Pakistan
³ Department of Computer Science, National University of Modern Languages, Islamabad, Pakistan
⁴ Department of Computer Science, Kansas State University, Manhattan, KS, United States
⁵ Department of Electrical and Computer Engineering, Comsats University Islamabad, Islamabad Campus, Pakistan
⁶ Department of Network and Computer Security, University of New York Polytechnic Institute, Utica, United States
⁷ Department of Computer Science and Engineering, University of Alaska Anchorage, Anchorage, United States

Abstract

The internet has become an essential part of many fields: Communication, entertainment, commerce, industrial production, agriculture, etc. Unfortunately, online users are vulnerable to various attacks; this could lead to financial damages and loss of personal information. Phishing is seen as an internet threat and a cybercrime where anyone can capture personal information and data by posing as a reliable source. Data may include passwords to access confidential private or industrial repositories, emails, banks, financial information, etc. The prediction task is one of the crucial aspects of modern security systems, including anti-virus, firewall, and anti-spyware software. Currently, there is no availability of a single technique that can effectively detect every phishing attack. This study proposes a novel intelligent approach, phishing Prediction, using machine learning and deep learning to accurately predict phishing websites. We apply a pre-processing pipeline and develop the model using four machine learning models namely decision tree, Naive Bayes, support vector machine random forest, and Convolutional Neural Network (CNN) as a deep learning model. The UCI machine learning repository dataset comprised 11,055 websites, including lists of 4898 phishing and 6157 legitimate websites. The multilayered CNN has achieved the highest accuracy of 99.1% among all the listed algorithms, showcasing a precision of 97, a recall of 96%, and an F1-score of 96%.

Journal of Computer Science

Volume 20 No. 9, 2024, 1069-1079

DOI: https://doi.org/10.3844/jcssp.2024.1069.1079

Submitted On: 26 February 2024 Published On: 6 July 2024

How to Cite: Bibi, H., Shah, S. R., Baig, M. M., Sharif, M. I., Mehmood, M., Akhtar, Z. & Siddique, K. (2024). Phishing Website Detection Using Improved Multilayered Convolutional Neural Networks. Journal of Computer Science, 20(9), 1069-1079. https://doi.org/10.3844/jcssp.2024.1069.1079

Copyright: © 2024 Hadia Bibi, Syed Rehan Shah, Mirza Murad Baig, Muhammad Imran Sharif, Mehwish Mehmood, Zahid Akhtar and Kamran Siddique. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

1,006 Views
491 Downloads
0 Citations

Download

Keywords

Machine Learning
Deep Learning
Phishing Website Prediction
Classification
CNN