Phishing Website Detection Using Improved Multilayered Convolutional Neural Networks
- 1 Department of Computer Science, Bahauddin Zakariya University, Multan, Pakistan
- 2 Department of Computer Science, Muhammad Nawaz Shareef University of Agriculture, Multan, Pakistan
- 3 Department of Computer Science, National University of Modern Languages, Islamabad, Pakistan
- 4 Department of Computer Science, Kansas State University, Manhattan, KS, United States
- 5 Department of Electrical and Computer Engineering, Comsats University Islamabad, Islamabad Campus, Pakistan
- 6 Department of Network and Computer Security, University of New York Polytechnic Institute, Utica, United States
- 7 Department of Computer Science and Engineering, University of Alaska Anchorage, Anchorage, United States
Abstract
The internet has become an essential part of many fields: Communication, entertainment, commerce, industrial production, agriculture, etc. Unfortunately, online users are vulnerable to various attacks; this could lead to financial damages and loss of personal information. Phishing is seen as an internet threat and a cybercrime where anyone can capture personal information and data by posing as a reliable source. Data may include passwords to access confidential private or industrial repositories, emails, banks, financial information, etc. The prediction task is one of the crucial aspects of modern security systems, including anti-virus, firewall, and anti-spyware software. Currently, there is no availability of a single technique that can effectively detect every phishing attack. This study proposes a novel intelligent approach, phishing Prediction, using machine learning and deep learning to accurately predict phishing websites. We apply a pre-processing pipeline and develop the model using four machine learning models namely decision tree, Naive Bayes, support vector machine random forest, and Convolutional Neural Network (CNN) as a deep learning model. The UCI machine learning repository dataset comprised 11,055 websites, including lists of 4898 phishing and 6157 legitimate websites. The multilayered CNN has achieved the highest accuracy of 99.1% among all the listed algorithms, showcasing a precision of 97, a recall of 96%, and an F1-score of 96%.
DOI: https://doi.org/10.3844/jcssp.2024.1069.1079
Copyright: © 2024 Hadia Bibi, Syed Rehan Shah, Mirza Murad Baig, Muhammad Imran Sharif, Mehwish Mehmood, Zahid Akhtar and Kamran Siddique. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 1,006 Views
- 491 Downloads
- 0 Citations
Download
Keywords
- Machine Learning
- Deep Learning
- Phishing Website Prediction
- Classification
- CNN