Loading [MathJax]/extensions/MathMenu.js
Phishing Detection System Through Hybrid Machine Learning Based on URL | IEEE Journals & Magazine | IEEE Xplore

Phishing Detection System Through Hybrid Machine Learning Based on URL


Cyber threat detection system based on Canopy Feature Selection with LR+SVC+DT (LSD) Ensemble Learning Model using Grid Search Hyperparameter tuning and Cross Fold Valida...

Abstract:

Currently, numerous types of cybercrime are organized through the internet. Hence, this study mainly focuses on phishing attacks. Although phishing was first used in 1996...Show More

Abstract:

Currently, numerous types of cybercrime are organized through the internet. Hence, this study mainly focuses on phishing attacks. Although phishing was first used in 1996, it has become the most severe and dangerous cybercrime on the internet. Phishing utilizes email distortion as its underlying mechanism for tricky correspondences, followed by mock sites, to obtain the required data from people in question. Different studies have presented their work on the precaution, identification, and knowledge of phishing attacks; however, there is currently no complete and proper solution for frustrating them. Therefore, machine learning plays a vital role in defending against cybercrimes involving phishing attacks. The proposed study is based on the phishing URL-based dataset extracted from the famous dataset repository, which consists of phishing and legitimate URL attributes collected from 11000+ website datasets in vector form. After preprocessing, many machine learning algorithms have been applied and designed to prevent phishing URLs and provide protection to the user. This study uses machine learning models such as decision tree (DT), linear regression (LR), random forest (RF), naive Bayes (NB), gradient boosting classifier (GBM), K-neighbors classifier (KNN), support vector classifier (SVC), and proposed hybrid LSD model, which is a combination of logistic regression, support vector machine, and decision tree (LR+SVC+DT) with soft and hard voting, to defend against phishing attacks with high accuracy and efficiency. The canopy feature selection technique with cross fold valoidation and Grid Search Hyperparameter Optimization techniques are used with proposed LSD model. Furthermore, to evaluate the proposed approach, different evaluation parameters were adopted, such as the precision, accuracy, recall, F1-score, and specificity, to illustrate the effects and efficiency of the models. The results of the comparative analyses demonstrate that the proposed approach outperf...
Cyber threat detection system based on Canopy Feature Selection with LR+SVC+DT (LSD) Ensemble Learning Model using Grid Search Hyperparameter tuning and Cross Fold Valida...
Published in: IEEE Access ( Volume: 11)
Page(s): 36805 - 36822
Date of Publication: 03 March 2023
Electronic ISSN: 2169-3536

Funding Agency:


References

References is not available for this document.