On Phishing: Proposing a Traffic Behavior-Based Model to Detect, Prevent, and Classify Webpage Suspicious and Malicious Activities
Phishing is a criminal act in which a Phisher sends a well- counterfeit Webpage, using its Lexical, Host-Based, and Content (LHC) features, containing unseen security threats and stealthy attacks to unwary victims tickling them to... See more
Phishing is a criminal act in which a Phisher sends a well- counterfeit Webpage, using its Lexical, Host-Based, and Content (LHC) features, containing unseen security threats and stealthy attacks to unwary victims tickling them to disclose sensitive credentials such as financial data, address, etc. The Webpage will probably pass under anti-phishing techniques (APT) because they mainly focus on detecting and classifying Webpages as either Malicious or Benign, neglecting Webpage traffic behavior (TB). In this research, we propose the detection, prevention, and classification (DPC) of Webpages' (W) suspicious and malicious activities based on their TB model, namely DPC based on the B - WTB or L2 model, as the second line of defense against a classified Benign Webpage has passed under the APT line of defense undetected. The L2 model is encapsulated in a sandbox to avoid system failure and keep attacks from spreading around the network, which will classify L1 Webpages as Benign, Suspicious, or Malicious based on their TB when they attempt to access unauthorized resources. Using 10369 records from ISCX - URL2016 dataset, the L2 model achieves an accuracy of 90.07%, 91.85%, and 92.62%, using KNN, LR, and SVM machine learning algorithms. In addition, the implementation of the proposed L2 model shows a significant observation regarding classified Webpages' attempt to access restricted resources based on their maximum number of access violation attempts for each of the restricted resources and an accumulative number of access attempts over time for each violation access attempts on the restricted resources. The experimental results show the precision score, the recall score, and the F1 score for each model.