From ML to LLM: Evaluating the Robustness of Phishing Webpage Detection Models against Adversarial Attacks

Page view(s)

Checked on Sep 09, 2025

Please use this identifier to cite or link to this item: https://oar.a-star.edu.sg/communities-collections/articles/21851

Title:

From ML to LLM: Evaluating the Robustness of Phishing Webpage Detection Models against Adversarial Attacks

Journal Title:

Digital Threats: Research and Practice

DOI:

10.1145/3737295

Publication URL:

https://doi.org/10.1145/3737295

Authors:

Aditya Kulkarni, Vivek Balachandran, Dinil Mon Divakaran, Tamal Das

Keywords:

Phishing, Machine Learning, Adversarial Robustness, Cybersecurity, large language models

Publication Date:

23 May 2025

Citation:

Kulkarni, A., Balachandran, V., Divakaran, D. M., & Das, T. (2025). From ML to LLM: Evaluating the Robustness of Phishing Webpage Detection Models against Adversarial Attacks. Digital Threats: Research and Practice. https://doi.org/10.1145/3737295

Abstract:

Phishing attacks attempt to deceive users into stealing sensitive information, posing a significant cybersecurity threat. Advances in machine learning (ML) and deep learning (DL) have led to the development of numerous phishing webpage detection solutions, but these models remain vulnerable to adversarial attacks. Evaluating their robustness against adversarial phishing webpages is essential. Existing tools contain datasets of pre-designed phishing webpages for a limited number of brands, and lack diversity in phishing features. To address these challenges, we develop PhishOracle , a tool that generates adversarial phishing webpages by embedding diverse phishing features into legitimate webpages. We evaluate the robustness of three existing task-specific models—Stack model, VisualPhishNet, and Phishpedia—against PhishOracle -generated adversarial phishing webpages and observe a significant drop in their detection rates. In contrast, a multimodal large language model (MLLM)-based phishing detector demonstrates stronger robustness against these adversarial attacks but still is prone to evasion. Our findings highlight the vulnerability of phishing detection models to adversarial attacks, emphasizing the need for more robust detection approaches. Furthermore, we conduct a user study to evaluate whether PhishOracle -generated adversarial phishing webpages can deceive users. The results show that many of these phishing webpages evade not only existing detection models but also users.

License type:

Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)

Funding Info:

There was no specific funding for the research done

Description:

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided thatcopies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page.Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copyotherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from permissions@acm.org. © 2025 Copyright held by the owner/author(s).

URI:

https://oar.a-star.edu.sg/communities-collections/articles/21851

ISSN:

2576-5337

Collections:

Institute for Infocomm Research

Files uploaded:

https://dl.acm.org/doi/10.1145/3737295