Robustifying Zero-Shot Vision Language Models by Subspaces Alignment

Page view(s)

Checked on

Please use this identifier to cite or link to this item: https://oar.a-star.edu.sg/communities-collections/articles/22965

Title:

Robustifying Zero-Shot Vision Language Models by Subspaces Alignment

Journal Title:

ICCV 2025

DOI:

Publication URL:

https://openaccess.thecvf.com/content/ICCV2025/html/Dong_Robustifying_Zero-Shot_Vision_Language_Models_by_Subspaces_Alignment_ICCV_2025_paper.html

Authors:

Junhao Dong, Koniusz Piotr , Liaoyuan Feng, Yifei Zhang, Hao Zhu, Weiming Liu, Xinghua Qu, Yew-Soon Ong

Keywords:

Publication Date:

01 January 1970

Citation:

Dong, Junhao, et al. "Robustifying zero-shot vision language models by subspaces alignment." Proceedings of the IEEE/CVF International Conference on Computer Vision. 2025.

Abstract:

Vision-Language Models (VLMs) enjoy strong zero-shot performance but are vulnerable to adversarial attacks posing security risks. Adversarially robust fine-tuning enhances zero-shot robustness on new datasets while preserving the natural performance of pre-trained VLMs. However, prior methods use sample-wise adversarial fine-tuning, neglecting the underlying second-order statistics that represent entire groups of samples. This leads to a feature-level discrepancy between clean and adversarial samples of their augmented variants. Thus, we propose to represent groups of samples as subspaces to capture distributions and turn the traditional sample-wise adversarial fine-tuning into its distributional counterpart. For each image, we build distributions from (i) a clean sample with its augmentations and (ii) their adversarial counterparts. For text, we build distributions from (iii) a clean prompt and its synonymous prompts and (iv) their adversarial counterparts. We then perform alignment between image and text subspaces, and "adversarial" subspaces are also aligned toward "clean" subspaces. Thus, all samples underlying these distributions (think infinite number) also get aligned, leading to generalizable robustness. Evaluations on 15 datasets are provided.

License type:

Attribution 4.0 International (CC BY 4.0)

Funding Info:

This research / project is supported by the NRF - AI-based urban cooling technology development
Grant Reference no. : AISG3-TC-2024-014-SGKR

Description:

URI:

https://oar.a-star.edu.sg/communities-collections/articles/22965

ISBN:

Dong_2025_ICCV

Collections:

Institute of High Performance Computing

Files uploaded:

https://openaccess.thecvf.com/content/ICCV2025/papers/Dong_Robustifying_Zero-Shot_Vision_Language_Models_by_Subspaces_Alignment_ICCV_2025_paper.pdf