跳到主要导航 跳到搜索 跳到主要内容

Towards Gradient-Based Saliency Consensus Training for Adversarial Robustness

  • Xi'an Jiaotong University

科研成果: 期刊稿件文章同行评审

6 引用 (Scopus)

摘要

In recent works, robust networks have consistently exhibited more discriminative saliency map that proves to indicate sufficient adversarial robustness. In existed safe training paradigms e.g., adversarial training, however, the progressive saliency information regarding on what input semantic feature model prediction relies, have not yet been fully-explored. Due to this, we consider the incorporation of posterior saliency properties of robust model in training, as an efficient supervision signal on robust learning. It thus provides an alternative direction to enhance robustness, from the saliency interpretability perspective. In this article, to harden model we propose to optimize the discrimination of intermediate gradient-based saliency and maintain its consensus in training, which encourage model to behave according to task-relevant feature from the salient region such as object edges in image. Then, we introduce Adversarially Gradient-based Saliency Consensus Training method, dubbed Adv-GSCT. Within it, we preserve the similarity between the learned model saliency and the target one as label, approximated in the most offending case representing the least but essential information scenario. Meanwhile, a constructed pseudo-input coupled with feature importance, is feed into model to ensure the discrimination of estimated target saliency. Besides providing a novel insight into adversarial defense, Adv-GSCT differs from the current most effective adversarial training and does not need multiple iterative generations of adversarial perturbation whose computational cost and sensitivity direction of prediction concern. Finally, extensive performance evaluations on MNIST, CIFAR-10 and ImageNet datasets demonstrate the superiority of our proposed method.

源语言英语
页(从-至)530-541
页数12
期刊IEEE Transactions on Dependable and Secure Computing
21
2
DOI
出版状态已出版 - 1 3月 2024

学术指纹

探究 'Towards Gradient-Based Saliency Consensus Training for Adversarial Robustness' 的科研主题。它们共同构成独一无二的指纹。

引用此