无需触发器与辅助数据集的模型后门攻击

Translated title of the contribution: Trigger-free and data-free backdoor attacks on deep neural networks
  • Jiahao Wang
  • , Xianglong Zhang
  • , Huanle Zhang
  • , Xiaobo Ma
  • , Xiuzhen Cheng
  • , Pengfei Hu
  • , Guoming Zhang

Research output: Contribution to journalArticlepeer-review

Abstract

With the rapid deployment of deep neural networks (DNNs) across critical application domains, backdoor attacks have emerged as a significant security threat. However, most existing methods rely on access to the target model's original training data and require explicit triggers to activate malicious behavior, which limits their practicality and compromises stealth.This paper proposes a novel trigger-free and data-free backdoor attack framework that enhances both the practicality and concealment of attacks. Our approach leverages a fine-tuning strategy to embed the semantics of malicious data into the feature space of an attacker-specified target class, enabling adversarial samples to be misclassified consistently without any visible trigger.To preserve the model's performance on clean inputs, we incorporate a knowledge distillation mechanism in place of the original training data and design an elastic weight consolidation-based parameter importance estimation method to guide the injection process.Extensive experiments conducted on three real-world benchmark datasets demonstrate the effectiveness, stealthiness, and real-world feasibility of the proposed method. Additionally, we explore the potential of auxiliary data and model inversion techniques in further enhancing attack success.

Translated title of the contributionTrigger-free and data-free backdoor attacks on deep neural networks
Original languageChinese (Traditional)
Pages (from-to)2798-2816
Number of pages19
JournalScientia Sinica Informationis
Volume55
Issue number11
DOIs
StatePublished - 1 Nov 2025

Fingerprint

Dive into the research topics of 'Trigger-free and data-free backdoor attacks on deep neural networks'. Together they form a unique fingerprint.

Cite this