MULTI-ATTENTION ENHANCED DISCRIMINATOR FOR GAN-BASED ANOMALOUS SOUND DETECTION

Research output: Contribution to journalConference articlepeer-review

7 Scopus citations

Abstract

Generative adversarial networks (GAN) have been regarded as promising for anomalous sound detection (ASD) by training an unsupervised one-class classifier to pick out the anomalous sample. Existing GAN-based anomaly detection models usually focus on the generator to reduce the reconstruction error. The generator even reconstructs anomalous samples effectively without learning their features, which increases the burden on the discriminator to differentiate between original and reconstructed samples. In this paper, we propose a multi-attention enhanced discriminator for GAN-based ASD named EDGAN. It integrates attention mechanisms from multiple dimensions into the discriminator to make it more effective. Besides, we incorporate the reconstruction error to devise a new abnormal score for efficient anomaly assessment. We conducted extensive experiments on the MIMII dataset, which illustrates the average AUC 8.29% improvement compared to the state-of-the-art methods.

Original languageEnglish
Pages (from-to)6715-6719
Number of pages5
JournalICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
DOIs
StatePublished - 2024
Event2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Seoul, Korea, Republic of
Duration: 14 Apr 202419 Apr 2024

Keywords

  • Anomalous Sound Detection
  • Anomaly Score
  • Generative Adversarial Network

Fingerprint

Dive into the research topics of 'MULTI-ATTENTION ENHANCED DISCRIMINATOR FOR GAN-BASED ANOMALOUS SOUND DETECTION'. Together they form a unique fingerprint.

Cite this