TY - JOUR
T1 - Contrastive feature-based learning-guided elevated deep reinforcement learning
T2 - Developing an imbalanced fault quantitative diagnosis under variable working conditions
AU - He, Shuilong
AU - Cui, Qianwen
AU - Chen, Jinglong
AU - Pan, Tongyang
AU - Hu, Chaofan
N1 - Publisher Copyright:
© 2024 Elsevier Ltd
PY - 2024/4/1
Y1 - 2024/4/1
N2 - Fault diagnosis is subject to the challenge of implementing model learning in the presence of small samples and imbalanced data (i.e., variable operating conditions), which is a fundamental and crucial problem that hinders their applications in real industrial scenarios. Herein, a novel deep reinforcement learning strategy (SIMC-PERDRL) that combines SimCLR and elevated prioritized experience replay (PER) is proposed for machinery fault quantitative diagnosis in non-ideal data scenarios. First, unsupervised contrastive learning pre-trains the feature extraction layer to mine optimal discriminative features with more optimal intra-class compactness and inter-class separability to reduce inter-class overlap. Second, the experience priority is quantified by reward and TD error to enhance the learning frequency of rare high-value samples; the reward function is skillfully constructed using adaptive unbalanced distribution, which immensely increases the agent's sensitivity to minorities, and enhances the model's domain adaptability by dynamically fine-tuning the agent's decision through real-time feedback. Moreover, ResNet utilizes the Convolutional Block Attention Module (CBAM) to construct a deep Q-network; thus, the agent's learning ability of critical fault features is enhanced. Finally, SIMC-PERDRL was validated online using three rotating machinery datasets. The results indicate that the method can automatically realize accurate qualitative identification under different rotational speeds, different loads, and class unbalanced conditions, with excellent effectiveness, stability, and versatility.
AB - Fault diagnosis is subject to the challenge of implementing model learning in the presence of small samples and imbalanced data (i.e., variable operating conditions), which is a fundamental and crucial problem that hinders their applications in real industrial scenarios. Herein, a novel deep reinforcement learning strategy (SIMC-PERDRL) that combines SimCLR and elevated prioritized experience replay (PER) is proposed for machinery fault quantitative diagnosis in non-ideal data scenarios. First, unsupervised contrastive learning pre-trains the feature extraction layer to mine optimal discriminative features with more optimal intra-class compactness and inter-class separability to reduce inter-class overlap. Second, the experience priority is quantified by reward and TD error to enhance the learning frequency of rare high-value samples; the reward function is skillfully constructed using adaptive unbalanced distribution, which immensely increases the agent's sensitivity to minorities, and enhances the model's domain adaptability by dynamically fine-tuning the agent's decision through real-time feedback. Moreover, ResNet utilizes the Convolutional Block Attention Module (CBAM) to construct a deep Q-network; thus, the agent's learning ability of critical fault features is enhanced. Finally, SIMC-PERDRL was validated online using three rotating machinery datasets. The results indicate that the method can automatically realize accurate qualitative identification under different rotational speeds, different loads, and class unbalanced conditions, with excellent effectiveness, stability, and versatility.
KW - Class imbalance
KW - Contrastive Learning
KW - Dueling Double Deep Q Network (D3QN)
KW - Fault quantitative diagnosis
KW - Prioritized experience replay
UR - https://www.scopus.com/pages/publications/85184478753
U2 - 10.1016/j.ymssp.2024.111192
DO - 10.1016/j.ymssp.2024.111192
M3 - 文章
AN - SCOPUS:85184478753
SN - 0888-3270
VL - 211
JO - Mechanical Systems and Signal Processing
JF - Mechanical Systems and Signal Processing
M1 - 111192
ER -