跳到主要导航 跳到搜索 跳到主要内容

CMB: A Novel Structural Re-parameterization Block without Extra Training Parameters

  • Xi'an Jiaotong University

科研成果: 书/报告/会议事项章节会议稿件同行评审

4 引用 (Scopus)

摘要

Structural re-parameterization is a raising field, which aims at improving the performance of convolutional neural networks (CNNs) through training an over-parameterization model and transferring it into a compact inference model. However, the performance improvements of prior structural re-parameterization works often come at the cost of heavy extra training resources, which increases carbon emissions and limits the potential applications on large-scale industrial tasks. To this end, first, we conduct experiments with a series of blocks composed of multiple identical branches to investigate the mechanism behind the structural re-parameterization, and then provide an interpretation. Moreover, motivated by the studies of effective receptive fields in the biological visual systems and neural networks, we propose a novel compact block named circular mask block (CMB). Given a neural network, we replace the regular convolutional layer with CMB to construct a training architecture, which can be trained to gain an accuracy boost with No extra training parameters and limited extra training FLOPs. After training, the training architecture can be transformed into the original architecture for inference. Extensive experiments are performed on CIFAR-10 and ImageNet to evaluate the effectiveness of our method. For example, we improve 0.85% top-1 accuracy of ResNet-50 on ImageNet without extra training parameters and only 11.32M extra training FLOPs, which saves 434x training FLOPs compared with prior works.

源语言英语
主期刊名2022 International Joint Conference on Neural Networks, IJCNN 2022 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9781728186719
DOI
出版状态已出版 - 2022
活动2022 International Joint Conference on Neural Networks, IJCNN 2022 - Padua, 意大利
期限: 18 7月 202223 7月 2022

出版系列

姓名Proceedings of the International Joint Conference on Neural Networks
ISSN(印刷版)2161-4393
ISSN(电子版)2161-4407

会议

会议2022 International Joint Conference on Neural Networks, IJCNN 2022
国家/地区意大利
Padua
时期18/07/2223/07/22

学术指纹

探究 'CMB: A Novel Structural Re-parameterization Block without Extra Training Parameters' 的科研主题。它们共同构成独一无二的指纹。

引用此