TY - JOUR
T1 - Accelerating Very Deep Convolutional Networks for Classification and Detection
AU - Zhang, Xiangyu
AU - Zou, Jianhua
AU - He, Kaiming
AU - Sun, Jian
N1 - Publisher Copyright:
© 1979-2012 IEEE.
PY - 2016/10/1
Y1 - 2016/10/1
N2 - This paper aims to accelerate the test-time computation of convolutional neural networks (CNNs), especially very deep CNNs [1] that have substantially impacted the computer vision community. Unlike previous methods that are designed for approximating linear filters or linear responses, our method takes the nonlinear units into account. We develop an effective solution to the resulting nonlinear optimization problem without the need of stochastic gradient descent (SGD). More importantly, while previous methods mainly focus on optimizing one or two layers, our nonlinear method enables an asymmetric reconstruction that reduces the rapidly accumulated error when multiple (e.g., ≥ 10) layers are approximated. For the widely used very deep VGG-16 model [1] , our method achieves a whole-model speedup of 4× with merely a 0.3 percent increase of top-5 error in ImageNet classification. Our 4× accelerated VGG-16 model also shows a graceful accuracy degradation for object detection when plugged into the Fast R-CNN detector [2].
AB - This paper aims to accelerate the test-time computation of convolutional neural networks (CNNs), especially very deep CNNs [1] that have substantially impacted the computer vision community. Unlike previous methods that are designed for approximating linear filters or linear responses, our method takes the nonlinear units into account. We develop an effective solution to the resulting nonlinear optimization problem without the need of stochastic gradient descent (SGD). More importantly, while previous methods mainly focus on optimizing one or two layers, our nonlinear method enables an asymmetric reconstruction that reduces the rapidly accumulated error when multiple (e.g., ≥ 10) layers are approximated. For the widely used very deep VGG-16 model [1] , our method achieves a whole-model speedup of 4× with merely a 0.3 percent increase of top-5 error in ImageNet classification. Our 4× accelerated VGG-16 model also shows a graceful accuracy degradation for object detection when plugged into the Fast R-CNN detector [2].
KW - Convolutional neural networks
KW - acceleration
KW - image classification
KW - object detection
UR - https://www.scopus.com/pages/publications/84986325670
U2 - 10.1109/TPAMI.2015.2502579
DO - 10.1109/TPAMI.2015.2502579
M3 - 文章
AN - SCOPUS:84986325670
SN - 0162-8828
VL - 38
SP - 1943
EP - 1955
JO - IEEE Transactions on Pattern Analysis and Machine Intelligence
JF - IEEE Transactions on Pattern Analysis and Machine Intelligence
IS - 10
M1 - 7332968
ER -