TY - GEN
T1 - Hardware implementation of reconfigurable separable convolution
AU - Rao, Lei
AU - Zhang, Bin
AU - Zhao, Jizhong
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/8/7
Y1 - 2018/8/7
N2 - Convolution operations occupy large amounts of computation resource in convolutional neural networks (CNNs). Separable convolution can greatly reduce computational complexity. Unfortunately, most trained kernels in CNNs are not separable. In this paper, least squares approach is applied to decompose a non-separable 2D kernel into two 1D kernels. A reconfigurable convolutional architecture is proposed to convert a 2D convolution into 1D convolution in convolutional layers. Moreover, a denoising CNN is mapped to the proposed convolution architecture. Experimental results show that the hardware architecture can restore a 1280 720 image in 0.83s, which achieves an 8.4 speed-up over GPU implementation. Verification experiments demonstrate that our approach and hardware architecture can drastically reduce the computational complexity in convolution operations without sacrificing the performance.
AB - Convolution operations occupy large amounts of computation resource in convolutional neural networks (CNNs). Separable convolution can greatly reduce computational complexity. Unfortunately, most trained kernels in CNNs are not separable. In this paper, least squares approach is applied to decompose a non-separable 2D kernel into two 1D kernels. A reconfigurable convolutional architecture is proposed to convert a 2D convolution into 1D convolution in convolutional layers. Moreover, a denoising CNN is mapped to the proposed convolution architecture. Experimental results show that the hardware architecture can restore a 1280 720 image in 0.83s, which achieves an 8.4 speed-up over GPU implementation. Verification experiments demonstrate that our approach and hardware architecture can drastically reduce the computational complexity in convolution operations without sacrificing the performance.
KW - Convolutional Neural Networks
KW - Hardware Implementation
KW - Reconfigurable Architecture
KW - Separable Convolution
UR - https://www.scopus.com/pages/publications/85052142174
U2 - 10.1109/ISVLSI.2018.00051
DO - 10.1109/ISVLSI.2018.00051
M3 - 会议稿件
AN - SCOPUS:85052142174
SN - 9781538670996
T3 - Proceedings of IEEE Computer Society Annual Symposium on VLSI, ISVLSI
SP - 232
EP - 237
BT - Proceedings - 2018 IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2018
PB - IEEE Computer Society
T2 - 17th IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2018
Y2 - 9 July 2018 through 11 July 2018
ER -