Skip to main navigation Skip to search Skip to main content

Hardware implementation of reconfigurable separable convolution

  • Xi'an Jiaotong University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

Convolution operations occupy large amounts of computation resource in convolutional neural networks (CNNs). Separable convolution can greatly reduce computational complexity. Unfortunately, most trained kernels in CNNs are not separable. In this paper, least squares approach is applied to decompose a non-separable 2D kernel into two 1D kernels. A reconfigurable convolutional architecture is proposed to convert a 2D convolution into 1D convolution in convolutional layers. Moreover, a denoising CNN is mapped to the proposed convolution architecture. Experimental results show that the hardware architecture can restore a 1280 720 image in 0.83s, which achieves an 8.4 speed-up over GPU implementation. Verification experiments demonstrate that our approach and hardware architecture can drastically reduce the computational complexity in convolution operations without sacrificing the performance.

Original languageEnglish
Title of host publicationProceedings - 2018 IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2018
PublisherIEEE Computer Society
Pages232-237
Number of pages6
ISBN (Print)9781538670996
DOIs
StatePublished - 7 Aug 2018
Event17th IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2018 - Hong Kong, Hong Kong
Duration: 9 Jul 201811 Jul 2018

Publication series

NameProceedings of IEEE Computer Society Annual Symposium on VLSI, ISVLSI
Volume2018-July
ISSN (Print)2159-3469
ISSN (Electronic)2159-3477

Conference

Conference17th IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2018
Country/TerritoryHong Kong
CityHong Kong
Period9/07/1811/07/18

Keywords

  • Convolutional Neural Networks
  • Hardware Implementation
  • Reconfigurable Architecture
  • Separable Convolution

Fingerprint

Dive into the research topics of 'Hardware implementation of reconfigurable separable convolution'. Together they form a unique fingerprint.

Cite this