Abstract
Multiple Convolutional Neural Networks (CNNs) become widely used in modern AI systems. There is increasingly necessity to apply different CNN shapes for different scenarios. However, it also brings challenges on throughput, energy efficiency and flexibility to hardware. In this paper, a novel accelerator, called reconfigurable neural accelerator (RNA), was proposed based on reconfigurable computing technology. In addition, image row broadcast (IRB) and zero detection technology (ZDT) were applied for increased energy efficiency and throughput. IRB can optimize the convolutional dataflow on spatial array architecture with 22×22 processing elements, increasing data reuse and reducing data movement. ZDT reduces the weight data access of the fully connected layer. At the cost of 10.25W power consumption on Virtex UltraScale XCVU440 platform, RNA can process the convolutional layers at 97.4 GOPS for AlexNet, at 90.75GOPS for VGG and at 100.8 GOPS for Lenet-5, respectively.
| Original language | English |
|---|---|
| Title of host publication | 2018 14th IEEE International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2018 - Proceedings |
| Editors | Ting-Ao Tang, Fan Ye, Yu-Long Jiang |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9781538644409 |
| DOIs | |
| State | Published - 5 Dec 2018 |
| Event | 14th IEEE International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2018 - Qingdao, China Duration: 31 Oct 2018 → 3 Nov 2018 |
Publication series
| Name | 2018 14th IEEE International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2018 - Proceedings |
|---|
Conference
| Conference | 14th IEEE International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2018 |
|---|---|
| Country/Territory | China |
| City | Qingdao |
| Period | 31/10/18 → 3/11/18 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
Keywords
- CNN
- Image Row Broadcast dataflow
- Reconfigurable computing
- Zero Detection Technology
Fingerprint
Dive into the research topics of 'An Energy-Efficient and Flexible Accelerator based on Reconfigurable Computing for Multiple Deep Convolutional Neural Networks'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver