Abstract
Compared with avid research activities of deep convolutional neural networks (DCNNs) in practice, the study of theoretical behaviors of DCNNs lags heavily behind. In particular, the universal consistency of DCNNs remains open. In this paper, we prove that implementing empirical risk minimization on DCNNs with expansive convolution (with zero-padding) is strongly universally consistent. Motivated by the universal consistency, we conduct a series of experiments to show that without any fully connected layers, DCNNs with expansive convolution perform not worse than the widely used deep neural networks with hybrid structure containing contracting (without zero-padding) convolutional layers and several fully connected layers.
| Original language | English |
|---|---|
| Pages (from-to) | 4610-4617 |
| Number of pages | 8 |
| Journal | IEEE Transactions on Information Theory |
| Volume | 68 |
| Issue number | 7 |
| DOIs | |
| State | Published - 1 Jul 2022 |
Keywords
- Deep learning
- convolutional neural networks
- universal consistency