Universal Consistency of Deep Convolutional Neural Networks

Research output: Contribution to journalArticlepeer-review

18 Scopus citations

Abstract

Compared with avid research activities of deep convolutional neural networks (DCNNs) in practice, the study of theoretical behaviors of DCNNs lags heavily behind. In particular, the universal consistency of DCNNs remains open. In this paper, we prove that implementing empirical risk minimization on DCNNs with expansive convolution (with zero-padding) is strongly universally consistent. Motivated by the universal consistency, we conduct a series of experiments to show that without any fully connected layers, DCNNs with expansive convolution perform not worse than the widely used deep neural networks with hybrid structure containing contracting (without zero-padding) convolutional layers and several fully connected layers.

Original languageEnglish
Pages (from-to)4610-4617
Number of pages8
JournalIEEE Transactions on Information Theory
Volume68
Issue number7
DOIs
StatePublished - 1 Jul 2022

Keywords

  • Deep learning
  • convolutional neural networks
  • universal consistency

Fingerprint

Dive into the research topics of 'Universal Consistency of Deep Convolutional Neural Networks'. Together they form a unique fingerprint.

Cite this