Skip to main navigation Skip to search Skip to main content

High-level spatial modeling in convolutional neural network with application to pedestrian detection

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Convolutional neural network (CNN) has achieved great success in many vision tasks. A key to this success is its ability to powerful automatically learns both high-level and low-level features. In general, low-level features have a small size of receptive fields and appear multiple times in different locations of objects, while high-level semantic features have a relatively large size of receptive fields and only appear once in a specific location of objects. However, traditional CNN treats these two kinds of features in the same manner, i.e., learning them by the convolution operation, which can be approximately considered as cumulating the probabilities that a feature appears in different locations. This strategy is reasonable for low-level features but not for high-level semantic ones, especially in the case of pedestrian detection, where a local feature can be shared by different locations but a semantic part, e.g., a head, only appears once for a human. To jointly model the spatial structure and appearance of high-level semantic features, we propose a new module to learn spatially weighted max pooling in CNN. The proposed method is evaluated on several pedestrian detection databases and the experimental results show that it achieves much better performance than traditional CNN.

Original languageEnglish
Title of host publication2015 IEEE 28th Canadian Conference on Electrical and Computer Engineering, CCECE 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages778-783
Number of pages6
EditionJune
ISBN (Electronic)9781479958276
DOIs
StatePublished - 19 Jun 2015
Externally publishedYes
Event2015 28th IEEE Canadian Conference on Electrical and Computer Engineering, CCECE 2015 - Halifax, Canada
Duration: 3 May 20156 May 2015

Publication series

NameCanadian Conference on Electrical and Computer Engineering
NumberJune
Volume2015-June
ISSN (Print)0840-7789

Conference

Conference2015 28th IEEE Canadian Conference on Electrical and Computer Engineering, CCECE 2015
Country/TerritoryCanada
CityHalifax
Period3/05/156/05/15

Fingerprint

Dive into the research topics of 'High-level spatial modeling in convolutional neural network with application to pedestrian detection'. Together they form a unique fingerprint.

Cite this