TY - GEN
T1 - Scene categorization using boosted back-propagation neural networks
AU - Qian, Xueming
AU - Yan, Zhe
AU - Hang, Kaiyu
AU - Liu, Guizhong
AU - Wang, Huan
AU - Wang, Zhe
AU - Li, Zhi
PY - 2010
Y1 - 2010
N2 - Scene categorization plays an important role in computer vision, image content understanding, and image retrieval. In this paper, back-propagation neural network (BPN) is served as the basic classifier for multi-class scene/image categorization. Four features, namely, SPM (spatial pyramid appearance descriptor represented by scale invariant feature transform), PHOG (pyramid histogram of oriented gradient), GIST, and HWVP (hierarchical wavelet packet transform) are selected as the basic inputs of BPNs. They are the appearance, shape and texture descriptors respectively. For an M (M>2) classes scene categorization problem, we cascade M one-versus-all BPNs to determine the accurate label of an image. An offline multi-class Adaboost algorithm is proposed to fuse multiple BPN classifiers trained with complementary features to improve scene categorization performance. Experimental results on the widely used Scene-13 and Sport Event datasets show the effectiveness of the proposed boosted BPN based scene categorization approach. Scene categorization performances of BPN classifiers with input features: SPM, PHOG, GIST and HWVP, boosted BPN classifiers of each of the four features, and the boosted classifiers of all the four features are given. Relationships of boosted classifiers number and the scene categorization performance are also discussed. Comparisons with some existing scene categorization methods using the authors' datasets further show effectiveness of the proposed boosted BPN based approach.
AB - Scene categorization plays an important role in computer vision, image content understanding, and image retrieval. In this paper, back-propagation neural network (BPN) is served as the basic classifier for multi-class scene/image categorization. Four features, namely, SPM (spatial pyramid appearance descriptor represented by scale invariant feature transform), PHOG (pyramid histogram of oriented gradient), GIST, and HWVP (hierarchical wavelet packet transform) are selected as the basic inputs of BPNs. They are the appearance, shape and texture descriptors respectively. For an M (M>2) classes scene categorization problem, we cascade M one-versus-all BPNs to determine the accurate label of an image. An offline multi-class Adaboost algorithm is proposed to fuse multiple BPN classifiers trained with complementary features to improve scene categorization performance. Experimental results on the widely used Scene-13 and Sport Event datasets show the effectiveness of the proposed boosted BPN based scene categorization approach. Scene categorization performances of BPN classifiers with input features: SPM, PHOG, GIST and HWVP, boosted BPN classifiers of each of the four features, and the boosted classifiers of all the four features are given. Relationships of boosted classifiers number and the scene categorization performance are also discussed. Comparisons with some existing scene categorization methods using the authors' datasets further show effectiveness of the proposed boosted BPN based approach.
KW - Scene Categorization
KW - adaboost
KW - back-propagation neural network
UR - https://www.scopus.com/pages/publications/78049406088
U2 - 10.1007/978-3-642-15702-8_20
DO - 10.1007/978-3-642-15702-8_20
M3 - 会议稿件
AN - SCOPUS:78049406088
SN - 3642157017
SN - 9783642157011
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 215
EP - 226
BT - Advances in Multimedia Information Processing, PCM 2010 - 11th Pacific Rim Conference on Multimedia, Proceedings
T2 - 11th Pacific Rim Conference on Multimedia, PCM 2010
Y2 - 21 September 2010 through 24 September 2010
ER -