TY - JOUR
T1 - Detection of salient objects with focused attention based on spatial and temporal coherence
AU - Wu, Yang
AU - Zheng, Nan Ning
AU - Yuan, Ze Jian
AU - Jiang, Huai Zu
AU - Liu, Tie
PY - 2011/4
Y1 - 2011/4
N2 - The understanding and analysis of video content are fundamentally important for numerous applications, including video summarization, retrieval, navigation, and editing. An important part of this process is to detect salient (which usually means important and interesting) objects in video segments. Unlike existing approaches, we propose a method that combines the saliency measurement with spatial and temporal coherence. The integration of spatial and temporal coherence is inspired by the focused attention in human vision. In the proposed method, the spatial coherence of low-level visual grouping cues (e. g. appearance and motion) helps per-frame object-background separation, while the temporal coherence of the object properties (e. g. shape and appearance) ensures consistent object localization over time, and thus the method is robust to unexpected environment changes and camera vibrations. Having developed an efficient optimization strategy based on coarse-to-fine multi-scale dynamic programming, we evaluate our method using a challenging dataset that is freely available together with this paper. We show the effectiveness and complementariness of the two types of coherence, and demonstrate that they can significantly improve the performance of salient object detection in videos.
AB - The understanding and analysis of video content are fundamentally important for numerous applications, including video summarization, retrieval, navigation, and editing. An important part of this process is to detect salient (which usually means important and interesting) objects in video segments. Unlike existing approaches, we propose a method that combines the saliency measurement with spatial and temporal coherence. The integration of spatial and temporal coherence is inspired by the focused attention in human vision. In the proposed method, the spatial coherence of low-level visual grouping cues (e. g. appearance and motion) helps per-frame object-background separation, while the temporal coherence of the object properties (e. g. shape and appearance) ensures consistent object localization over time, and thus the method is robust to unexpected environment changes and camera vibrations. Having developed an efficient optimization strategy based on coarse-to-fine multi-scale dynamic programming, we evaluate our method using a challenging dataset that is freely available together with this paper. We show the effectiveness and complementariness of the two types of coherence, and demonstrate that they can significantly improve the performance of salient object detection in videos.
KW - focused attention
KW - salient object detection
KW - spatial and temporal coherence
KW - visual attention
UR - https://www.scopus.com/pages/publications/79955100523
U2 - 10.1007/s11434-010-4387-1
DO - 10.1007/s11434-010-4387-1
M3 - 文章
AN - SCOPUS:79955100523
SN - 1001-6538
VL - 56
SP - 1055
EP - 1062
JO - Chinese Science Bulletin
JF - Chinese Science Bulletin
IS - 10
ER -