TY - GEN
T1 - Beyond sum and weighted aggregation
T2 - 1st International ACM Thematic Workshops, Thematic Workshops 2017
AU - Pang, Shanmin
AU - Zhang, Wei
AU - Zhu, Li
AU - Zhu, Jihua
AU - Xue, Jianru
N1 - Publisher Copyright:
© 2017 Association for Computing Machinery.
PY - 2017/10/23
Y1 - 2017/10/23
N2 - Image search with local descriptors represents an image usually by embedding and aggregating a set of patch descriptors into a single vector. Standard aggregation operations include sum and weighted aggregations. While showing high efficiency, sum aggregation lacks discriminative power. In contrast, weighted aggregation shows promising retrieval performance but suffers extremely high time cost. In this paper, we present a general mixed aggregation method that unifies sum and weighted aggregation methods. Owing to its general formulation, our method is able to balance the trade-off between quality and speed. Furthermore, we propose to compute multiple weighting coefficients rather than one for each to be aggregated vector by partitioning it into several components. Experimental results demonstrate that, while showing over ten times speedup over baselines, the image search frameworks with our mixed aggregation method achieve the state-of-the-art performance. Inspired by our aggregation method, we also present a new embedding strategy. Different from the existing embedding methods that individually map each descriptor into a single embedded vector, our embedding method maps a group of local descriptors into a single vector, which significantly benefits the aggregation step in terms of speed. As demonstrated by the experiments, the retrieval frameworks with our embedding method are more than fifty times faster than baselines, while maintaining competitive retrieval performance.
AB - Image search with local descriptors represents an image usually by embedding and aggregating a set of patch descriptors into a single vector. Standard aggregation operations include sum and weighted aggregations. While showing high efficiency, sum aggregation lacks discriminative power. In contrast, weighted aggregation shows promising retrieval performance but suffers extremely high time cost. In this paper, we present a general mixed aggregation method that unifies sum and weighted aggregation methods. Owing to its general formulation, our method is able to balance the trade-off between quality and speed. Furthermore, we propose to compute multiple weighting coefficients rather than one for each to be aggregated vector by partitioning it into several components. Experimental results demonstrate that, while showing over ten times speedup over baselines, the image search frameworks with our mixed aggregation method achieve the state-of-the-art performance. Inspired by our aggregation method, we also present a new embedding strategy. Different from the existing embedding methods that individually map each descriptor into a single embedded vector, our embedding method maps a group of local descriptors into a single vector, which significantly benefits the aggregation step in terms of speed. As demonstrated by the experiments, the retrieval frameworks with our embedding method are more than fifty times faster than baselines, while maintaining competitive retrieval performance.
KW - Aggregation
KW - Embedding
KW - Image representation
KW - Image search
UR - https://www.scopus.com/pages/publications/85034866623
U2 - 10.1145/3126686.3126704
DO - 10.1145/3126686.3126704
M3 - 会议稿件
AN - SCOPUS:85034866623
T3 - Thematic Workshops 2017 - Proceedings of the Thematic Workshops of ACM Multimedia 2017, co-located with MM 2017
SP - 59
EP - 67
BT - Thematic Workshops 2017 - Proceedings of the Thematic Workshops of ACM Multimedia 2017, co-located with MM 2017
PB - Association for Computing Machinery, Inc
Y2 - 23 October 2017 through 27 October 2017
ER -