TY - GEN
T1 - CS-GAN
T2 - 4th International Conference on Image, Video and Signal Processing, IVSP 2022
AU - Fang, Xin
AU - Li, Fan
N1 - Publisher Copyright:
© 2022 ACM.
PY - 2022/3/18
Y1 - 2022/3/18
N2 - Cross-Media retrieval has become necessary to satisfy the urgent needs of multimedia economic and cultural value. The heterogeneous gap makes it difficult to directly retrieve media of different modalities. Existing common space-based Cross-Media retrieval methods usually focus on increasing the inter-class distance while paying little attention to decreasing the intra- class distance, they neglect to correlate the mapping process from original space to common space, which limits the narrowing of intra-class distance. To address this problem, we propose Centrosymmetric Generative Adversarial Network (CS-GAN) for Cross-Media retrieval, which consists of two generative models and our proposed Heterogeneous Input Discriminative Model (HIDM), forming a centrosymmetric structure. Our proposed HIDM conjoins the mapping processes of each media type by dimensionality reduction and knowledge transfer, which ensures the unification of internal processing and well retains the underlying manifold structure information. Besides, it discriminates the distribution of common representations for both image and text at the same time, which can interrelate the generative mapping, preserve the distribution of the original features and narrow the intra-class distances of common embeddings. The experimental results on PKU XMediaNet and Pascal Sentences show the effectiveness of our proposed CS-GAN.
AB - Cross-Media retrieval has become necessary to satisfy the urgent needs of multimedia economic and cultural value. The heterogeneous gap makes it difficult to directly retrieve media of different modalities. Existing common space-based Cross-Media retrieval methods usually focus on increasing the inter-class distance while paying little attention to decreasing the intra- class distance, they neglect to correlate the mapping process from original space to common space, which limits the narrowing of intra-class distance. To address this problem, we propose Centrosymmetric Generative Adversarial Network (CS-GAN) for Cross-Media retrieval, which consists of two generative models and our proposed Heterogeneous Input Discriminative Model (HIDM), forming a centrosymmetric structure. Our proposed HIDM conjoins the mapping processes of each media type by dimensionality reduction and knowledge transfer, which ensures the unification of internal processing and well retains the underlying manifold structure information. Besides, it discriminates the distribution of common representations for both image and text at the same time, which can interrelate the generative mapping, preserve the distribution of the original features and narrow the intra-class distances of common embeddings. The experimental results on PKU XMediaNet and Pascal Sentences show the effectiveness of our proposed CS-GAN.
KW - Cross-Media retrieval
KW - Generative adversarial networks
KW - Intra-class distance
UR - https://www.scopus.com/pages/publications/85131856409
U2 - 10.1145/3531232.3531246
DO - 10.1145/3531232.3531246
M3 - 会议稿件
AN - SCOPUS:85131856409
T3 - ACM International Conference Proceeding Series
SP - 100
EP - 106
BT - IVSP 2022 - 2022 4th International Conference on Image, Video and Signal Processing
PB - Association for Computing Machinery
Y2 - 18 March 2022 through 20 March 2022
ER -