TY - GEN
T1 - User-Adjustable Image Cropping Based on Visual Semantic Awareness
AU - Li, Xinyi
AU - Yang, Xinyu
AU - Zhang, Shuo
AU - Sun, Jiazhe
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2026.
PY - 2026
Y1 - 2026
N2 - Image cropping aims to create visually appealing pictures aligned with user preferences. As social media develops, user demand for visual content in images has become more diverse. Previous cropping methods struggle to capture the semantics of images, failing to highlight representative content and adapt to the varying preferences of users for non-representative content. To address these issues, we propose a novel visual-semantic-aware cropping method that uses a visual semantic aggregation approach to identify key visual patches in the image, then integrates these patch features and basic image features through partition enhancement and graph-based feature interaction, thereby extracting representative content with high aesthetic value. To consider user preferences for non-representative content, we introduce a user-adjustable semantic coordination proportional mechanism, allowing users to adjust the visual richness of non-representative content in cropped images. Experiments show our method outperforms state-of-the-art methods in achieving aesthetic crops, while providing users with adjustable cropping options.
AB - Image cropping aims to create visually appealing pictures aligned with user preferences. As social media develops, user demand for visual content in images has become more diverse. Previous cropping methods struggle to capture the semantics of images, failing to highlight representative content and adapt to the varying preferences of users for non-representative content. To address these issues, we propose a novel visual-semantic-aware cropping method that uses a visual semantic aggregation approach to identify key visual patches in the image, then integrates these patch features and basic image features through partition enhancement and graph-based feature interaction, thereby extracting representative content with high aesthetic value. To consider user preferences for non-representative content, we introduce a user-adjustable semantic coordination proportional mechanism, allowing users to adjust the visual richness of non-representative content in cropped images. Experiments show our method outperforms state-of-the-art methods in achieving aesthetic crops, while providing users with adjustable cropping options.
KW - Image Cropping
KW - Representative Content
KW - User-Adjustable
KW - Visual-Semantic-Aware
UR - https://www.scopus.com/pages/publications/105028411048
U2 - 10.1007/978-981-95-5679-3_19
DO - 10.1007/978-981-95-5679-3_19
M3 - 会议稿件
AN - SCOPUS:105028411048
SN - 9789819556786
T3 - Lecture Notes in Computer Science
SP - 270
EP - 284
BT - Pattern Recognition and Computer Vision - 8th Chinese Conference, PRCV 2025, Proceedings
A2 - Kittler, Josef
A2 - Xiong, Hongkai
A2 - Yang, Jian
A2 - Chen, Xilin
A2 - Lu, Jiwen
A2 - Lin, Weiyao
A2 - Yu, Jingyi
A2 - Zheng, Weishi
PB - Springer Science and Business Media Deutschland GmbH
T2 - 8th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2025
Y2 - 15 October 2025 through 18 October 2025
ER -