跳到主要导航 跳到搜索 跳到主要内容

InfoSAM: Fine-Tuning the Segment Anything Model from An Information-Theoretic Perspective

  • Yuanhong Zhang
  • , Muyao Yuan
  • , Weizhan Zhang
  • , Tieliang Gong
  • , Wen Wen
  • , Jiangyong Ying
  • , Weijie Shi
  • Xi'an Jiaotong University
  • Ltd.

科研成果: 期刊稿件会议文章同行评审

1 引用 (Scopus)

摘要

The Segment Anything Model (SAM), a vision foundation model, exhibits impressive zero-shot capabilities in general tasks but struggles in specialized domains. Parameter-efficient fine-tuning (PEFT) is a promising approach to unleash the potential of SAM in novel scenarios. However, existing PEFT methods for SAM neglect the domain-invariant relations encoded in the pretrained model. To bridge this gap, we propose InfoSAM, an information-theoretic approach that enhances SAM fine-tuning by distilling and preserving its pre-trained segmentation knowledge. Specifically, we formulate the knowledge transfer process as two novel mutual information-based objectives: (i) to compress the domain-invariant relation extracted from pre-trained SAM, excluding pseudo-invariant information as possible, and (ii) to maximize mutual information between the relational knowledge learned by the teacher (pretrained SAM) and the student (fine-tuned model). The proposed InfoSAM establishes a robust distillation framework for PEFT of SAM. Extensive experiments across diverse benchmarks validate InfoSAM’s effectiveness in improving SAM family’s performance on real-world tasks, demonstrating its adaptability and superiority in handling specialized scenarios. The code and models are available at InfoSAM project page.

源语言英语
页(从-至)76655-76677
页数23
期刊Proceedings of Machine Learning Research
267
出版状态已出版 - 2025
活动42nd International Conference on Machine Learning, ICML 2025 - Vancouver, 加拿大
期限: 13 7月 202519 7月 2025

学术指纹

探究 'InfoSAM: Fine-Tuning the Segment Anything Model from An Information-Theoretic Perspective' 的科研主题。它们共同构成独一无二的指纹。

引用此