跳到主要导航 跳到搜索 跳到主要内容

FuPaD: Scalable Pose Estimation by Fusing Patch-Wise VGGT with Dense Bundle Adjustment

  • Xi'an Jiaotong University
  • Shaanxi Key Laboratory of Intelligent Robots

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Pose estimation, a cornerstone of 3D computer vision, is crucial for applications such as autonomous driving and augmented reality. Global feed-forward methods, such as VGGT, demonstrate potential in direct scene reconstruction and pose inference. However, they are often constrained by prohibitive memory requirements when processing long sequences typical in large-scale environments. Furthermore, the accuracy of their single-pass predictions is often limited by the absence of explicit local geometric modeling or iterative refinement. To address these limitations, we introduce FuPaD, a novel hierarchical approach for scalable pose estimation. FuPaD integrates global pose priors derived from a tailored VGGT with the local refinement offered by dense bundle adjustment (DBA). First, a tracking-informed patch sampling strategy is introduced to select salient image patches from keyframes. These patches are subsequently processed by the tailored VGGT to yield globally consistent keyframe pose priors, meanwhile significantly reducing the memory footprint compared to frame-wise processing. These global keyframe poses are then integrated with dense local pose estimates from DBA within a pose graph optimization framework. Finally, a global DBA module further refines all poses. Such hierarchical fusion ensures the global consistency while benefiting from the fine-grained local refinement provided by DBA. Evaluation on benchmarks indicates that FuPaD achieves competitive pose accuracy, particularly in large-scale scenarios, while exhibiting computational and memory efficiency.

源语言英语
主期刊名Intelligent Robotics and Applications - 18th International Conference, ICIRA 2025, Proceedings
编辑Takayuki Matsuno, Honghai Liu, Lianqing Liu, Zhouping Yin, Xiangyang Zhu, Weihong Ren, Zhiyong Wang, Yixuan Sheng
出版商Springer Science and Business Media Deutschland GmbH
508-520
页数13
ISBN(印刷版)9789819521005
DOI
出版状态已出版 - 2026
活动18th International Conference on Intelligent Robotics and Applications, ICIRA 2025 - Okayama, 日本
期限: 6 8月 20259 8月 2025

出版系列

姓名Lecture Notes in Computer Science
16076 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议18th International Conference on Intelligent Robotics and Applications, ICIRA 2025
国家/地区日本
Okayama
时期6/08/259/08/25

学术指纹

探究 'FuPaD: Scalable Pose Estimation by Fusing Patch-Wise VGGT with Dense Bundle Adjustment' 的科研主题。它们共同构成独一无二的指纹。

引用此