跳到主要导航 跳到搜索 跳到主要内容

An Unsupervised Monocular Visual Odometry Based on Multi-Scale Modeling

  • Henghui Zhi
  • , Chenyang Yin
  • , Huibin Li
  • , Shanmin Pang
  • Xi'an Jiaotong University

科研成果: 期刊稿件文章同行评审

3 引用 (Scopus)

摘要

Unsupervised deep learning methods have shown great success in jointly estimating camera pose and depth from monocular videos. However, previous methods mostly ignore the importance of multi-scale information, which is crucial for pose estimation and depth estimation, especially when the motion pattern is changed. This article proposes an unsupervised framework for monocular visual odometry (VO) that can model multi-scale information. The proposed method utilizes densely linked atrous convolutions to increase the receptive field size without losing image information, and adopts a non-local self-attention mechanism to effectively model the long-range dependency. Both of them can model objects of different scales in the image, thereby improving the accuracy of VO, especially in rotating scenes. Extensive experiments on the KITTI dataset have shown that our approach is competitive with other state-of-the-art unsupervised learning-based monocular methods and is comparable to supervised or model-based methods. In particular, we have achieved state-of-the-art results on rotation estimation.

源语言英语
文章编号5193
期刊Sensors (Switzerland)
22
14
DOI
出版状态已出版 - 7月 2022

学术指纹

探究 'An Unsupervised Monocular Visual Odometry Based on Multi-Scale Modeling' 的科研主题。它们共同构成独一无二的指纹。

引用此