跳到主要导航 跳到搜索 跳到主要内容

Representing Multimodal Behaviors With Mean Location for Pedestrian Trajectory Prediction

  • Xi'an Jiaotong University
  • Meta
  • University of Illinois at Chicago
  • Wormpex AI Research

科研成果: 期刊稿件文章同行评审

46 引用 (Scopus)

摘要

Representing multimodal behaviors is a critical challenge for pedestrian trajectory prediction. Previous methods commonly represent this multimodality with multiple latent variables repeatedly sampled from a latent space, encountering difficulties in interpretable trajectory prediction. Moreover, the latent space is usually built by encoding global interaction into future trajectory, which inevitably introduces superfluous interactions and thus leads to performance reduction. To tackle these issues, we propose a novel Interpretable Multimodality Predictor (IMP) for pedestrian trajectory prediction, whose core is to represent a specific mode by its mean location. We model the distribution of mean location as a Gaussian Mixture Model (GMM) conditioned on sparse spatio-temporal features, and sample multiple mean locations from the decoupled components of GMM to encourage multimodality. Our IMP brings four-fold benefits: 1) Interpretable prediction to provide semantics about the motion behavior of a specific mode; 2) Friendly visualization to present multimodal behaviors; 3) Well theoretical feasibility to estimate the distribution of mean locations supported by the central-limit theorem; 4) Effective sparse spatio-temporal features to reduce superfluous interactions and model temporal continuity of interaction. Extensive experiments validate that our IMP not only outperforms state-of-the-art methods but also can achieve a controllable prediction by customizing the corresponding mean location.

源语言英语
页(从-至)11184-11202
页数19
期刊IEEE Transactions on Pattern Analysis and Machine Intelligence
45
9
DOI
出版状态已出版 - 1 9月 2023

学术指纹

探究 'Representing Multimodal Behaviors With Mean Location for Pedestrian Trajectory Prediction' 的科研主题。它们共同构成独一无二的指纹。

引用此