摘要
Detection, parsing, and future predictions on sequence data (e.g., videos) require the algorithms to capture non-Markovian and compositional properties of high-level semantics. Context-free grammars are natural choices to capture such properties, but traditional grammar parsers (e.g., Earley parser) only take symbolic sentences as inputs. In this paper, we generalize the Earley parser to parse sequence data which is neither segmented nor labeled. Given the output of an arbitrary probabilistic classifier, this generalized Earley parser finds the optimal segmentation and labels in the language defined by the input grammar. Based on the parsing results, it makes top-down future predictions. The proposed method is generic, principled, and widely applicable. Experiment results clearly show the benefit of our method for both human activity parsing and prediction on three video datasets.
| 源语言 | 英语 |
|---|---|
| 文章编号 | 9018126 |
| 页(从-至) | 2538-2554 |
| 页数 | 17 |
| 期刊 | IEEE Transactions on Pattern Analysis and Machine Intelligence |
| 卷 | 43 |
| 期 | 8 |
| DOI | |
| 出版状态 | 已出版 - 1 8月 2021 |
学术指纹
探究 'A Generalized Earley Parser for Human Activity Parsing and Prediction' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver