摘要
In recent years, Industry 5.0 has emphasized human-centric intelligent manufacturing, positioning human-robot collaboration as a cornerstone for achieving mass customization. Consequently, enabling robots to perceive human state has become critical for efficient and safe human-robot collaborative assembly. However, current vision-based methods for assembly intention recognition face challenges such as limited dataset modalities, difficulties in reflecting real-world assembly processes, and inconsistent annotation workflows. To address these issues, this paper introduces the MCV-Intention dataset—a multimodalities, cross-view dataset designed for assembly scene understanding. Collected from 15 subjects, each assembly sequence encompasses six modalities and two views, capturing data from operators both before and after training. In this paper, we first outline the dataset collection process, detailing the hardware and software systems as well as the assembly objects involved. Subsequently, we present a comprehensive annotation protocol for assembly intention recognition and analyze the dataset from the viewpoints of structure and distribution. Finally, we conducted a series of benchmark experiments using state-of-the-art algorithms to establish baselines for future researches.
| 源语言 | 英语 |
|---|---|
| 文章编号 | 1784 |
| 期刊 | Scientific Data |
| 卷 | 12 |
| 期 | 1 |
| DOI | |
| 出版状态 | 已出版 - 12月 2025 |
学术指纹
探究 'MCV-Intention: A Multimodalities and Cross-View Dataset for Human Assembly Intention Recognition' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver