Abstract
Road accident classification which is an essential but rarely explored problem in the safe driving field. The core issue of accident classification is to learn the feature representation for partitioning different kinds of accidents. Compared with accident detection or anticipation only with occurrence probability, feature representation learning in accident classification is more challenging because of the extremely imbalanced accident categories. In addition, the severe light or weather conditions, various occasions, and complex crashing-object movement exacerbate the challenges. In this work, we form a text-video benchmark for fine-grained road accident classification (named Accident-CLIP). Accident-CLIP owns 13,669 dashcam videos with 58 kinds of accidents, where each accident is annotated with the text description of the accident type and the accident window. In the benchmarking stage, six state-of-the-art methods are evaluated from different video frame sampling methods, frame mixup strategy, input frame length, and the adaptation for long-tailed accident distribution. From experiments, we observe that current video classification models need a large space (the best Top-1 value is 41.39% on 2000 testing videos) to adapt to the extremely imbalanced road accident classification, and the formed Accident-CLIP benchmark provides a promising evaluation platform.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of 3rd 2023 International Conference on Autonomous Unmanned Systems (3rd ICAUS 2023) - Volume III |
| Editors | Yi Qu, Mancang Gu, Yifeng Niu, Wenxing Fu |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 487-498 |
| Number of pages | 12 |
| ISBN (Print) | 9789819710867 |
| DOIs | |
| State | Published - 2024 |
| Externally published | Yes |
| Event | 3rd International Conference on Autonomous Unmanned Systems, ICAUS 2023 - Nanjing, China Duration: 9 Sep 2023 → 11 Sep 2023 |
Publication series
| Name | Lecture Notes in Electrical Engineering |
|---|---|
| Volume | 1173 LNEE |
| ISSN (Print) | 1876-1100 |
| ISSN (Electronic) | 1876-1119 |
Conference
| Conference | 3rd International Conference on Autonomous Unmanned Systems, ICAUS 2023 |
|---|---|
| Country/Territory | China |
| City | Nanjing |
| Period | 9/09/23 → 11/09/23 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Long-tailed accident distribution
- Text-video benchmark
- Traffic accident classification
Fingerprint
Dive into the research topics of 'Accident-CLIP: Text-Video Benchmarking for Fine-Grained Accident Classification in Driving Scenes'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver