跳到主要导航 跳到搜索 跳到主要内容

Beat Tracking Algorithm Based on Multi-scale Feature Fusion and Attention Mechanism

  • Xi'an Jiaotong University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Automatic beat and downbeat tracking is an important research direction in the field of music information retrieval. This paper proposes a beat tracking algorithm based on multi-scale feature fusion and attention mechanism for the joint tracking of beat and downbeat. Firstly, we propose a convolution feature extraction layer based on multiscale feature fusion, which makes the model pay attention to different levels of music information and exchange musical instrument information with separated tracks. Then, based on the dilated self-attention, we introduce the dilated neighborhood attention module and the global attention module with multi-scale features. The former not only reduces the time complexity, but also realizes the information exchange of time instrument dimension characteristics, and improves the accuracy of beat detection; The latter can determine the global optimal beat sequence while fusing the time information of different scales, which improves the stability of beat detection. By comprehensively utilizing the information of different musical levels and a variety of attention mechanisms, our model can better perceive the global and local characteristics of beat. We performed experimental verification on four widely used datasets, including ballroom, Hainsworth, harmonic and Carnatic datasets. The experimental results show that, compared with the deep learning method in recent years, our proposed model shows better performance in beat tracking and downbeat tracking. Compared with baseline, the F-measure indexes of beat tracking and downbeat tracking on ballroom dataset are improved by 1.2% and 2.8% respectively.

源语言英语
主期刊名Artificial Intelligence and Robotics - 10th International Symposium, ISAIR 2025, Revised Selected Papers
编辑Huimin Lu
出版商Springer Science and Business Media Deutschland GmbH
8-18
页数11
ISBN(印刷版)9789819548279
DOI
出版状态已出版 - 2026
活动10th International Symposium on Artificial Intelligence and Robotics, ISAIR 2025 - Nantong, 中国
期限: 24 8月 202526 8月 2025

出版系列

姓名Communications in Computer and Information Science
2746 CCIS
ISSN(印刷版)1865-0929
ISSN(电子版)1865-0937

会议

会议10th International Symposium on Artificial Intelligence and Robotics, ISAIR 2025
国家/地区中国
Nantong
时期24/08/2526/08/25

学术指纹

探究 'Beat Tracking Algorithm Based on Multi-scale Feature Fusion and Attention Mechanism' 的科研主题。它们共同构成独一无二的指纹。

引用此