Skip to main navigation Skip to search Skip to main content

Beat Tracking Algorithm Based on Multi-scale Feature Fusion and Attention Mechanism

  • Xi'an Jiaotong University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Automatic beat and downbeat tracking is an important research direction in the field of music information retrieval. This paper proposes a beat tracking algorithm based on multi-scale feature fusion and attention mechanism for the joint tracking of beat and downbeat. Firstly, we propose a convolution feature extraction layer based on multiscale feature fusion, which makes the model pay attention to different levels of music information and exchange musical instrument information with separated tracks. Then, based on the dilated self-attention, we introduce the dilated neighborhood attention module and the global attention module with multi-scale features. The former not only reduces the time complexity, but also realizes the information exchange of time instrument dimension characteristics, and improves the accuracy of beat detection; The latter can determine the global optimal beat sequence while fusing the time information of different scales, which improves the stability of beat detection. By comprehensively utilizing the information of different musical levels and a variety of attention mechanisms, our model can better perceive the global and local characteristics of beat. We performed experimental verification on four widely used datasets, including ballroom, Hainsworth, harmonic and Carnatic datasets. The experimental results show that, compared with the deep learning method in recent years, our proposed model shows better performance in beat tracking and downbeat tracking. Compared with baseline, the F-measure indexes of beat tracking and downbeat tracking on ballroom dataset are improved by 1.2% and 2.8% respectively.

Original languageEnglish
Title of host publicationArtificial Intelligence and Robotics - 10th International Symposium, ISAIR 2025, Revised Selected Papers
EditorsHuimin Lu
PublisherSpringer Science and Business Media Deutschland GmbH
Pages8-18
Number of pages11
ISBN (Print)9789819548279
DOIs
StatePublished - 2026
Event10th International Symposium on Artificial Intelligence and Robotics, ISAIR 2025 - Nantong, China
Duration: 24 Aug 202526 Aug 2025

Publication series

NameCommunications in Computer and Information Science
Volume2746 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference10th International Symposium on Artificial Intelligence and Robotics, ISAIR 2025
Country/TerritoryChina
CityNantong
Period24/08/2526/08/25

Keywords

  • Attention mechanism
  • Beat-tracking
  • Transformer

Fingerprint

Dive into the research topics of 'Beat Tracking Algorithm Based on Multi-scale Feature Fusion and Attention Mechanism'. Together they form a unique fingerprint.

Cite this