TY - GEN
T1 - Music Genre Classification Based on Chroma Features and Deep Learning
AU - Shi, Leisi
AU - Li, Chen
AU - Tian, Lihua
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/12
Y1 - 2019/12
N2 - Music genre classification is an important branch of content-based music signal analysis. It is a challenging task in the field of music information retrieval (MIR). At present, the method based on deep learning has achieved good results. This paper constructs a neural network framework for music genre classification based on chroma feature. The chroma feature can represent the time domain and the frequency domain of music character and consider the existence of harmony. Besides, it is independent of the timbre, volume, absolute pitch, which are completely irrelevant to the genre classification. It is relatively robust to the background noise and can represent the primary information such as monophonic and polyphonic music distribution. In this paper, we estimate the type of music audio based on chroma feature combined with deep learning network. We input this feature into VGG16 network for training, and improve the last three layers. In the experiment, the classifier is trained by GTZAN dataset. The experimental results show that the framework can obtain higher classification accuracy and better performance.
AB - Music genre classification is an important branch of content-based music signal analysis. It is a challenging task in the field of music information retrieval (MIR). At present, the method based on deep learning has achieved good results. This paper constructs a neural network framework for music genre classification based on chroma feature. The chroma feature can represent the time domain and the frequency domain of music character and consider the existence of harmony. Besides, it is independent of the timbre, volume, absolute pitch, which are completely irrelevant to the genre classification. It is relatively robust to the background noise and can represent the primary information such as monophonic and polyphonic music distribution. In this paper, we estimate the type of music audio based on chroma feature combined with deep learning network. We input this feature into VGG16 network for training, and improve the last three layers. In the experiment, the classifier is trained by GTZAN dataset. The experimental results show that the framework can obtain higher classification accuracy and better performance.
KW - convolutional neural network
KW - deep learning
KW - music genre classification
KW - music information retrieval
UR - https://www.scopus.com/pages/publications/85082241365
U2 - 10.1109/ICICIP47338.2019.9012215
DO - 10.1109/ICICIP47338.2019.9012215
M3 - 会议稿件
AN - SCOPUS:85082241365
T3 - 10th International Conference on Intelligent Control and Information Processing, ICICIP 2019
SP - 81
EP - 86
BT - 10th International Conference on Intelligent Control and Information Processing, ICICIP 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 10th International Conference on Intelligent Control and Information Processing, ICICIP 2019
Y2 - 14 December 2019 through 19 December 2019
ER -