TY - GEN
T1 - Timbre identification of instrumental music via energy distribution modeling
AU - Guo, Jinxi
AU - Ding, Mengying
AU - Guan, Xiaohong
AU - Du, Youtian
AU - Feng, Jicheng
AU - Gao, Qinping
AU - Liu, Zheng
N1 - Publisher Copyright:
© 2015 ACM.
PY - 2015/8/19
Y1 - 2015/8/19
N2 - The traditional evaluation of instrumental music is gener-ally based on experts. However, the expert-based evalua-tion strategy is seriously affected by a number of factors such as human's subjectivity and then decreases the eval-uation reliability. This paper aims at automatically identi-fying the timbre of saxophone music, and proposes a new method based on the energy distribution in frequency do-main of music signals. First, we transform music signals into frequency domain using short-time Fourier transformation (STFT). Then, we compute the spectral envelope, which may describe the rule of frequency attitude, based on linear predictive coding (LPC). At last, we find that the energy dis-tribution can be approximated by an exponential function, and present a ONE-dimensional feature named average lin-earity value (ALV). The ALV feature measures the extent of closeness between the energy distribution and exponen-tial functions and is used to distinguish high-level timbres from low-level timbres in our work. The experiment is con-ducted on 9 groups of data, and the experimental results demonstrate the effectiveness of this method.
AB - The traditional evaluation of instrumental music is gener-ally based on experts. However, the expert-based evalua-tion strategy is seriously affected by a number of factors such as human's subjectivity and then decreases the eval-uation reliability. This paper aims at automatically identi-fying the timbre of saxophone music, and proposes a new method based on the energy distribution in frequency do-main of music signals. First, we transform music signals into frequency domain using short-time Fourier transformation (STFT). Then, we compute the spectral envelope, which may describe the rule of frequency attitude, based on linear predictive coding (LPC). At last, we find that the energy dis-tribution can be approximated by an exponential function, and present a ONE-dimensional feature named average lin-earity value (ALV). The ALV feature measures the extent of closeness between the energy distribution and exponen-tial functions and is used to distinguish high-level timbres from low-level timbres in our work. The experiment is con-ducted on 9 groups of data, and the experimental results demonstrate the effectiveness of this method.
KW - Frequency spectrum analysis
KW - Linear predictive coding
KW - Short-time Fourier transformation
KW - Timbre analysis
UR - https://www.scopus.com/pages/publications/84947557256
U2 - 10.1145/2808492.2808559
DO - 10.1145/2808492.2808559
M3 - 会议稿件
AN - SCOPUS:84947557256
T3 - ACM International Conference Proceeding Series
SP - 135
EP - 139
BT - ICIMCS 2015 - Proceedings of the 7th International Conference on Internet Multimedia Computing and Service
A2 - Jain, Ramesh
A2 - Jiang, Shuqiang
A2 - Smith, John
A2 - Sang, Jitao
A2 - Li, Guohui
A2 - Zhang, Tianzhu
A2 - Wang, Shuhui
PB - Association for Computing Machinery
T2 - 7th International Conference on Internet Multimedia Computing and Service, ICIMCS 2015
Y2 - 19 August 2015 through 21 August 2015
ER -