TY - GEN
T1 - Underdetermined blind source separation of speech mixtures based on K-means clustering
AU - Xie, Yuan
AU - Xie, Kan
AU - Wu, Zongze
AU - Xie, Shengli
N1 - Publisher Copyright:
© 2019 Technical Committee on Control Theory, Chinese Association of Automation.
PY - 2019/7
Y1 - 2019/7
N2 - Underdetermined blind source separation is to recover the source signals from the observed signals without prior knowledge of the mixing channel. In signal processing, it is an open problem attracted the attention of more and more researchers. In this paper, we presents a fast and effective time-frequency algorithm to separate speech source signals in the underdetermined mixture case. In the proposed algorithm, the time-domain mixture signals are transformed to the frequency-domain by using short-time Fourier transform (STFT). Then the mixing matrix is estimated using K-means clustering, and frequency-domain sources are separated by solving a low-dimensional linear programming problem based on the estimated mixing matrix. Finally, the time-domain source signals are obtained using inverse STFT. The proposed algorithm has two advantages, one is to save time consumption, the other is to obtain better separation performance. Experimental results based on two mixtures of four speech sources demonstrate the feasibility and superiority of the proposed algorithm.
AB - Underdetermined blind source separation is to recover the source signals from the observed signals without prior knowledge of the mixing channel. In signal processing, it is an open problem attracted the attention of more and more researchers. In this paper, we presents a fast and effective time-frequency algorithm to separate speech source signals in the underdetermined mixture case. In the proposed algorithm, the time-domain mixture signals are transformed to the frequency-domain by using short-time Fourier transform (STFT). Then the mixing matrix is estimated using K-means clustering, and frequency-domain sources are separated by solving a low-dimensional linear programming problem based on the estimated mixing matrix. Finally, the time-domain source signals are obtained using inverse STFT. The proposed algorithm has two advantages, one is to save time consumption, the other is to obtain better separation performance. Experimental results based on two mixtures of four speech sources demonstrate the feasibility and superiority of the proposed algorithm.
KW - K-means clustering
KW - Short-time Fourier transform
KW - Speech source signal mixtures
KW - Underdetermined blind source separation
UR - https://www.scopus.com/pages/publications/85074429050
U2 - 10.23919/ChiCC.2019.8865385
DO - 10.23919/ChiCC.2019.8865385
M3 - 会议稿件
AN - SCOPUS:85074429050
T3 - Chinese Control Conference, CCC
SP - 42
EP - 46
BT - Proceedings of the 38th Chinese Control Conference, CCC 2019
A2 - Fu, Minyue
A2 - Sun, Jian
PB - IEEE Computer Society
T2 - 38th Chinese Control Conference, CCC 2019
Y2 - 27 July 2019 through 30 July 2019
ER -