跳到主要导航 跳到搜索 跳到主要内容

GMM and CNN Hybrid Method for Short Utterance Speaker Recognition

  • Zheli Liu
  • , Zhendong Wu
  • , Tong Li
  • , Jin Li
  • , Chao Shen
  • Nankai University
  • Hangzhou Dianzi University
  • Guangzhou University

科研成果: 期刊稿件文章同行评审

154 引用 (Scopus)

摘要

During the last few years, the speaker recognition technique has been widely attractive for its extensive application in many fields, such as speech communications, domestics services, and smart terminals. As a critical method, the Gaussian mixture model (GMM) makes it possible to achieve the recognition capability that is close to the hearing ability of human in a long speech. However, the GMM is failing to recognize a short utterance speaker with a high accuracy. Aiming at solving this problem, in this paper, we propose a novel model to enhance the recognition accuracy of the short utterance speaker recognition system. Different from traditional models based on the GMM, we design a method to train a convolutional neural network to process spectrograms, which can describe speakers better. Thus, the recognition system gains the considerable accuracy as well as the reasonable convergence speed. The experiment results show that our model can help to decrease the equal error rate of the recognition from 4.9% to 2.5%.

源语言英语
页(从-至)3244-3252
页数9
期刊IEEE Transactions on Industrial Informatics
14
7
DOI
出版状态已出版 - 7月 2018

学术指纹

探究 'GMM and CNN Hybrid Method for Short Utterance Speaker Recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此