跳到主要导航 跳到搜索 跳到主要内容

Simultaneous Flexible Keyword Detection and Text-dependent Speaker Recognition for Low-resource Devices

  • Hiroshi Fujimura
  • , Ning Ding
  • , Daichi Hayakawa
  • , Takehiko Kagoshima

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

This paper proposes a new method for simultaneous flexible keyword detection and text-dependent speaker identification using a recognized keyword. The purpose is to identify a speaker from among a set of preregistered speakers on the basis of a short-command utterance in an office or home on low-resource chip devices. The first contribution is to construct the process that includes a neural network (NN) and a customized Viterbi-based algorithm for flexible keyword detection, and Gaussian mixture models (GMMs) for speaker identification. Outputs of a middle layer in the NN and alignment information for keyword detection are also used for creating feature vectors for speaker GMMs. The second contribution is to apply DropConnect in speaker-modeling uncertainties of the Bayesian NN that is used for speaker reacognition. It results in robust speaker models when enrollment utterances are few. Evaluation was conducted using 39 Japanese keywords by 100 speakers. Recognition performance was measured on the basis of false acceptances and false rejects using keyword utterances. Speaker identification for 100 pre-registered speakers for recognized keywords was simultaneously evaluated. The identification rate when using a conventional i-vector method was 71.22%. By contrast, the identification rate of the proposed method was 89.29% while using low-cost resources.

源语言英语
主期刊名ICPRAM 2020 - Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods, Volume 1
编辑Maria De Marsico, Gabriella Sanniti di Baja, Ana L.N. Fred
出版商Science and Technology Publications, Lda
297-307
页数11
ISBN(印刷版)9789897583971
DOI
出版状态已出版 - 2020
活动9th International Conference on Pattern Recognition Applications and Methods , ICPRAM 2020 - Valletta, 马耳他
期限: 22 2月 202024 2月 2020

出版系列

姓名International Conference on Pattern Recognition Applications and Methods
1
ISSN(电子版)2184-4313

会议

会议9th International Conference on Pattern Recognition Applications and Methods , ICPRAM 2020
国家/地区马耳他
Valletta
时期22/02/2024/02/20

学术指纹

探究 'Simultaneous Flexible Keyword Detection and Text-dependent Speaker Recognition for Low-resource Devices' 的科研主题。它们共同构成独一无二的指纹。

引用此