跳到主要导航 跳到搜索 跳到主要内容

FGCL: Fine-Grained Contrastive Learning For Mandarin Stuttering Event Detection

  • Han Jiang
  • , Wenyu Wang
  • , Yiquan Zhou
  • , Hongwu Ding
  • , Jiacheng Xu
  • , Jihua Zhu
  • Xi'an Jiaotong University
  • Happy Elements

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

This paper presents the T031 team's approach to the StutteringSpeech Challenge in SLT2024. Mandarin Stuttering Event Detection (MSED) aims to detect instances of stuttering events in Mandarin speech. We propose a detailed acoustic analysis method to improve the accuracy of stutter detection by capturing subtle nuances that previous Stuttering Event Detection (SED) techniques have overlooked. To this end, we introduce the Fine-Grained Contrastive Learning (FGCL) framework for MSED. Specifically, we model the frame-level probabilities of stuttering events and introduce a mining algorithm to identify both easy and confusing frames. Then, we propose a stutter contrast loss to enhance the distinction between stuttered and fluent speech frames, thereby improving the discriminative capability of stuttered feature embeddings. Extensive evaluations on English and Mandarin datasets demonstrate the effectiveness of FGCL, achieving a significant increase of over 5.0% in F1 score on Mandarin data1,.

源语言英语
主期刊名Proceedings of 2024 IEEE Spoken Language Technology Workshop, SLT 2024
出版商Institute of Electrical and Electronics Engineers Inc.
379-384
页数6
ISBN(电子版)9798350392258
DOI
出版状态已出版 - 2024
活动2024 IEEE Spoken Language Technology Workshop, SLT 2024 - Macao, 中国
期限: 2 12月 20245 12月 2024

出版系列

姓名Proceedings of 2024 IEEE Spoken Language Technology Workshop, SLT 2024

会议

会议2024 IEEE Spoken Language Technology Workshop, SLT 2024
国家/地区中国
Macao
时期2/12/245/12/24

学术指纹

探究 'FGCL: Fine-Grained Contrastive Learning For Mandarin Stuttering Event Detection' 的科研主题。它们共同构成独一无二的指纹。

引用此