Chinese Speech Processing via Chinese Character Feature

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper focuses on the basic structure of Chinese characters: semantic-phonetic compound characters. This paper takes advantage of this feature of Chinese characters and innovatively proposes a Chinese speech-processing method based on character shape. We use the association between the character shape and pronunciation of Chinese characters to construct a new character stroke-based dataset. We use two neural network structures, RNN and Transformer, to verify our proposed Chinese speech processing method.It is proved through experiments that the method improves the performance of Mandarin ASR(Automatic Speech Recognition) by about 2% and AEC(ASR Error Correction) by about 1%. Theoretically, this method applies to all Chinese speech-processing algorithms based on the attention mechanism.

Original languageEnglish
Title of host publication2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025 - Proceedings
EditorsBhaskar D Rao, Isabel Trancoso, Gaurav Sharma, Neelesh B. Mehta
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350368741
DOIs
StatePublished - 2025
Event2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025 - Hyderabad, India
Duration: 6 Apr 202511 Apr 2025

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (Print)1520-6149

Conference

Conference2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025
Country/TerritoryIndia
CityHyderabad
Period6/04/2511/04/25

Keywords

  • Automatic Speech Recognition
  • Chinese Character Feature
  • Semantic-phonetic Compound Character
  • Text Recorrection

Fingerprint

Dive into the research topics of 'Chinese Speech Processing via Chinese Character Feature'. Together they form a unique fingerprint.

Cite this