跳到主要导航 跳到搜索 跳到主要内容

SYKI-SVC: Advancing Singing Voice Conversion with Post-Processing Innovations and an Open-Source Professional Testset

  • Yiquan Zhou
  • , Wenyu Wang
  • , Hongwu Ding
  • , Jiacheng Xu
  • , Jihua Zhu
  • , Xin Gao
  • , Shihao Li
  • Xi'an Jiaotong University
  • Happy Elements
  • East China Normal University
  • Union Wheatland Culture and Media Ltd.

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Singing voice conversion aims to transform a source singing voice into that of a target singer while preserving the original lyrics, melody, and various vocal techniques. In this paper, we propose a high-fidelity singing voice conversion system. Our system builds upon the SVCC T02 framework and consists of three key components: A feature extractor, a voice converter, and a post-processor. The feature extractor utilizes the ContentVec and Whisper models to derive F0 contours and extract speaker-independent linguistic features from the input singing voice. The voice converter then integrates the extracted timbre, F0, and linguistic content to synthesize the target speaker's waveform. The post-processor augments high-frequency information directly from the source through simple and effective signal processing to enhance audio quality. Due to the lack of a standardized professional dataset for evaluating expressive singing conversion systems, we have created and made publicly available a specialized test set. Comparative evaluations demonstrate that our system achieves a remarkably high level of naturalness, and further analysis confirms the efficacy of our proposed system design.

源语言英语
主期刊名2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025 - Proceedings
编辑Bhaskar D Rao, Isabel Trancoso, Gaurav Sharma, Neelesh B. Mehta
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798350368741
DOI
出版状态已出版 - 2025
已对外发布
活动2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025 - Hyderabad, 印度
期限: 6 4月 202511 4月 2025

出版系列

姓名ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN(印刷版)1520-6149

会议

会议2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025
国家/地区印度
Hyderabad
时期6/04/2511/04/25

学术指纹

探究 'SYKI-SVC: Advancing Singing Voice Conversion with Post-Processing Innovations and an Open-Source Professional Testset' 的科研主题。它们共同构成独一无二的指纹。

引用此