摘要
Discovering new user intents based on existing intents from constantly incoming unlabeled data is an important task in many intelligent systems deployed in the real world (e.g., dialogue systems). Since data with new intents are completely unlabeled, most current approaches employ clustering methods to generate pseudo labels to train their models. However, due to intent gaps between existing and new intents, pseudo labels generated by these models are noisy, and prior knowledge from existing intents is not fully utilized. To mitigate these issues, we propose a robust pseudo label training and source domain joint-training network to refine the noisy pseudo labels and make full use of prior knowledge. Experimental results on three intent detection datasets show that our model is more effective and robust than state-of-the-art methods. The code and data are released at https://github.com/Lackel/PTJN.
| 源语言 | 英语 |
|---|---|
| 页(从-至) | 21-31 |
| 页数 | 11 |
| 期刊 | IEEE Intelligent Systems |
| 卷 | 38 |
| 期 | 4 |
| DOI | |
| 出版状态 | 已出版 - 1 7月 2023 |
学术指纹
探究 'New User Intent Discovery With Robust Pseudo Label Training and Source Domain Joint Training' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver