跳到主要导航 跳到搜索 跳到主要内容

The performance of improved XLNet on text classification

  • Lanzhou Jiaotong University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Text classification is the most common application of Natural Language Processing(NLP), and Transformer models have dominated the field in recent years. Currently, pre-training modeling of text through deep learning methods is a common way of text classification. This paper firstly proposes an improved XLNet text classification model based on the problems of long-term dependence and insufficient contextual semantic expression in previous pre-trained language models, and uses XLNet pre-trained language modeling to represent text as low-dimensional word vectors to obtain sequences. Secondly, the generated word vector sequence is passed into the LSTM network, and the two-way features of the sentence are extracted by the memory unit of the LSTM. On the basis of effectively extracting text features, the Multi-head attention model is used to calculate the multi-angle attention probability, to focus on the important words. Finally, text classification is achieved through a fully connected network. The experimental results show that the accuracy of the model is 0.9457 and the loss rate is 0.3133. Compared with other related models, it has achieved better results.

源语言英语
主期刊名Third International Conference on Artificial Intelligence and Electromechanical Automation, AIEA 2022
编辑Shuangming Yang
出版商SPIE
ISBN(电子版)9781510657281
DOI
出版状态已出版 - 2022
已对外发布
活动3rd International Conference on Artificial Intelligence and Electromechanical Automation, AIEA 2022 - Changsha, 中国
期限: 8 4月 202210 4月 2022

出版系列

姓名Proceedings of SPIE - The International Society for Optical Engineering
12329
ISSN(印刷版)0277-786X
ISSN(电子版)1996-756X

会议

会议3rd International Conference on Artificial Intelligence and Electromechanical Automation, AIEA 2022
国家/地区中国
Changsha
时期8/04/2210/04/22

学术指纹

探究 'The performance of improved XLNet on text classification' 的科研主题。它们共同构成独一无二的指纹。

引用此