Semantic Enhancement and Multi-level Label Embedding for Chinese News Headline Classification

  • Jiangnan Qi
  • , Yuan Rao
  • , Ling Sun
  • , Xiong Yang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

News headline classification is a specific example of short text classification, which aims to extract semantic information from the short text and classify it accurately. It can provide a fast classification method for data of various kinds of news media, thus arousing the common concern of academia and industry. Most short text classification methods are based on the semantic expansion of external knowledge, which is unable to expansion dynamically in real time and make full use of label information. To overcome these problems, we propose a novel method which consists of three parts: semantic enhancement, multi-dimensional feature fusion network and multi-level label embedding. Firstly, the word-level semantic information are embedded into the character encoding from pre-Train model to enhance semantic features. Secondly, both of Bi-GRU and multi-scale CNN are used to extract sequence and local features of text to enhance the semantic representation of the sentence. Furthermore, the multi-level label embedding is used to filter textual vector and assist classification in the word and sentence level respectively. Experimental results on NLPCC 2017 Chinese news headline classification task show that our model achieves 84.74% of accuracy and 84.75% of F1, improves over the best baseline model by 1.5% and 1.6%, respectively, and reaches the state-of-The-Art performance.

Original languageEnglish
Title of host publicationProceedings - 2019 14th International Joint Symposium on Artificial Intelligence and Natural Language Processing, iSAI-NLP 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728156316
DOIs
StatePublished - Oct 2019
Event14th International Joint Symposium on Artificial Intelligence and Natural Language Processing, iSAI-NLP 2019 - Chiang Mai, Thailand
Duration: 30 Oct 20191 Nov 2019

Publication series

NameProceedings - 2019 14th International Joint Symposium on Artificial Intelligence and Natural Language Processing, iSAI-NLP 2019

Conference

Conference14th International Joint Symposium on Artificial Intelligence and Natural Language Processing, iSAI-NLP 2019
Country/TerritoryThailand
CityChiang Mai
Period30/10/191/11/19

Keywords

  • News headlines classification
  • multi-dimensional feature fusion
  • multi-level label embedding
  • semantic enhance

Fingerprint

Dive into the research topics of 'Semantic Enhancement and Multi-level Label Embedding for Chinese News Headline Classification'. Together they form a unique fingerprint.

Cite this