Skip to main navigation Skip to search Skip to main content

COS-training: A new semi-supervised learning method for keyphrase extraction based on co-training and SMOTE

  • Hefei University of Technology
  • City University of Hong Kong

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

As keyphrase is a small set of words that can best represent a document, they play significant roles in varieties of text-related tasks. In recent years, many unsupervised and supervised methods have been proposed for keyphrase extraction. However, keyphrase extraction is an imbalanced classification problem in nature and contains many unlabeled data, which have not been paid attention to in the previous studies. In this research, a new semi-supervised learning method, COS-training, is proposed for keyphrase extraction based on co-training and SMOTE. For the testing and illustration purpose, a keyphrase extraction dataset is selected to verify the effectiveness of the proposed method. Empirical results reveal that COS-training is a potential solution for keyphrase extraction. Among the compared methods, COS-training gets the best result. Al l these results illustrate that COS-training can be used as an alternative method for keyphrase extraction.

Original languageEnglish
Pages (from-to)233-238
Number of pages6
JournalICIC Express Letters, Part B: Applications
Volume6
Issue number1
StatePublished - 1 Jan 2015
Externally publishedYes

Keywords

  • Co-training
  • Keyphrase extraction
  • SMOTE
  • Semi-supervised learning

Fingerprint

Dive into the research topics of 'COS-training: A new semi-supervised learning method for keyphrase extraction based on co-training and SMOTE'. Together they form a unique fingerprint.

Cite this