Abstract
As keyphrase is a small set of words that can best represent a document, they play significant roles in varieties of text-related tasks. In recent years, many unsupervised and supervised methods have been proposed for keyphrase extraction. However, keyphrase extraction is an imbalanced classification problem in nature and contains many unlabeled data, which have not been paid attention to in the previous studies. In this research, a new semi-supervised learning method, COS-training, is proposed for keyphrase extraction based on co-training and SMOTE. For the testing and illustration purpose, a keyphrase extraction dataset is selected to verify the effectiveness of the proposed method. Empirical results reveal that COS-training is a potential solution for keyphrase extraction. Among the compared methods, COS-training gets the best result. Al l these results illustrate that COS-training can be used as an alternative method for keyphrase extraction.
| Original language | English |
|---|---|
| Pages (from-to) | 233-238 |
| Number of pages | 6 |
| Journal | ICIC Express Letters, Part B: Applications |
| Volume | 6 |
| Issue number | 1 |
| State | Published - 1 Jan 2015 |
| Externally published | Yes |
Keywords
- Co-training
- Keyphrase extraction
- SMOTE
- Semi-supervised learning
Fingerprint
Dive into the research topics of 'COS-training: A new semi-supervised learning method for keyphrase extraction based on co-training and SMOTE'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver