跳到主要导航 跳到搜索 跳到主要内容

An Unbalanced Emotion Classification Method for Interactive Texts Based on Multiple-Domain Instance Transfer

  • Xi'an Jiaotong University
  • Coventry University

科研成果: 期刊稿件文章同行评审

1 引用 (Scopus)

摘要

A data level sampling method of target dataset-oriented instance transfer is proposed to solve the problem that the characteristics of interactive texts such as short sentences, missing parts of sentences and unbalanced class distribution in multiple-domains result in difficulties of high dimension, sparse eigenvalue in feature space and lack of positive instances. A function is employed to choose features for evaluating the instance similarity between source and target datasets. The function calculates the sum of the information gains of Top-N common features of these two datasets and their proportions in the sum. Moreover, a homogenization processing method is presented for feature spaces of the target dataset and the source dataset to overcome the feature spaces inconsistency between these two datasets. A method for selecting and transferring instances from a domain of source dataset to the corresponding one of target dataset is adopted to solve the problem of unbalanced class distribution in multiple domains. Experimental results show that the proposed method effectively alleviates the unbalanced problem in target dataset. The proposed method running with four classic classification methods, i.e. support vector machine, random forest, naive Bayes, and random committee, results in an 11.3% improvement in average of weighted receiver operating characteristic curve (ROC).

源语言英语
页(从-至)67-72
页数6
期刊Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University
49
4
DOI
出版状态已出版 - 10 4月 2015

学术指纹

探究 'An Unbalanced Emotion Classification Method for Interactive Texts Based on Multiple-Domain Instance Transfer' 的科研主题。它们共同构成独一无二的指纹。

引用此