TY - JOUR
T1 - Robust mixture clustering using Pearson type VII distribution
AU - Sun, Jianyong
AU - Kabán, Ata
AU - Garibaldi, Jonathan M.
PY - 2010/12/1
Y1 - 2010/12/1
N2 - A mixture of Student t-distributions (MoT) has been widely used to model multivariate data sets with atypical observations, or outliers for robust clustering. In this paper, we developed a novel robust clustering approach by modeling the data sets using mixture of Pearson type VII distributions (MoP). An EM algorithm is developed for the maximum likelihood estimation of the model parameters. An outlier detection criterion is derived from the EM solution. Controlled experimental results on the synthetic datasets show that the MoP is more viable than the MoT. The MoP performs comparably if not better, on average, in terms of outlier detection accuracy and out-of-sample log-likelihood with the MoT. Furthermore, we compared the performances of the Pearson type VII and the student t mixtures on the classification of several real pattern recognition data sets. The comparison favours the developed Pearson type VII mixtures.
AB - A mixture of Student t-distributions (MoT) has been widely used to model multivariate data sets with atypical observations, or outliers for robust clustering. In this paper, we developed a novel robust clustering approach by modeling the data sets using mixture of Pearson type VII distributions (MoP). An EM algorithm is developed for the maximum likelihood estimation of the model parameters. An outlier detection criterion is derived from the EM solution. Controlled experimental results on the synthetic datasets show that the MoP is more viable than the MoT. The MoP performs comparably if not better, on average, in terms of outlier detection accuracy and out-of-sample log-likelihood with the MoT. Furthermore, we compared the performances of the Pearson type VII and the student t mixtures on the classification of several real pattern recognition data sets. The comparison favours the developed Pearson type VII mixtures.
KW - Outlier detection
KW - Pearson type VII distribution
KW - Robust learning
KW - Robust mixture modeling
UR - https://www.scopus.com/pages/publications/77957278305
U2 - 10.1016/j.patrec.2010.07.015
DO - 10.1016/j.patrec.2010.07.015
M3 - 文章
AN - SCOPUS:77957278305
SN - 0167-8655
VL - 31
SP - 2447
EP - 2454
JO - Pattern Recognition Letters
JF - Pattern Recognition Letters
IS - 16
ER -