TY - GEN
T1 - Model-based clustering with nonconvex penalty
AU - Chang, Xiangyu
AU - Cao, Xiangyong
AU - Liang, Dong
AU - Lu, Xiaoling
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/9/22
Y1 - 2016/9/22
N2 - Nonconvex penalty functions, which include the smoothly clipped absolute deviation (SCAD) penalty, minimax concave penalty (MCP) and ℓq(0 ≤ q < 1) norm penalty, have been demonstrated to have attractive theoretical properties and excellent performance on experiment studies in the area of penalized regressions, compressive sensing and matrix completion. To take their advantages, we propose a penalized model-based clustering framework via the nonconvex penalty functions for dealing with high-dimensional data clustering problems. We establish an expectation-maximization (EM) algorithm to fit the suggested framework efficiently. To illustrate the general framework, we utilize four popular nonconvex penalties (SCAD, MCP, ℓ0 and ℓ1/2) to construct specific models. They are compared with the ℓ1 penalty in the simulations and a real world application. Based on our experiments, the finite sample performance of the four proposed models is well exhibited. In particular, our numerical results suggest that the model-based clustering with the MCP or ℓ0 penalty is the preferred approach.
AB - Nonconvex penalty functions, which include the smoothly clipped absolute deviation (SCAD) penalty, minimax concave penalty (MCP) and ℓq(0 ≤ q < 1) norm penalty, have been demonstrated to have attractive theoretical properties and excellent performance on experiment studies in the area of penalized regressions, compressive sensing and matrix completion. To take their advantages, we propose a penalized model-based clustering framework via the nonconvex penalty functions for dealing with high-dimensional data clustering problems. We establish an expectation-maximization (EM) algorithm to fit the suggested framework efficiently. To illustrate the general framework, we utilize four popular nonconvex penalties (SCAD, MCP, ℓ0 and ℓ1/2) to construct specific models. They are compared with the ℓ1 penalty in the simulations and a real world application. Based on our experiments, the finite sample performance of the four proposed models is well exhibited. In particular, our numerical results suggest that the model-based clustering with the MCP or ℓ0 penalty is the preferred approach.
UR - https://www.scopus.com/pages/publications/84991727631
U2 - 10.1109/CYBER.2016.7574796
DO - 10.1109/CYBER.2016.7574796
M3 - 会议稿件
AN - SCOPUS:84991727631
T3 - 6th Annual IEEE International Conference on Cyber Technology in Automation, Control and Intelligent Systems, IEEE-CYBER 2016
SP - 61
EP - 66
BT - 6th Annual IEEE International Conference on Cyber Technology in Automation, Control and Intelligent Systems, IEEE-CYBER 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 6th Annual IEEE International Conference on Cyber Technology in Automation, Control and Intelligent Systems, IEEE-CYBER 2016
Y2 - 19 June 2016 through 22 June 2016
ER -