摘要
In high-dimensional data clustering practices, the cluster structure is commonly assumed to be confined to a limited number of relevant features, rather than the entire feature set. However, for high-dimensional data, identifying the relevant features and discovering the cluster structure are still challenging problems. To solve these problems, this paper proposes a novel fuzzy $ {c}$-means (FCM) model with sparse regularization ( $ {\ell-{q}(0<q\leq 1)}$-norm regularization), by reformulating the FCM objective function into the weighted between-cluster sum of square form and imposing the sparse regularization on the weights. An algorithm is also developed to explicitly solve the proposed model. Compared with the existing clustering models, the proposed model can shrink the weights of irrelevant features (noisy features) to exact zero, and also can be efficiently solved in analytic forms when $ {q=1,1/2}$. Experiments on both synthetic and real-world data sets show that the proposed approach outperforms the existing clustering approaches.
| 源语言 | 英语 |
|---|---|
| 文章编号 | 7763769 |
| 页(从-至) | 2616-2627 |
| 页数 | 12 |
| 期刊 | IEEE Transactions on Cybernetics |
| 卷 | 47 |
| 期 | 9 |
| DOI | |
| 出版状态 | 已出版 - 9月 2017 |
学术指纹
探究 'Sparse Regularization in Fuzzy $c$-Means for High-Dimensional Data Clustering' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver