Abstract
Sparse clustering, which aims at finding a proper partition of extremely high dimensional data set with fewest relevant features, has been attracted more and more attention. Most researches model the problem through minimizing weighted feature contributions subject to a l1 constraint. However, the l0 constraint is the essential constraint for sparse modeling while the l1 constraint is only a convex relaxation of it. In this article, we bridge the gap between the l0 constraint and the l 1 constraint through development of two new sparse clustering models, which are the sparse k-means with the lq(0 < q < 1) constraint and the sparse k-means with the 10 constraint. By proving the certain forms of the optimal solutiion of particular lq(0 = q < 1) non-convex optimizations, two efficient iterative algorithms are proposed. We conclude with experiments on both synthetic data and the Allen Developing on both synthetic data and the lq(0 = q < 1) models exhibit the advantages compared with the standard k-mans and sparse k-means with the l1 constraint.
| Original language | English |
|---|---|
| Article number | 6729564 |
| Pages (from-to) | 797-806 |
| Number of pages | 10 |
| Journal | Proceedings - IEEE International Conference on Data Mining, ICDM |
| DOIs | |
| State | Published - 2013 |
| Event | 13th IEEE International Conference on Data Mining, ICDM 2013 - Dallas, TX, United States Duration: 7 Dec 2013 → 10 Dec 2013 |
Keywords
- 0l(0 = q < 1) Constraint
- High-Dimensional Clustering
- Sparse K-means
Fingerprint
Dive into the research topics of 'Sparse K-means with the lq(0 ≤ q < 1) constraint for high-dimensional data clustering'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver