TY - GEN
T1 - Hybrid genetic clustering by using FCM and geodesic distance for complex distributed data
AU - Yang, Yongsheng
AU - Li, Gang
AU - Zhu, Yongsheng
AU - Zhang, Youyun
PY - 2013
Y1 - 2013
N2 - To efficiently find hidden clusters in datasets with complex distributed data,inspired by complementary strategies, a hybrid genetic clustering algorithm was developed, which is on the basis of the geodesic distance metric, and combined with the Fuzzy C-Means clustering (FCM) algorithm. First, instead of using Euclidean distance,the new approach employs geodesic distance based dissimilarity metric during all fitness evaluation. And then, with the help of FCM clustering, some sub-clusters with spherical distribution are partitioned effectively. Next, a genetic algorithm based clustering using geodesic distance metric, named GCGD, is adopted to cluster the clustering centers obtained from FCM clustering. Finally, the final results are acquired based on above two clustering results. Experimental results on eight benchmark datasets clustering questions show the effectiveness of the algorithm as a clustering technique. Compared with conventional GCGD, the hybrid clustering can decrease the computational time obviously, while retaining high clustering correct ratio.
AB - To efficiently find hidden clusters in datasets with complex distributed data,inspired by complementary strategies, a hybrid genetic clustering algorithm was developed, which is on the basis of the geodesic distance metric, and combined with the Fuzzy C-Means clustering (FCM) algorithm. First, instead of using Euclidean distance,the new approach employs geodesic distance based dissimilarity metric during all fitness evaluation. And then, with the help of FCM clustering, some sub-clusters with spherical distribution are partitioned effectively. Next, a genetic algorithm based clustering using geodesic distance metric, named GCGD, is adopted to cluster the clustering centers obtained from FCM clustering. Finally, the final results are acquired based on above two clustering results. Experimental results on eight benchmark datasets clustering questions show the effectiveness of the algorithm as a clustering technique. Compared with conventional GCGD, the hybrid clustering can decrease the computational time obviously, while retaining high clustering correct ratio.
KW - Data clustering
KW - Fuzzy c-means
KW - Genetic algorithm
KW - Geodesic distance
UR - https://www.scopus.com/pages/publications/84872470252
U2 - 10.4028/www.scientific.net/AMM.263-266.2597
DO - 10.4028/www.scientific.net/AMM.263-266.2597
M3 - 会议稿件
AN - SCOPUS:84872470252
SN - 9783037855744
T3 - Applied Mechanics and Materials
SP - 2597
EP - 2601
BT - Information Technology Applications in Industry
T2 - 2012 International Conference on Information Technology and Management Innovation, ICITMI 2012
Y2 - 10 November 2012 through 11 November 2012
ER -