Max-D clustering K-means algorithm for Autogeneration of Centroids and Distance of Data Points Cluster
K-Means is one of the unsupervised learning and partitioning clustering algorithms. It is very popular and widely used for its simplicity and fastness. The main drawback of this algorithm is that user should specify the number of cluster in advance. As an iterative clustering strategy, K-Means algor...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Subjects: | |
Online Access: | http://umpir.ump.edu.my/id/eprint/6871/ http://umpir.ump.edu.my/id/eprint/6871/ http://umpir.ump.edu.my/id/eprint/6871/1/Max-D_clustering_K-means_algorithm_for_Autogeneration.pdf |
Summary: | K-Means is one of the unsupervised learning and partitioning clustering algorithms. It is very popular and widely used for its simplicity and fastness. The main drawback of this algorithm is that user should specify the number of cluster in advance. As an iterative clustering strategy, K-Means algorithm is very sensitive to the initial starting conditions. In this paper has
been proposed a clustering technique called MaxD K-Means clustering algorithm. MaxD K-Means algorithm auto generates initial k (the desired number of cluster) without asking for input from the user. MaxD k-means also
used a novel strategy of setting the initial centroids. The experiment of the Max-D means has been conducted using synthetic data, which is taken from the Llyod’s K-Means experiments. Another experiment has been done using reallife data focusing on student’s results in higher-education institution in Malaysia. The results from the new algorithm show that the number of iteration improves tremendously, and the number of iterations is reduced. The improvement rate is around 78%. |
---|