A Novel Center Point Initialization Technique for K-means Clustering Algorithm


  •  Dauda Usman    
  •  Ismail Mohamad    

Abstract

Clustering is a major data analysis tool utilized in numerous domains. The basic K-means method has been widely discussed and applied in many applications. But unfortunately failed to offer good clustering result due to the initial center points are chosen randomly. In this article, we present a new method of centre points initialization and we prove that the distance of the new method follows a Chi-square distribution. The new method overcomes the drawbacks of the basic K-means. Experimental analysis shows that the new method performs well on infectious diseases dataset when compare with the basic K-means clustering method and a histogram measures the quality of the new method.



This work is licensed under a Creative Commons Attribution 4.0 License.