
Research Article
Fast Estimation for the Number of Clusters
@INPROCEEDINGS{10.1007/978-3-030-63941-9_27, author={Xiaohong Zhang and Zhenzhen He and Zongpu Jia and Jianji Ren}, title={Fast Estimation for the Number of Clusters}, proceedings={6GN for Future Wireless Networks. Third EAI International Conference, 6GN 2020, Tianjin, China, August 15-16, 2020, Proceedings}, proceedings_a={6GN}, year={2021}, month={1}, keywords={Clustering The number of clusters Density Distance}, doi={10.1007/978-3-030-63941-9_27} }
- Xiaohong Zhang
Zhenzhen He
Zongpu Jia
Jianji Ren
Year: 2021
Fast Estimation for the Number of Clusters
6GN
Springer
DOI: 10.1007/978-3-030-63941-9_27
Abstract
Clustering analysis has been widely used in many areas. In many cases, the number of clusters is required to been assigned artificially, while inappropriate assignments affect analysis negatively. Many solutions have been proposed to estimate the optimal number of clusters. However, the accuracy of those solutions drop severely on overlapping data sets. To handle the accuracy problem, we propose a fast estimation solution based on the cluster centers selected in a static way. In the solution, each data point is assigned with one score calculated according to a density-distance model. The score of each data point does not change any more once it is generated. The solution takes the top k data points with the highest scores as the centers of k clusters. It utilizes the significant change of the minimal distance between cluster centers to identify the optimal number of the clusters in overlapping data sets. The experiment results verify the usefulness and effectiveness of our solution.