本文已被:浏览 477次 下载 1307次
Received:August 13, 2021 Revised:September 13, 2021
Received:August 13, 2021 Revised:September 13, 2021
中文摘要: 以K-means为代表的聚类算法被广泛地应用在许多领域, 但是K-means不能直接处理不完整数据集. km-means是一种处理不完整数据集的聚类算法, 通过调整局部距离计算方式, 减少不完整数据对聚类过程的影响. 然而km-means初始化阶段选取的聚类中心存在较大的不可靠性, 容易陷入局部最优解. 针对此问题, 本文引入可信度, 提出了结合可信度的km-means聚类算法, 通过可信度调整距离计算, 增大初始化过程中选取聚类中心的可靠性, 提高聚类算法的准确度. 最后, 通过UCI和UCR数据集验证算法的有效性.
Abstract:The clustering algorithm represented by K-means is widely used in many fields, but K-means cannot directly deal with incomplete data. km-means is a clustering algorithm for processing incomplete data. It reduces the impact of incomplete data on the clustering process by adjusting the calculation method of partial distance. However, the centroids selected in the initialization stage of km-means are unreliable, resulting in local optimal solutions. For incomplete data, a clustering algorithm that combined credibility was proposed to solve this problem. The calculation of distance was adjusted by credibility to increase the reliability of cluster centroids in the initialization stage, improving the performance of clustering algorithm. Finally, the algorithm was verified by the experimental results from the UCI and UCR dataset.
文章编号: 中图分类号: 文献标志码:
基金项目:福建省自然科学基金(2018J01794)
引用文本:
熊君竹,何振峰.结合可信度的km-means算法.计算机系统应用,2022,31(6):175-181
XIONG Jun-Zhu,HE Zhen-Feng.Clustering Algorithm of km-means with Credibility.COMPUTER SYSTEMS APPLICATIONS,2022,31(6):175-181
熊君竹,何振峰.结合可信度的km-means算法.计算机系统应用,2022,31(6):175-181
XIONG Jun-Zhu,HE Zhen-Feng.Clustering Algorithm of km-means with Credibility.COMPUTER SYSTEMS APPLICATIONS,2022,31(6):175-181