结合可信度的k<sub>m</sub>-means算法

doi:10.15888/j.cnki.csa.008498

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月24日 6:02 星期四

首页 > 过刊浏览>2022年第31卷第6期 >175-181. DOI:10.15888/j.cnki.csa.008498

PDF HTML阅读 XML下载导出引用引用提醒

结合可信度的km-means算法
DOI:
                        10.15888/j.cnki.csa.008498
                    
CSTR:
                        
                    
作者:
                        熊君竹熊君竹
福州大学 计算机与大数据学院, 福州 350108
在期刊界中查找
在百度中查找
在本站中查找
何振峰何振峰
福州大学 计算机与大数据学院, 福州 350108
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:福建省自然科学基金(2018J01794)

Clustering Algorithm of k_m-means with Credibility

Author:

XIONG Jun-Zhu
XIONG Jun-Zhu
College of Computer and Data Science, Fuzhou University, Fuzhou 350108, China
在期刊界中查找
在百度中查找
在本站中查找
HE Zhen-Feng
HE Zhen-Feng
College of Computer and Data Science, Fuzhou University, Fuzhou 350108, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

以K-means为代表的聚类算法被广泛地应用在许多领域, 但是K-means不能直接处理不完整数据集. k_m-means是一种处理不完整数据集的聚类算法, 通过调整局部距离计算方式, 减少不完整数据对聚类过程的影响. 然而k_m-means初始化阶段选取的聚类中心存在较大的不可靠性, 容易陷入局部最优解. 针对此问题, 本文引入可信度, 提出了结合可信度的k_m-means聚类算法, 通过可信度调整距离计算, 增大初始化过程中选取聚类中心的可靠性, 提高聚类算法的准确度. 最后, 通过UCI和UCR数据集验证算法的有效性.

关键词:不完整数据;聚类中心;可信度;局部距离;K-means

Abstract:

The clustering algorithm represented by K-means is widely used in many fields, but K-means cannot directly deal with incomplete data. k_m-means is a clustering algorithm for processing incomplete data. It reduces the impact of incomplete data on the clustering process by adjusting the calculation method of partial distance. However, the centroids selected in the initialization stage of k_m-means are unreliable, resulting in local optimal solutions. For incomplete data, a clustering algorithm that combined credibility was proposed to solve this problem. The calculation of distance was adjusted by credibility to increase the reliability of cluster centroids in the initialization stage, improving the performance of clustering algorithm. Finally, the algorithm was verified by the experimental results from the UCI and UCR dataset.

Key words:incomplete data;cluster centroids;credibility;partial distance;K-means

引用本文

熊君竹,何振峰.结合可信度的k_m-means算法.计算机系统应用,2022,31(6):175-181

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2021-08-13
最后修改日期:2021-09-13
录用日期:
在线发布日期: 2022-05-26
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码