基于集束搜索的可解释阈值树构造

doi:10.15888/j.cnki.csa.009309

微信公众号

网站二维码

首页 > 过刊浏览>2023年第32卷第11期 >247-252. DOI:10.15888/j.cnki.csa.009309

PDF HTML阅读 XML下载导出引用引用提醒

基于集束搜索的可解释阈值树构造
DOI:
                        10.15888/j.cnki.csa.009309
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:

Explainable Threshold Tree Construction Based on Beam Search

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

传统的聚类算法能够将数据集划分成不同的簇, 但是这些簇通常都是难以解释的. IMM (iterative mistake minimization)是一种常见的可解释聚类算法, 通过单个特征来构造阈值树, 每个簇都可以用根节点到叶子节点路径上的特征-阈值对进行解释. 然而, 阈值树在每一轮划分数据时仅考虑错误最少的特征-阈值对, 这种贪心的方法容易导致局部最优解. 针对这一问题, 本文引入了集束搜索, 通过在阈值树的每一轮划分过程当中保留预定数量的状态来减缓局部最优, 进而提高阈值树提供的聚类划分与初始聚类划分的一致性. 最后, 通过实验验证了该算法的有效性.

Abstract:

Traditional clustering algorithms can split the dataset into different clusters, whereas these clusters are usually difficult to explain. Iterative mistake minimization (IMM) is a common explainable clustering algorithm, which constructs a threshold tree from a single feature, and each cluster can be explained by feature-threshold pairs on the path from the root node to the leaf node. However, the threshold tree only considers the feature-threshold pair with the fewest errors when dividing the data in each round, and this greedy method is easy to lead to the local optimal solution. To solve this problem, this study introduces beam search, which slows local optimization by retaining a predetermined number of states in each round of division, thereby improving the consistency between the clustering provided by the threshold tree and the initial clustering. Finally, the effectiveness of the algorithm is verified by experiments.

参考文献

相似文献

引证文献

引用本文

李钰群,何振峰.基于集束搜索的可解释阈值树构造.计算机系统应用,2023,32(11):247-252

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2023-03-21
最后修改日期:2023-05-11
录用日期:
在线发布日期: 2023-08-22
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码