###
计算机系统应用英文版:2023,32(11):247-252
本文二维码信息
码上扫一扫!
基于集束搜索的可解释阈值树构造
(福州大学 计算机与大数据学院, 福州 350108)
Explainable Threshold Tree Construction Based on Beam Search
(College of Computer and Data Science, Fuzhou University, Fuzhou 350108, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 273次   下载 493
Received:March 21, 2023    Revised:May 11, 2023
中文摘要: 传统的聚类算法能够将数据集划分成不同的簇, 但是这些簇通常都是难以解释的. IMM (iterative mistake minimization)是一种常见的可解释聚类算法, 通过单个特征来构造阈值树, 每个簇都可以用根节点到叶子节点路径上的特征-阈值对进行解释. 然而, 阈值树在每一轮划分数据时仅考虑错误最少的特征-阈值对, 这种贪心的方法容易导致局部最优解. 针对这一问题, 本文引入了集束搜索, 通过在阈值树的每一轮划分过程当中保留预定数量的状态来减缓局部最优, 进而提高阈值树提供的聚类划分与初始聚类划分的一致性. 最后, 通过实验验证了该算法的有效性.
中文关键词: 可解释聚类  集束搜索  阈值树  K-means
Abstract:Traditional clustering algorithms can split the dataset into different clusters, whereas these clusters are usually difficult to explain. Iterative mistake minimization (IMM) is a common explainable clustering algorithm, which constructs a threshold tree from a single feature, and each cluster can be explained by feature-threshold pairs on the path from the root node to the leaf node. However, the threshold tree only considers the feature-threshold pair with the fewest errors when dividing the data in each round, and this greedy method is easy to lead to the local optimal solution. To solve this problem, this study introduces beam search, which slows local optimization by retaining a predetermined number of states in each round of division, thereby improving the consistency between the clustering provided by the threshold tree and the initial clustering. Finally, the effectiveness of the algorithm is verified by experiments.
文章编号:     中图分类号:    文献标志码:
基金项目:
引用文本:
李钰群,何振峰.基于集束搜索的可解释阈值树构造.计算机系统应用,2023,32(11):247-252
LI Yu-Qun,HE Zhen-Feng.Explainable Threshold Tree Construction Based on Beam Search.COMPUTER SYSTEMS APPLICATIONS,2023,32(11):247-252