加入跳跃连接的深度嵌入K-means聚类

doi:10.15888/j.cnki.csa.009348

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月1日 10:57 星期二

首页 > 过刊浏览>2024年第33卷第1期 >11-21. DOI:10.15888/j.cnki.csa.009348

PDF HTML阅读 XML下载导出引用引用提醒

加入跳跃连接的深度嵌入K-means聚类
DOI:
                        10.15888/j.cnki.csa.009348
                    
CSTR:
                        32024.14.csa.009348
                    
作者:
                        李顺勇李顺勇
山西大学 数学科学学院, 太原 030006
在期刊界中查找
在百度中查找
在本站中查找
胥瑞胥瑞
山西大学 数学科学学院, 太原 030006
在期刊界中查找
在百度中查找
在本站中查找
李师毅李师毅
山西大学 数学科学学院, 太原 030006
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金(82274360, 61976128); 2022 年度山西省研究生教育教学改革课题(2022YJJG010); 山西省横向课题(109023901054)

Deep Embedded K-means Clustering with Skip Connections

Author:

LI Shun-Yong
LI Shun-Yong
School of Mathematical Sciences, Shanxi University, Taiyuan 030006, China
在期刊界中查找
在百度中查找
在本站中查找
XU Rui
XU Rui
School of Mathematical Sciences, Shanxi University, Taiyuan 030006, China
在期刊界中查找
在百度中查找
在本站中查找
LI Shi-Yi
LI Shi-Yi
School of Mathematical Sciences, Shanxi University, Taiyuan 030006, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

现有的深度聚类算法大多采用对称的自编码器来提取高维数据的低维特征, 但随着自编码器训练次数的不断增加, 数据的低维特征空间在一定程度上发生了扭曲, 这样得到的数据低维特征空间无法反映原始数据空间中潜在的聚类结构信息. 为了解决上述问题, 本文提出了一种新的深度嵌入K-means算法(SDEKC). 首先, 在低维特征提取阶段, 在对称的卷积自编码器中相对应的编码器与解码器之间以一定的权重加入两个跳跃连接, 以减弱解码器对编码器的编码要求同时突出卷积自编码器的编码能力, 这样可以更好地保留原始数据空间中蕴含的聚类结构信息; 其次, 在聚类阶段, 通过一个标准正交变换矩阵将低维数据空间转换为一个新的揭示聚类结构信息的空间; 最后, 本文以端到端的方式采用贪婪算法迭代优化数据的低维表示及其聚类, 在6个真实数据集上验证了本文提出新算法的有效性.

关键词:跳跃连接;深度学习;卷积自编码器;嵌入K-means

Abstract:

Most of the existing deep clustering algorithms adopt symmetric autoencoders to extract low-dimensional features of high-dimensional data. However, with the increasing training times of autoencoders, the low-dimensional feature space of the data is distorted to a certain extent, and then the obtained data low-dimensional feature space cannot reflect the potential clustering structure information in the original data space. To this end, this study proposes a new deep embedded K-means algorithm (SDEKC). First, during low-dimensional feature extraction, two skip connections are added with a certain weight between the corresponding encoder and decoder in the symmetric convolutional autoencoder. As a result, the encoding requirements of the decoder for the encoder are reduced, and the coding ability of the convolutional autoencoder is highlighted, which can better retain the clustering structure information in the original data space. Second, the low-dimensional data space is converted into a new space revealing clustering structure information by an orthogonal transformation matrix in the clustering stage. Finally, this study utilizes the greedy algorithm to iteratively optimize the low-dimensional representation of the data and its clustering in an end-to-end way and verifies the effectiveness of the proposed new algorithm on six real datasets.

Key words:skip connections;deep learning;convolutional autoencoder;embedded K-means

引用本文

李顺勇,胥瑞,李师毅.加入跳跃连接的深度嵌入K-means聚类.计算机系统应用,2024,33(1):11-21

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2023-06-29
最后修改日期:2023-07-27
录用日期:
在线发布日期: 2023-11-17
出版日期: 2023-01-05

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码