Unbalanced Data Mining of Self-Encoding Network under Spectral Clustering Undersampling

doi:10.15888/j.cnki.csa.008105

WeChat

Mobile website

Home > Archive>Volume 30, Issue 10, 2021 >331-335. DOI:10.15888/j.cnki.csa.008105

PDF HTML XML Export Cite reminder

Unbalanced Data Mining of Self-Encoding Network under Spectral Clustering Undersampling
DOI:
                        10.15888/j.cnki.csa.008105
                    
CSTR:
                        [cstr]
                    
Author:
                        
                        
                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The application fields of unbalanced data sets are becoming increasingly extensive, and the demand for them is getting higher. Taking the spectral clustering undersampling as a prerequisite, this study develops an unbalanced data mining method based on a self-encoding network to improve the classification accuracy of the overall data set. The clustering problem is converted into the multi-path partition problem of an undirected graph, and the spectral clustering is completed depending on the undirected graph and standardized processing. The majority of data sets are processed through selective undersampling to yield the classification boundary offset. The learning process is a self-encoding network of unsupervised learning, based on which the dimensionality of data is increased or reduced so that hidden features of each dimension can be obtained and the efficient representation and learning of data are realized at all levels. The self-encoding network is adjusted according to the comparison between the maximum mean difference and the preset threshold. The unbalanced data mining is then completed with the obtained classification interface. UCI data sets with different practical application backgrounds are selected, from which 10 sets of data are extracted as test sets. After spectral clustering undersampling, the simulation experiments demonstrate that the proposed method greatly improves the classification accuracy of the minority and overall mining performance, which shows good applicability and feasibility.

Reference

Cited by

Get Citation

王舒梵,严涛,姜新盈.谱聚类欠取样下自编码网络不平衡数据挖掘.计算机系统应用,2021,30(10):331-335

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:December 24,2020
Revised:January 25,2021
Adopted:
Online: October 08,2021
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063