###

计算机系统应用英文版:2019,28(8):120-128

View/Add Comment 过刊浏览高级检索 HTML

←前一篇 | 后一篇→

码上扫一扫！

下载全文

面向不平衡数据的分类算法

蒋宗礼, 史倩月

(北京工业大学信息学部, 北京 100124)

Classification Algorithm for Imbalanced Data Set

JIANG Zong-Li, SHI Qian-Yue

(Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China)

摘要

图/表

参考文献

相似文献

本文已被：浏览 2020次下载 2647次
Received:January 08, 2019 Revised:February 03, 2019

中文摘要: 不平衡数据在分类时往往会偏向"多数"，传统过采样生成的样本不能较好的表达原始数据集分布特征.改进的变分自编码器结合数据预处理方法，通过少数类样本训练，使用变分自编码器的生成器生成样本，用于以均衡训练数据集，从而解决传统采样导致的不平衡数据引起分类过拟合问题.我们在UCI四个常用的数据集上进行了实验，结果表明该算法在保证准确率的同时提高了F_measure和G_mean.

中文关键词: 不平衡数据分类变分自编码器过采样深度学习

Abstract:Imbalanced dataset tends to be biased towards "majority" when classifying, and samples generated by traditional over-sampling cannot well express the distribution characteristics of the original dataset. The improved variational autoencoders combine with data preprocessing method, generate samples by the generator of variational autoencoders trained by the minority class samples to balance the training data set, solve the overfitting problem caused by imbalanced dataset of traditional sampling. Experiments are carried out on four commonly used UCI datasets, the results demonstrate that the proposed method shows better classification performance in F_measure and G_mean with high accuracy.

keywords: imbalanced dataset classification variational autoencoders over-sampling deep learning

文章编号： 中图分类号： 文献标志码：

基金项目:

Author Name	Affiliation	E-mail
JIANG Zong-Li	Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China
SHI Qian-Yue	Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China	qianyue_sophia@163.com

Author Name	Affiliation	E-mail
JIANG Zong-Li	Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China
SHI Qian-Yue	Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China	qianyue_sophia@163.com

引用文本：
蒋宗礼,史倩月.面向不平衡数据的分类算法.计算机系统应用,2019,28(8):120-128
JIANG Zong-Li,SHI Qian-Yue.Classification Algorithm for Imbalanced Data Set.COMPUTER SYSTEMS APPLICATIONS,2019,28(8):120-128