Borderline-mixup Imbalanced Data Sets Classification Method

doi:10.15888/j.cnki.csa.009297

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-25- 19

Home > Archive>Volume 32, Issue 11, 2023 >73-82. DOI:10.15888/j.cnki.csa.009297

PDF HTML XML Export Cite reminder

Borderline-mixup Imbalanced Data Sets Classification Method
DOI:
                        10.15888/j.cnki.csa.009297
                    
CSTR:
                        [cstr]
                    
Author:
                        WU Zhen-XuanWU Zhen-Xuan
College of Computer and Cyber Security, Fujian Normal University, Fuzhou 350117, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
GUO Gong-DeGUO Gong-De
College of Computer and Cyber Security, Fujian Normal University, Fuzhou 350117, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
WANG HuiWANG Hui
School of Electronics, Electrical Engineering and Computer Science, Queen’s University Belfast, Belfast BT9 5BN, UK
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Related [20]

Cited by

Materials

Comments

Abstract:

The problem of imbalanced datasets has attracted people’s attention since two decades ago, and various solutions have been proposed. Mixup is a popular data synthesis method in recent years, with many variants extended. However, there are not many Mixup variants proposed for imbalanced datasets. This study proposes a Mixup variant, namely Borderline-mixup, to address the classification problem of imbalanced datasets, which uses a support vector machine (SVM) to select boundary samples and increases the probability that the boundary sample is sampled in the sampler. Two boundary samplers are constructed to replace the original random sampler. Extensive experiments have been conducted on 14 UCI datasets and CIFAR10 long-tail datasets. The results show that Borderline-mixup has outperformed Mixup consistently on UCI datasets by up to 49.3% and on CIFAR10 long-tail datasets by about 3%–3.6%. Therefore, the proposed Borderline-mixup is effective in the classification of imbalanced datasets.

Key words:Mixup;support vector machine (SVM);imbalanced data sets;boundary samples;classification

Get Citation

吴振煊,郭躬德,王晖. Borderline-mixup不平衡数据集分类方法.计算机系统应用,2023,32(11):73-82

Copy

Article Metrics

Abstract:673
PDF: 2081
HTML: 1580
Cited by: 0

History

Received:April 30,2023
Revised:May 29,2023
Adopted:
Online: September 15,2023
Published:

Article QR Code

You are the first992323Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063