Personal Credit Prediction Based on Feature Optimization and Boosting Algorithm

doi:10.15888/j.cnki.csa.008959

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-25- 10

Home > Archive>Volume 32, Issue 3, 2023 >224-231. DOI:10.15888/j.cnki.csa.008959

PDF HTML XML Export Cite reminder

Personal Credit Prediction Based on Feature Optimization and Boosting Algorithm
DOI:
                        10.15888/j.cnki.csa.008959
                    
CSTR:
                        [cstr]
                    
Author:
                        CHANG San-QiangCHANG San-Qiang
School of Management, University of Science and Technology of China, Hefei 230026, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
ZHOU Chui-RiZHOU Chui-Ri
School of Management, University of Science and Technology of China, Hefei 230026, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

With the rapid growth of Internet finance and electronic payment business, resulting personal credit problems are also increasing. Personal credit prediction is essentially an imbalanced binary sequence classification issue. Such an issue is faced with a large size and high dimension of data samples and extremely imbalanced data distribution. To effectively distinguish the credit situation of applicants, this study proposes a personal credit prediction method based on feature optimization and ensemble learning (PL-SmoteBoost). This method involves the construction of a personal credit prediction model within the boosting ensemble framework. Specifically, data initialization analysis with the Pearson correlation coefficient is conducted to eliminate redundant data; some features are selected with the least absolute shrinkage and selection operator (Lasso) to reduce data dimension and thereby lower high dimensional risks; linear interpolation among the minority classes in the dimension-reduced data is carried out by SMOTE oversampling to solve the class imbalance problem; finally, to verify the effectiveness of the proposed algorithm, this study takes the algorithms commonly used to deal with binary classification issues as comparison methods and tests the algorithms with the high dimensional imbalance datasets downloaded from the open databases of Kaggle and Microsoft. With the area under the curve (AUC) as the algorithm evaluation index, the test results are analyzed by the statistical test method. The results show that the proposed PL-SmoteBoost algorithm has significant advantages over other algorithms.

Key words:personal credit;SMOTE;ensemble learning;feature optimization

Get Citation

常三强,周垂日.基于特征优化和Boosting算法的个人信用预测.计算机系统应用,2023,32(3):224-231

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:July 26,2022
Revised:August 26,2022
Adopted:
Online: October 28,2022
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063