Random Forest Credit Evaluation Based on Combination Feature Selection
CSTR:
Author:
  • Article
  • | |
  • Metrics
  • |
  • Reference [14]
  • |
  • Related [20]
  • | | |
  • Comments
    Abstract:

    In the process of building a personal credit risk evaluation model, feature engineering largely determines the performance of the evaluator. Traditional feature selection methods cannot fully consider the impact of high-dimensional indicators on the evaluation results, and most studies artificially determines the size of the feature set in the process of building the model, leading to high randomness and low credibility. Therefore, a random forest model (IV-XGBoostRF) based on traditional risk control indicators to optimize XGBoost is proposed. The traditional risk control indicators IV and XGBoost are combined to screen the original feature set to build a relatively complete credit evaluation model. The results of comparison experiments show that the accuracy of the improved random forest model is increased by 0.90%, and other evaluation indicators are better than the traditional credit evaluation model, which proves the feasibility of the feature selection method and has certain application value.

    Reference
    [1] Hoque N, Bhattacharyya DK, Kalita JK. MIFS-ND: A mutual information-based feature selection method. Expert Systems with Applications, 2014, 41(14): 6371–6385. [doi: 10.1016/j.eswa.2014.04.019
    [2] Fernandes GB, Artes R. Spatial dependence in credit risk and its improvement in credit scoring. European Journal of Operational Research, 2016, 249(2): 517–524. [doi: 10.1016/j.ejor.2015.07.013
    [3] 郭畅. 基于类别不平衡的企业信用风险违约测度探索——以制造业上市公司为例. 重庆工商大学学报(自然科学版), 2021, 38(1): 113–119. [doi: 10.16055/j.issn.1672-058X.2021.0001.017
    [4] 周毓萍, 陈官羽. 基于机器学习方法的个人信用评价研究. 金融理论与实践, 2019, (12): 1–8. [doi: 10.3969/j.issn.1003-4625.2019.12.001
    [5] Harris T. Credit scoring using the clustered support vector machine. Expert Systems with Applications, 2015, 42(2): 741–750. [doi: 10.1016/j.eswa.2014.08.029
    [6] Abdou H, Pointon J, El-Masry A. Neural nets versus conventional techniques in credit scoring in Egyptian banking. Expert Systems with Applications, 2008, 35(3): 1275–1292. [doi: 10.1016/j.eswa.2007.08.030
    [7] 陈云, 石松, 潘彦, 等. 基于SVM混合集成的信用风险评估模型. 计算机工程与应用, 2016, 52(4): 115–120. [doi: 10.3778/j.issn.1002-8331.1403-0473
    [8] Twala B. Combining classifiers for credit risk prediction. Journal of Systems Science and Systems Engineering, 2009, 18(3): 292–311. [doi: 10.1007/s11518-009-5109-y
    [9] Zhu Y, Xie C, Wang GJ, et al. Comparison of individual, ensemble and integrated ensemble machine learning methods to predict China’s SME credit risk in supply chain finance. Neural Computing and Applications, 2017, 28(1): 41–50. [doi: 10.1007/s00521-016-2304-x
    [10] 萧超武, 蔡文学, 黄晓宇, 等. 基于随机森林的个人信用评估模型研究及实证分析. 管理现代化, 2014, 34(6): 111–113. [doi: 10.3969/j.issn.1003-1154.2014.06.038
    [11] 周永圣, 崔佳丽, 周琳云, 等. 基于改进的随机森林模型的个人信用风险评估研究. 征信, 2020, 38(1): 28–32. [doi: 10.3969/j.issn.1674-747X.2020.01.006
    [12] 李欣, 俞卫琴. 基于改进GS-XGBoost的个人信用评估. 计算机系统应用, 2020, 29(11): 145–150. [doi: 10.15888/j.cnki.csa.007624
    [13] Chen TQ, Guestrin C. XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conferenceon Knowledge Discovery and Data Mining. New York: ACM, 2016. 785–794.
    [14] Breiman L. Random forests. Machine Learning, 2001, 45(1): 5–32. [doi: 10.1023/A:1010933404324
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

饶姗姗,冷小鹏.基于组合特征选择的随机森林信用评估.计算机系统应用,2022,31(3):345-350

Copy
Share
Article Metrics
  • Abstract:1132
  • PDF: 2350
  • HTML: 2048
  • Cited by: 0
History
  • Received:May 17,2021
  • Revised:June 14,2021
  • Online: January 24,2022
Article QR Code
You are the first992272Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063