基于聚类与机器学习的零售商品销量预测

doi:10.15888/j.cnki.csa.008147

微信公众号

网站二维码

首页 > 过刊浏览>2021年第30卷第11期 >188-194. DOI:10.15888/j.cnki.csa.008147

PDF HTML阅读 XML下载导出引用引用提醒

基于聚类与机器学习的零售商品销量预测
DOI:
                        10.15888/j.cnki.csa.008147
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金(71771179, 71532015)

Retail Products Sales Forecast Based on Clustering and Machine Learning

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

本文提出一种基于K-means聚类与机器学习回归算法的预测模型以解决零售行业多个商品的销售预测问题, 首先通过聚类分析识别出具有相似销售模式的商品从而实现数据集的划分, 然后分别在每个子数据集上训练了支持向量回归、随机森林以及XGBoost模型, 通过构建数据池的方式增加了用于训练模型的数据量以及预测变量的选择范围. 在一家零售企业的真实销售数据集上对提出的模型进行了验证, 实验结果表明基于K-means和支持向量回归的预测模型表现最优, 且所提出的模型预测效果明显优于基准模型以及不使用聚类的机器学习模型.

Abstract:

In this study, we propose a forecasting model based on K-means clustering and a machine learning regression algorithm for the sales forecasting of multiple commodities in the retail industry. First, we utilize the clustering technique to identify commodities with similar sales patterns and then divide the whole dataset into different groups. Subsequently, three machine learning regression algorithms, i.e., support vector regression, random forest and XGBoost models, are trained on each sub-dataset. The data size for model training and the scope of forecasting variables are increased by the construction of a data pool. The proposed models are verified on a real sales dataset of a retail company. The experimental results show that the forecasting model based on K-means and support vector regression performs the best, and the forecasting performance of the proposed models is significantly better than that of the benchmark models and the machine learning models without using clustering.

参考文献

相似文献

引证文献

引用本文

周雨,段永瑞.基于聚类与机器学习的零售商品销量预测.计算机系统应用,2021,30(11):188-194

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2021-01-21
最后修改日期:2021-02-23
录用日期:
在线发布日期: 2021-10-22
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码