基于RFM模型的半监督聚类算法
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

四川省高等教育改革项目([2014]156551)


Semi-Supervised Clustering Algorithm Based on RFM Model
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    客户分类作为客户关系管理(CRM)的重要管理方法,是企业进行市场营销的重要依据.通过对客户进行分类,有利于对客户价值进行准确评估,方便进行精准营销.本文通过对RFM模型数据集本身潜藏的先验结构化信息进行研究,标记出两组客户数据作为先验类别标记,进而得到两个初始聚类中心.基于传统K-means算法使用自适应方法确定K值和初始聚类中心.引入Must-link和Cannot-link两种约束将类别标记转换为成对约束信息,基于HMRF-KMeans成对约束,引入约束惩罚项和约束奖励项,实现对聚类引导和聚类结果的调整.使用改进的半监督聚类算法(RFM-SS-means)对标准数据集进行了测试,同时使用Food mart数据集对比了RFM-SS-means算法与传统K-means算法、two-steps算法的聚类效果.由实验结果可知,RFM-SS-means的CH系数最大,无需事先确定K值和初始聚类中心,聚类效果良好.

    Abstract:

    As an important management method of customer relationship management (CRM), the customer classification is the basis for enterprises to carry out marketing. The classification of customers is conducive to accurate assessment of customer value and facilitate the precise marketing. In this paper, we study the priori structured information hidden in the RFM model dataset, and mark two sets of customer data as a priori category mark, and then get two initial clustering centers. Based on the traditional K-means algorithm, the K value and the initial clustering center are determined with the adaptive method. Combining the two types of constraints of Must-link and Cannot-link, the category markers are transformed into pairs of constraint information. Based on HMRF-KMeans pairs, the constraints and constraint bonuses are introduced to improve the clustering guidance and clustering results. The improved semi-supervised clustering algorithm (RFM-SS-means) was used to test the standard data set, and the Food mart data set was also used to compare the RFM-SS-means algorithm with the traditional K-means algorithm and the two-steps algorithm Class effect. From the experimental results, it can be seen that the CH coefficient of RFM-SS-means is the largest, and the clustering effect is good without prior determination of K value and initial clustering center.

    参考文献
    相似文献
    引证文献
引用本文

程汝娇,徐鸿雁.基于RFM模型的半监督聚类算法.计算机系统应用,2017,26(11):170-175

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2017-02-21
  • 最后修改日期:2017-03-23
  • 录用日期:
  • 在线发布日期: 2017-10-30
  • 出版日期:
文章二维码
您是第位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京海淀区中关村南四街4号 中科院软件园区 7号楼305房间,邮政编码:100190
电话:010-62661041 传真: Email:csa (a) iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号