本文已被:浏览 1738次 下载 2211次
Received:April 25, 2009
Received:April 25, 2009
中文摘要: FP-growth算法用于关联规则挖掘分成两个阶段:构建频繁模式树和进行频繁模式挖掘;对这两个阶段分别进行改进,若项头表中存在同频度的频繁项,在构建FP-tree的过程动态调整其位置,构建压缩的最优化FP-tree,提出了IMFP-tree算法。在进行频繁模式挖掘阶段,提出CFP-mine 算法,CFP-mine算法采用一种新方法构建条件模式基,且采用组合方式挖掘频繁项集,有别于传统FP-growth算法的挖掘过程,理论上证明和实验验证本算法的正确性和高效性。
中文关键词: 关联规则 FP树 频繁项集
Abstract:FP-growth algorithm for mining association rules is divided into two phases: building a FP-tree and mining frequent patterns. In this paper new algorithms are proposed to improve the two stages separately. In the first stage, if frequent items in header table have the same support, their position can be dynamically changed to construct a compressed and optimized FP-tree. IMFP-tree algorithm is proposed to realize that function. In the second stage, CFP-mine algorithm is proposed, which constructs pattern-base by using a new method different from the conditional pattern-base in FP-growth.This paper mines frequent itemsets with a new combination method without recursive construction of conditional FP-tree. It has theoretically proved and experimentally verified the correctness and efficiency of CFP-mine algorithm.
keywords: Association rules FP-tree Frequent itemsets
文章编号: 中图分类号: 文献标志码:
基金项目:重庆市自然科学基金(CSTC,2007BB6118);中国博士后科学基金(20080430750)
Author Name | Affiliation |
WANG Cheng-Liang | 重庆大学 计算机学院 重庆 400044 重庆大学 电气工程学院 重庆 400044 |
LUO Chang-Yin | 重庆大学 计算机学院 重庆 400044 |
Author Name | Affiliation |
WANG Cheng-Liang | 重庆大学 计算机学院 重庆 400044 重庆大学 电气工程学院 重庆 400044 |
LUO Chang-Yin | 重庆大学 计算机学院 重庆 400044 |
引用文本:
汪成亮,罗昌银.一种基于组合方式改进的频繁项集挖掘算法.计算机系统应用,2010,19(1):67-71
WANG Cheng-Liang,LUO Chang-Yin.An Improved Frequent Itemsets Mining Algorithm Based on Combination Approach.COMPUTER SYSTEMS APPLICATIONS,2010,19(1):67-71
汪成亮,罗昌银.一种基于组合方式改进的频繁项集挖掘算法.计算机系统应用,2010,19(1):67-71
WANG Cheng-Liang,LUO Chang-Yin.An Improved Frequent Itemsets Mining Algorithm Based on Combination Approach.COMPUTER SYSTEMS APPLICATIONS,2010,19(1):67-71