Evaluation of Impact of Path Completion in Weblog Mining
DOI:
CSTR:
Author:
Affiliation:

Clc Number:

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    Web user access is almost anonymous access. The main goal of weblog mining is to extract users’ behavior patterns from the Weblogs, and then understand users’ behavior by analyzing the mining results to improve the structure of the site. The first step of weblog mining is data preprocessing. Data preprocessing is the most time consuming stage in web page analysis. This paper first studies the process of data preprocessing, including data cleaning, user identification, session identification, path completion. A path completion algorithm is proposed. The paper poses the hypothesis that the path completion has a significant impact on rule extraction quantity and quality, and then experimental verification is conducted to assess the effect of path completion in weblog mining. The experiment result also provides an experimental basis to what extent data preparation should be carried out.

    Reference
    Related
    Cited by
Get Citation

蔡卫欣,冯振宇,杨剑. Web 日志挖掘中路径补充的影响评估.计算机系统应用,2011,20(3):226-229

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:July 12,2010
  • Revised:September 05,2010
  • Adopted:
  • Online:
  • Published:
Article QR Code
You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063