本文已被:浏览 1482次 下载 4413次
Received:March 04, 2013 Revised:April 07, 2013
Received:March 04, 2013 Revised:April 07, 2013
中文摘要: 从介绍Web挖掘与数据挖掘的差异入手, 分析Web挖掘中Web爬虫的必要性和现代Web挖掘技术的发展方向, 在深入了解Web爬虫的原理及其功能的基础上, 提出一个现代网站通用的挖掘模型, 并利用该模型设计一种网络爬虫. 经实例证明, 该爬虫能高效爬取更多的各种页面数据.
Abstract:The diffeences between web-minning and data-mining were introduced in this paper firstly, then the necessity of Web crawler during web-minning and the development of modern web-minning technology were analysed. Based on the deep understanding of the principle and its function of Web crawler, a minning model popular in modern website was put forward, and a web crawler was designed by the use of this model. Tested by several examples, this kind of crawler can get more diversified pagedata efficiently.
keywords: data-mining Web crawler Web-minning technology
文章编号: 中图分类号: 文献标志码:
基金项目:
引用文本:
肖毅,张林,聂笑一.基于WEB挖掘的网络爬虫设计与实现.计算机系统应用,2013,22(9):60-63
XIAO Yi,ZHANG Lin,NIE Xiao-Yi.Design and Realization of Web Crawlwer Based on Web Minning.COMPUTER SYSTEMS APPLICATIONS,2013,22(9):60-63
肖毅,张林,聂笑一.基于WEB挖掘的网络爬虫设计与实现.计算机系统应用,2013,22(9):60-63
XIAO Yi,ZHANG Lin,NIE Xiao-Yi.Design and Realization of Web Crawlwer Based on Web Minning.COMPUTER SYSTEMS APPLICATIONS,2013,22(9):60-63