###
DOI:
计算机系统应用英文版:2013,22(9):60-63
本文二维码信息
码上扫一扫!
基于WEB挖掘的网络爬虫设计与实现
(1.湖南农业大学 信息科学技术学院, 长沙 410128;2.湖南农业大学 东方科技学院, 长沙 410128)
Design and Realization of Web Crawlwer Based on Web Minning
(1.Information Science and Technology College, Hunan Agricultural University, Changsha 410128, China;2.Orient Science&Technology College, Hunan Agricultural University, Changsha 410128, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 1482次   下载 4413
Received:March 04, 2013    Revised:April 07, 2013
中文摘要: 从介绍Web挖掘与数据挖掘的差异入手, 分析Web挖掘中Web爬虫的必要性和现代Web挖掘技术的发展方向, 在深入了解Web爬虫的原理及其功能的基础上, 提出一个现代网站通用的挖掘模型, 并利用该模型设计一种网络爬虫. 经实例证明, 该爬虫能高效爬取更多的各种页面数据.
中文关键词: 数据挖掘  Web爬虫  挖掘技术
Abstract:The diffeences between web-minning and data-mining were introduced in this paper firstly, then the necessity of Web crawler during web-minning and the development of modern web-minning technology were analysed. Based on the deep understanding of the principle and its function of Web crawler, a minning model popular in modern website was put forward, and a web crawler was designed by the use of this model. Tested by several examples, this kind of crawler can get more diversified pagedata efficiently.
文章编号:     中图分类号:    文献标志码:
基金项目:
引用文本:
肖毅,张林,聂笑一.基于WEB挖掘的网络爬虫设计与实现.计算机系统应用,2013,22(9):60-63
XIAO Yi,ZHANG Lin,NIE Xiao-Yi.Design and Realization of Web Crawlwer Based on Web Minning.COMPUTER SYSTEMS APPLICATIONS,2013,22(9):60-63