本文已被:浏览 1782次 下载 3366次
Received:July 16, 2010 Revised:August 19, 2010
Received:July 16, 2010 Revised:August 19, 2010
中文摘要: 针对从Web 页面获取信息的广泛需求,分析了从中提取信息的关键技术如URL 地址、HTML 页面和HtmlParse 解析库;以从Google Map 中获取企业黄页信息为例,根据从中自动提取数据的技术和步骤,设计和实现了该系统原型,并指出的相关问题及其解决办法。
Abstract:In order to respond some extensive requirements for getting information from Web pages, some key techniques such as URL, HTML page and HtmlParse API, were analyzed. Getting yellow page information from Google maps was taken as an example, and according to related techniques and steps of abstracting information from it, the system prototype was designed and implemented. Some related problems were presented, and its corresponding solution were discussed too.
keywords: Web page HtmlParse Google map information extract system
文章编号: 中图分类号: 文献标志码:
基金项目:宁夏科技攻关计划项目(KGX-01-10-01)
引用文本:
马龙,张春涛,杨德仁.一种批量抽取动态Web 信息系统.计算机系统应用,2011,20(3):41-44
MA Long,ZHANG Chun-Tao,YANG De-Ren.Batch Extraction Information System from Dynamic Web.COMPUTER SYSTEMS APPLICATIONS,2011,20(3):41-44
马龙,张春涛,杨德仁.一种批量抽取动态Web 信息系统.计算机系统应用,2011,20(3):41-44
MA Long,ZHANG Chun-Tao,YANG De-Ren.Batch Extraction Information System from Dynamic Web.COMPUTER SYSTEMS APPLICATIONS,2011,20(3):41-44