###

DOI:

计算机系统应用英文版:2011,20(3):41-44

View/Add Comment 过刊浏览高级检索 HTML

←前一篇 | 后一篇→

码上扫一扫！

下载全文

一种批量抽取动态Web 信息系统

马龙¹, 张春涛¹, 杨德仁²

(1.宁夏万纬信息技术公司，银川 750000;2.宁夏医科大学理学院，银川 750000)

Batch Extraction Information System from Dynamic Web

MA Long¹, ZHANG Chun-Tao¹, YANG De-Ren²

(1.Ningxia Wanwei IT Technology Co, Yinchuan 750000, China;2.Science College of Ningxia Medical University, Yinchuan 750000, China)

摘要

图/表

参考文献

相似文献

本文已被：浏览 1782次下载 3366次
Received:July 16, 2010 Revised:August 19, 2010

中文摘要: 针对从Web 页面获取信息的广泛需求，分析了从中提取信息的关键技术如URL 地址、HTML 页面和HtmlParse 解析库；以从Google Map 中获取企业黄页信息为例，根据从中自动提取数据的技术和步骤，设计和实现了该系统原型，并指出的相关问题及其解决办法。

中文关键词: Web 页面 HtmlParse Google 地图信息抽取系统

Abstract:In order to respond some extensive requirements for getting information from Web pages, some key techniques such as URL, HTML page and HtmlParse API, were analyzed. Getting yellow page information from Google maps was taken as an example, and according to related techniques and steps of abstracting information from it, the system prototype was designed and implemented. Some related problems were presented, and its corresponding solution were discussed too.

keywords: Web page HtmlParse Google map information extract system

文章编号： 中图分类号： 文献标志码：

基金项目:宁夏科技攻关计划项目(KGX-01-10-01)

Author Name	Affiliation
MA Long	Ningxia Wanwei IT Technology Co, Yinchuan 750000, China
ZHANG Chun-Tao	Ningxia Wanwei IT Technology Co, Yinchuan 750000, China
YANG De-Ren	Science College of Ningxia Medical University, Yinchuan 750000, China

Author Name	Affiliation
MA Long	Ningxia Wanwei IT Technology Co, Yinchuan 750000, China
ZHANG Chun-Tao	Ningxia Wanwei IT Technology Co, Yinchuan 750000, China
YANG De-Ren	Science College of Ningxia Medical University, Yinchuan 750000, China

引用文本：
马龙,张春涛,杨德仁.一种批量抽取动态Web 信息系统.计算机系统应用,2011,20(3):41-44
MA Long,ZHANG Chun-Tao,YANG De-Ren.Batch Extraction Information System from Dynamic Web.COMPUTER SYSTEMS APPLICATIONS,2011,20(3):41-44