Thematic VSM Based on Ontology Semantic Tree
DOI:
CSTR:
Author:
Affiliation:

Clc Number:

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    Based on the traditional search model, combining the concept of ontology, this paper proposes a thematic network crawling model based on ontology semantic tree. Unlike the traditional keyword-based subject description methods, the model can describe a subject with semantic concept tree with which it is simple to describe the semantic relationships between concepts. On this basis, the paper presents a method to calculate the relevance of HTML pages and the topic. When analyzing the relevance of URL, it does not only analyze the relevance of link anchor text and the topic, but also analyzes the relevance of the link with an improved PageRank algorithm. Only when the relevance does not reach a given threshold will it download the page corresponding to the URL. This calculation method can greatly reduce unnecessary computational overhead, and make fully use of anchor text and link importance of information. Finally, it calculates the relevance of a web page which is not sure whether it is related to the topic, and ultimately determines whether this page should be collected or not.

    Reference
    Related
    Cited by
Get Citation

卢承山.基于本体语义树的主题空间向量模型.计算机系统应用,2011,20(10):44-48

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:February 01,2011
  • Revised:March 14,2011
  • Adopted:
  • Online:
  • Published:
Article QR Code
You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063