Illegitimate Website Detection Based on Multi-Dimensional Features

doi:10.15888/j.cnki.csa.005597

WeChat

Mobile website

Home > Archive>Volume 26, Issue 2, 2017 >207-211. DOI:10.15888/j.cnki.csa.005597

PDF HTML XML Export Cite reminder

Illegitimate Website Detection Based on Multi-Dimensional Features
DOI:
                        10.15888/j.cnki.csa.005597
                    
CSTR:
                        [cstr]
                    
Author:
                        
                        
                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The Web Information Extraction and Knowledge Presentation System is proposed to extract information from data intensive web pages. It downloads dynamic web pages, based on a knowledge database, changes them to XML documents after preprocessing, finds repeated patterns from them, by using a PAT-array based pattern discovery algorithm, recognizes their data display structure models, automatically based on the repeated patterns and an ontology-based keyword library, and then extracts the data and stores them in the knowledge database with the object-relational mapping technology of XML. Through these steps, web data is extracted automatically, and the knowledge database is also expanded automatically. Experiments on the traffic information auto-extraction and mixed traffic travel schemes auto-creation system showed that the system has high precision and is adaptive to web pages in different domains with different structures.

Reference

Cited by

Get Citation

田双柱,陈勇,延志伟,李晓东.基于多维度特征的不良网站检测.计算机系统应用,2017,26(2):207-211

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:May 17,2016
Revised:June 27,2016
Adopted:
Online: February 15,2017
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063