###

计算机系统应用英文版:2019,28(12):226-231

View/Add Comment 过刊浏览高级检索 HTML

←前一篇 | 后一篇→

码上扫一扫！

下载全文

基于文档分层表示的恶意网页快速检测方法

袁梁¹, 林金芳²

(1.无锡城市职业技术学院师范学院, 无锡 214153;2.国防科技大学系统工程学院, 长沙 410073)

Hierarchical Representation Approach to Fast Detection of Malicious Webpages

YUAN Liang¹, LIN Jin-Fang²

(1.Normal College, Wuxi City College of Vocational Technology, Wuxi 214153, China;2.Collcge of Systems Engineering, National University of Defense Technology, Changsha 410073, China)

摘要

图/表

参考文献

相似文献

本文已被：浏览 898次下载 1358次
Received:April 25, 2019 Revised:May 21, 2019

中文摘要: 近年来，恶意网页检测主要依赖于语义分析或代码模拟执行来提取特征，但是这类方法实现复杂，需要高额的计算开销，并且增加了攻击面.为此，提出了一种基于深度学习的恶意网页检测方法，首先使用简单的正则表达式直接从静态HTML文档中提取与语义无关的标记，然后采用神经网络模型捕获文档在多个分层空间尺度上的局部性表示，实现了能够从任意长度的网页中快速找到微小恶意代码片段的能力.将该方法与多种基线模型和简化模型进行对比实验，结果表明该方法在0.1%的误报率下实现了96.4%的检测率，获得了更好的分类准确率.本方法的速度和准确性使其适合部署到端点、防火墙和Web代理中.

中文关键词: 深度学习恶意Web内容网页分类恶意网页识别

Abstract:In recent years, the web content detection mainly focuses on how to extract features from HTML document through semantic analysis or emulation execution, while it is undesirable, because it significantly complicates implementation which requires high computational overhead, and opens up an attack surface within the detector. A deep learning approach to detect malicious web pages is proposed. Firstly, we take advantage of the non-complex regular expression to extract tokens from static HTML document, then capture locality representation at multiple hierarchical spatial scales over the document with neural network model, by which the mode can quickly find tiny fragments of malicious code in any length of web pages. The experimental results show that this approach achieves a detection rate of 96.4% at a false positive rate of 0.1%, much better than the baseline and simplified model at the classification accuracy. The speed and accuracy of proposed approach makes it appropriate for deployment to endpoints, firewalls and web proxies.

keywords: deep learning malicious Web content Web page classification malicious Web page detection

文章编号： 中图分类号： 文献标志码：

基金项目:国家自然科学基金（91430214）

Author Name	Affiliation
YUAN Liang	Normal College, Wuxi City College of Vocational Technology, Wuxi 214153, China
LIN Jin-Fang	Collcge of Systems Engineering, National University of Defense Technology, Changsha 410073, China

Author Name	Affiliation
YUAN Liang	Normal College, Wuxi City College of Vocational Technology, Wuxi 214153, China
LIN Jin-Fang	Collcge of Systems Engineering, National University of Defense Technology, Changsha 410073, China

引用文本：
袁梁,林金芳.基于文档分层表示的恶意网页快速检测方法.计算机系统应用,2019,28(12):226-231
YUAN Liang,LIN Jin-Fang.Hierarchical Representation Approach to Fast Detection of Malicious Webpages.COMPUTER SYSTEMS APPLICATIONS,2019,28(12):226-231