基于改进人工蜂群算法的文本对抗样本生成

doi:10.15888/j.cnki.csa.008820

AIPUB归智期刊联盟

微信公众号

网站二维码

首页 > 过刊浏览>2022年第31卷第11期 >238-245. DOI:10.15888/j.cnki.csa.008820

PDF HTML阅读 XML下载导出引用引用提醒

基于改进人工蜂群算法的文本对抗样本生成
DOI:
                        10.15888/j.cnki.csa.008820
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金创新群体项目(61521003)

Text Adversarial Samples Generation Based on Improved Artificial Bee Colony Algorithm

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

文本对抗样本的生成对于研究基于深度学习的自然语言处理系统的脆弱性, 提升这类系统的鲁棒性具有重要的意义. 本文对词级对抗样本生成中的重要步骤, 替换词的搜索展开研究, 针对现有算法存在的早熟收敛和有效性差的问题, 提出了基于改进人工蜂群搜索算法的文本对抗样本生成方法. 首先, 根据知网HowNet库中单词的义原标注筛选得到拟被替换词的搜索空间; 然后, 基于改进的人工蜂群算法搜索并定位替换词生成高质量的文本对抗样本. 本文针对当前主流的基于深度神经网络的文本分类模型, 在两个文本分类数据集上进行了攻击测试. 结果表明, 跟已有文本对抗样本生成方法相比, 本文提出的方法能以较高的攻击成功率误导文本分类系统, 并更多地保留语义和语法的正确性.

Abstract:

The generation of text adversarial samples is of great significance for studying the vulnerability of deep learning-based natural language processing (NLP) systems and improving the robustness of such systems. This work studies the important steps in the generation of word-level adversarial samples and the search for replacement words. Considering the problems of premature convergence and poor effectiveness of existing algorithms, a text adversarial sample generation method is proposed, which is based on an improved artificial bee colony (ABC) search algorithm. Firstly, the search space of the words to be replaced is obtained by the screening of the sememe annotations of the words in the HowNet database. Then, the improved ABC algorithm is employed to search and locate the replacement words for the generation of high-quality text adversarial samples. Finally, attack tests are conducted on two text classification datasets for a comparison with the current mainstream text classification models based on deep neural networks (DNNs). The results demonstrate that compared with the existing text adversarial sample generation methods, the proposed method can mislead the text classification system with a higher success rate of attack and preserve semantic and grammatical correctness to a larger extent.

参考文献

相似文献

引证文献

引用本文

杨帆,李邵梅,金柯君.基于改进人工蜂群算法的文本对抗样本生成.计算机系统应用,2022,31(11):238-245

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2022-03-04
最后修改日期:2022-04-02
录用日期:
在线发布日期: 2022-08-26
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码