###

计算机系统应用英文版:2022,31(11):349-357

View/Add Comment 过刊浏览高级检索 HTML

←前一篇 | 后一篇→

码上扫一扫！

下载全文

基于增量预训练和对抗训练的文本匹配模型

司志博文^1,2, 李少博², 单丽莉², 孙承杰², 刘秉权²

(1.人民网传播内容认知国家重点实验室, 北京 100733;2.哈尔滨工业大学计算学部, 哈尔滨 150001)

Text Matching Model Based on Incremental Pre-training and Adversarial Training

SI Zhi-Bo-Wen^1,2, LI Shao-Bo², SHAN Li-Li², SUN Cheng-Jie², LIU Bing-Quan²

(1.State Key Laboratory of Communication Content Cognition, People’s Daily Online, Beijing 100733, China;2.Faculty of Computing, Harbin Institute of Technology, Harbin 150001, China)

摘要

图/表

参考文献

相似文献

本文已被：浏览 769次下载 1889次
Received:January 29, 2022 Revised:February 24, 2022

中文摘要: 文本匹配是自然语言理解的关键技术之一, 其任务是判断两段文本的相似程度. 近年来随着预训练模型的发展, 基于预训练语言模型的文本匹配技术得到了广泛的应用. 然而, 这类文本匹配模型仍然面临着在某一特定领域泛化能力不佳、语义匹配时鲁棒性较弱这两个挑战. 为此, 本文提出了基于低频词的增量预训练及对抗训练方法来提高文本匹配模型的效果. 本文通过针对领域内低频词的增量预训练, 帮助模型向目标领域迁移, 增强模型的泛化能力; 同时本文尝试多种针对低频词的对抗训练方法, 提升模型对词级别扰动的适应能力, 提高模型的鲁棒性. 本文在LCQMC数据集和房产领域文本匹配数据集上的实验结果表明, 增量预训练、对抗训练以及这两种方式的结合使用均可明显改善文本匹配结果.

中文关键词: 文本匹配预训练模型增量预训练对抗训练低频词深度学习自然语言处理

Abstract:Text matching is one of the key techniques in natural language understanding, and its task is to determine the similarity of two texts. In recent years, with the development of pre-trained models, text-matching techniques based on pre-trained language models have been widely used. However, these text matching models still face the challenges of poor generalization ability in a particular domain and weak robustness in semantic matching. Therefore, this study proposes an incremental pre-training and adversarial training method for low-frequency words to improve the effect of the text matching model. The incremental pre-training of low-frequency words in the source domain helps the model migrate to the target domain and enhances the generalization ability of the model. Additionally, various adversarial training methods for low-frequency words are tried to improve the model’s adaptability to word-level perturbations and the robustness of the model. The experimental results on the LCQMC dataset and the text-matching dataset in the real estate domain indicate that incremental pre-training, adversarial training, and the combination of the two approaches can significantly improve the text matching results.

keywords: text matching pre-trained model incremental pre-training adversarial training low-frequency word deep learning natural language processing (NLP)

文章编号： 中图分类号： 文献标志码：

基金项目:国家自然科学基金(62176074)

引用文本：
司志博文,李少博,单丽莉,孙承杰,刘秉权.基于增量预训练和对抗训练的文本匹配模型.计算机系统应用,2022,31(11):349-357
SI Zhi-Bo-Wen,LI Shao-Bo,SHAN Li-Li,SUN Cheng-Jie,LIU Bing-Quan.Text Matching Model Based on Incremental Pre-training and Adversarial Training.COMPUTER SYSTEMS APPLICATIONS,2022,31(11):349-357

Author Name	Affiliation	E-mail
SI Zhi-Bo-Wen	State Key Laboratory of Communication Content Cognition, People’s Daily Online, Beijing 100733, China Faculty of Computing, Harbin Institute of Technology, Harbin 150001, China
LI Shao-Bo	Faculty of Computing, Harbin Institute of Technology, Harbin 150001, China
SHAN Li-Li	Faculty of Computing, Harbin Institute of Technology, Harbin 150001, China
SUN Cheng-Jie	Faculty of Computing, Harbin Institute of Technology, Harbin 150001, China
LIU Bing-Quan	Faculty of Computing, Harbin Institute of Technology, Harbin 150001, China	liubq@hit.edu.cn