###
计算机系统应用英文版:2022,31(7):158-164
本文二维码信息
码上扫一扫!
融合多角度特征的文本匹配模型
(湘潭大学 计算机学院·网络空间安全学院, 湘潭 411105)
Text Matching Model Incorporating Multi-angle Features
(School of Computer Science & School of Cyberspace Security, Xiangtan University, Xiangtan 411105, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 596次   下载 1032
Received:September 22, 2021    Revised:October 19, 2021
中文摘要: 文本匹配是自然语言处理的一个核心研究领域, 深度文本匹配模型大致可以分为表示型和交互型两种类型, 表示型模型容易失去语义焦点难以衡量词上下文重要性, 交互型模型缺少句型、句间等全局性信息. 针对以上问题提出一种融合多角度特征的文本匹配模型, 该模型以孪生网络为基本架构, 利用BERT模型生成词向量进行词相似度融合加强语义特征, 利用Bi-LSTM对文本的句型结构特征进行编码, 即融合文本词性序列的句型结构信息, 使用Transformer编码器对文本句型结构特征和文本特征进行多层次交互, 最后拼接向量推理计算出两个文本之间的相似度. 在Quora部分数据集上的实验表明, 本模型相比于经典深度匹配模型有更好的表现.
Abstract:Text matching is a core research area in natural language processing. Deep text matching models can be broadly classified into representational models and interactive models. The former tends to lose semantic focus and fails to measure the contextual importance of words. The latter lacks global information such as sentence type and inter-sentence information. To address these problems, we propose a text matching model incorporating multi-angle features based on Siamese neural network. The model generates word vectors using the BERT model and enhances semantic features by the similarity fusion of words. It then encodes the syntactic structured features using Bi-LSTM, namely the syntactic structured information containing the lexical sequence. A Transformer encoder is utilized to realize the multi-level interaction between the syntactic structured features and the text features. Finally, the similarity is deduced by spliced vectors. Experiments on part of Quora question pair show that this model performs better than the classical deep matching model.
文章编号:     中图分类号:    文献标志码:
基金项目:智能化公共法律服务关键技术湖南省重点研发项目(2022SK2106)
引用文本:
李广,刘新,马中昊,黄浩钰,张远明.融合多角度特征的文本匹配模型.计算机系统应用,2022,31(7):158-164
LI Guang,LIU Xin,MA Zhong-Hao,HUANG Hao-Yu,ZHANG Yuan-Ming.Text Matching Model Incorporating Multi-angle Features.COMPUTER SYSTEMS APPLICATIONS,2022,31(7):158-164