长文本匹配LTM-B模型

doi:10.15888/j.cnki.csa.008313

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月14日 15:43 星期一

首页 > 过刊浏览>2022年第31卷第2期 >291-297. DOI:10.15888/j.cnki.csa.008313

PDF HTML阅读 XML下载导出引用引用提醒

长文本匹配LTM-B模型
DOI:
                        10.15888/j.cnki.csa.008313
                    
CSTR:
                        
                    
作者:
                        刘龙刘龙
湘潭大学 计算机学院·网络空间安全学院, 湘潭 411105
在期刊界中查找
在百度中查找
在本站中查找
刘新刘新
湘潭大学 计算机学院·网络空间安全学院, 湘潭 411105
在期刊界中查找
在百度中查找
在本站中查找
蔡林杰蔡林杰
湘潭大学 计算机学院·网络空间安全学院, 湘潭 411105
在期刊界中查找
在百度中查找
在本站中查找
唐朝唐朝
湘潭大学 计算机学院·网络空间安全学院, 湘潭 411105
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:湖南省重点研发项目（2022SK2106）

LTM-B Model of Long Text Matching

Author:

LIU Long
LIU Long
School of Computer Science & School of Cyberspace Science, Xiangtan University, Xiangtan 411105, China
在期刊界中查找
在百度中查找
在本站中查找
LIU Xin
LIU Xin
School of Computer Science & School of Cyberspace Science, Xiangtan University, Xiangtan 411105, China
在期刊界中查找
在百度中查找
在本站中查找
CAI Lin-Jie
CAI Lin-Jie
School of Computer Science & School of Cyberspace Science, Xiangtan University, Xiangtan 411105, China
在期刊界中查找
在百度中查找
在本站中查找
TANG Chao
TANG Chao
School of Computer Science & School of Cyberspace Science, Xiangtan University, Xiangtan 411105, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

长文本匹配是自然语言处理的一项基础工作, 在文本聚类、新闻推荐等方面有着关键作用. 受语料、篇幅结构、文本表示技术的限制, 长文本匹配工作进展缓慢. 近年提出的BERT模型在文本表示方面具有非常卓越的表现, 而对于BERT来说, 长文本的处理有截断法、分段法和压缩法3种常用方式, 截断法丢失大量文本信息, 分段法保留文本信息却丢失部分语义信息, 压缩法可能丢失部分关键信息. 针对以上问题, 本文对分段法加以改进, 提出一种基于BERT的长文本匹配模型(long text matching model based on BERT, LTM-B), 它以孪生网络为基础, 采用分层的思想将文档切分成多个分段, 使用BERT模型处理文本向量化, 从而得到文档的矩阵表示, 并采用BiLSTM产生位置矩阵, 然后将文档矩阵及其位置矩阵求和输入至Transformer编码器进行特征提取, 最后将两个文档矩阵进行交互、池化、拼接后经由全连接层分类输出匹配结果. 实验表明, 相比于其他方法, LTM-B模型在长文本匹配问题上拥有更好的表现.

关键词:长文本匹配;BERT;孪生网络;BiLSTM;Transformer

Abstract:

Long text matching is a basic work of natural language processing, and it plays a key role in text clustering, news recommendation, etc. Due to the limitations of the corpus, space structure, and text representation technology, long text matching has been progressing slowly. The bidirectional encoder representations from Transformer (BERT) model proposed in recent years has an excellent performance in the text representation. For BERT, there are three common methods for processing long texts: truncation, segmentation, and compression. The truncation method causes the loss of massive text information; the segmentation method retains text information but loses part of the semantic information; the compression method may lose part of the key information. In response to the above problems, this study improves the segmentation method and proposes a long text matching model based on BERT (LTM-B), which is based on the Siamese neural network and adopts a layered idea to divide the document into multiple segments. The BERT model is used for text vectorization. As a result, the matrix representation of the document is obtained. The bidirectional long short-term memory (BiLSTM) is employed to generate the position matrix, and then the sum of the document matrix and the position matrix is input to the Transformer encoder for feature extraction. Finally, the two matrices are interacted, pooled, and spliced, and then the matching results are output through the fully connected layer classification. Experiments show that the LTM-B model outperforms other methods in long text matching.

Key words:long text matching;BERT;siamese neural network;BiLSTM;Transformer

引用本文

刘龙,刘新,蔡林杰,唐朝.长文本匹配LTM-B模型.计算机系统应用,2022,31(2):291-297

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2021-04-25
最后修改日期:2021-05-19
录用日期:
在线发布日期: 2022-01-28
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码