基于深度哈希的文本表示学习

doi:10.15888/j.cnki.csa.008496

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月4日 19:48 星期五

首页 > 过刊浏览>2022年第31卷第6期 >158-166. DOI:10.15888/j.cnki.csa.008496

PDF HTML阅读 XML下载导出引用引用提醒

基于深度哈希的文本表示学习
DOI:
                        10.15888/j.cnki.csa.008496
                    
CSTR:
                        
                    
作者:
                        邹傲邹傲
陆军工程大学 指挥控制工程学院, 南京 210007
在期刊界中查找
在百度中查找
在本站中查找
郝文宁郝文宁
陆军工程大学 指挥控制工程学院, 南京 210007
在期刊界中查找
在百度中查找
在本站中查找
田媛田媛
陆军工程大学 指挥控制工程学院, 南京 210007
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金(61806221)

Text Representation Learning Based on Deep Hashing

Author:

ZOU Ao
ZOU Ao
Command & Control Engineering College, Army Engineering University of PLA, Nanjing 210007, China
在期刊界中查找
在百度中查找
在本站中查找
HAO Wen-Ning
HAO Wen-Ning
Command & Control Engineering College, Army Engineering University of PLA, Nanjing 210007, China
在期刊界中查找
在百度中查找
在本站中查找
TIAN Yuan
TIAN Yuan
Command & Control Engineering College, Army Engineering University of PLA, Nanjing 210007, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

文本表示学习作为自然语言处理的一项重要基础性工作, 在经历了向量空间模型、词向量模型以及上下文分布式表示的一系列发展后, 其语义表示能力已经取得了较大突破, 并直接促进模型在机器阅读、文本检索等下游任务上的表现不断提升. 然而, 预训练语言模型作为当前最先进的文本表示学习方法, 在训练阶段和预测阶段的时空复杂度较高, 造成了较高的使用门槛. 为此, 本文提出了一种基于深度哈希和预训练的新的文本表示学习方法, 旨在以更低的计算量实现尽可能高的文本表示能力. 实验结果表明, 在牺牲有限性能的情况下, 本文所提出的方法可以大幅降低模型在预测阶段的计算复杂度, 在很大程度上提升了模型在预测阶段的使用效率.

关键词:深度哈希;预训练语言模型;Transformer结构;文本表示学习;深度学习;注意力机制

Abstract:

As a cornerstone of natural language processing, text representation learning has made a great breakthrough in its semantic representation ability when it undergoes the development of the vector space model, word embedding model, and contextual distributed representation. In addition, it directly promotes the continuous improvement of the performance of models in downstream tasks such as machine reading and text retrieval. However, as the most advanced text representation learning method, the pre-trained language model has high space-time complexity in the training and prediction stages, which results in a high threshold of use. Therefore, this study proposes a new text representation learning method based on deep hashing and pre-training, which aims to achieve as high a text representation ability as possible with less computation. The experimental results show that the proposed method can remarkably reduce the computational complexity and to a great extent improve the efficiency of the model in the prediction stage.

Key words:deep hashing;pre-trained language models;Transformer;text representation learning;deep learning;attention mechanism

引用本文

邹傲,郝文宁,田媛.基于深度哈希的文本表示学习.计算机系统应用,2022,31(6):158-166

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2021-08-13
最后修改日期:2021-09-13
录用日期:
在线发布日期: 2022-05-26
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码