基于实体复制和双粒度指导的抽象摘要

doi:10.15888/j.cnki.csa.009508

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月23日 20:48 星期三

首页 > 过刊浏览>2024年第33卷第5期 >210-217. DOI:10.15888/j.cnki.csa.009508

PDF HTML阅读 XML下载导出引用引用提醒

基于实体复制和双粒度指导的抽象摘要
DOI:
                        10.15888/j.cnki.csa.009508
                    
CSTR:
                        32024.14.csa.009508
                    
作者:
                        周子力周子力
曲阜师范大学 网络空间安全学院, 曲阜 273165
在期刊界中查找
在百度中查找
在本站中查找
高士亮高士亮
曲阜师范大学 网络空间安全学院, 曲阜 273165
在期刊界中查找
在百度中查找
在本站中查找
安润鲁安润鲁
曲阜师范大学 网络空间安全学院, 曲阜 273165
在期刊界中查找
在百度中查找
在本站中查找
包新月包新月
曲阜师范大学 网络空间安全学院, 曲阜 273165
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:山东省自然科学基金(ZR2021MD115); 上海市科委项目(21511100302)

Abstractive Summarization Based on Entity Copy and Dual Granularity Guidance

Author:

ZHOU Zi-Li
ZHOU Zi-Li
School of Cyber Science and Engineering, Qufu Normal University, Qufu 273165, China
在期刊界中查找
在百度中查找
在本站中查找
GAO Shi-Liang
GAO Shi-Liang
School of Cyber Science and Engineering, Qufu Normal University, Qufu 273165, China
在期刊界中查找
在百度中查找
在本站中查找
AN Run-Lu
AN Run-Lu
School of Cyber Science and Engineering, Qufu Normal University, Qufu 273165, China
在期刊界中查找
在百度中查找
在本站中查找
BAO Xin-Yue
BAO Xin-Yue
School of Cyber Science and Engineering, Qufu Normal University, Qufu 273165, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

抽象神经网络在文本摘要领域取得了长足进步, 展示了令人瞩目的成就. 然而, 由于抽象摘要的灵活性, 它很容易造成生成的摘要忠实性差的问题, 甚至偏离源文档的语义主旨. 针对这一问题, 本文提出了两种方法来提高摘要的保真度. (1)由于实体在摘要中起着重要作用, 而且通常来自于原始文档, 因此本文提出允许模型从源文档中复制实体, 确保生成的实体与源文档中的实体相匹配, 这有助于防止生成不一致的实体. (2)为了更好地防止生成的摘要与原文产生语义偏离, 本文在摘要生成过程中使用关键实体和关键token作为两种不同粒度的指导信息以指导摘要的生成. 本文使用 ROUGE指标在两个广泛使用的文本摘要数据集CNNDM和XSum上评估了本文方法的性能, 实验结果表明, 这两种方法在提高模型性能方面都取得了显著的效果. 此外, 实验还证明了实体复制机制可以在一定程度上借助指导信息以纠正引入的语义噪声.

关键词:抽象摘要;实体复制;双粒度指导;深度学习;预训练模型

Abstract:

Abstract neural networks have made significant progress and demonstrated remarkable achievements in the field of text summarization. However, abstract summarization is highly likely to generate summaries of poor fidelity and even deviate from the semantic essence of the source documents due to its flexibility. To address this issue, this study proposes two methods to improve the fidelity of summaries. For Method 1, since entities play an important role in summaries and are usually derived from the original documents, the paper suggests allowing the model to copy entities from the source document to ensure that the generated entities match those in the source document and thereby prevent the generation of inconsistent entities. For Method 2, to better prevent the generated summary from deviating from the original text semantically, the study uses key entities and key tokens as two types of guiding information at different levels of granularity in the summary generation process. The performance of the proposed methods is evaluated using the ROUGE metric on two widely used text summarization datasets, namely, CNNDM and XSum. The experimental results demonstrate that both methods have significantly improved the performance of the model. Furthermore, the experiments also prove that the entity copy mechanism can, to some extent, use guiding information to correct introduced semantic noise.

Key words:abstract summarization;entity copy;dual granularity guidance;deep learning;pre-train model

引用本文

周子力,高士亮,安润鲁,包新月.基于实体复制和双粒度指导的抽象摘要.计算机系统应用,2024,33(5):210-217

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2023-12-12
最后修改日期:2024-01-10
录用日期:
在线发布日期: 2024-04-01
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码