Multi-attribute Controllable Text Summary Model Based on Pointer Generator Network and Extended Transformer

doi:10.15888/j.cnki.csa.009457

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-25- 13

Home > Archive>Volume 33, Issue 4, 2024 >246-253. DOI:10.15888/j.cnki.csa.009457

PDF HTML XML Export Cite reminder

Multi-attribute Controllable Text Summary Model Based on Pointer Generator Network and Extended Transformer
DOI:
                        10.15888/j.cnki.csa.009457
                    
CSTR:
                        [cstr]
                    
Author:
                        XIAN Guang-MingXIAN Guang-Ming
School of Software, South China Normal University, Foshan 528225, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
LI Fan-LongLI Fan-Long
School of Software, South China Normal University, Foshan 528225, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
ZHENG Zhao-MingZHENG Zhao-Ming
School of Software, South China Normal University, Foshan 528225, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The controllable text summary models can generate summaries that conform to user preferences. Previous summary models focus on controlling a certain attribute alone, rather than the combination of multiple attributes. When multiple control attributes are satisfied, the traditional Seq2Seq multi-attribute controllable text summary model cannot integrate all control attributes, accurately reproduce key information in the texts, and handle words outside the word lists. Therefore, this study proposes a model based on the extended Transformer and pointer generator network (PGN). The extended Transformer in the model extends the Transformer single encoder-single decoder model form into a dual encoder with dual text semantic information extraction and a single decoder form that can fuse guidance signal features. Then the PGN model is employed to select the source from the source copy words in the text or adopt vocabulary to generate new summary information to solve the OOV (out of vocabulary) problem that often occurs in summary tasks. Additionally, to efficiently complete position information encoding, the model utilizes relative position representation in the attention layer to introduce sequence information of the texts. The model can be leveraged to control many important summary attributes, including lengths, topics, and specificity. Experiments on the public dataset MACSum show that compared with previous methods, the proposed model performs better at ensuring the summary quality. At the same time, it is more in line with the attribute requirements given by users.

Key words:deep learning;controlled text summary;Transformer model;relative position representation;pointer generator network (PGN)

Get Citation

冼广铭,李凡龙,郑兆明.基于指针生成网络和扩展Transformer的多属性可控文本摘要模型.计算机系统应用,2024,33(4):246-253

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:September 28,2023
Revised:November 09,2023
Adopted:
Online: January 30,2024
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063