Deep Text Retrieval Re-Ranking Based on Adversarial Data Augmentation

doi:10.15888/j.cnki.csa.008114

WeChat

Mobile website

Home > Archive>Volume 30, Issue 7, 2021 >204-209. DOI:10.15888/j.cnki.csa.008114

PDF HTML XML Export Cite reminder

Deep Text Retrieval Re-Ranking Based on Adversarial Data Augmentation
DOI:
                        10.15888/j.cnki.csa.008114
                    
CSTR:
                        [cstr]
                    
Author:
                        
                        
                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The neural network ranking model has been widely used in the ranking task of the information retrieval field. It requires extremely high data quality; however, the information retrieval datasets usually contain a lot of noise, and documents irrelevant to the query cannot be accurately obtained. High-quality negative samples are essential to training a high-performance neural network ranking model. Inspired by the existing doc2query method, we propose a deep and end-to-end model AQGM. This model increases the diversity of queries and enhances the quality of negative samples by learning mismatched query document pairs and generating adversarial queries irrelevant to the documents and similar to the original query. Then, we train a deep ranking model based on BERT with the real samples and the samples generated by the AQGM model. Compared with the baseline model BERT-base, our model improves the MRR index by 0.3% and 3.2% on the MSMARCO and TrecQA datasets, respectively.

Reference

Cited by

Get Citation

陈丽萍,任俊超.基于对抗式数据增强的深度文本检索重排序.计算机系统应用,2021,30(7):204-209

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:November 09,2020
Revised:December 12,2020
Adopted:
Online: July 02,2021
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063