Text Similarity Matching Model Based on Positive and Negative Samples and Bi-LSTM

doi:10.15888/j.cnki.csa.007846

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-14- 19

Home > Archive>Volume 30, Issue 4, 2021 >175-180. DOI:10.15888/j.cnki.csa.007846

PDF HTML XML Export Cite reminder

Text Similarity Matching Model Based on Positive and Negative Samples and Bi-LSTM
DOI:
                        10.15888/j.cnki.csa.007846
                    
CSTR:
                        [cstr]
                    
Author:
                        ZHOU Yan-PingZHOU Yan-Ping
College of Information Science and Technology, Qingdao University of Science & Technology, Qingdao 266061, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
ZHU Xiao-HuZHU Xiao-Hu
College of Information Science and Technology, Qingdao University of Science & Technology, Qingdao 266061, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Similarity matching is crucial for natural language processing and also for extracting answers from the question answering system. This study proposes a model of text similarity matching based on positive and negative samples and Bi-LSTM. Firstly, this model constructs question answering pairs for positive and negative samples in model training, improving the similarity between a question and its correct answer. Secondly, it applies the dual-layer word vector embedding for pre-training to solve the experimental error caused by segmentation mistakes. Thirdly, it adopts the internal attention mechanism before feature extraction to solve the backward offset of the characteristic vectors caused by the attention mechanism. Then this model trains the data on the Bi-LSTM neural network to retain important temporal characteristics. Finally, it puts forward a similarity calculation function including semantic information to calculate similarity at the semantic level. The model proposed in this study is simulated on the public data set DuReader and compared with other models. The experimental results show that the proposed model has high accuracy and good robustness, and the accuracy of top-1 reaches 78.34%.

Key words:question answering system;similarity match;positive and negative samples;Bi-LSTM

Get Citation

周艳平,朱小虎.基于正负样本和Bi-LSTM的文本相似度匹配模型.计算机系统应用,2021,30(4):175-180

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:July 30,2020
Revised:August 26,2020
Adopted:
Online: March 31,2021
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063