Text Matching Based on SimCSE Framework Fused with Pre-trained Model Internal Hierarchical Features

doi:10.15888/j.cnki.csa.009538

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-25- 17

Home > Archive>Volume 33, Issue 7, 2024 >103-111. DOI:10.15888/j.cnki.csa.009538

PDF HTML XML Export Cite reminder

Text Matching Based on SimCSE Framework Fused with Pre-trained Model Internal Hierarchical Features
DOI:
                        10.15888/j.cnki.csa.009538
                    
CSTR:
                        [cstr]
                    
Author:
                        SHENG Cheng-ChengSHENG Cheng-Cheng
Computer School, Beijing Information Science and Technology University, Beijing 100192, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
CHEN Jin-DongCHEN Jin-Dong
School of Economics and Management, Beijing Information Science and Technology University, Beijing 100192, China;Beijing International Science and Technology Cooperation Base for Intelligent Decision and Big Data Application, Beijing 100192, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
ZHANG JianZHANG Jian
School of Economics and Management, Beijing Information Science and Technology University, Beijing 100192, China;Beijing International Science and Technology Cooperation Base for Intelligent Decision and Big Data Application, Beijing 100192, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The simple contrastive learning of sentence embedding (SimCSE) framework only uses the classification [CLS]tokens as text vectors, and it also neglects the hierarchical information within the base model, which results in insufficient extraction of semantic features from the base model output. Based on the SimCSE framework, this study proposes a method that fuses hierarchical features of pre-trained models, SimCSE with hierarchical feature fusion (SimCSE-HFF). SimCSE-HFF is based on a dual-path parallel network, using short and long paths to strengthen feature learning. The short path uses a convolutional neural network to learn local text features and perform dimensionality reduction, while the long path uses a bidirectional gated recurrent neural network to learn deep semantic information. Additionally, in the long path, an autoencoder is used to fuse features from other layers within the base model, solving the problem of insufficient extraction of output features by the model. On the Chinese and English datasets of spring tools suite-bundle (STS-B), the SimCSE-HFF method outperforms traditional methods in terms of semantic similarity Spearman and Pearson correlation metrics, showing improvements on different pre-trained models. Additionally, it also outperforms the SimCSE framework in downstream task retrieval-based question answering, demonstrating better versatility.

Key words:text matching;SimCSE;feature fusion;autoencoder;parallel network

Get Citation

盛成城,陈进东,张健.基于SimCSE框架融合预训练模型层级特征的文本匹配.计算机系统应用,2024,33(7):103-111

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:January 10,2024
Revised:February 07,2024
Adopted:
Online: June 05,2024
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063