###
计算机系统应用英文版:2022,31(4):303-308
本文二维码信息
码上扫一扫!
基于句子的多属性融合相似度计算方法
(青岛科技大学 信息科学技术学院, 青岛 266061)
Multi-attribute Fusion Similarity Calculation Method Based on Sentence
(College of Information Science and Technology, Qingdao University of Science & Technology, Qingdao 266061, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 527次   下载 1676
Received:June 29, 2021    Revised:July 30, 2021
中文摘要: 针对现有的句子相似度计算方法没有考虑句子中的关键词的多属性信息, 无法更好衡量句子相似度的问题, 综合考虑句子的结构和包含的属性, 提出一种基于句子的多属性融合相似度计算方法. 该方法通过提取句子的词频属性、词序属性、词性属性及句长属性, 采用层次分析法(AHP)计算出各属性的权重, 并验证权重值的合理性, 继而加权融合4种属性的相似度. 将本文提出的多属性融合相似度计算方法在构建的数据集上进行实验, 验证此方法的可靠性及可行性, 并以召回率、准确率以及归一化F度量值为标准和其他传统方法进行对比分析, 结果表明, 该方法不仅有着均衡的召回率和准确率, 且F-度量值较高, 达到83.57%.
Abstract:The current sentence similarity calculation method does not consider the multi-attributes of the keywords in the sentence and cannot better measure the sentence similarity. Therefore, this study proposes a sentence similarity calculation method based on multi-attribute fusion, considering the sentence structure and the attributes contained. First, this method extracts the attributes of the sentence including the word frequency, word order, part of speech, and sentence length. Next, the analytic hierarchy process (AHP) is used to calculate the weight of each attribute and verify the rationality of the weight, and then the weighted fusion of the similarity of the four attributes is conducted. This proposed calculation method for multi-attribute sentence similarity is tested on the constructed dataset to verify its reliability and feasibility, and it is compared with other traditional methods in recall rates, accuracy rates, and normalized F-metric values.The results show that this method has balanced recall and accuracy rates and a high F-measure value of 83.57%.
文章编号:     中图分类号:    文献标志码:
基金项目:
引用文本:
袁绍正,周艳平.基于句子的多属性融合相似度计算方法.计算机系统应用,2022,31(4):303-308
YUAN Shao-Zheng,ZHOU Yan-Ping.Multi-attribute Fusion Similarity Calculation Method Based on Sentence.COMPUTER SYSTEMS APPLICATIONS,2022,31(4):303-308