###
DOI:
计算机系统应用英文版:2015,24(11):162-166
本文二维码信息
码上扫一扫!
基于组合特征的Web人名消歧方法
(中国科学与技术大学计算机学院, 合肥 230027)
Web Name Disambiguation Approach Based on Combined Features
(School of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 1530次   下载 3176
Received:March 09, 2015    Revised:May 08, 2015
中文摘要: 重名问题在Web人物搜索过程中是很普遍的现象.研究了Web人名消歧相关问题,提取与待消歧人名相关的不同特征集,运用向量空间模型构造人物实体的组合特征,最后通过层次聚类算法将相似度高的文档优先聚类,由此实现人名消歧.在WePS数据集上的实验结果表明,提出的方法具有良好的消歧效果.
Abstract:Name ambiguity is a common phenomenon when one tries to search for someone's information in the Internet. In this paper, we have studied the web name disambiguation issue in detail. After extracting different features related to the name and then creating combined features by vector space model, we give priority to cluster the documents with high similarity by hierarchical clustering algorithm. Evaluated on the WePS data set, the proposed method showed its effectiveness in solving name disambiguation problem.
文章编号:     中图分类号:    文献标志码:
基金项目:
引用文本:
辛涛,程绍银,蒋凡.基于组合特征的Web人名消歧方法.计算机系统应用,2015,24(11):162-166
XIN Tao,CHENG Shao-Yin,JIANG Fan.Web Name Disambiguation Approach Based on Combined Features.COMPUTER SYSTEMS APPLICATIONS,2015,24(11):162-166