本文已被:浏览 1252次 下载 1937次
Received:April 12, 2016 Revised:May 19, 2016
Received:April 12, 2016 Revised:May 19, 2016
中文摘要: 为了让读者可以更快地获取所有新闻评论中最有代表性的观点,提出一种新的新闻评论摘要采集算法,并依此设计出评论摘要采集系统.该算法将有效地结合聚类算法和排序算法,首先,使用改进的Borderflow算法对所有评论聚类;其次,采用类PageRank算法对聚类中的评论进行排序,选出排名最前的几条评论;最后,利用MMR算法对PageRank算法选出的所有评论进行再次排序,并选取名次最高的K条评论作为评论摘要.通过仿真实验得到的NDCG和MAP数据表明,使用本文算法得到的评论摘要具有更好的有效性和准确性,更符合读者直观感觉.
中文关键词: 评论摘要 BorderFlow算法 PageRank算法 MMR算法
Abstract:In order to make the readers get the most informative and representative opinions efficiently among the news comments, this paper proposes a novel news article comments summarization algorithm and then designs an article summarization system, which combines the clustering algorithm with the ranking algorithm.First, it groups comments using the modified BorderFlow clustering algorithm.Second, for each group, it uses the similar PageRank algorithm to score and rank comments, and selects top comments in each cluster as representation.At last, it ranks the selected comments by MMR algorithm and displays the top-K comments as the comments summarization.According to the experimental statics of NDCG and MAP data, the proposed method meets the intuitive sense of readers more.Meanwhile, it shows the better effectiveness and accuracy theoretically.
文章编号: 中图分类号: 文献标志码:
基金项目:
引用文本:
师昕,赵雪青.新型的面向新闻评论摘要采集算法.计算机系统应用,2017,26(1):163-167
SHI Xin,ZHAO Xue-Qing.Novel News Article Comments Summarization Algorithm of Computer Engineering and Applications.COMPUTER SYSTEMS APPLICATIONS,2017,26(1):163-167
师昕,赵雪青.新型的面向新闻评论摘要采集算法.计算机系统应用,2017,26(1):163-167
SHI Xin,ZHAO Xue-Qing.Novel News Article Comments Summarization Algorithm of Computer Engineering and Applications.COMPUTER SYSTEMS APPLICATIONS,2017,26(1):163-167