Hot Word Extraction for Microblog Based on Massive Data Filtering

WeChat

Mobile website

Home > Archive>Volume 21, Issue 11, 2012 >131-136

Hot Word Extraction for Microblog Based on Massive Data Filtering
DOI:
                        
CSTR:
                        [cstr]
                    
Author:
                        
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

This paper presents a Chinese microblog hot words extraction algorithm based on massive data Filtering. Firstly, it chooses the user behaviour characteristics and text characteristics to create user behavior models, and filters massive data to create topic-trees by a fast algorithm based on rules. Then, it uses hot words extraction algorithm to get the hot topic of topic-trees by word frequency feature. The experiment results show that the proposed algorithm can reduce the scale of the input data, with keeping lots of important information to extract hot words.

Reference

Cited by

Get Citation

汪洋,帅建梅,陈志刚.基于海量信息过滤的微博热词抽取方法.计算机系统应用,2012,21(11):131-136

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:March 13,2012
Revised:April 18,2012
Adopted:
Online:
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063