Research on Chinese Weibo Text Classification Based on Word2Vec
CSTR:
Author:
Affiliation:

Clc Number:

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    The Chinese Weibo is an indispensable communication tool for people today. Mining information in Weibo text is of great significance to automatic question and answer, public opinion analysis and other applied research. The short text classification study is the basis of short text mining. The neural network-based Word2Vec can solve problems of high-dimensional sparseness and semantic gap that traditional text categorization methods cannot solve. This study obtains the word vector based on Word2Vec, then the class factor is introduced into the traditional weight calculation method TF-IDF (Term Frequency-Inverse Document Frequency) to design the word vector weight. Finally, the SVM classifier is used for classification. The effectiveness of the method is verified by experiments on Weibo data.

    Reference
    Related
    Cited by
Get Citation

牛雪莹,赵恩莹.基于Word2Vec的微博文本分类研究.计算机系统应用,2019,28(8):256-261

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:February 17,2019
  • Revised:March 08,2019
  • Adopted:
  • Online: August 14,2019
  • Published: August 15,2019
Article QR Code
You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063