News Text Classification Based on Weighted Word Vector and CNN
CSTR:
Author:
Affiliation:

Clc Number:

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    In the text classification methods, the text representation based on the Word2Vec ignores the weight of words in distinguishing text. The method of combining Word2Vec weighted by TF-IDF and CNN is designed. In news text classification, the importance of news title is always neglected. Therefore, this study proposes an improved TF-IDF method, which takes both news title and body into account. Experiments show that the news text classification method based on weighted word vector and CNN has a greater improvement than the logistic regression classification. And its effect increases by 2 or 3 percentage points than the un-weighted method.

    Reference
    Related
    Cited by
Get Citation

胡万亭,贾真.基于加权词向量和卷积神经网络的新闻文本分类.计算机系统应用,2020,29(5):275-279

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:October 02,2019
  • Revised:October 29,2019
  • Adopted:
  • Online: May 07,2020
  • Published: May 15,2020
Article QR Code
You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063