###
计算机系统应用英文版:2018,27(9):118-123
←前一篇   |   后一篇→
本文二维码信息
码上扫一扫!
基于词向量技术与主题词特征的微博立场检测
(上海海事大学 信息工程学院, 上海 201306)
Stance Detection in Chinese Microblog Topic Based on Word Embedding Technology and Thematic Words Feature
(College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 2324次   下载 2784
Received:January 06, 2018    Revised:January 23, 2018
中文摘要: 微博话题随着移动互联网的发展变得火热起来,单个热门话题可能有数万条评论,微博话题的立场检测是针对某话题判断发言人对该话题的态度是支持的、反对的或中立的.本文一方面由Word2Vec训练语料库中每个词的词向量获取句子的语义信息,另一方面使用TextRank构建主题集作为话题的立场特征,同时结合情感词典获取句子的情感信息,最后将特征选择后的词向量使用支持向量机对其训练和预测完成最终的立场检测模型.实验表明基于主题词及情感词相结合的立场特征可以获得不错的立场检测效果.
Abstract:With the development of the mobile Internet, Microblog topic has become popular. A single hot topic may have tens of thousands of comments. The stance detection of Microblog topic aims to automatically determine whether the author of a text is in favor of the given target, against the given target, or neither. Firstly, Word2Vec trains out each word of the corpus of vector to extract semantics information from sentence. Then, TextRank keywords extracted method is used to construct the thematic words set as the stance's feature, meanwhile, the sentiment lexicon is used to extract the sentiment information of the sentence. Finally, the word vector of feature selection is trained and predicted by Support Vector Machine (SVM), so as to complete the model of stance detection. The experimental result shows that the stance feature based on the combination of thematic words and sentiment words can obtain good stance detection effect.
文章编号:     中图分类号:    文献标志码:
基金项目:
引用文本:
郑海洋,高俊波,邱杰,焦凤.基于词向量技术与主题词特征的微博立场检测.计算机系统应用,2018,27(9):118-123
ZHENG Hai-Yang,GAO Jun-Bo,QIU Jie,JIAO Feng.Stance Detection in Chinese Microblog Topic Based on Word Embedding Technology and Thematic Words Feature.COMPUTER SYSTEMS APPLICATIONS,2018,27(9):118-123