Short Text Topic Model Based on Semantic Enhancement

doi:10.15888/j.cnki.csa.007937

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-14- 20

Home > Archive>Volume 30, Issue 6, 2021 >141-147. DOI:10.15888/j.cnki.csa.007937

PDF HTML XML Export Cite reminder

Short Text Topic Model Based on Semantic Enhancement
DOI:
                        10.15888/j.cnki.csa.007937
                    
CSTR:
                        [cstr]
                    
Author:
                        GAO JuanGAO Juan
School of Computer Science, Xi’an Polytechnic University, Xi’an 710600, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
ZHANG Xiao-BinZHANG Xiao-Bin
School of Computer Science, Xi’an Polytechnic University, Xi’an 710600, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Traditional topic models rely largely on word co-occurrence patterns to generate text topics. The data sparseness of short texts due to insufficient context has restrained traditional topic models from achieving good results with regard to short texts. On this basis, this study proposes a short text topic model based on semantic enhancement. The algorithm integrates the Dirichlet Multinomial Mixture (DMM) model with a word embedding model. It obtains the vector representation of words by training global word embedding and local word embedding and calculates the semantic correlation between word vectors with cosine similarity. Besides, it enhances the semantic meaning of words by calculating the weight of topic-related words. Experiments demonstrate the proposed model is more accurate in consistence of topic representation and improves the classification accuracy of the model in regard to short texts.

Key words:short text;topic model;word embedding;semantic enhancement;Gibbs Sampling

Get Citation

高娟,张晓滨.基于语义增强的短文本主题模型.计算机系统应用,2021,30(6):141-147

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:October 05,2020
Revised:November 02,2020
Adopted:
Online: June 05,2021
Published:

Article QR Code

You are the first991215Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063