Improved Word Representation Based on GloVe Model

doi:10.15888/j.cnki.csa.006704

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-25- 15

Home > Archive>Volume 28, Issue 1, 2019 >194-199. DOI:10.15888/j.cnki.csa.006704

PDF HTML XML Export Cite reminder

Improved Word Representation Based on GloVe Model
DOI:
                        10.15888/j.cnki.csa.006704
                    
CSTR:
                        [cstr]
                    
Author:
                        CHEN Zhen-RuiCHEN Zhen-Rui
Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
DING Zhi-MingDING Zhi-Ming
Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Word vector representation is a sound way to catch the grammatical and semantic information of words. In order to improve the accuracy of the semantic information of the word, this study proposes an improved training method model based on the GloVe by analyzing the characteristics of the co-occurrence matrix and using the distributed hypothesis. This method summarizes the general rules of irrelevant words and noise words in the co-occurrence matrix from analyzing the word frequency of Wikipedia statistics. Finally, we give the evaluation results of word vector in word analogy dataset and word correlation dataset. Experiments show that the method presented in this paper can effectively shorten the training time and the accuracy of the word semantic analogy experiment is improved in the same experimental environment.

Key words:word vector;Word2Vec;GloVe;cooccurrence matrix;unrelated words

Get Citation

陈珍锐,丁治明.基于GloVe模型的词向量改进方法.计算机系统应用,2019,28(1):194-199

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:June 04,2018
Revised:June 27,2018
Adopted:
Online: December 27,2018
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063