Using Word Clustering to Improve Recurrent Neural Network Language Model

AIPUB归智期刊联盟

WeChat

Mobile website

2025-8-7- 1

Home > Archive>Volume 23, Issue 5, 2014 >101-106

PDF HTML XML Export Cite reminder

Using Word Clustering to Improve Recurrent Neural Network Language Model
DOI:
                        
                    
CSTR:
                        
                    
Author:
                        LIU ZhangLIU Zhang
School of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
CHEN Xiao-PingCHEN Xiao-Ping
School of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Previous studies proved that, adding part of speech tag information to the input layer of neural language model, can improve the performance significantly. But part of speech tag need hand-annotated data to train the tag model, which consumes a lot and the extra tagger also makes the model more complicated. To solve the problem, this article propose adding the results of brown clustering, instead of part of speech tag information to the input layer of the recurrent network language model. In the Penn Treebank corpus, the relative improvement over the original recurrent neural network language model reaches 8%~9%.

Key words:recurrent neural network language model;part of speech tag;brown clustering;language model

Get Citation

刘章,陈小平.联合无监督词聚类的递归神经网络语言模型.计算机系统应用,2014,23(5):101-106

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:September 12,2013
Revised:November 11,2013
Adopted:
Online: May 29,2014
Published:

Article QR Code

You are the first1094942Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063