N-gram模型综述

doi:10.15888/j.cnki.csa.006560

AIPUB归智期刊联盟

微信公众号

网站二维码

首页 > 过刊浏览>2018年第27卷第10期 >33-38. DOI:10.15888/j.cnki.csa.006560

PDF HTML阅读 XML下载导出引用引用提醒

N-gram模型综述
DOI:
                        10.15888/j.cnki.csa.006560
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:

Survey on N-gram Model

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

N-gram模型是自然语言处理中最常用的语言模型之一，广泛应用于语音识别、手写识别、拼写纠错、机器翻译和搜索引擎等众多任务.但是N-gram模型在训练和应用时经常会出现零概率问题，导致无法获得良好的语言模型，因此出现了拉普拉斯平滑、卡茨回退和Kneser-Ney平滑等平滑方法.在介绍了这些平滑方法的基本原理后，使用困惑度作为度量标准去比较了基于这几种平滑方法所训练出的语言模型.

Abstract:

The N-gram model is one of the most commonly used language models in natural language processing and is widely used in many tasks such as speech recognition, handwriting recognition, spelling correction, machine translation and search engines. However, the N-gram model often presents zero-probability problems in training and application, resulting in failure to obtain a good language model. As a result, smoothing methods such as Laplace smoothing, Katz back-off, and Kneser-Ney smoothing appeared. After introducing the basic principles of these smoothing methods, we use the perplexity as a metric to compare the language models trained based on these types of smoothing methods.

参考文献

相似文献

引证文献

引用本文

尹陈,吴敏. N-gram模型综述.计算机系统应用,2018,27(10):33-38

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2018-01-29
最后修改日期:2018-02-27
录用日期:
在线发布日期: 2018-09-29
出版日期:

微信公众号

网站二维码

引用本文

分享

相关视频

文章指标

历史

文章二维码