###

DOI:

计算机系统应用英文版:2011,20(12):60-63

View/Add Comment 过刊浏览高级检索 HTML

←前一篇 | 后一篇→

码上扫一扫！

下载全文

基于上下文的拉丁维文拼写校对的研究

何晋一^1,2, 陈红英¹, 姜文斌², 张海波^2,3, 刘群²

(1.华南师范大学计算机学院,广州 510631;2.中国科学院计算技术研究所智能信息处理重点实验室,北京 100190;3.四川大学软件学院,成都 610065)

Latin-Uighur Spelling Check Based on Context

HE Jin-Yi^1,2, CHEN Hong-Ying¹, JIANG Wen-Bin², ZHANG Hai-Bo^2,3, LIU Qun²

(1.School of Computer, South China Normal University, Guangzhou 510631, China;2.Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China;3.School of Software Engineering, Sichuan University, Chengdu 610065, China)

摘要

图/表

参考文献

相似文献

本文已被：浏览 2475次下载 3832次
Received:March 29, 2011 Revised:May 04, 2011

中文摘要: 根据拉丁维文的特点,分析了拉丁维文常见的拼写错误类型,提出了一种将最小编辑距离、基于有向图模型的词语切分和trigram 语言模型融合的方法,实现了基于上下文的拉丁维文的自动拼写校对系统,从而大大提高了拉丁维文的校对准确率。在新疆大学提供的维文语料库的测试中,拉丁维文的校对准确率达到了90.1%。

中文关键词: 拉丁维文最小编辑距离有向图模型词语切分语言模型上下文拼写校对

Abstract:According to the characteristics of Latin-Uighur, this paper analyzed the common spelling error types of Latin-Uighur, and then proposed a method which merged the minimum edit distance, directed graph model based lexical segmentation, trigram language model together. Finally, we implemented the automatically spelling check system of Latin-Uighur based on context. It has increased the accuracy of Latin-Uighur spelling check largely. The experiment on the Uighur corpus provided by Xinjiang University reaches an accuracy of 90.1%.

keywords: Latin-Uighur minimum edit distance directed graph model lexical segmentation language model context spelling check

文章编号： 中图分类号： 文献标志码：

基金项目:国家自然科学基金(60736014)

Author Name	Affiliation
HE Jin-Yi	School of Computer, South China Normal University, Guangzhou 510631, China Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
CHEN Hong-Ying	School of Computer, South China Normal University, Guangzhou 510631, China
JIANG Wen-Bin	Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
ZHANG Hai-Bo	Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China School of Software Engineering, Sichuan University, Chengdu 610065, China
LIU Qun	Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China

Author Name	Affiliation
HE Jin-Yi	School of Computer, South China Normal University, Guangzhou 510631, China Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
CHEN Hong-Ying	School of Computer, South China Normal University, Guangzhou 510631, China
JIANG Wen-Bin	Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
ZHANG Hai-Bo	Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China School of Software Engineering, Sichuan University, Chengdu 610065, China
LIU Qun	Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China

引用文本：
何晋一,陈红英,姜文斌,张海波,刘群.基于上下文的拉丁维文拼写校对的研究.计算机系统应用,2011,20(12):60-63
HE Jin-Yi,CHEN Hong-Ying,JIANG Wen-Bin,ZHANG Hai-Bo,LIU Qun.Latin-Uighur Spelling Check Based on Context.COMPUTER SYSTEMS APPLICATIONS,2011,20(12):60-63