###

DOI:

计算机系统应用英文版:2015,24(8):85-90

View/Add Comment 过刊浏览高级检索 HTML

←前一篇 | 后一篇→

码上扫一扫！

下载全文

基于最大熵模型的冠词错误纠正系统

陈朝才¹, 吴敏¹, 吴桂兴², 郭燕²

(1.中国科学技术大学现代教育技术中心, 合肥 230026;2.中国科学技术大学苏州研究院, 苏州 235123)

Article Error Correction System Based on Maximum Entropy Model

CHEN Zhao-Cai¹, WU Min¹, WU Gui-Xing², GUO Yan²

(1.Center of Modern Educational Technology, University of Science and Technology of China, Hefei 230026, China;2.Suzhou Institute of University of Science and Technology of China, Suzhou 235123, China)

摘要

图/表

参考文献

相似文献

本文已被：浏览 1581次下载 2192次
Received:December 12, 2014 Revised:February 02, 2015

中文摘要: 研究了英语语法中冠词错误的计算机自动纠正. 首先对冠词使用的错误进行定义分类, 并考虑到可能出现冠词缺失的情况, 通过采用基于最大熵模型的分类器, 选择包含上下文、上下文词性、短语结构等特征, 在训练集上进行模型预的训练, 然后使用模型对于输入句子进行预测并纠正存在的使用错误. 在NUCLE语料的实验中, 给出了语料处理、模型特点、训练语料的大小对于测试集效果的影响, 并且比较了自然语言处理中非常通用的朴素贝叶斯模型的结果, 还根据英语语法中存在的错误特点对模型进行改进, 最后在测试数据达到35.48%的F值, 相较于CoNLL2013的shared task中最好结果有小幅提升.

中文关键词: 冠词错误计算机自动纠正最大熵模型

Abstract:Computer automation correction of article errors in English grammar is been studied. First we define the categories of article errors, and missing articles is also included, by using a maximum entropy model, extracting features covering context, part of speech, noun phrase structure and so on, training the model on the training corpus, then use the model to predict and correct the article errors of an input sentence. In the experiment on NUCLE corpus, effects of corpus preprocess, model types and the size of the training corpus are discussed. We make a comparison with the popular Naive Bayes model, at last we introduce the characters of English grammar to improve the model, a F-score of 35.48% is achieved, the result is slightly better than the best result in CoNLL 2013 shared task.

keywords: aticle errors computer automation correction maximum entropy model

文章编号： 中图分类号： 文献标志码：

基金项目:

Author Name	Affiliation
CHEN Zhao-Cai	Center of Modern Educational Technology, University of Science and Technology of China, Hefei 230026, China
WU Min	Center of Modern Educational Technology, University of Science and Technology of China, Hefei 230026, China
WU Gui-Xing	Suzhou Institute of University of Science and Technology of China, Suzhou 235123, China
GUO Yan	Suzhou Institute of University of Science and Technology of China, Suzhou 235123, China

Author Name	Affiliation
CHEN Zhao-Cai	Center of Modern Educational Technology, University of Science and Technology of China, Hefei 230026, China
WU Min	Center of Modern Educational Technology, University of Science and Technology of China, Hefei 230026, China
WU Gui-Xing	Suzhou Institute of University of Science and Technology of China, Suzhou 235123, China
GUO Yan	Suzhou Institute of University of Science and Technology of China, Suzhou 235123, China

引用文本：
陈朝才,吴敏,吴桂兴,郭燕.基于最大熵模型的冠词错误纠正系统.计算机系统应用,2015,24(8):85-90
CHEN Zhao-Cai,WU Min,WU Gui-Xing,GUO Yan.Article Error Correction System Based on Maximum Entropy Model.COMPUTER SYSTEMS APPLICATIONS,2015,24(8):85-90