Preposition Error Correction System Based on Maximun Entropy Model
CSTR:
Author:
  • Article
  • | |
  • Metrics
  • |
  • Reference [8]
  • |
  • Related [20]
  • | | |
  • Comments
    Abstract:

    English preposition error correction system is to help English language learners to correct automatically the common mistakes of English prepositions. First, to classsify and count up the preposition errors in the marked corpus files, sum up 21 kinds of common prepositions, with English wiki corpus and use of computer algorithms automatically error interpolation algorithm to get the training set, and then based on the training set, by using a classification based on the maximum entropy model chosen, including context, prepositions complement other features, training model on the training set, and then use the model to predict the input sentence and correct use of the presence of errors. In NUCLE corpus experiment, given corpus processing, model features, size, number of iterations to test the effect of the impact of training data set, and compare the results of the Naive Bayes model, and finally to the F value 27.68 in the test data with respect to the shared task CoNLL2013 best results have slightly improved.

    Reference
    1 祝德勤.英语介词.北京:商务出版社,2004.
    2 Rozovskaya A, Sammons M, Roth D. The UI system in the HOO 2012 shared task on error correction. Proc. of the Seventh Workshop on Building Educational Applications Using NLP. Association for Computational Linguistics. 2012. 272-280.
    3 Freund Y, Schapire RE. Large margin classification using the perceptron algorithm. Machine Learning, 1999, 37(3):277-296.
    4 Wang P, Jia Z, Zhao H. Grammatical error detection and correction using a single maximum entropy model. CoNLL-2014. 2014. 74.
    5 Berger AL, Della Pietra SA, Della PVJ. A maximum entropy approach to natural language processing. Computational Linguistics, 1996, 22(1):39-71.
    6 Dahlmeier D, Ng HT, Wu SM. Building a large annotated corpus of learner English:the NUS corpus of learner English. Proc. of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications. 2013. 22-31.
    7 Klein D, Manning CD. Accurate unlexicalized parsing. Proc. of the 41st Annual Meeting on Association for Computational Linguistics-V1. Association for Computational Linguistics, 2003. 423-430.
    8 Ng HT, Wu SM, Wu Y, et al. The conll-2013 shared task on grammatical error correction. Proc. of CoNLL. 2013.
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

李悦,吴敏,吴桂兴,郭燕.基于最大熵模型的介词纠错系统.计算机系统应用,2016,25(1):96-100

Copy
Share
Article Metrics
  • Abstract:1713
  • PDF: 2421
  • HTML: 0
  • Cited by: 0
History
  • Received:May 05,2015
  • Revised:June 08,2015
  • Online: January 15,2016
Article QR Code
You are the first990837Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063