Method for automatically correcting syntax errors in English composition based on multivariate features

A grammatical error and grammar technology, applied in the field of natural language processing technology and statistics, can solve problems such as taking a long time, limited types of English grammatical error correction, and the correct rate of error correction needs to be improved.

Active Publication Date: 2013-10-23
GUILIN UNIV OF ELECTRONIC TECH
View PDF4 Cites 27 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Disadvantages: Due to the flexibility and variability of English usage, it takes a long time to build a model of English grammar rules, and the summarized English grammar rules can only cover limited grammar rules, and the types of English grammar error correction are limited
[0008] Disadvantages: The correct

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for automatically correcting syntax errors in English composition based on multivariate features
  • Method for automatically correcting syntax errors in English composition based on multivariate features
  • Method for automatically correcting syntax errors in English composition based on multivariate features

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0131] The specific implementation of the method for automatically correcting grammatical errors in English composition based on multiple features of the present invention is divided into the following three steps.

[0132] Step 1: Execute the "Grammar Error Correction Preprocessing Module"

[0133] (1) Preprocess the training text set. The training text set used for preprocessing is drawn from a variety of English articles. They are 500,000-word correct English essays that do not contain any word errors, grammatical errors, or expression errors. One of the English essays The content is as follows:

[0134]In all my life there are several people who help me a lot at my crucial moments, among whom my dear second uncle influences me most. Since my childhood, my beloved uncle, who was a person of integrity and a learned man known far and near ,has told me again and again to be a good boy and to be a top student as well as an honest man.Being a boy I never made any mischief or to...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for automatically correcting syntax errors in an English composition based on multivariate features. The method comprises a syntax error correcting preprocessing module, a syntax error correcting model training module and a syntax error checking and correcting module, wherein the syntax error correcting preprocessing module carries out part-of-speech tagging of words, syntactic parsing of sentences and word frequency statistics of words for input training texts; the syntax error correcting model training module extracts words and part-of-speech context syntactic features thereof, words and part-of-speech structure-dependent syntactic features thereof and words and part-of-speech syntactic features thereof, calculates syntactic feature weight of words and outputs a statistical model of syntax error correcting for a part-of-speech tagging library of input words, a syntax tree structure library of sentences, a word frequency statistics library of words and a part-of-speech and syntax confusion set of words; and the syntax error checking and correcting module utilizes the statistical model of syntax error correcting and a rule model of syntax error correcting to correct syntax errors in a composition to be corrected and outputs the corrected results of the syntax errors in the English composition. The method can automatically correct eleven kinds of common English syntax errors in the English composition.

Description

technical field [0001] The invention relates to natural language processing technology and statistics, in particular to a method for automatically correcting grammatical errors in English compositions based on multiple features. Background technique [0002] At present, the automatic correction methods of English composition grammatical errors are mainly divided into two categories, namely: the automatic correction method of English composition grammatical errors based on rules and the automatic correction method of English composition grammatical errors based on statistics. Among them, the rule-based automatic correction method for English composition grammatical errors is: by analyzing the grammatical features of English sentences, summarizing and summarizing fixed English grammatical rules to construct an English sentence grammatical rule error correction model, when correcting grammatical errors in English compositions , build an error correction model of English sentenc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/28
Inventor 黄桂敏周娅王晓娟
Owner GUILIN UNIV OF ELECTRONIC TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products