English grammar error correction method based on CNN and BERT model

An error correction method, an English technology, applied in neural learning methods, biological neural network models, natural language data processing, etc., can solve problems such as low accuracy, limited types of grammatical errors, and low training model efficiency

Inactive Publication Date: 2020-08-07
BEIJING BOZHITIANXIA INFORMATION TECH CO LTD
View PDF0 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Most of the traditional English error correction systems are purely based on the principle of statistical machine translation or based on certain grammatical rules. The accuracy is not high, the efficiency of training models is low, and the types of grammatical errors that can be detected are limited.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • English grammar error correction method based on CNN and BERT model
  • English grammar error correction method based on CNN and BERT model
  • English grammar error correction method based on CNN and BERT model

Examples

Experimental program
Comparison scheme
Effect test

example

[0012] Combined with a specific example method, the steps of the grammatical error correction operation process are as follows:

[0013] 1) Collect a large amount of "wrong-correct" parallel corpus;

[0014] 2) Use the convolutional neural network to train the error correction model;

[0015] 3) Use BERT to train the scoring model;

[0016] 4) Input the sentence to be corrected into the model in step 2, and input the corrected result into the model in step 3 to obtain the corresponding score, and the sentence with the highest score is the final corrected result.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

English grammar error correction is an important research direction in the field of natural language processing. A traditional grammar error correction system is mostly based on rule judgment, the types of detected errors are limited, and the expansion capacity is poor. An existing syntax error correction system based on a recurrent neural network is prone to losing head and tail information whenfacing long sentences, and due to the fact that characteristics cannot be extracted in parallel, the training period is long. The invention provides the English grammar error correction method based on the CNN and the BERT model. A CNN+Attention+BERT structure is adopted in the model, and an Encoder-Decoder framework is adopted in the implementation mode. Through convolution, the features of the context can be efficiently and accurately extracted; the Attention layer adds weights to different words, so that the model can learn more important features; the BERT adopts a Masked Lange Model modeto train a language model, (0, 1) classification tasks can be added to the language model through fin-tuning, the classification tasks are used for scoring sentences output by an error correction system, and the accuracy of the system is improved.

Description

technical field [0001] The present invention is a task in the field of natural language processing, mainly an English grammar error correction method based on CNN (Convolutional Neural Networks, Convolutional Neural Networks) and BERT (Bidirectional Encoder Representations from Transformers) models. Background technique [0002] For domestic English learners, grammar is often a difficult point in their English learning. Due to limited teaching resources, learners often cannot get correct correction opinions and examples in time for grammar problems encountered by learners. If there is a grammatical error correction system that enables learners to point out grammatical errors in the process of learning and using English in a timely manner, and give corresponding feedback and suggestions, it can greatly reduce the difficulty of learning for learners , and significantly improve their English proficiency. Most of the traditional English error correction systems are purely based...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/232G06N3/04G06N3/08
CPCG06N3/08G06N3/045
Inventor 马士成贾艳明
Owner BEIJING BOZHITIANXIA INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products