A Chinese grammatical error detection method based on word vectors added with text information

A technology for text information and grammatical errors, applied in the field of information processing, can solve problems such as not making good use of Chinese vocabulary and ignoring polysemy in a word

Active Publication Date: 2019-07-12
BEIJING UNIV OF POSTS & TELECOMM
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, most of the existing detection methods do not make good use of the information expressed in Chinese vocabulary, ignoring the phenomenon of polysemy in Chinese

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Chinese grammatical error detection method based on word vectors added with text information
  • A Chinese grammatical error detection method based on word vectors added with text information
  • A Chinese grammatical error detection method based on word vectors added with text information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0013] Next, embodiments of the present invention will be described in more detail.

[0014] figure 1 It is a network structure diagram of the error detection method provided by the present invention, which includes:

[0015] Step S1: the vectorization of text words;

[0016] Step S2: The cyclic neural network forms text information related to each word vector;

[0017] Step S3: Text matrix reconstruction;

[0018] Step S4: the cyclic neural network extracts context information;

[0019] Step S5: the forward neural network calculates the error score of each word;

[0020] Step S6: use the error score to infer the error location;

[0021] Each step will be described in detail below:

[0022] Step S1: Vectorization of text words. The present invention first establishes a mapping table from words to word vector numbers, and maps each word in the text to a corresponding word number through the mapping table. Initialize the word vector matrix, each row in the word vector ma...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Chinese grammatical error detection method and device adding word vectors of text information, belonging to the field of information processing. The characteristics of this method include: first vectorize the words of the input text to form a text matrix; then use the cyclic neural network to form text information related to each word vector; reconstruct the text matrix; use the cyclic neural network to extract context information; Compute error scores for each word to the neural network; use the error scores to infer error locations. The present invention improves the detection effect of Chinese grammar by combining text-based word vectors, and has great use value.

Description

technical field [0001] The invention relates to the field of information processing, in particular to a neural network-based Chinese grammatical error detection method. Background technique [0002] Due to the rapid development of China, more and more foreigners start to learn Chinese, so the task of Chinese grammatical error detection has attracted more and more attention. The purpose of the Chinese grammatical error detection task is to judge whether there are grammatical errors in the text written by non-Chinese native speakers, and to give an error message. [0003] Most of the current grammatical error detection models use sequence labeling. The model marks the wrong words in the text through calculation and gives an error message. Commonly used statistical learning methods for Chinese grammatical error diagnosis include n-gram, and machine learning methods include recurrent neural network methods. However, these networks require more artificial features to achieve be...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/27G06N3/04
CPCG06F40/253G06F40/284G06N3/044
Inventor 赵建博李思李明正徐雅静
Owner BEIJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products