BERT and feedforward neural network-based text error correction method

A feedforward neural network and text error correction technology, which is applied in the field of text error correction based on BERT and feedforward neural network, can solve problems such as inapplicability and time-consuming, and achieve the effect of improving time-consuming

Pending Publication Date: 2021-05-25
ZHEJIANG LAB +1
View PDF0 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Traditional text error correction mainly adopts the method based on rules or translation models. The rule-based method mainly relies on manually defining the replacement word dictionary, and can only correct several specific errors; using translation models to correct text errors is the current mainstream method , and the neural network-based translation model has replaced the statistical-based translation model for error co...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • BERT and feedforward neural network-based text error correction method
  • BERT and feedforward neural network-based text error correction method
  • BERT and feedforward neural network-based text error correction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070] The method proposed by the present invention will be further described in detail below in conjunction with the accompanying drawings and specific examples.

[0071] The present invention is a text error correction method based on BERT and feed-forward neural network, using deep learning method, combined with the pre-trained language model BERT, can effectively extract semantic information such as word part of speech and syntactic structure from the text, so as to obtain each Characteristic representation of a word in context. In addition, the three different feed-forward neural networks designed by the present invention can use the extracted feature information to perform the functions of text correctness and error judgment, error position detection and correct text generation respectively, and the organic combination of each module can realize The purpose of text error correction of the present invention; figure 1 shown, including the following steps:

[0072] 1. Dat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a BERT and feedforward neural network-based text error correction method, which can be used for quickly and accurately identifying and correcting errors of large-scale corpora. The method comprises the following steps: firstly, preprocessing a text, then carrying out semantic coding on the text by using BERT, then judging whether the text is correct or not by using semantic information of the whole text, then finding out a specific position of an error in the text which is judged to be wrong by using a sequence labeling method, and finally, combining context information of the error to determine whether the error occurs in the text; and generating a corresponding correct text by using the feedforward neural network. The text error correction method constructed by the invention has the characteristics of high reasoning speed and good interpretability.

Description

technical field [0001] The invention belongs to the fields of artificial intelligence and natural language processing, and in particular relates to a text error correction method based on BERT and a feedforward neural network. [0002] technical background [0003] Text error correction is a natural language processing technology that corrects erroneous content in texts, specifically including spelling error correction, grammatical error correction, and semantic-pragmatic error correction in specific scenarios. Among them, spelling error correction is characterized by not changing the length of the text, but only one-to-one correction of typos in the text; grammatical error correction and semantic pragmatic error correction need to deal with multiple word errors, few word errors, and word usage in the text. Mistakes, such as mistakes and wrong word order, may change the length of the text. [0004] In recent years, large-scale deep pre-trained language models such as BERT ha...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/232G06F40/289G06F40/30G06N3/04
CPCG06F40/232G06F40/289G06F40/30G06N3/04
Inventor 潘法昱曹斌於其之
Owner ZHEJIANG LAB
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products