Unlock instant, AI-driven research and patent intelligence for your innovation.

Text error correction method and device

A text error correction and text technology, applied in the field of data processing, can solve the problems of high model complexity, no consideration of the order of Encoder modules, and poor error correction result accuracy.

Active Publication Date: 2021-02-05
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, since the magnitude of the vocabulary is usually tens of thousands to hundreds of thousands, the solution space of the error correction model is too large when decoding the output, the complexity of the model is high, and the convergence speed of the model training is too slow.
And the attention mechanism does not consider the order of the original input sequence of the Encoder module, resulting in poor accuracy of error correction results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text error correction method and device
  • Text error correction method and device
  • Text error correction method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] Embodiments of the present application are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary, and are intended to explain the present application, and should not be construed as limiting the present application.

[0026] The text error correction method and device according to the embodiments of the present application are described below with reference to the accompanying drawings.

[0027] The embodiment of the present application is illustrated by taking a text error correction method configured in a text error correction device as an example, and the text error correction device may specifically be an improved NMT+Attention error correction model. The improved NMT+Attention error correction model is based on the existing NMT+Atten...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application proposes a text error correction method and device, wherein the method includes: input the word vector array corresponding to the text to be error corrected into the preset encoding module, obtain the first hidden state vector array and input it to the decoding module, and for each The decoding position determines the decoding vector according to the second hidden state vector, the attention vector and the first hidden state vector array corresponding to the decoding position; The limited word table determines the decoding result of the decoding position, and then determines the error-corrected text corresponding to the text. When determining the decoding vector in this method, the first hidden state vector array is used, thus taking into account the word order of the text, ensuring corrected text. In addition, the use of restricted vocabulary limits the size of the understanding space, reduces the complexity of the error correction model, and improves the convergence speed of the model.

Description

technical field [0001] The present application relates to the technical field of data processing, in particular to a text error correction method and device. Background technique [0002] The current end-to-end error correction model is the NMT error correction model based on the introduction of attention mechanism. The NMT error correction model is a Sequence-To-Sequence model based on Encoder-Decoder. Among them, the structure of the Encoder module and the Decoder module is a recurrent neural network (Recurrent Neural Network, RNN for short) network structure, and the vocabulary used by the two to map the words / segments in the text sequence to the word vector space is the same. However, since the magnitude of the vocabulary is usually tens of thousands to hundreds of thousands, the solution space of the error correction model is too large when decoding the output, the complexity of the model is high, and the convergence speed of the model training is too slow. Moreover, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/232G06F40/289G06F40/126G06N3/04
CPCG06F40/126G06F40/232G06F40/289G06N3/045
Inventor 罗希意邓卓彬赖佳伟付志宏何径舟
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD