Text error correction method and device

A text error correction and text technology, applied in the field of data processing, can solve problems such as not considering the order of Encoder modules, high model complexity, and slow convergence speed

Active Publication Date: 2019-08-30
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF10 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, since the magnitude of the vocabulary is usually tens of thousands to hundreds of thousands, the solution space of the error correction model is too large when decoding the output, the complexity of the model is high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text error correction method and device
  • Text error correction method and device
  • Text error correction method and device

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0025] The embodiments of the present application are described in detail below. Examples of the embodiments are shown in the accompanying drawings, wherein the same or similar reference numerals indicate the same or similar elements or elements with the same or similar functions. The embodiments described below with reference to the accompanying drawings are exemplary, and are intended to explain the application, but should not be understood as a limitation to the application.

[0026] The text error correction method and device in the embodiments of the present application are described below with reference to the drawings.

[0027] In the embodiment of the present application, the text error correction method is configured in a text error correction device as an example. The text error correction device may specifically be an improved NMT+Attention error correction model. The improved NMT+Attention error correction model is based on the existing NMT+Attention error correction mo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a text error correction method and device. The text error correction method comprises the following steps: inputting a word vector array corresponding to a text to be correctedinto a preset encoding module, obtaining a first hidden state vector array, inputting the first hidden state vector array into a decoding module, and for each decoding position, determining a decodingvector according to a second hidden state vector corresponding to the decoding position, an attention vector and the first hidden state vector array; and according to the decoding vector of the decoding position, a global word list and a limited word list corresponding to the words of the decoding position, determining the decoding result of the decoding position, so as to determine the text after being corrected corresponding to the text. In the text error correction method, the first hidden state vector array is adopted when the decoding vector is determined, so that the word sequence of the text is considered, and the accuracy of the error correction result is ensured; and due to the adoption of the limited word list, the size of the understanding space is limited, and the complexity of the error correction model is reduced, and the convergence speed of the model is increased.

Description

technical field [0001] The present application relates to the technical field of data processing, in particular to a text error correction method and device. Background technique [0002] The current end-to-end error correction model is the NMT error correction model based on the introduction of attention mechanism. The NMT error correction model is a Sequence-To-Sequence model based on Encoder-Decoder. Among them, the structure of the Encoder module and the Decoder module is a recurrent neural network (Recurrent Neural Network, RNN for short) network structure, and the vocabulary used by the two to map the words / segments in the text sequence to the word vector space is the same. However, since the magnitude of the vocabulary is usually tens of thousands to hundreds of thousands, the solution space of the error correction model is too large when decoding the output, the complexity of the model is high, and the convergence speed of the model training is too slow. Moreover, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06F17/22G06N3/04
CPCG06F40/126G06F40/232G06F40/289G06N3/045
Inventor 罗希意邓卓彬赖佳伟付志宏何径舟
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products