Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text reconstruction training method and system

A training method and text technology, applied in the field of text generation and deep learning, can solve problems such as irregular words, mispronunciation of speech recognition results, miscommunication and misunderstanding of information, and achieve the effect of improving text quality and eliminating text errors

Pending Publication Date: 2020-09-08
北京中科汇联科技股份有限公司
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to problems such as similar characters in shape, similar in sound, clerical errors, slips of the tongue, inaccurate speech recognition, and uneven levels of authors, the phenomenon of wrong words and irregular words in the text occurs from time to time, causing misinformation and misunderstandings
The existing text error correction methods based on tree models or deep neural networks cannot effectively solve the problems of missing words, multiple words, mispronunciation of speech recognition results, and irregular words in the text.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text reconstruction training method and system
  • Text reconstruction training method and system
  • Text reconstruction training method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0052]The purpose of the present invention is to provide a text reconstruction training method and system, using text encoding and generating a neural network to convert a given original text into a target text, correct typos in the original text, supplement missing words, and remove redundant Words, and standardized words, in order to eliminate text errors and improve text quality.

[0053] In order to make the above objects, features and advantages of the pr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text reconstruction training method and system. The method comprises the following steps: constructing a training sample; encoding splicing of the original text and the pronunciation sequence by adopting a neural network text sequence model; adopting a sigmoid feedforward neural network classifier to obtain a word segmentation classification result; taking a preset GEN _LENGTH as the maximum generation sequence length, and circularly generating each character of the sequence from head to tail; optimizing parameters of the neural network, calculating word segmentationloss and text generation loss, performing weighted summation on the generation loss to obtain joint loss, and optimizing the parameters of the neural network for the joint loss by using a gradient descent algorithm. According to the text reconstruction training method and system provided by the invention, the text coding and the generative neural network are used for converting the given originaltext into the target text, correcting wrongly written characters in the original text, supplementing missing characters, removing redundant characters and standardizing words, so that the purposes ofeliminating text errors and improving the text quality are achieved.

Description

technical field [0001] The invention relates to the technical field of text generation and deep learning, in particular to a text reconstruction training method and system. Background technique [0002] In the information age, text is an important information carrier in Internet multimedia, with huge data and many sources and authors. Due to problems such as similar characters in shape, similar in sound, clerical errors, slips of the tongue, inaccurate speech recognition, and uneven levels of authors, the phenomenon of wrong words and irregular words in the text occurs from time to time, causing misinformation and misunderstandings . The existing text error correction methods based on tree models or deep neural networks cannot effectively solve the problems of missing words, multiple words, mispronunciation of speech recognition results, and irregular words in the text. Contents of the invention [0003] The purpose of the present invention is to provide a text reconstru...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/151G06F40/232G06F40/284G06N3/04G06N3/08
CPCG06F40/151G06F40/232G06F40/284G06N3/049G06N3/08G06N3/044
Inventor 王丙栋游世学
Owner 北京中科汇联科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products