Error correction method and system for real-time translated text, storage medium and device

An error correction method and storage medium technology, applied in the field of storage media and devices, systems, and error correction methods for real-time translation of text, can solve problems such as difficult statistics of ASR translation errors, increased errors, and ASR error correction methods that are difficult to reach the use level, etc. , to achieve the effect of improving the error range and accuracy, and improving the word accuracy

Active Publication Date: 2022-01-18
北京数美时代科技有限公司
View PDF7 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

On the one hand, the defect of this type of algorithm is that when the error correction position of the detector mark is wrong, additional errors will be added.
On the other hand, the maintenance of the two-stage error correction method is cumbersome, especially the construction of the candidate set of the error c

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Error correction method and system for real-time translated text, storage medium and device
  • Error correction method and system for real-time translated text, storage medium and device
  • Error correction method and system for real-time translated text, storage medium and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0086] The principles and features of the present invention will be described below in conjunction with the accompanying drawings, and the examples given are only used to explain the present invention, and are not intended to limit the scope of the present invention.

[0087] Such as figure 1 As shown, an error correction method for real-time translated text provided by an embodiment of the present invention includes:

[0088] S1, obtaining the ASR translation text of the real-time live broadcast;

[0089] S2, interpret the ASR translation text through the trained BERT error correction model, and output the first error correction text; interpret the ASR translation text through the trained GPT error correction model, and output the second error correction text;

[0090] In a certain embodiment, the training process of the BERT error correction model may include:

[0091] Use the alignment algorithm based on the Levenshtein distance to align the text strings of the standard t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an error correction method and system for a real-time translated text, a storage medium and a device, and relates to the field of voice error correction. The method comprises the steps that: an ASR translation text in real-time live broadcast is acquired, the ASR translation text is interpreted through a trained BERT error correction model, and a first error correction text is output; the ASR translation text is interpreted through the trained GPT error correction model, a second error correction text is output, the first error correction text and the second error correction text are combined to obtain an error correction target text, and end-to-end error correction is performed on the ASR translation text content in the live broadcast scene through the scheme, the character accuracy of the ASR on an audio translated text can be effectively improved, and the method can be quickly applied to the field of live broadcast.

Description

technical field [0001] The invention relates to the field of speech error correction, in particular to an error correction method, system, storage medium and device for real-time translated text. Background technique [0002] Due to the large storage capacity and complex content of voice information, it is not easy to store, monitor and analyze directly. Therefore, automatic speech recognition technology (ASR) is used for voice-text translation, and the text is further stored, monitored and analyzed. [0003] In recent years, with the rise of the webcast industry, a large amount of information with voice signals as the carrier has been disseminated on the Internet. When using ASR for speech-to-text translation, due to the uneven quality of the live broadcast environment and the insufficient capacity of the ASR model, it is enough to change the semantics. Mistranslated information, such as translating the audio information of "I want to go to Dali" into the text information o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/06G10L15/16G10L15/22G10L15/26G06N3/04G06N3/08
CPCG10L15/063G10L15/16G10L15/22G10L15/26G06N3/08G06N3/047G06N3/045
Inventor 孙晓兵齐路唐会军刘栓林
Owner 北京数美时代科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products