Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech recognition correction method, corresponding device, equipment and medium

A technology of speech recognition and correction method, which is applied in speech recognition, speech analysis, instruments, etc., and can solve problems such as lost time series correspondence and acoustic model training.

Pending Publication Date: 2021-10-22
GUANGZHOU HUADUO NETWORK TECH
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the audio text part of these data often loses its timing correspondence with the audio data. Generally, this kind of data is named "dirty data", which cannot be directly used for the training of the acoustic model, so it needs to be modified. Further processing in order to produce useful training samples, so the key to the problem is how to construct an effective technical solution to achieve efficient production of acoustic model training samples

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition correction method, corresponding device, equipment and medium
  • Speech recognition correction method, corresponding device, equipment and medium
  • Speech recognition correction method, corresponding device, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0062] Embodiments of the present application are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present application, and are not construed as limiting the present application.

[0063] Those skilled in the art will understand that unless otherwise stated, the singular forms "a", "an", "said" and "the" used herein may also include plural forms. It should be further understood that the word "comprising" used in the specification of the present application refers to the presence of the features, integers, steps, operations, elements and / or components, but does not exclude the presence or addition of one or more other features, Integers, steps, operations, elements, components, and / or groups thereof. It will be under...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech recognition correction method, a corresponding device, equipment and a medium. The method comprises the following steps: acquiring a preliminary audio text and confidence data recognized from original audio data by a selected acoustic model; replacing words with confidence lower than a preset threshold value in the preliminary audio text with hole marks to obtain a marked audio text; performing text alignment on the marked audio text according to an original audio text of the original audio data, so that the hole marks in the marked audio text are correspondingly complemented according to the original audio text to obtain a corrected audio text; and marking the original audio data as a training sample, marking the corrected audio text as a supervision label of the original audio data, and storing the training sample and the supervision label in a sample library required by acoustic model training. According to the method and the device, dirty data formed by the audio text and the audio data associated with the same voice content can be efficiently cleaned, so that the training data required by acoustic model training can be prepared.

Description

technical field [0001] The embodiments of the present application relate to the technical field of speech recognition, and in particular, to a speech recognition correction method and corresponding devices, equipment, and media. Background technique [0002] A large amount of training data is precisely the essential material for an excellent acoustic model. At present, the mainstream method of producing ASR (Automatic Speech Recognition, automatic speech recognition) training data is often obtained through direct sampling: by manually reading a certain text or dialogue accurately, Thus, the audio data formed by reading aloud and the audio text read are obtained. The audio file can be used as a training sample, and its audio text can be used as a supervision label, so high-quality training corpus can be produced, which can be directly used for training. data. Obviously, in this method, the collection efficiency is very low, and the data acquisition cost is very high. [000...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/01G10L15/06G10L15/16G10L15/18G10L15/26
CPCG10L15/063G10L15/01G10L15/26G10L15/18G10L15/16
Inventor 姜博怀
Owner GUANGZHOU HUADUO NETWORK TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products