Speech recognition correction method, corresponding device, equipment and medium

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of speech recognition and correction method, which is applied in speech recognition, speech analysis, instruments, etc., and can solve problems such as lost time series correspondence and acoustic model training.

Pending Publication Date: 2021-10-22

GUANGZHOU HUADUO NETWORK TECH

View PDF0 Cites 2 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the audio text part of these data often loses its timing correspondence with the audio data. Generally, this kind of data is named "dirty data", which cannot be directly used for the training of the acoustic model, so it needs to be modified. Further processing in order to produce useful training samples, so the key to the problem is how to construct an effective technical solution to achieve efficient production of acoustic model training samples

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0062] Embodiments of the present application are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present application, and are not construed as limiting the present application.

[0063] Those skilled in the art will understand that unless otherwise stated, the singular forms "a", "an", "said" and "the" used herein may also include plural forms. It should be further understood that the word "comprising" used in the specification of the present application refers to the presence of the features, integers, steps, operations, elements and / or components, but does not exclude the presence or addition of one or more other features, Integers, steps, operations, elements, components, and / or groups thereof. It will be under...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speech recognition correction method, a corresponding device, equipment and a medium. The method comprises the following steps: acquiring a preliminary audio text and confidence data recognized from original audio data by a selected acoustic model; replacing words with confidence lower than a preset threshold value in the preliminary audio text with hole marks to obtain a marked audio text; performing text alignment on the marked audio text according to an original audio text of the original audio data, so that the hole marks in the marked audio text are correspondingly complemented according to the original audio text to obtain a corrected audio text; and marking the original audio data as a training sample, marking the corrected audio text as a supervision label of the original audio data, and storing the training sample and the supervision label in a sample library required by acoustic model training. According to the method and the device, dirty data formed by the audio text and the audio data associated with the same voice content can be efficiently cleaned, so that the training data required by acoustic model training can be prepared.

Description

technical field [0001] The embodiments of the present application relate to the technical field of speech recognition, and in particular, to a speech recognition correction method and corresponding devices, equipment, and media. Background technique [0002] A large amount of training data is precisely the essential material for an excellent acoustic model. At present, the mainstream method of producing ASR (Automatic Speech Recognition, automatic speech recognition) training data is often obtained through direct sampling: by manually reading a certain text or dialogue accurately, Thus, the audio data formed by reading aloud and the audio text read are obtained. The audio file can be used as a training sample, and its audio text can be used as a supervision label, so high-quality training corpus can be produced, which can be directly used for training. data. Obviously, in this method, the collection efficiency is very low, and the data acquisition cost is very high. [000...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/01G10L15/06G10L15/16G10L15/18G10L15/26

CPCG10L15/063G10L15/01G10L15/26G10L15/18G10L15/16

Inventor姜博怀

OwnerGUANGZHOU HUADUO NETWORK TECH

Speech recognition correction method, corresponding device, equipment and medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology