Speech feature processing method, device, electronic equipment and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of speech characteristics and processing methods, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of reducing the user experience of speech recognition services and affecting the accuracy of speech recognition, and achieve the effect of removing speech distortion damage and improving accuracy

Active Publication Date: 2021-07-23

BEIJING CENTURY TAL EDUCATION TECH CO LTD

View PDF7 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] When performing speech recognition, it is necessary to encode the speech features corresponding to the speech to be recognized to form deep speech feature coding information, and then perform further processing such as decoding the speech feature coding information to realize text conversion; however, in the actual application scene of speech recognition In addition to the pure speech of the speaker itself, the speech to be recognized may also have noise, that is, the speech to be recognized is a noisy speech, which makes the speech features extracted from the noisy speech have noise speech features, resulting in the speech formed by subsequent processing The feature coding information also has correspondingly noisy coding information, which greatly affects the accuracy of speech recognition and reduces the user experience of speech recognition services;

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0030] The following will clearly and completely describe the technical solutions in the embodiments of the application with reference to the drawings in the embodiments of the application. Apparently, the described embodiments are only some of the embodiments of the application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0031] At present, the speech recognition function is mainly realized by the speech recognition model. In order to facilitate the understanding of the speech recognition technology, figure 1 An example of a structure of a traditional speech recognition model, such as figure 1 As shown, the speech recognition model mainly includes: an acoustic model and a language model; wherein, the acoustic model is used to encode the speech features corresponding to the speech to form deep speech ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

Embodiments of the present application provide a speech feature processing method, device, electronic equipment, and storage medium, wherein the method includes: removing noise speech features from noisy speech features to obtain pure speech feature estimates; It is estimated to perform coding processing to obtain first speech feature coding information, and to perform coding processing on the speech features of the noisy speech to obtain second speech feature coding information; according to the first speech feature coding information and the second speech feature encoding information to obtain target speech feature encoding information for decoding. The embodiments of the present application can accurately form speech feature coding information for noisy speech, and provide a basis for improving the accuracy of speech recognition.

Description

technical field [0001] The embodiments of the present application relate to the technical field of voice recognition, and in particular to a voice feature processing method, device, electronic equipment, and storage medium. Background technique [0002] Speech recognition is a technology that converts speech into text. It is widely used in human-computer voice interaction, intelligent control, communication and other scenarios. Therefore, it is of great significance to improve the accuracy of speech recognition. [0003] When performing speech recognition, it is necessary to encode the speech features corresponding to the speech to be recognized to form deep speech feature coding information, and then perform further processing such as decoding the speech feature coding information to realize text conversion; however, in the actual application scene of speech recognition In addition to the pure speech of the speaker itself, the speech to be recognized may also have noise, th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L15/20G10L21/0208

CPCG10L15/20G10L21/0208

Inventor 谷悦杨嵩王莎

Owner BEIJING CENTURY TAL EDUCATION TECH CO LTD

Speech feature processing method, device, electronic equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology