Unlock instant, AI-driven research and patent intelligence for your innovation.

Speech feature processing method, device, electronic equipment and storage medium

A technology of speech characteristics and processing methods, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of reducing the user experience of speech recognition services and affecting the accuracy of speech recognition, and achieve the effect of removing speech distortion damage and improving accuracy

Active Publication Date: 2021-07-23
BEIJING CENTURY TAL EDUCATION TECH CO LTD
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] When performing speech recognition, it is necessary to encode the speech features corresponding to the speech to be recognized to form deep speech feature coding information, and then perform further processing such as decoding the speech feature coding information to realize text conversion; however, in the actual application scene of speech recognition In addition to the pure speech of the speaker itself, the speech to be recognized may also have noise, that is, the speech to be recognized is a noisy speech, which makes the speech features extracted from the noisy speech have noise speech features, resulting in the speech formed by subsequent processing The feature coding information also has correspondingly noisy coding information, which greatly affects the accuracy of speech recognition and reduces the user experience of speech recognition services;

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech feature processing method, device, electronic equipment and storage medium
  • Speech feature processing method, device, electronic equipment and storage medium
  • Speech feature processing method, device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The following will clearly and completely describe the technical solutions in the embodiments of the application with reference to the drawings in the embodiments of the application. Apparently, the described embodiments are only some of the embodiments of the application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0031] At present, the speech recognition function is mainly realized by the speech recognition model. In order to facilitate the understanding of the speech recognition technology, figure 1 An example of a structure of a traditional speech recognition model, such as figure 1 As shown, the speech recognition model mainly includes: an acoustic model and a language model; wherein, the acoustic model is used to encode the speech features corresponding to the speech to form deep speech ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the present application provide a speech feature processing method, device, electronic equipment, and storage medium, wherein the method includes: removing noise speech features from noisy speech features to obtain pure speech feature estimates; It is estimated to perform coding processing to obtain first speech feature coding information, and to perform coding processing on the speech features of the noisy speech to obtain second speech feature coding information; according to the first speech feature coding information and the second speech feature encoding information to obtain target speech feature encoding information for decoding. The embodiments of the present application can accurately form speech feature coding information for noisy speech, and provide a basis for improving the accuracy of speech recognition.

Description

technical field [0001] The embodiments of the present application relate to the technical field of voice recognition, and in particular to a voice feature processing method, device, electronic equipment, and storage medium. Background technique [0002] Speech recognition is a technology that converts speech into text. It is widely used in human-computer voice interaction, intelligent control, communication and other scenarios. Therefore, it is of great significance to improve the accuracy of speech recognition. [0003] When performing speech recognition, it is necessary to encode the speech features corresponding to the speech to be recognized to form deep speech feature coding information, and then perform further processing such as decoding the speech feature coding information to realize text conversion; however, in the actual application scene of speech recognition In addition to the pure speech of the speaker itself, the speech to be recognized may also have noise, th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/20G10L21/0208
CPCG10L15/20G10L21/0208
Inventor 谷悦杨嵩王莎
Owner BEIJING CENTURY TAL EDUCATION TECH CO LTD