Far-field speech recognition enhancement method for intelligent water dispenser

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of speech recognition and water dispenser, applied in speech analysis, instruments, etc., can solve problems such as poor consistent effect, achieve the effect of improving effect, ensuring reliability, efficient noise and reverberation

Inactive Publication Date: 2021-08-06

SHENZHEN LIGHT LIFE TECH CO LTD

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The invention provides a far-field speech recognition enhancement method for an intelligent water dispenser to solve the problem in the prior art that the consistent effect of noise and reverberation is not good when processing far-field speech signals

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0061] An embodiment of the present invention provides a far-field speech recognition enhancement method for an intelligent water dispenser, figure 1 It is a flow chart of a far-field speech recognition enhancement method for smart water dispensers in an embodiment of the present invention, please refer to figure 1 , the method includes the following steps:

[0062] Step S101, using a microphone array to acquire multi-channel far-field voice signals;

[0063] Step S102, using a Wiener filter to perform noise reduction preprocessing on the speech signal;

[0064] Step S103, obtaining the variance of the expected speech spectrum and the noise-free reverberation spectrum and the expected speech signal spectrum based on the deep learning of the long-short-term memory network;

[0065] Step S104, determine the coefficient of predictive filter to the variance of the expected speech spectrum of output based on WPE algorithm;

[0066] Step S105, determining the desired speech signa...

Embodiment 2

[0075] On the basis of Embodiment 1, the deep learning based on the long short-term memory network obtains the variance of the desired speech spectrum, including:

[0076] The speech signal after the noise reduction preprocessing is subjected to frame division processing;

[0077] Using the log magnitude spectrum of the current frame and adjacent frames as the input signal of the long short-term memory network;

[0078] Outputting the corresponding ideal masking value through the long short-term memory network;

[0079] determining a noise-free reverberation spectrum and an expected speech signal spectrum according to the ideal masking value;

[0080] A variance of the desired speech spectrum is determined from the desired speech signal spectrum.

[0081] The working principle of the above-mentioned technical solution is: the solution adopted in this embodiment is the process of obtaining the variance of the desired speech spectrum through deep learning based on the long-sho...

Embodiment 3

[0084] On the basis of Embodiment 2, the determination of the noise-free reverberation spectrum and the desired speech signal spectrum according to the ideal masking value includes:

[0085] multiplying the noise-reduction preprocessed speech signal by the ideal mask corresponding to the noise-free reverberation spectrum to obtain the noise-free reverberation spectrum;

[0086] The desired speech signal spectrum is obtained by multiplying the noise-reduced preprocessed speech signal by an ideal mask corresponding to the desired speech signal spectrum.

[0087] The working principle of the above-mentioned technical solution is: the solution adopted in this embodiment is the process of determining the noise-free reverberation spectrum and the expected speech signal spectrum according to the ideal masking value, specifically, the speech signal after the noise reduction preprocessing is multiplied by The ideal mask corresponding to the noise-free reverberation spectrum is obtained...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a far-field speech recognition enhancement method for an intelligent water dispenser, according to the scheme, the method comprises the steps: combining a long-short term memory network with a WPE algorithm, determining the variance of an expected speech spectrum, a noise-free reverberation spectrum, and an expected speech signal spectrum through calculation, and determining a coefficient of a prediction filter for the variance of the output expected speech spectrum based on the WPE algorithm; and in addition, optimizing the coefficient of the prediction filter by adopting a traditional WPE algorithm. By adopting the scheme provided by the invention, on one hand, the calculation time cost is saved by reducing the operand, and on the other hand, the parameter reliability can be ensured. In a word, the noise and reverberation can be efficiently suppressed by combining the long and short term memory network with the WPE algorithm, and the effect of suppressing noise and reverberation is improved.

Description

technical field [0001] The invention relates to the technical field of intelligent water dispensers, in particular to a far-field voice recognition enhancement method for intelligent water dispensers. Background technique [0002] Speech enhancement technology is mainly to remove the interference of noise and reverberation in the speech signal, and improve the performance of the speech recognition system. In the enhancement technology that takes multi-channel speech as input, the noise can be suppressed by using the different characteristics of speech and noise in the time-frequency domain, and the reverberation can be suppressed by using the spatial information of the target speech between channels. In the case of the coexistence of far-field environmental noise and reverberation, most traditional speech enhancement methods that take multi-channel as input cannot provide better noise and reverberation suppression effects, and there is no special treatment for the environmen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L21/0208G10L21/0216G10L21/0232G10L25/30

CPCG10L21/0208G10L21/0216G10L21/0232G10L25/30G10L2021/02082G10L2021/02166

Inventor 陈芒

Owner SHENZHEN LIGHT LIFE TECH CO LTD

Far-field speech recognition enhancement method for intelligent water dispenser

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology