A method and system for intelligent speech recognition based on deep learning

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
An intelligent speech and deep learning technology, applied in speech recognition, neural learning methods, speech analysis, etc., can solve problems such as speaking style differences, speech loss, low intelligibility, etc., to eliminate noise, reduce speech distortion, computing small amount of effect

Active Publication Date: 2022-06-17

凯新创达(深圳)科技发展有限公司

View PDF6 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Compared with traditional speech, the speech enhancement algorithm based on DNN (Deep Neural Network) can achieve great performance improvement, especially in the case of dealing with non-stationary noise. However, the supervised speech enhancement algorithm based on DNN is in practice In the face of real noise scenes, differences in speaking styles, and low signal-to-noise ratio (Signal-to-Noise Ratio), there are generalization problems, such as voice loss, low intelligibility, etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0056] The present invention will be further described below through specific embodiments.

[0057] The invention proposes an intelligent speech recognition method based on deep learning, which can eliminate noise while retaining necessary target speech, improve the robustness of speech enhancement for various complex environments, and has a small amount of computation.

[0058] like figure 1 , is a flowchart of a deep learning-based intelligent speech recognition method provided by an embodiment of the present invention, which specifically includes:

[0059] S101: obtain voice information;

[0060] Use microphones and other pickup devices to obtain voice information;

[0061] S102: Perform noise elimination on the acquired voice information using a fused noise elimination model to obtain denoised voice information, where the fused noise elimination model is obtained by merging two noise elimination models in combination with a voice endpoint detection algorithm;

[0062] T...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present invention proposes an intelligent speech recognition method based on deep learning. Firstly, the speech information is acquired; the noise elimination is performed on the acquired speech information by using a fusion noise elimination model, and the noise elimination is obtained. The fusion noise elimination The model is obtained by combining two noise elimination models with the voice endpoint detection algorithm; the voice information after noise elimination is input into the staged learning enhancement network structure, and the enhanced voice information is obtained; the staged learning enhancement network structure includes multiple The target layer, the target layer adopts a linear activation function, and the hidden layer is an LSTM-RNN network; the enhanced voice information is input into the voice model for voice recognition; the method provided by the invention can eliminate noise while retaining necessary Target voice, improve the robustness of voice enhancement in various complex environments, with a small amount of calculation.

Description

technical field [0001] The field of speech recognition of the present invention particularly refers to an intelligent speech recognition method and system based on deep learning. Background technique [0002] In recent years, the artificial intelligence boom triggered by deep learning is affecting and changing people's lifestyles. People are no longer satisfied with the human-computer interaction of a single text and instruction, but look forward to the more convenient voice interaction. Fast way to communicate. Voice has become an indispensable information medium. However, in the actual transmission process of speech, background noise and human voice interference will have a certain impact on speech, which will reduce the quality and intelligibility of speech, and also bring challenges to subsequent applications, such as speech recognition and speaker recognition. Wait. In a complex application environment, as the front-end interface of voice applications, voice signal p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L15/16G10L15/06G10L21/0208G10L25/87G06N3/04G06N3/08

Inventor任国斌

Owner凯新创达(深圳)科技发展有限公司

A method and system for intelligent speech recognition based on deep learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology