Unlock instant, AI-driven research and patent intelligence for your innovation.

Voice recognition method and system

A speech recognition and speech signal technology, which is applied in speech recognition, speech analysis, instruments, etc., can solve problems such as unsatisfactory performance, great differences in different noise effects, and performance degradation of speech recognition systems, so as to suppress noise and improve Accuracy and anti-noise ability, the effect of suppressing noise interference

Active Publication Date: 2016-08-10
ALIBABA GRP HLDG LTD
View PDF5 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In recent years, with the development of signal processing and machine learning, speech recognition research has achieved great success, including Gaussian mixture model (Gaussion mixture model, referred to as GMM), hidden Markov model (Hidden markov model, referred to as Methods including HMM) and deep neural networks have achieved high recognition accuracy, but the performance in noisy environments is still not satisfactory, and the effects of existing algorithms for different noises vary greatly.
[0006] Therefore, it is necessary to solve the problem of performance degradation of the existing speech recognition system in a noisy environment, in order to improve the applicability and practicability of the speech recognition system, and try to approach and achieve the ability of human ear speech perception

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice recognition method and system
  • Voice recognition method and system
  • Voice recognition method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] In the following description, many technical details are proposed in order to enable readers to better understand the application. However, those skilled in the art can understand that without these technical details and various changes and modifications based on the following implementation modes, the technical solution claimed in each claim of the present application can be realized.

[0035] In order to make the purpose, technical solution and advantages of the present invention clearer, the following will further describe the implementation of the present invention in detail in conjunction with the accompanying drawings.

[0036] The first embodiment of the present invention relates to a speech recognition method, figure 1 is a flow diagram of the speech recognition method. Specifically, as figure 1 As shown, the speech recognition method includes the following steps:

[0037] Step 101, acquire N voice signals, where N is an integer greater than 1.

[0038] Wher...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the field of voice recognition, and discloses a voice recognition method and system. The method comprises the following steps: carrying out the spectrum analysis of the obtained N voice signals, and obtaining multi-dimensional N preliminary frequency spectrum characteristic parameters, inputting the parameters into N samples of a pre-trained auditory perception model based on a deep neural network for characteristic transformation, and obtaining N refined auditory perception characteristics; enabling the combination of the N refined auditory perception characteristics to be inputted into a pre-trained acoustic classification model, and coding the output so as to recognize a text content corresponding to a voice signal. According to the invention, the method carries out the frequency spectrum analysis and the characteristic transformation of the obtained multipath voice signals, and achieves the supplementary effect for the auditory perception. The extracted auditory perception characteristics are more suitable for the auditory perception of human ears, and the method can improve the voice recognition accuracy and anti-noise capability.

Description

technical field [0001] The invention relates to the field of pattern recognition, in particular to the technical field of speech recognition. Background technique [0002] Speech is the acoustic performance of language, the most natural, effective and convenient means for human to exchange information, and also a kind of support for human thinking. In the era of mobile Internet, speech recognition is one of the very important human-computer interaction technologies. In today's information society and fast-paced life, using signal processing and pattern recognition technology makes it possible to use machines for automatic speech recognition, which is important for Improving production efficiency and quality of life is of great significance. Automatic speech recognition has a wide range of applications. It can turn handwritten documents into automatic dictation operations, use voice to remotely control home appliances, use voice to search for events of interest on the Intern...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/02G10L15/06G10L21/0208G10L21/16
Inventor 李宏言
Owner ALIBABA GRP HLDG LTD