Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and system for speech recognition of short words and sentences based on dtw and gmm

A speech recognition and speech technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problem that the Gaussian mixture model GMM cannot be applied to short-term speech recognition, the discrete vector set Gaussian mixture model GMM modeling, and the inability to use DTW distance, etc. problem, to achieve the effect of reducing the amount of calculation, speeding up the real-time performance, and high recognition efficiency

Active Publication Date: 2022-07-12
宁夏理工学院
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] The existing Gaussian mixture model GMM cannot use the DTW distance, so it is impossible to perform Gaussian mixture model GMM modeling on discrete vector sets of different lengths, so the Gaussian mixture model GMM cannot be applied to short word speech recognition
At present, the kNN method based on the DTW algorithm is basically adopted, but the recognition speed of this method is slow and the calculation consumption is large

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for speech recognition of short words and sentences based on dtw and gmm
  • Method and system for speech recognition of short words and sentences based on dtw and gmm
  • Method and system for speech recognition of short words and sentences based on dtw and gmm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0054] The embodiment of the present invention provides a short-word speech recognition method based on DTW and GMM, which enables the Gaussian mixture model GMM to use the DTW distance and directly process data of different lengths in the time domain, with high recognition efficiency and high recognition accuracy. The advantages.

[0055] like figure 1 As shown, a DTW and GMM-based speech recognition method for short words and sentences provided by an embodiment of the present invention includes the following steps.

[0056] Step 100: Acquire the voice and audio data collected at the current stage.

[0057] Step 110: Determine a recognition result corresponding to the voice and audio data collected at the current stage according to the voice and audio data collected at the current stage, the Gaussian mixture model with built-in DTW distance, and the voice and audio database.

[0058] Wherein, the voice and audio database includes n short words and sentences, the Gaussian mi...

Embodiment 2

[0076] In order to achieve the above purpose, the embodiment of the present invention provides a short phrase speech recognition system based on DTW and GMM, such as figure 2 shown, including:

[0077] The data acquisition module 200 is used for acquiring the voice and audio data collected in the current stage.

[0078] The recognition module 210 is configured to determine the recognition result corresponding to the voice and audio data collected at the current stage according to the voice and audio data collected at the current stage, the Gaussian mixture model with built-in DTW distance, and the voice and audio database.

[0079] Wherein, the voice and audio database includes n short words and sentences, the Gaussian mixture model of the built-in DTW distance includes K Gaussian models, and K=n; the Gaussian mixture model of the built-in DTW distance is based on the voice and audio database, DTW algorithm and Gaussian mixture model are constructed.

[0080] Wherein, the i...

Embodiment 3

[0086] The embodiment of the present invention provides a short-word speech recognition method using a Gaussian mixture model in the time domain, and the method has the advantages of high recognition accuracy and high computational efficiency.

[0087] Now, machine learning based on large-scale multi-level neural networks can be applied to the field of speech recognition, and has achieved successful accuracy that traditional hidden Markov models (HMMs) cannot achieve, but only for continuous speech recognition. In the field of short-word and short-sentence speech recognition, the traditional DTW algorithm (Dynamic Time Warping, DTW) still occupies a place, because compared with the deep learning method, the DTW algorithm has the advantages of simplicity, less dependence on data accumulation, and recognition accuracy theory. It can also meet the needs and other characteristics.

[0088] The short-word speech recognition method based on DTW algorithm is briefly described as foll...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and system for short-word speech recognition based on DTW and GMM, and relates to the technical field of speech recognition. The recognition result corresponding to the voice and audio data collected at the current stage; wherein, the voice and audio database includes n short phrases, the Gaussian mixture model with built-in DTW distance includes K Gaussian models, and K=n; the built-in Gaussian mixture model with DTW distance It is built from the speech audio database, DTW algorithm and Gaussian mixture model. By applying the present invention, the purpose of high recognition efficiency and high recognition accuracy can be achieved.

Description

technical field [0001] The invention relates to the technical field of speech recognition, in particular to a method and system for speech recognition of short words and sentences based on DTW and GMM. Background technique [0002] The existing Gaussian mixture model GMM cannot use the DTW distance, so it is impossible to model the Gaussian mixture model GMM for discrete vector sets of different lengths, so the Gaussian mixture model GMM cannot be applied to short-word speech recognition. At present, the kNN method based on the DTW algorithm is basically used, but this method has a slow recognition speed and a large computational consumption. SUMMARY OF THE INVENTION [0003] The purpose of the present invention is to provide a method and system for speech recognition of short words and sentences based on DTW and GMM, so as to achieve the purpose of high recognition efficiency and high recognition accuracy. [0004] For achieving the above object, the present invention pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/12G10L15/14G10L15/22
CPCG10L15/12G10L15/14G10L15/22
Inventor 陆成刚王庆月谢涛
Owner 宁夏理工学院