Isolated word speech recognition method based on HRSF and improved DTW algorithm

A sound recognition and algorithm technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of increased time spent, reduced recognition rate and recognition speed, etc.

Inactive Publication Date: 2013-03-20
SOUTH CHINA NORMAL UNIVERSITY +1
View PDF0 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In this way, as the number of models increases, the time spent on one recognition will rise linearly, resulting in a significant reduction in the recognition rate and recognition speed.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Isolated word speech recognition method based on HRSF and improved DTW algorithm
  • Isolated word speech recognition method based on HRSF and improved DTW algorithm
  • Isolated word speech recognition method based on HRSF and improved DTW algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0089] The above technical solutions of the present invention are clear to those skilled in the art. In order to facilitate the examiner’s understanding, the specific implementation of the present invention will be further described below in conjunction with the accompanying drawings and examples, but the implementation and protection scope of the present invention are not limited thereto. Those who are not particularly described in the present invention are those skilled in the art.

[0090] Such as figure 1 , an isolated word speech recognition method based on HRSF and improved DTW algorithm, the main process of this method is as follows:

[0091] (1) Digitization and preprocessing of voice signals: The input analog voice signals must first be preprocessed, including pre-filtering, sampling and quantization, windowing, pre-emphasis, endpoint detection, etc.;

[0092] (2) Parameter extraction of the voice signal: After the voice signal is preprocessed, the next very importan...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an isolated word speech recognition method based on an HRSF (Half Raised Sine Function) and an improved DTW (Dynamic Time Warping) algorithm. The isolated word speech recognition method comprises the following steps that (1), a received analog voice signal is preprocessed; preprocessing comprises pre-filtering, sampling, quantification, pre-emphasis, windowing, short-time energy analysis, short-time average zero crossing rate analysis and end-point detection; (2), a power spectrum X(n) of a frame signal is obtained by FFT (Fast Fourier Transform) and is converted into a power spectrum under a Mel frequency; an MFCC (Mel Frequency Cepstrum Coefficient) parameter is calculated; the calculated MFCC parameter is subjected to HRSF cepstrum raising after a first order difference and a second order difference are calculated; and (3), the improved DTW algorithm is adopted to match test templates with reference templates; and the reference template with the maximum matching score serves as an identification result. According to the isolated word speech recognition method, the identification of a single Chinese character is achieved through the improved DTW algorithm, and the identification rate and the identification speed of the single Chinese character are increased.

Description

technical field [0001] The present invention relates to the application field of speech recognition, in particular to a method for identifying isolated words based on Half Raised-Sine function (HRSF) and improved Dynamic Time Warping (DTW) algorithm. Background technique [0002] In the field of speech recognition, there are generally three speech recognition methods: methods based on vocal tract models and speech knowledge, template matching methods, and methods using artificial neural networks. [0003] 1. Methods based on phonetics and acoustics. The methods based on phonetics and acoustics started earlier, and there have been researches in this area since the speech recognition technology was proposed. However, due to the complexity of its model and speech knowledge, it has not reached the practical stage at this stage. [0004] It is generally believed that there are a finite number of different speech primitives in common languages, and they can be distinguished by th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L15/06G10L15/12
Inventor 胡晓晖李玉婷彭宏利薛云蔡倩华黄海东曾广祥
Owner SOUTH CHINA NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products