Speech signal dynamic characteristic extraction method based on formant curves

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A voice signal and dynamic feature technology, applied in voice analysis, voice recognition, instruments, etc., can solve the problems of performance degradation, poor stability and distinguishing ability, and inability to fully mine dynamic information, so as to improve performance Effect

Inactive Publication Date: 2016-10-12

BOHAI UNIV

View PDF6 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

With the development of recognition technology, people found that the characteristic parameters in the time domain are not very stable and distinguishable, so they began to use the frequency domain parameters as the characteristics of the speech signal, such as pitch period, formant frequency, linear prediction coefficient ( LPC), line spectrum pair (LSP), cepstral coefficient, etc., the most widely used feature parameter is the Mel cepstral coefficient (MFCC) based on the human auditory model; but once these parameters are applied to the noise environment, their performance will drop sharply;

[0005] Moreover, the characteristic parameters mentioned above all reflect the static characteristics of the speech. The dynamic characteristics of the speech signal refer to the characteristic parameters extracted from several consecutive frames of speech. Acceleration parameters cannot fully mine the dynamic information, so they cannot reflect the dynamic characteristics of the speech signal well

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0042] An embodiment of the present invention will be further described below in conjunction with the accompanying drawings.

[0043] A method for extracting dynamic features of speech signals based on formant curves, the flow chart of the method is as follows figure 1 shown, including the following steps:

[0044] Step 1, collecting voice signals;

[0045] In the embodiment of the present invention, utilize microphone to input speech data, and carry out sampling and quantization with the sampling frequency of 11.025KHz, the quantization precision of 16bit by processing unit such as computer, single-chip microcomputer or DSP chip, obtain corresponding speech signal; Adopt in the embodiment of the present invention computer as processing unit;

[0046] Step 2, preprocessing the voice signal, including pre-emphasis, framing windowing and endpoint detection;

[0047] In the embodiment of the present invention, the pre-emphasis: realized by a first-order digital pre-emphasis fi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a speech signal dynamic characteristic extraction method based on formant curves, belonging to the technical field of Chinese speech signal dynamic characteristic extraction. The method comprises the following steps: acquiring speech signals; carrying out preprocessingon the speech signals; extracting formant frequency characteristics of the speech signals; according to the sequence from the first frame to the last frame, combining the first formant frequency characteristic values of all the frames of preprocessed speech signals to obtain a first formant curve, and then obtaining a second formant curve, a third formant curve, and a fourth formant curve in the same manner; carrying out rapid Fourier transform on each obtained formant curve to obtain a linear frequency spectrum; obtaining an energy spectrum according to the linear frequency spectrum; obtaining logarithm energy according to the energy spectrum; and carrying out discrete cosine transform on the logarithm energy. Compared with the existing method, the method provided by the invention has the advantages that the speech signal dynamic characteristics are extracted, the temporal correlation is available, therefore, the close relevance before and after the speech signals and between the adjacent speech signals is disclosed, and the speech recognition property is improved.

Description

technical field [0001] The invention belongs to the technical field of dynamic feature extraction of Chinese speech signals, in particular to a method for extracting dynamic features of speech signals based on formant curves. Background technique [0002] Speech recognition research in my country started in the 1950s, but it did not develop rapidly until the 1970s. The Chinese Academy of Sciences, Tsinghua University, Peking University and many other research institutes are engaged in the development of Chinese speech recognition system. At present, the research on continuous speech recognition system with large vocabulary is close to the highest level in foreign countries; "In the plan, the research on Chinese speech recognition has received strong support. The National 863 "Intelligent Computer Topics" expert group has specifically established a project for speech recognition research. At the same time, due to China's growing international status and its important position...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L25/15G10L25/24G10L15/02G10L15/04G10L25/18G10L25/21

Inventor 韩志艳王健王东周建壮郭继宁刘继行曹丽

Owner BOHAI UNIV

Speech signal dynamic characteristic extraction method based on formant curves

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology