Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Acoustic model training and constructing method, acoustic model and speech recognition system

A technology of acoustic model and construction method, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as limited training data, influence of statistical information, training data cannot reflect statistical distribution, etc.

Active Publication Date: 2016-05-25
INST OF ACOUSTICS CHINESE ACAD OF SCI +1
View PDF10 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in practical applications, it is found that, on the one hand, when used for speech recognition, the state class describing silence usually occupies a large amount of statistics, far exceeding the single state class describing speech, which makes the heteroscedasticity linearity calculated based on statistics Discriminant analysis, which is too biased towards silence, inhibits the distinction of speech parts to a certain extent; on the other hand, due to limited training data, the state distribution of some speech is relatively sparse, and the corresponding training data cannot reflect its true statistical distribution. , which leads to the statistical information in the calculation of heteroscedastic linear discriminant analysis is also affected accordingly

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Acoustic model training and constructing method, acoustic model and speech recognition system
  • Acoustic model training and constructing method, acoustic model and speech recognition system
  • Acoustic model training and constructing method, acoustic model and speech recognition system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0044] Embodiment 1, constructing an acoustic model

[0045] Such as figure 1 As shown, the number of states of the acoustic model is denoted as N. Based on all the training data, the frame number statistics and scatter matrix of each state are counted, where the frame number statistics are recorded as occ(n):

[0046] occ(n) = the total number of frames belonging to state n in the training data

[0047] Based on the statistics of all states and the total number of states N, the average statistics of the state class can be calculated

[0048] occ ( N ) ‾ = Σ n = 1 N occ ( n ) N

[0049] The fram...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an acoustic model training and constructing method, a hidden Markov acoustic model based on the training method, and a speech recognition system. The training method comprises the following steps: (1) calculating the frame statistical number of each class and an intra-class divergence matrix based on training data and a pre-given state cluster; (2) for a non-speech state class in a model, inhibiting and smoothing the statistical number of the state class if the frame statistical number corresponding to the state class is much higher than the average statistical number of state classes; (2) for a speech state class in the model, inhibiting and smoothing the statistical number of the state class if the frame statistical number corresponding to the state class is much lower than the average statistical number of state classes; (4) calculating a heteroscedastic linear discriminant analysis matrix based on the intra-class divergence matrix and the smoothed class statistical number; and (5) using the calculated heteroscedastic linear discriminant analysis matrix in speech characteristic and model dimension reduction, and carrying out iteration again to get a dimension-reduced stable acoustic model. The recognition performance of the acoustic model is improved eventually.

Description

technical field [0001] The invention belongs to the field of speech recognition, and in particular relates to a smoothing method for heteroskedasticity linear discriminant analysis, which can be used for fast dimensionality reduction and decorrelation processing of high-dimensional feature vectors in language recognition. Background technique [0002] In large-vocabulary continuous speech recognition, heteroscedastic linear discriminant analysis (HLDA, Heteroscedastic Linear Discriminant Analysis) improves the recognition performance of the model by removing the correlation between features, which is widely used in acoustic modeling (N.Kumar. Investigation of silicon auditory models and generalization of linear Discriminant analysis for improved speech recognition.PhDthesis, Johns Hopkins University, Baltimore, Maryland, 1997.). The core of its algorithm is to divide the speech into different classes according to the state, and reduce the dimensionality of the original featu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/06
Inventor 张晴晴潘接林颜永红
Owner INST OF ACOUSTICS CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products