Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for improving rejection capability of speech recognition system

A technology of speech recognition and ability, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as high algorithm complexity, difficulty in successful application, poor universality, etc., achieve uncomplicated calculation, improve recognition rejection effect, The effect of improving robustness

Active Publication Date: 2013-05-01
讯飞医疗科技股份有限公司
View PDF6 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

By defining an auxiliary space and performing decoding on it to obtain an effective competition path, either the effectiveness of the competition path is very dependent on the recognition grammar itself, and the universality is poor; Taking into account important knowledge such as time series information and language models when calculating the path, the effective competitive path can be obtained more accurately, but the algorithm complexity is high, and it is difficult to successfully apply it in a speech recognition system that requires a high real-time rate.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for improving rejection capability of speech recognition system
  • Method for improving rejection capability of speech recognition system
  • Method for improving rejection capability of speech recognition system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] Such as figure 2 As shown, the present invention may improve the ability to reject invalid input such as out-of-set words, background voices, and other noises. The specific process is as follows:

[0030] (1) Collect a variety of noise data; then classify according to the type of noise, including background noise, background music, door closing and coughing; then train Gaussian mixture models (GMM) for different types of noise; finally combine All kinds of GMM models are the overall absorption model; the Gaussian mixture model GMM (Gaussian mixture model) is an extension of a single Gaussian density function, which can smoothly approximate the density distribution of any shape, which is why the GMM model is often used in the field of speech recognition. one of

[0031] (2) A statistical language model is trained through a variety of relatively random texts, and then a recognition network is constructed through weighted finite state machine (WFST) technology, which is ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method for improving rejection capability of a speech recognition system. The method comprises the following steps of collecting various types of noise data; classifying according to the noise types; for different types of noise, respectively training GMMs (Gauss mixed model); assembling various types of GMMs into an integral absorption model; training a statistic language model by various types of relatively random texts, and then establishing a recognition network by WFST (weighted finite state transducer) technique, which is called as an absorption network; connecting the absorption network, the absorption model and an original decoding network in parallel to form a new decoding network; enabling the input original audio frequency to pass endpoint detection and a feature extraction module, so as to generate feature vectors; and competing the feature vectors in the three parts of the decoding network according to an Viterbi algorithm, so as to generate a final recognition result, and effectively reject the noise and an out-of-vocabulary condition. The method has the advantage that on the premise of balancing the recognition efficiency, the effect of rejecting the out-of-vocabulary condition and the invalid input is well realized.

Description

technical field [0001] The invention relates to a method for improving recognition rejection ability in a speech recognition system, which is used in the technical field of command word recognition in the speech recognition system. Background technique [0002] The command word recognition system is a very important category in the current speech recognition system, which is widely used in navigation products of home appliances, vehicles, smart phones and call centers. The task of the command word recognition system is to find the most similar recognition result of the input speech within the scope of recognition grammar. Compared with the limited recognition grammar, the input speech is unlimited. When the actual content of the input speech is not within the recognition grammar range, the input is called an out-of-set word. In addition to out-of-collection words, there will be other invalid inputs such as background speech and noise. After these invalid inputs are sent to ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/14G10L15/30G10L15/06
Inventor 鹿晓亮赵志伟陈旭尚丽吴晓如于振华
Owner 讯飞医疗科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products