Method and device for determining confidence of voice recognition result

A speech recognition and confidence technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as the influence of the accuracy of the recognition results, and the confidence that the recognition results cannot reflect the real situation.

Active Publication Date: 2014-05-21
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF4 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in Chinese, a word can be composed of two other words, so-called compound words, corresponding to this type of words, just as "Chinese people" is composed of the words "China" and "people", the confidence of the existing speech recognition results The degree determination method ignores the constituent factors of compound words, so that the confidence degree of the recognition result cannot reflect the real situation. Since the confidence degree of the recognition result may have an impact on the subsequent adaptive adjustment process of the acoustic model and language model, it will also have an impact on the accuracy of the recognition results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for determining confidence of voice recognition result
  • Method and device for determining confidence of voice recognition result
  • Method and device for determining confidence of voice recognition result

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0046] figure 2 The flow chart of the method provided by Embodiment 1 of the present invention, such as figure 2 As shown, the method may specifically include the following steps:

[0047] Step 201: Determine the confidence level of each arc in the decoded word graph.

[0048] The confidence of each arc in this step is equal to the value obtained by dividing the sum of the scores of all paths passing through the arc by the sum of the scores of all paths in the word graph, where the path score is the sum of the acoustic score and language score of the path .

[0049] still with figure 1 Take the word map shown in as an example, assuming that the path "Renmin University" has a score of 5, the path "China"-"People" has a score of 3, and the path "China"-"People" has a score of 2, then we can get:

[0050] The confidence of the arc "University of the People" is

[0051] The confidence of the arc "China" is

[0052] The confidence of the arc "people" is

[0053] The ...

Embodiment 2

[0075] Figure 4 The structure diagram of the device for determining the confidence level of the speech recognition result provided by Embodiment 2 of the present invention, as shown in Figure 4 As shown, the apparatus may include: an initial determination unit 400 , a set determination unit 410 and a confidence degree determination unit 420 .

[0076] First, the initial determination unit 400 determines the confidence of each arc in the decoded word graph, and determines the optimal path in the word graph. Specifically, the confidence of each arc is equal to the value obtained by dividing the sum of the scores of all paths passing through the arc by the sum of the scores of all paths in the word graph, where the path score is the sum of the acoustic score and language score of the path . The optimal path in the word graph is the path with the highest score among all paths.

[0077] Then the set determination unit 410 for each arc A on the optimal path i , and determine t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and a device for determining the confidence of a voice recognition result, wherein the method comprises the following steps: determining the confidence of each arc in a word graph obtained by decoding, and determining an optimal path in the word graph; determining an arc assembly T competitive to an arc Ai in the word graph for each arc Ai on the optimal path; determining an arc Aj from the arc assembly T competitive to an arc Ai when the confidence of a word expressed by the arc Ai, wherein the arc Aj and the arc Ai represent the same words, or the arc Aj and the arc assembly connected with the arc Aj are formed into a word expressed as same as the arc Ai; determining the confidence of the word represented by the arc Ai according to the confidence of the arc Ai and the confidence of the arc Aj or further according to the confidence of the arc connected with the arc Aj. When the method and the device are used for determining the confidence of the voice recognition result, the component factors of a compound word are considered, so that the confidence can further reflect the real state accurately.

Description

【Technical field】 [0001] The invention relates to the field of speech recognition in computer application technology, in particular to a method and device for determining the confidence level of speech recognition results. 【Background technique】 [0002] In speech recognition, the confidence level is used to indicate the possibility of the recognition result being correct. The larger the value, the higher the possibility of the recognition result being the correct result. It is an important basis for speech recognition. The method for determining the confidence level of the speech recognition result is straightforward. affect the accuracy of speech recognition. [0003] The confidence degree determination of the speech recognition result is mainly obtained by processing the word map (Aattice) generated by decoding. Word graph is a more commonly used expression form of speech recognition results in recent years. It represents multiple decoded candidate results on a directed ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06
Inventor 李新辉
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products