Method and device for checking voice recognition results, voice recognition system and audio monitoring system
A sound recognition and sound technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of not considering the recognition results, sound recognition false alarms, and difficulty in setting robust thresholds, so as to improve recognition performance and reduce false alarms Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
no. 1 example
[0053] figure 2 is a flowchart showing a method for checking a voice recognition result according to the first embodiment of the present invention.
[0054] Such as figure 2 As shown, in the receiving step 210, the N-best list of the voice recognition results of the current window is received from the voice recognition engine.
[0055] During the voice recognition process, voice signals are input into the voice recognition engine. Then, a series of potential recognition candidate sounds is generated with their corresponding recognition scores. The sound recognition engine sorts the recognition candidate sounds into an N-best list according to the recognition scores of the recognition candidate sounds, and then outputs the N-best list.
[0056] In the first calculation step 220, a first probability distribution of all candidate sounds in the N-best list of the current window is calculated based on the N-best list of the current window.
[0057] For each candidate sound in...
no. 2 example
[0096] Figure 5 is a flowchart showing a method for checking a voice recognition result according to the second embodiment of the present invention. as from Figure 5 As can be seen from , the difference between the method according to the second embodiment and the method according to the first embodiment is that a determination step 540 is added before the third calculation step is performed. In the sound recognition process according to the second embodiment, it is intended to recognize (detect) a target sound (or a plurality of target sounds). If the first candidate sound (which has the highest recognition score in the N-best list) of the current window's N-best list is not a target sound, it is not necessary to check this sound recognition result.
[0097] Such as Figure 5 As shown, in the receiving step 510, the N-best list of the voice recognition results of the current window is received from the voice recognition engine.
[0098] In the first calculation step 520...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 