Method and apparatus for speech recognition
A speech recognition and phoneme recognition technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of increasing the difficulty of speech recognition of specific types of named entities, and achieve the effect of improving accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
specific Embodiment approach
[0065] In a first implementation manner, the speech recognition based on Pinyin is syllable recognition. The pinyin sequence is a syllable sequence.
[0066] In the first implementation manner, step 120 is specifically performing syllable recognition on the voice of the named entity to be recognized, so as to recognize a syllable sequence that is a syllable recognition result of the voice of the named entity to be recognized.
[0067] That is to say, using the syllable recognition network constituted by the acoustic model and the syllable-based language model to perform syllable recognition on the speech of the named entity to be recognized, so as to recognize a syllable sequence as a syllable recognition result of the speech of the named entity to be recognized. For example, for the speech of the named entity "Zhang San", the syllable sequence "zhang san" is output after the syllable recognition network is used for syllable recognition.
[0068] In the second implementation ...
Embodiment approach
[0105] In the case that the overall edit distance is a weighted average, corresponding to the first or second implementation of step S120, according to the corresponding The weight corresponding to the weight of the syllable sequence edit distance or the weight corresponding to the phoneme sequence edit distance, the edit distance of the Chinese character sequence and the syllable sequence edit distance are weighted, or the edit distance of the Chinese character sequence and the phoneme sequence The edit distance is weighted, and the obtained weighted average is used as the overall edit distance between each candidate named entity in the specific named entity list and the speech of the named entity to be recognized.
[0106] In the case that the overall edit distance is a weighted average, corresponding to the third implementation of step S120, according to the weight, phoneme The weight corresponding to the sequence edit distance and the weight corresponding to the tone seque...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


