Pronunciation dictionary construction method, speech recognition method and related device

A construction method and dictionary technology, which are applied in the fields of speech recognition methods, pronunciation dictionary construction methods, equipment and storage media, and devices, can solve problems such as improvement, poor fault tolerance of speech recognition schemes, and unfavorable speech recognition effects, and achieve good recognition effects, Improve speech recognition, improve fault tolerance and robustness

Pending Publication Date: 2022-01-14
IFLYTEK CO LTD
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Existing speech recognition schemes have poor fault tolerance. Specifically, only phoneme strings that strictly match the pronunciations in the pronunciation dictionary will be recognized as texts corresponding to the pronunciations in the pronunciation dictionary. Although this can ensure speech recognition Absolute accuracy, but it reduces the robustness of speech recognition, which is not conducive to the improvement of speech recognition effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Pronunciation dictionary construction method, speech recognition method and related device
  • Pronunciation dictionary construction method, speech recognition method and related device
  • Pronunciation dictionary construction method, speech recognition method and related device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] The technical solutions of the embodiments of the present application are applicable to speech recognition application scenarios. By adopting the technical solutions of the embodiments of the present application, the robustness of speech recognition can be improved, and the effect of speech recognition can be further improved.

[0062] Speech recognition technology, also known as Automatic Speech Recognition (ASR, Automatic Speech Recognition), takes speech as its research object, and its goal is to convert human voice signals into words or instructions. Today, with the rapid development of artificial intelligence, speech recognition technology is the first step for machines to "understand" human language.

[0063] At present, mainstream commercial speech recognition systems are still based on the joint decoding framework of acoustic models and language models. The acoustic model is mainly responsible for the mapping of speech features to phonemes, and the language model...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a pronunciation dictionary construction method, a speech recognition method and a related device, and the method comprises the steps: carrying out the phoneme extraction of target audio data, and obtaining a corresponding phoneme set; wherein the target audio data is audio data covering all phonemes; according to the identification probability of each phoneme in the extracted phoneme set and a phoneme label corresponding to the target audio data, determining similar phonemes of the phonemes in the phoneme label from the phoneme set; and constructing a multi-pronunciation dictionary according to a rule of correspondingly storing the word and the standard pronunciation and the similar pronunciation corresponding to the word; wherein the standard pronunciation is composed of phonemes in the phoneme label, and the similar pronunciation is composed of similar phonemes of the phonemes in the phoneme label. The multi-pronunciation dictionary constructed through the above scheme can improve the fault tolerance and robustness of speech recognition, and further can improve the speech recognition effect.

Description

technical field [0001] The present application relates to the technical field of speech recognition, in particular to a pronunciation dictionary construction method, speech recognition method, device, equipment and storage medium. Background technique [0002] At present, mainstream commercial speech recognition systems are still based on the joint decoding framework of acoustic models and language models. The acoustic model is mainly responsible for the mapping of speech features to phonemes, and the language model combines pronunciation dictionaries to convert phoneme strings into corresponding text strings. [0003] In the pronunciation dictionary, the correspondence between the pronunciation constituted by phonemes and the text is recorded. When the pronunciation of the phoneme string recognized by the acoustic model matches a certain pronunciation in the pronunciation dictionary, the text corresponding to the phoneme string can be determined according to the corresponde...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/02G10L15/06G10L15/08G10L15/18G10L15/26
CPCG10L15/02G10L15/08G10L15/063G10L15/1815G10L15/26G10L2015/025G10L2015/088G10L2015/0633
Inventor 方昕刘俊华
Owner IFLYTEK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products