Unlock instant, AI-driven research and patent intelligence for your innovation.

Method, device, electronic device and storage medium for predicting pronunciation of words

A technology for words and prediction models, applied in speech analysis, speech recognition, instruments, etc., can solve the problems such as the inability to guarantee the prediction efficiency of word pronunciation prediction models and the limited operation speed of LSTM neurons, so as to improve the operation speed and improve the The effect of predicting efficiency

Active Publication Date: 2022-04-19
BEIJING SINOVOICE TECH CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, some existing word pronunciation prediction models are obtained based on conventional cyclic neural network training, for example: the hidden layer in the model uses conventional LSTM neurons for abstract learning, when it is necessary to predict the pronunciation of a large number of new words When , the operation speed of the LSTM neuron is limited, and the prediction efficiency of the entire word pronunciation prediction model cannot be guaranteed.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device, electronic device and storage medium for predicting pronunciation of words
  • Method, device, electronic device and storage medium for predicting pronunciation of words
  • Method, device, electronic device and storage medium for predicting pronunciation of words

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0084] The following will be combined with the drawings in the embodiment of the present application, the technical solution in the embodiment of the present application is clearly and completely described, obviously, the embodiment described is part of the embodiment of the present application, not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without the premise of creative labor are within the scope of the present application.

[0085] In order to facilitate the understanding of the advantages of the method provided in this application, the following first word pronunciation prediction model in the relevant technique is briefly described. In related techniques, conventional LSTM neurons are used in the hidden layer of the word pronunciation prediction model. Figure 1 It is a schematic diagram of the internal structure of an LSTM neuron in a related technique.

[0086] Reference ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application provides a method, device, electronic equipment and storage medium for predicting the pronunciation of a word. When predicting the pronunciation of a target word, first obtain each character vector of the target word, and then input the word pronunciation prediction model in turn to obtain the target word Predict pronunciation. Among them, the word pronunciation prediction model is obtained by using the corresponding relationship between multiple words and pronunciation factor sequences to train the deep neural network model in advance. The deep neural network model includes: an input layer, a recurrent neural network with at least two hidden layers, and output layer. This application improves the word pronunciation prediction model in the related art. Specifically, an improved LSTM neuron is used in the hidden layer. Compared with the conventional LSTM neuron, the improved LSTM neuron performs the input word vector The calculation speed can be significantly improved during abstract learning, therefore, when the pronunciation of words needs to be predicted, the method provided by this application can significantly improve the prediction efficiency.

Description

Technical field [0001] The present invention relates to the field of language processing technology, specifically to a method for predicting the pronunciation of words, apparatus, electronic devices and storage media. Background [0002] Nowadays, speech recognition technology has been widely used in people's daily lives, bringing great convenience to people's lives. Usually, within the system where speech recognition technology is applied, a speech pronunciation dictionary is pre-established, which includes: the correspondence between each word and the corresponding sequence of pronunciation factors. [0003] However, the phonetic pronunciation dictionary has limited coverage of words, and with the emergence of some emerging words, if the pronunciation of these words is not added to the phonetic pronunciation dictionary in time, the accuracy of speech recognition results will be greatly reduced. Related techniques are to predict the pronunciation of the word by entering a new w...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/06G10L15/16G10L15/197G06N3/063G06N3/04
CPCG10L15/063G10L15/16G10L15/197G06N3/063G06N3/045
Inventor 邢启洲李健张连毅武卫东
Owner BEIJING SINOVOICE TECH CO LTD