Rhythm prediction method and device, equipment and medium

A prediction method and prosody technology, which is applied in the field of data processing, can solve the problems of reducing the accuracy rate of text prosody prediction and losing semantic information of words, and achieve the effect of improving accuracy rate and recall rate, improving accuracy rate, and reducing the amount of training annotation data
CN110797005AActive Publication Date: 2020-02-14BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Patent Information

Authority / Receiving Office
CN ยท China
Patent Type
Applications(China)
Current Assignee / Owner
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Publication Date
2020-02-14

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

Embodiments of the invention disclose a rhythm prediction method and device, equipment and a medium, and relate to the field of data processing, in particular to a voice synthesis technology. The method comprises the following steps of segmenting a Chinese and English mixed text to be predicted to obtain a Chinese text and an English text; determining character vectors of characters in the Chinesetext and word vectors of words in the English text; and determining a rhythm prediction result of the Chinese and English mixed text according to the determined character vectors and word vectors. According to the rhythm prediction method and device, the equipment and the medium, the rhythm prediction accuracy of the Chinese and English mixed text is improved.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The embodiments of the present application relate to the field of data processing, and in particular to speech synthesis technology. Specifically, this embodiment provides a prosody prediction method, device, device and medium. Background technique

[0002] Before speech synthesis, it is necessary to predict the prosody of the speech text.

[0003] Existing prosody prediction methods include: predicting the text content to be predicted by a machine learning method according to a pre-trained prediction model, and obtaining the corresponding pause prediction result of the text content, wherein the pause prediction result can include pause position, pause type (can include long pause, short pause, etc.) and a probability value corresponding to the type of pause.

[0004] The above scheme has the following defects:

[0005] The text content to be predicted does not distinguish between languages. When the text content includes both Chinese and English, t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More