Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Prosodic prediction model training method, prosody prediction method and related device

A prediction model and training method technology, applied in the computer field, can solve problems such as limited ability, inaccurate rhythm, and inability to reflect semantic relationships, and achieve the effect of improving the accuracy of division, ensuring data reading and writing, and reducing the difficulty of acquisition.

Active Publication Date: 2021-01-29
BEIJING CENTURY TAL EDUCATION TECH CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] When performing prosodic prediction, it is necessary to extract feature information from text information, including shallow feature information and deep feature information. The shallow feature information includes feature information such as word length, part of speech, and punctuation. , can be intuitively extracted from the text analysis results, but because the language information it covers is relatively superficial, it cannot reflect the semantic relationship between the texts in the sentence, and it is easy to cause the predicted prosody to be inaccurate, and the semantics of the prosodic unit cannot be guaranteed Integrity; for deep feature information, the information covered by each deep feature information is single, resulting in limited ability to use any kind of deep feature information for prosodic prediction. When combining multiple deep feature information, different There will be side effects between deep feature information, which will affect the accuracy of prosody prediction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Prosodic prediction model training method, prosody prediction method and related device
  • Prosodic prediction model training method, prosody prediction method and related device
  • Prosodic prediction model training method, prosody prediction method and related device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] In the prior art, when prosody prediction is performed on text, the accuracy is low.

[0038] In order to improve the accuracy of text prosody prediction, an embodiment of the present invention provides a prosody prediction model training method, including:

[0039] Determining the current text unit and the previous text unit, wherein the current text unit is a text unit of the current training text, and the previous text unit is arranged according to the position of each text unit of the current training text, adjacent to and located in the current text unit The text unit before the text unit, and the text initial unit used to represent the beginning of the current training text, each of the text units is marked with a reference rhythm;

[0040] Obtaining the training current text prediction vector of the current text unit, obtaining the previous prosody prediction vector of the previous text unit, and fusing the training current text prediction vector and the previous...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

An embodiment of the present invention provides a prosody prediction model training method, a prosody prediction method and related devices. The training method includes: using the prosody prediction model to be trained to determine the current text unit and the previous text unit; obtaining the training current text unit of the current text unit The prediction vector is to obtain the previous prosodic prediction vector of the previous text unit, and fuse the two to obtain the training prosody fusion prediction vector; obtain the first training prediction prosodic vector according to the training current text prediction vector, and obtain the second training prosody fusion prediction vector according to the training prosody fusion prediction vector 2. Training and predicting prosodic vectors: obtaining the prediction loss of the current text unit according to the first training predicting prosody vector, the second training predicting prosody vector and the current reference prosody vector, adjusting the parameters of the model, and obtaining the trained prosody prediction model. The prosody prediction model training method, prosody prediction method and related devices provided in the embodiments of the present invention can improve the accuracy of prosody prediction.

Description

technical field [0001] The embodiments of the present invention relate to the field of computers, and in particular, to a prosody prediction model training method, a prosody prediction method and related devices. Background technique [0002] With the development of computer technology and deep learning technology, speech synthesis technology has become an important research direction and has been widely used, such as: voice broadcast, voice navigation and smart speakers. In speech synthesis, it is necessary to predict the prosody of the text, and the quality of the prosody prediction directly affects the naturalness of the synthesized speech and the accuracy of meaning expression at the semantic level of the text. [0003] When performing prosodic prediction, it is necessary to extract feature information from text information, including shallow feature information and deep feature information. The shallow feature information includes feature information such as word length...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L13/10G06F40/289G06F40/216
CPCG10L13/10G06F40/216G06F40/289
Inventor 李成飞袁军峰杨嵩
Owner BEIJING CENTURY TAL EDUCATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products