Unlock instant, AI-driven research and patent intelligence for your innovation.

Speech synthesis related method, speech stream speech change model training method and related device

A training method and technology of speech synthesis, applied in speech synthesis, speech analysis, speech recognition, etc., can solve the problems of difficulty in reducing the mechanical sound of synthesized speech, affecting the user's experience, and changing the speech flow and sound, so as to improve the user experience. Sense, improve naturalness, improve the effect of accuracy

Pending Publication Date: 2022-07-01
MASHANG CONSUMER FINANCE CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, there are strong mechanical sounds in the current synthetic speech, which seriously affects the user's experience. The mechanical sounds are mainly caused by two types of reasons. What should be read together should be read together; the second is the wrong pronunciation of language flow, such as: taking Mandarin Chinese as an example, the first upper tone of continuous reading (for example: "tiger") should be pronounced as Yangping tone (second tone) , but still read the upper tone (three tones)
[0004] Existing speech synthesis methods are difficult to reduce the mechanical sound of synthesized speech

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis related method, speech stream speech change model training method and related device
  • Speech synthesis related method, speech stream speech change model training method and related device
  • Speech synthesis related method, speech stream speech change model training method and related device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0031] see figure 1 , figure 1 It is a schematic diagram of an embodiment of the training process of the speech flow-to-sound transformation model of the present invention.

[0032] Step S11: Acquire training data, wherein the training data includes text data, pinyin annotation data of the text data, and phonetic transcription flow data of the text data.

[0033] First obtain multiple training data to train the initial model. The training data ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech synthesis related method, a speech stream speech change model training method and a related device, and the speech synthesis method comprises the steps: carrying out the pinyin labeling of a to-be-processed text, and obtaining the pinyin labeling information of the to-be-processed text; inputting the to-be-processed text and the pinyin annotation information of the to-be-processed text into a speech stream sound change model to obtain first phonetic symbol stream data of the to-be-processed text; and performing speech synthesis on the to-be-processed text based on the first phonetic symbol stream data. Through the above steps, the speech synthesis method can improve the naturalness of the synthesized speech.

Description

technical field [0001] The present invention relates to the field of speech synthesis, in particular to a speech synthesis related method, a training method of a speech flow-to-speech model and a related device. Background technique [0002] TTS (text to speech, also known as speech synthesis, text-to-speech conversion) refers to the process by which a machine converts language from a text carrier to a sound carrier, and is a key module in systems such as man-machine dialogue and intelligent broadcast. With the maturity of related technologies, the competition of speech synthesis products of major manufacturers gradually focuses on the naturalness of the synthesized speech. [0003] However, there are strong mechanical sounds in the current synthesized speech, which seriously affect the user's experience, and the mechanical sounds are mainly caused by two types of reasons. Consecutive readings should be read consecutively; the second is that the pronunciation of the languag...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G10L15/06G10L15/26
CPCG10L13/02G10L15/063G10L15/26
Inventor 白安琪蒋宁王洪斌吴海英赵立军
Owner MASHANG CONSUMER FINANCE CO LTD