Voice synthesis method based on voice vector textual characteristics

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech synthesis and vector technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of inaccurate description of text features, too smooth spectrum parameter trajectory and fundamental frequency trajectory, flat listening of synthesized speech, etc., to reduce complexity The degree of human involvement, the description is simple and direct, and the effect of improving accuracy

Active Publication Date: 2016-06-08

中科极限元(杭州)智能科技股份有限公司

View PDF6 Cites 25 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] 1. The description of text features is not precise enough, requiring a lot of manpower and material resources to label the text, and a large part of the labeling results depends on the experience and background knowledge of the labelers, which requires professionals to complete, which greatly affects the construction of the system speed;

[0004] 2. There is still a big gap between the sound quality of synthetic speech and real people, especially the speech analysis and synthesis model that uses parametric description of speech and statistical modeling is not ideal, and the modeling of speech is not accurate enough; in addition, the use of statistical parameter modeling , the generated spectral parameter trajectory and fundamental frequency trajectory are too smooth, and the synthetic speech is too flat in the sense of hearing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0032] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0033] Such as figure 1 As shown, the present invention includes the following modules:

[0034] Text analysis module 1, text parameterization module 2, sound vector training module 3, language parameter training module 4, sound vector generation module 5, language parameter prediction module 6, speech synthesizer module 7;

[0035] Concrete implementation steps of the present invention are as follows:

[0036] The text analysis module 1 receives the input text to be analyzed, regularizes the text features, removes redundant symbols in the text, marks the consonants and tones of each syllable, corrects the pronunciation errors of polyphonic characters, and obtains the pronunciation unit corresponding to the input text sequence;

[0037] Text parameterization module 2 receives the pronunciation unit sequence corresponding to the abo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a voice synthesis method based on voice vector textual characteristics. The voice synthesis method comprises the following steps: receiving an input text by a text analyzing module; carrying out regular processing on the textual characteristics and transmitting obtained text data to a text parameterization module; obtaining a parameterized text by adopting a single-bit heat code encoding method; receiving the parameterized text by a voice vector training module, and training a linguistic model based on voice vectors; then transmitting to a linguistic parameter training model to train a mapping model from the text to voice parameters; receiving the output text of the text parameterization module and the voice vector training module through a voice vector generation module, so as to generate the voice vectors of the text data; and transmitting the voice vectors of the text data and the mapping model from the text to the voice parameters to a linguistic parameter predication module to obtain the voice parameters corresponding to the voice vectors; and finally, synthesizing voices by a voice synthesis module. According to the voice synthesis method based on the voice vector textual characteristics, the accuracy of modeling of a voice synthesis system is improved; and the complexity and the manual participation degree of system realization are greatly reduced.

Description

technical field [0001] The invention relates to a speech synthesis method, in particular to a speech synthesis method based on sound vector text features. Background technique [0002] Speech synthesis technology enables computers to generate high-definition, high-naturalness continuous speech, making human-machine communication more harmonious and natural. In the development of speech synthesis technology, the early research mainly used the speech synthesis method based on unit waveform splicing, but this method will cause speech distortion and sudden change at the splicing point. In recent years, the speech synthesis method based on statistical parameters has developed rapidly due to the rapid construction of the synthesis system, small corpus size requirements, and smooth and smooth synthesized speech. However, this method still has the following two shortcomings: [0003] 1. The description of text features is not precise enough, requiring a lot of manpower and material...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L13/02G10L13/08G10L17/04G10L17/02

Inventor徐明星车浩

Owner中科极限元(杭州)智能科技股份有限公司

Voice synthesis method based on voice vector textual characteristics

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology