Voice synthesis method and system

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech synthesis and speech technology, applied in the field of data processing, can solve problems such as high cost and unsatisfactory speech consistency

Active Publication Date: 2018-11-06

BEIJING UNISOUND INFORMATION TECH

View PDF10 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Because the text with high frequency of occurrence will often find changes, it is usually necessary to find a speaker to re-record, which consumes a lot of manpower, material and financial resources, and the cost is relatively high; and the voice synthesized by this technology has a poor voice consistency.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0048] The preferred embodiments of the present invention will be described below in conjunction with the accompanying drawings. It should be understood that the preferred embodiments described here are only used to illustrate and explain the present invention, and are not intended to limit the present invention.

[0049] The present invention provides a method and system for speech synthesis, which does not require a specific speaker to perform supplementary recording of high-frequency text, which is convenient and fast, and the obtained speech is more than directly synthesized by using a corresponding parameter synthesis model (such as an LSTM parameter synthesis model) The naturalness of the speech is significantly high, thus improving the naturalness of the synthesized speech. Such as figure 1 as shown, figure 1 It is a schematic flow chart of an embodiment of the speech synthesis method of the present invention; a speech synthesis method of the present invention can be i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a voice synthesis method and system. The method includes steps: obtaining a recorded voice correspondingly recorded by a speaker according to a specific text; extracting fundamental frequency information f01 from the recorded voice, performing analysis processing on the recorded voice, and obtaining phoneme duration information corresponding to the recorded voice; generating fundamental frequency information f00 and frequency spectrum information cep0 by employing a preset parameter synthesis model according to the specific text and the obtained phoneme duration information; performing domain adjustment on the fundamental frequency information f01 of the recorded voice by employing the fundamental frequency information f01 of the recorded voice and the fundamental frequency information f00 generated by the preset parameter synthesis model; and obtaining final fundamental frequency information. According to the method and system, the corresponding synthesized voice is obtained through reconstruction of a vocoder by employing the obtained final fundamental frequency information and the frequency spectrum information cep0, the beneficial effect of reducing thevoice recording cost is achieved, and the natural degree of the synthesized voice is further improved.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a speech synthesis method and system. Background technique [0002] The naturalness of the existing speech synthesis is not very ideal. In order to obtain a higher synthetic naturalness, some scenarios use the method of combining natural speech and synthetic speech for speech synthesis. For texts with high frequency, pre-recorded speech , for other text, use synthesized speech. Since the text with high frequency of occurrence will often find changes, it is usually necessary to find a speaker to re-record, which consumes a lot of manpower, material and financial resources, and the cost is relatively high; and the voice synthesized by this technology is not ideal for voice consistency. Contents of the invention [0003] The present invention provides a method and system for speech synthesis, aiming to directly use speakers of other pronunciation standards to record speec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L13/02G10L13/08

CPCG10L13/02G10L13/08

Inventor 孙见青

Owner BEIJING UNISOUND INFORMATION TECH

Voice synthesis method and system

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology