Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice synthesis method and system

A speech synthesis and speech technology, applied in the field of data processing, can solve problems such as high cost and unsatisfactory speech consistency

Active Publication Date: 2018-11-06
BEIJING UNISOUND INFORMATION TECH
View PDF10 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Because the text with high frequency of occurrence will often find changes, it is usually necessary to find a speaker to re-record, which consumes a lot of manpower, material and financial resources, and the cost is relatively high; and the voice synthesized by this technology has a poor voice consistency.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice synthesis method and system
  • Voice synthesis method and system
  • Voice synthesis method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] The preferred embodiments of the present invention will be described below in conjunction with the accompanying drawings. It should be understood that the preferred embodiments described here are only used to illustrate and explain the present invention, and are not intended to limit the present invention.

[0049] The present invention provides a method and system for speech synthesis, which does not require a specific speaker to perform supplementary recording of high-frequency text, which is convenient and fast, and the obtained speech is more than directly synthesized by using a corresponding parameter synthesis model (such as an LSTM parameter synthesis model) The naturalness of the speech is significantly high, thus improving the naturalness of the synthesized speech. Such as figure 1 as shown, figure 1 It is a schematic flow chart of an embodiment of the speech synthesis method of the present invention; a speech synthesis method of the present invention can be i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a voice synthesis method and system. The method includes steps: obtaining a recorded voice correspondingly recorded by a speaker according to a specific text; extracting fundamental frequency information f01 from the recorded voice, performing analysis processing on the recorded voice, and obtaining phoneme duration information corresponding to the recorded voice; generating fundamental frequency information f00 and frequency spectrum information cep0 by employing a preset parameter synthesis model according to the specific text and the obtained phoneme duration information; performing domain adjustment on the fundamental frequency information f01 of the recorded voice by employing the fundamental frequency information f01 of the recorded voice and the fundamental frequency information f00 generated by the preset parameter synthesis model; and obtaining final fundamental frequency information. According to the method and system, the corresponding synthesized voice is obtained through reconstruction of a vocoder by employing the obtained final fundamental frequency information and the frequency spectrum information cep0, the beneficial effect of reducing thevoice recording cost is achieved, and the natural degree of the synthesized voice is further improved.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a speech synthesis method and system. Background technique [0002] The naturalness of the existing speech synthesis is not very ideal. In order to obtain a higher synthetic naturalness, some scenarios use the method of combining natural speech and synthetic speech for speech synthesis. For texts with high frequency, pre-recorded speech , for other text, use synthesized speech. Since the text with high frequency of occurrence will often find changes, it is usually necessary to find a speaker to re-record, which consumes a lot of manpower, material and financial resources, and the cost is relatively high; and the voice synthesized by this technology is not ideal for voice consistency. Contents of the invention [0003] The present invention provides a method and system for speech synthesis, aiming to directly use speakers of other pronunciation standards to record speec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G10L13/08
CPCG10L13/02G10L13/08
Inventor 孙见青
Owner BEIJING UNISOUND INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products