Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech synthesis method under a small amount of recording samples

A small amount of speech synthesis technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of unsatisfactory naturalness, waste of time, energy and financial resources, and failure to achieve the desired effect, so as to reduce the cost of speech recording and ensure smoothness The effect of sex and naturalness

Inactive Publication Date: 2019-12-06
广州九四智能科技有限公司
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] Speech-related fields, especially speech synthesis, are developing extremely rapidly, but the process of preparing corpora is very difficult
During the recording process, when different recording personnel record the same content, due to the excessive recording content, a lot of time, energy and financial resources may be wasted, and it is very unnecessary to record the same recording text multiple times
The naturalness of the related speech synthesis technology for a small amount of speech synthesis is very unsatisfactory, and there will be obvious differences from the actual recorded speech, which cannot achieve the desired ideal effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method under a small amount of recording samples
  • Speech synthesis method under a small amount of recording samples

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0020] In the embodiment of the speech synthesis method under the situation of a small number of recording samples of the present invention, the flow chart of the speech synthesis method under the situation of a small number of recording samples is as follows figure 1 As shown, the frame diagram of the speech synthesis method in the case of a small number of recording samples is as follows figure 2 shown. The method for speech synthesis under the situation o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech synthesis method under a small amount of recording samples. The method comprises the following steps of A) obtaining a background speaker model by applying a small amount of training sentences recorded by target recording personnel; B) respectively establishing speaker models for the obtained current speaker sentence and the originally recorded complete speaker sentence through a self-adaptive algorithm; and C) realizing the speech synthesis by adjusting the mean and variance of the speaker models, and synthesizing a small amount of recorded audios into a complete recording sentence. The speech synthesis method under a small amount of recording samples disclosed by the invention has the advantages that an operation of repeatedly recording the same recordingtext by multiple customer service staff is avoided, the speed recording cost is reduced, and the fluency and the naturalness of the whole conversation process effect can be guaranteed.

Description

technical field [0001] The invention relates to the field of speech synthesis, in particular to a speech synthesis method under the condition of a small number of recording samples. Background technique [0002] The speech-related fields of Dang, especially speech synthesis, are developing extremely rapidly, but the preparation process of the corpus is very difficult. During the recording process, when different recording personnel record the same content, a lot of time, energy and financial resources may be wasted due to too much recording content, and it is very unnecessary to record the same recording text multiple times. The naturalness of the related speech synthesis technology for a small amount of speech synthesis is not ideal, and at the same time there will be obvious differences with the actual recorded speech, which cannot achieve the desired ideal effect. Contents of the invention [0003] The technical problem to be solved by the present invention is that, ai...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02
CPCG10L13/02
Inventor 刘嗣平陈孟达柯登峰
Owner 广州九四智能科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products