Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Singing synthetic method for tone conversion

A technology of timbre conversion and synthesis method, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as difficult acquisition, poor generalization ability, inapplicable singing timbre conversion, etc., to achieve breakthrough dependence and good effect

Inactive Publication Date: 2018-08-28
FUZHOU UNIVERSITY
View PDF1 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The so-called "one-to-one" means that the speech timbre conversion model can only convert the timbre of a specific source speaker into the timbre of a specific target speaker. Obviously, the generalization ability of this "one-to-one" model is very high. Poor; the so-called parallel data set means that the training data set used by the timbre conversion model in the training stage is strictly parallel, that is, the voice data of the source speaker and the target speaker have exactly the same semantic content. Said that it is very difficult to obtain such a parallel corpus due to the limitation of the singer's own singing repertoire
Therefore, the commonly used "one-to-one" speech timbre conversion model based on parallel corpora is not suitable for singing timbre conversion

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Singing synthetic method for tone conversion
  • Singing synthetic method for tone conversion
  • Singing synthetic method for tone conversion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The present invention will be further explained below in conjunction with the accompanying drawings and specific embodiments.

[0023] The present invention provides a singing voice synthesis method oriented to timbre conversion, such as figure 1 Shown include the following steps:

[0024] Step S1: Obtain the singer's cappella audio file, and use the STRAIGHT algorithm to extract its acoustic features;

[0025] Step S2: Construct and train a variational autoencoder-generated confrontational network model to obtain a trained singing voice timbre conversion model;

[0026]Step S3: Obtain the a cappella audio file of the source singer, use the STRAIGHT algorithm to extract the acoustic features and input the singing voice timbre conversion model, the model output is the acoustic features after timbre conversion, and then use STRAIGHT to synthesize the acoustic features into the timbre converted singing voice.

[0027] In an embodiment of the present invention, the step S...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a singing synthetic method for tone conversion. The method comprises the steps: calculating the frequency spectrum features (shown in the description), a fundamental tone frequency (shown in the description) and an aperiodic component (shown in the description) of a singing audio without makeup and acting of a singer through a STRAIGHT algorithm; constructing a singing tone conversion model, wherein the model is generated through a variational self-encoding-generative confrontation network; training the model through the spectrum features of the singer's singing data without makeup and acting, and obtaining a trained singing tone conversion model; finally inputting the spectrum of a to-be-converted singer's singing audio without makeup and acting into the trained singing tone conversion model, wherein the model output is the spectrum with the tone of a target singer (shown in the description), and synthesizing the converted spectrum (shown in the description),fundamental tone frequency (shown in the description) and aperiodic component (shown in the description) into the singing sound through the STRAIGHT algorithm after tone conversion. The method achieves the many-to-many singing conversion under a non-parallel data set.

Description

technical field [0001] The invention belongs to an audio signal processing method in the field of singing, in particular to a singing voice synthesis method oriented to timbre conversion. Background technique [0002] The American National Institute of Standardization defines timbre as follows, "timbre refers to a certain attribute of sound produced by hearing, and the listener can judge the difference between two sounds presented in the same way with the same pitch and loudness. ". Therefore, the vocal timbre during singing refers to the voice characteristics that people use to identify the specific singer when different singers sing the same song. [0003] The timbre conversion process can be understood as a process of modifying and reconstructing the original audio signal. Similar to voice timbre conversion, the so-called singing voice timbre conversion refers to converting the timbre of the singing voice of the source singer to the timbre of the target singer on the pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L13/02
CPCG10L13/02
Inventor 余春艳齐子铭胡进森张栋
Owner FUZHOU UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products