Singing synthetic method for tone conversion

A technology of timbre conversion and synthesis method, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as difficult acquisition, poor generalization ability, inapplicable singing timbre conversion, etc., to achieve breakthrough dependence and good effect

Inactive Publication Date: 2018-08-28
FUZHOU UNIVERSITY
View PDF1 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The so-called "one-to-one" means that the speech timbre conversion model can only convert the timbre of a specific source speaker into the timbre of a specific target speaker. Obviously, the generalization ability of this "one-to-one" model is very high. Poor; the so-called parallel data set means that the training data set used by the timbre conversion model in the training stage is stric

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Singing synthetic method for tone conversion
  • Singing synthetic method for tone conversion
  • Singing synthetic method for tone conversion

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0022] The present invention will be further explained below in conjunction with the drawings and specific embodiments.

[0023] The invention provides a singing voice synthesis method oriented to tone color conversion, such as figure 1 The steps shown include the following:

[0024] Step S1: Obtain a cappella audio file of the singer, and extract its acoustic features using the STRAIGHT algorithm;

[0025] Step S2: Construct and train a variational self-encoding-generative confrontation network model to obtain a trained song voice color conversion model;

[0026] Step S3: Obtain a cappella audio file of the source singer, extract the acoustic features using the STRAIGHT algorithm and input the singing voice color conversion model, the model output is the acoustic feature after timbre conversion, and then STRAIGHT is used to synthesize the acoustic feature into the timbre converted singing voice.

[0027] In an embodiment of the present invention, the step S1 specifically includes the f...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a singing synthetic method for tone conversion. The method comprises the steps: calculating the frequency spectrum features (shown in the description), a fundamental tone frequency (shown in the description) and an aperiodic component (shown in the description) of a singing audio without makeup and acting of a singer through a STRAIGHT algorithm; constructing a singing tone conversion model, wherein the model is generated through a variational self-encoding-generative confrontation network; training the model through the spectrum features of the singer's singing data without makeup and acting, and obtaining a trained singing tone conversion model; finally inputting the spectrum of a to-be-converted singer's singing audio without makeup and acting into the trained singing tone conversion model, wherein the model output is the spectrum with the tone of a target singer (shown in the description), and synthesizing the converted spectrum (shown in the description),fundamental tone frequency (shown in the description) and aperiodic component (shown in the description) into the singing sound through the STRAIGHT algorithm after tone conversion. The method achieves the many-to-many singing conversion under a non-parallel data set.

Description

technical field [0001] The invention belongs to an audio signal processing method in the field of singing, in particular to a singing voice synthesis method oriented to timbre conversion. Background technique [0002] The American National Institute of Standardization defines timbre as follows, "timbre refers to a certain attribute of sound produced by hearing, and the listener can judge the difference between two sounds presented in the same way with the same pitch and loudness. ". Therefore, the vocal timbre during singing refers to the voice characteristics that people use to identify the specific singer when different singers sing the same song. [0003] The timbre conversion process can be understood as a process of modifying and reconstructing the original audio signal. Similar to voice timbre conversion, the so-called singing voice timbre conversion refers to converting the timbre of the singing voice of the source singer to the timbre of the target singer on the pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L13/02
CPCG10L13/02
Inventor 余春艳齐子铭胡进森张栋
Owner FUZHOU UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products