High-quality voice conversion method based on modeling of signal timing characteristics

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of speech conversion and signal timing, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of violating the strong correlation of speech signals, affecting the effect of speech conversion, and reducing the ability of time-varying characteristics of signals.

Inactive Publication Date: 2013-04-10

SHENZHEN TENGRUIFENG TECH CO LTD

View PDF1 Cites 15 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, this type of algorithm also has some disadvantages, such as artificially assuming that the data satisfies the condition of independent and identical distribution, and in the process of feature conversion, the forced conversion method is performed in a frame-by-frame order

Although this method of ignoring the inter-frame parameter correlation greatly simplifies the problem and reduces the difficulty of solving it, it violates the fact that there is a strong correlation in the speech signal, which leads to a decline in the ability of the model to describe the time-varying characteristics of the signal, and ultimately affects The effect of voice change

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0037] The present invention will be further described below in conjunction with the accompanying drawings.

[0038] A high-quality speech conversion method based on signal timing feature modeling, considering the parallel data of source and target, considering its timing feature modeling and tracking, using hybrid Kalman filter, and estimating the model structure under the expectation maximization criterion Parameters, and finally use the model to map the feature parameter set of speech to achieve high-quality speech conversion effect; the specific steps are as follows:

[0039] (1) adopting the speech analysis model to analyze the original speech signal;

[0040] (2) Extracting a phoneme-related feature parameter set from the analyzed parameters;

[0041] (3) Perform a normalization operation on the feature parameter sets of the source and the target to realize the alignment of the parameter sets;

[0042] (4) Use the aligned parameter sets as the input and output of the h...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a high-quality voice conversion method based on modeling of signal timing characteristics. The high-quality voice conversion method based on the modeling of the signal timing characteristics comprises the following steps: aiming at parallel data of a source and a target, considering modeling and tracing the timing characteristics of the source and the target, utilizing the hybrid Kalman filter, estimating structural parameters of a model under the criteria of expectation maximization, utilizing characteristic parameter set of mapping voice of the model and finally achieving a high-quality voice conversion effect. According to the high-quality voice conversion method based on the modeling of the signal timing characteristics, strong correlation between the voice signal parameters is fully utilized, a novel hybrid Kalman filter is constructed by means of a physical process that simulation parameters change with time and is used for a parameter mapping process of the voice conversion, a set of special conversion algorithm which associates the parameter of the Kalman filter with physical properties of a voice signal is designed, and therefore a conversion of personality traits of a speaker can be achieved.

Description

technical field [0001] The present invention relates to speech conversion technology, which is a technology that combines speech recognition and speech synthesis technology to realize the transformation of a speaker's voice so that it sounds like another specific speaker's voice, especially a technology based on signal timing characteristics. Modeled High-Quality Speech Transformation Methods. Background technique [0002] Speech conversion technology is an emerging research branch in the field of speech signal processing in recent years, covering the fields of speech recognition and speech synthesis. A person's voice personality characteristics, so that his (or her) speech is perceived by the listener as the speech of another specific speaker (called the target speaker). The main tasks of speech conversion include extracting characteristic parameters representing the speaker's personality and performing mathematical transformation, and then reconstructing the transformed p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/02G10L15/06G10L25/03

Inventor徐宁鲍静益汤一彬

OwnerSHENZHEN TENGRUIFENG TECH CO LTD

High-quality voice conversion method based on modeling of signal timing characteristics

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology