High-quality voice conversion method based on modeling of signal timing characteristics

A technology of speech conversion and signal timing, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of violating the strong correlation of speech signals, affecting the effect of speech conversion, and reducing the ability of time-varying characteristics of signals.

Inactive Publication Date: 2013-04-10
SHENZHEN TENGRUIFENG TECH CO LTD
View PDF1 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this type of algorithm also has some disadvantages, such as artificially assuming that the data satisfies the condition of independent and identical distribution, and in the process of feature conversion, the forced conversion method is performed in a frame-by-frame order
Although this method of ignoring the inter-frame par

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • High-quality voice conversion method based on modeling of signal timing characteristics
  • High-quality voice conversion method based on modeling of signal timing characteristics
  • High-quality voice conversion method based on modeling of signal timing characteristics

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The present invention will be further described below in conjunction with the accompanying drawings.

[0038] A high-quality speech conversion method based on signal timing feature modeling, considering the parallel data of source and target, considering its timing feature modeling and tracking, using hybrid Kalman filter, and estimating the model structure under the expectation maximization criterion Parameters, and finally use the model to map the feature parameter set of speech to achieve high-quality speech conversion effect; the specific steps are as follows:

[0039] (1) adopting the speech analysis model to analyze the original speech signal;

[0040] (2) Extracting a phoneme-related feature parameter set from the analyzed parameters;

[0041] (3) Perform a normalization operation on the feature parameter sets of the source and the target to realize the alignment of the parameter sets;

[0042] (4) Use the aligned parameter sets as the input and output of the h...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a high-quality voice conversion method based on modeling of signal timing characteristics. The high-quality voice conversion method based on the modeling of the signal timing characteristics comprises the following steps: aiming at parallel data of a source and a target, considering modeling and tracing the timing characteristics of the source and the target, utilizing the hybrid Kalman filter, estimating structural parameters of a model under the criteria of expectation maximization, utilizing characteristic parameter set of mapping voice of the model and finally achieving a high-quality voice conversion effect. According to the high-quality voice conversion method based on the modeling of the signal timing characteristics, strong correlation between the voice signal parameters is fully utilized, a novel hybrid Kalman filter is constructed by means of a physical process that simulation parameters change with time and is used for a parameter mapping process of the voice conversion, a set of special conversion algorithm which associates the parameter of the Kalman filter with physical properties of a voice signal is designed, and therefore a conversion of personality traits of a speaker can be achieved.

Description

technical field [0001] The present invention relates to speech conversion technology, which is a technology that combines speech recognition and speech synthesis technology to realize the transformation of a speaker's voice so that it sounds like another specific speaker's voice, especially a technology based on signal timing characteristics. Modeled High-Quality Speech Transformation Methods. Background technique [0002] Speech conversion technology is an emerging research branch in the field of speech signal processing in recent years, covering the fields of speech recognition and speech synthesis. A person's voice personality characteristics, so that his (or her) speech is perceived by the listener as the speech of another specific speaker (called the target speaker). The main tasks of speech conversion include extracting characteristic parameters representing the speaker's personality and performing mathematical transformation, and then reconstructing the transformed p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/02G10L15/06G10L25/03
Inventor 徐宁鲍静益汤一彬
Owner SHENZHEN TENGRUIFENG TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products