Voice converting system and method using blind voice separation

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A voice conversion and voice separation technology, which is applied in voice analysis, voice recognition, instruments, etc., can solve the problems of large amount of calculation in the separation process, cannot meet the real-time requirements of the voice conversion system, and high algorithm complexity, so as to optimize the learning process , optimization of separation results, and the effect of overcoming dependencies

Inactive Publication Date: 2012-07-18

BEIJING JIAOTONG UNIV

View PDF4 Cites 18 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the current popular ICA-based blind signal source separation method has high algorithm complexity and a large amount of calculation in the separation process, which cannot meet the real-time requirements of the voice conversion system.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0037] Combine below figure 1 The specific steps of the speech conversion synthesis method of the present invention are described in detail.

[0038] Step 1: Use subband decomposition to obtain two-channel data

[0039] The speech of the source speaker or the target speaker required by the speech conversion system is usually obtained by the microphone, and the noisy speaker's speech collected by the microphone can only constitute a single-channel signal, while the independent component analysis (ICA) needs to observe the signal x i (t) (i.e. the mixed signal of background noise and desired speech) number is greater than or equal to the number of source signals (i.e. each independent source signal before mixing), so the collected speech signal is divided into two parts using existing sub-band decomposition technology : Low frequency part and high frequency part.

[0040] The speech converted from the low-frequency part is used as the first observation signal x of ICA 1 (t), ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a voice converting system and method using blind voice separation. The voice converting method comprises the steps of: applying a blind voice separation method which combines sub-band decomposition with ICA (Independent Component Analysis) in a voice input module at the front end of the voice converting system to separate a mixed signal including background noise and expected voice, and converting the expected voice obtained through separation and subjected to an unequal-length framing process to realize conversion of specific voice of people in a noise environment.

Description

technical field [0001] The invention relates to speech conversion, analysis and signal processing, in particular to a speech conversion system and method using blind speech separation. Background technique [0002] Speech conversion is to change a speaker's voice to make it sound like another person's voice, that is, to realize the conversion of timbre characteristics from a specific source speaker to a target speaker. The voice conversion system needs to perform two voice inputs. For the first time, it needs to collect a certain amount of source speaker's voice and target speaker's voice to establish two parallel voice databases, and then perform feature parameter extraction and Training, establish a source-to-target conversion function; the second time you need to input the source speaker's voice with any content into the conversion function, the conversion system can output the target speaker's voice with the same content. Speech conversion is a relatively new branch in ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L21/02G10L15/14G10L11/06G10L19/04G10L21/0224

Inventor申艳汶跃龙张嘉驰范礼乾杨柳蒋诗慧

OwnerBEIJING JIAOTONG UNIV

Voice converting system and method using blind voice separation

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology