Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice converting system and method using blind voice separation

A voice conversion and voice separation technology, which is applied in voice analysis, voice recognition, instruments, etc., can solve the problems of large amount of calculation in the separation process, cannot meet the real-time requirements of the voice conversion system, and high algorithm complexity, so as to optimize the learning process , optimization of separation results, and the effect of overcoming dependencies

Inactive Publication Date: 2012-07-18
BEIJING JIAOTONG UNIV
View PDF4 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the current popular ICA-based blind signal source separation method has high algorithm complexity and a large amount of calculation in the separation process, which cannot meet the real-time requirements of the voice conversion system.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice converting system and method using blind voice separation
  • Voice converting system and method using blind voice separation
  • Voice converting system and method using blind voice separation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] Combine below figure 1 The specific steps of the speech conversion synthesis method of the present invention are described in detail.

[0038] Step 1: Use subband decomposition to obtain two-channel data

[0039] The speech of the source speaker or the target speaker required by the speech conversion system is usually obtained by the microphone, and the noisy speaker's speech collected by the microphone can only constitute a single-channel signal, while the independent component analysis (ICA) needs to observe the signal x i (t) (i.e. the mixed signal of background noise and desired speech) number is greater than or equal to the number of source signals (i.e. each independent source signal before mixing), so the collected speech signal is divided into two parts using existing sub-band decomposition technology : Low frequency part and high frequency part.

[0040] The speech converted from the low-frequency part is used as the first observation signal x of ICA 1 (t), ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a voice converting system and method using blind voice separation. The voice converting method comprises the steps of: applying a blind voice separation method which combines sub-band decomposition with ICA (Independent Component Analysis) in a voice input module at the front end of the voice converting system to separate a mixed signal including background noise and expected voice, and converting the expected voice obtained through separation and subjected to an unequal-length framing process to realize conversion of specific voice of people in a noise environment.

Description

technical field [0001] The invention relates to speech conversion, analysis and signal processing, in particular to a speech conversion system and method using blind speech separation. Background technique [0002] Speech conversion is to change a speaker's voice to make it sound like another person's voice, that is, to realize the conversion of timbre characteristics from a specific source speaker to a target speaker. The voice conversion system needs to perform two voice inputs. For the first time, it needs to collect a certain amount of source speaker's voice and target speaker's voice to establish two parallel voice databases, and then perform feature parameter extraction and Training, establish a source-to-target conversion function; the second time you need to input the source speaker's voice with any content into the conversion function, the conversion system can output the target speaker's voice with the same content. Speech conversion is a relatively new branch in ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L21/02G10L15/14G10L11/06G10L19/04G10L21/0224
Inventor 申艳汶跃龙张嘉驰范礼乾杨柳蒋诗慧
Owner BEIJING JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products