Method for separating single-channel mixed voice based on compressed sensing and K-SVD

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A K-SVD algorithm and mixed voice technology, applied in voice analysis, instruments, etc., can solve problems such as high algorithm complexity, large differences, and poor practicability

Active Publication Date: 2011-06-01

NANJING UNIV OF POSTS & TELECOMM

View PDF3 Cites 25 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Although the current research on single-channel mixed speech separation has achieved certain results, the overall complexity of the algorithm is relatively high, and the performance varies greatly despite the difference in the source speech signal. In addition, there are special requirements for the training data in the training phase, so Overall, the practicability is not strong and needs to be improved for specific application

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0048] figure 1 In order to realize the system block diagram of this scheme. As shown in the figure: the present invention first adopts the K-SVD algorithm, trains and constructs a complete dictionary, and then based on the constructed K-SVD dictionary, adopts the method based on l in compressed sensing 0 - Norm-optimized signal reconstruction algorithm for separation of single-channel mixed speech.

[0049] The speech used in the experiment is a speech with a sampling rate of 16KHZ. There are four speakers, two men and two women. Each speaker takes 40 sentences of Chinese phrase structure to construct training speech. Each speaker randomly selects 5 sentences of Chinese phrase structure as the test speech, and the test speech is different from the training speech. Single channel mixed speech x by two sources Test speech s 1 ,s 2 Overlay acquisition, that is, x=s 1 +s 2 , a total of 100 mixed voices of men and women, 25 mixed voices of men and women, and 25 mixed voice...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a method for separating single-channel mixed voice based on compressed sensing and kernel singular value decomposition (K-SVD), which comprises the following steps of: constructing a universally applicable overcomplete dictionary, namely a K-SVD dictionary for each of three types of man-man, man-woman and woman-woman mixed training voice by using a K-SVD algorithm throughmixed training voice frames; making a signal sparse under the dictionary while a reconfiguration error is in a certain range; on the basis of the constructed K-SVD dictionary, starting from the similarity of compressed sensing observation and single-channel mixed voice expressions, separating the single-channel mixed voice by using a l0-norm optimization-based signal reconfiguration algorithm in a compressed sensing theory; solving the estimation of sparse representation of each source voice frame under the K-SVD dictionary on the basis of the expression of each single-channel mixed voice frame, and reconfiguring each separated voice frame through the estimation of the sparse representation and the K-SVD dictionary; and sequentially connecting the separated voice frames to acquire a separated voice signal.

Description

technical field [0001] The present invention relates to a special category of speech enhancement—speech separation, in particular to a single-channel hybrid speech separation method based on compressed sensing and K-SVD, which belongs to the technical field of speech signal processing. Background technique [0002] Speech is the most convenient, direct and commonly used communication method for human beings. However, in the actual environment, people will inevitably be disturbed by the surrounding noise while acquiring speech signals. These disturbances will affect the performance of speech processing systems (such as speech recognition systems) on the one hand, and affect the performance of human ears on the other hand. Perception and understanding of speech. Therefore, speech enhancement is particularly necessary. Speech separation is a special kind of speech enhancement method. Its noise objects are generally speech-like noises that are difficult to process. It is based...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L21/02G10L19/00G10L21/028

Inventor郭海燕杨震

OwnerNANJING UNIV OF POSTS & TELECOMM

Method for separating single-channel mixed voice based on compressed sensing and K-SVD

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology