Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for separating single-channel mixed voice based on compressed sensing and K-SVD

A K-SVD algorithm and mixed voice technology, applied in voice analysis, instruments, etc., can solve problems such as high algorithm complexity, large differences, and poor practicability

Active Publication Date: 2011-06-01
NANJING UNIV OF POSTS & TELECOMM
View PDF3 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the current research on single-channel mixed speech separation has achieved certain results, the overall complexity of the algorithm is relatively high, and the performance varies greatly despite the difference in the source speech signal. In addition, there are special requirements for the training data in the training phase, so Overall, the practicability is not strong and needs to be improved for specific application

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for separating single-channel mixed voice based on compressed sensing and K-SVD
  • Method for separating single-channel mixed voice based on compressed sensing and K-SVD
  • Method for separating single-channel mixed voice based on compressed sensing and K-SVD

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] figure 1 In order to realize the system block diagram of this scheme. As shown in the figure: the present invention first adopts the K-SVD algorithm, trains and constructs a complete dictionary, and then based on the constructed K-SVD dictionary, adopts the method based on l in compressed sensing 0 - Norm-optimized signal reconstruction algorithm for separation of single-channel mixed speech.

[0049] The speech used in the experiment is a speech with a sampling rate of 16KHZ. There are four speakers, two men and two women. Each speaker takes 40 sentences of Chinese phrase structure to construct training speech. Each speaker randomly selects 5 sentences of Chinese phrase structure as the test speech, and the test speech is different from the training speech. Single channel mixed speech x by two sources Test speech s 1 ,s 2 Overlay acquisition, that is, x=s 1 +s 2 , a total of 100 mixed voices of men and women, 25 mixed voices of men and women, and 25 mixed voice...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method for separating single-channel mixed voice based on compressed sensing and kernel singular value decomposition (K-SVD), which comprises the following steps of: constructing a universally applicable overcomplete dictionary, namely a K-SVD dictionary for each of three types of man-man, man-woman and woman-woman mixed training voice by using a K-SVD algorithm throughmixed training voice frames; making a signal sparse under the dictionary while a reconfiguration error is in a certain range; on the basis of the constructed K-SVD dictionary, starting from the similarity of compressed sensing observation and single-channel mixed voice expressions, separating the single-channel mixed voice by using a l0-norm optimization-based signal reconfiguration algorithm in a compressed sensing theory; solving the estimation of sparse representation of each source voice frame under the K-SVD dictionary on the basis of the expression of each single-channel mixed voice frame, and reconfiguring each separated voice frame through the estimation of the sparse representation and the K-SVD dictionary; and sequentially connecting the separated voice frames to acquire a separated voice signal.

Description

technical field [0001] The present invention relates to a special category of speech enhancement—speech separation, in particular to a single-channel hybrid speech separation method based on compressed sensing and K-SVD, which belongs to the technical field of speech signal processing. Background technique [0002] Speech is the most convenient, direct and commonly used communication method for human beings. However, in the actual environment, people will inevitably be disturbed by the surrounding noise while acquiring speech signals. These disturbances will affect the performance of speech processing systems (such as speech recognition systems) on the one hand, and affect the performance of human ears on the other hand. Perception and understanding of speech. Therefore, speech enhancement is particularly necessary. Speech separation is a special kind of speech enhancement method. Its noise objects are generally speech-like noises that are difficult to process. It is based...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/02G10L19/00G10L21/028
Inventor 郭海燕杨震
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products