Multi-channel Speech Separation System Based on Gated Recursive Fusion of Deep Embedded Features

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of speech separation and feature fusion, which is applied in the field of signal processing, can solve problems such as damage to speech separation performance, difficulty in learning space and amplitude spectrum mutual information, unfavorable network learning and optimization, etc., to ensure performance, reduce distance, The effect of improving performance

Active Publication Date: 2022-03-01

中科极限元(杭州)智能科技股份有限公司

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

First, MDC only stitches the spatial information as an auxiliary feature to the magnitude spectrum feature, which makes it difficult to learn the mutual information between the space and the magnitude spectrum, and the distribution of IPDs and the magnitude spectrum feature is different. It is also not conducive to the learning and optimization of the network

Second, the training objective function of MDC is defined on the deep embedding vector, not on the real separation target. These deep embedding vectors cannot perfectly represent the target speech, so it will damage the performance of speech separation.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0043] Specific embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. It should be understood that the specific embodiments described here are only used to illustrate and explain the present invention, and are not intended to limit the present invention.

[0044] Such as figure 1 As shown, the multi-channel speech separation system based on gated recursive fusion deep embedded features, including gated recursive fusion module, deep embedded feature extraction module, speech separation module, discriminative training module and joint training module, gated recursive fusion (GRF, Gated recurrent fusion) module, which uses the spatial information and amplitude spectrum information provided by the microphone array as two modes for deep fusion of spatial information and amplitude spectrum features, and outputs gated recurrent fusion features; deep embedded features The extraction module communicates with the gated recur...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a multi-channel speech separation system based on gated recursive fusion of deep embedded features, including a gated recursive fusion module, a deep embedded feature extraction module, a speech separation module, a discriminative training module and a joint training module, and the gated recursive The fusion module deeply fuses the spatial information and the amplitude spectrum information, and outputs the gated recursive fusion feature; the deep embedded feature extraction module, through the deeply embedded feature loss objective function, extracts more distinguishing features from the gated recursive fusion feature The deep embedded features; the voice separation module separates the deep embedded features to obtain each source and target speech signal; the discriminative training module obtains the discriminative loss objective function through the differentiated source and target speech signals; the joint training module passes Joint Training of Discriminative Loss Objective Function and Deeply Embedded Feature Loss Objective Function.

Description

technical field [0001] The invention relates to the technical field of signal processing, in particular to a multi-channel speech separation system based on gated recursive fusion of deep embedded features. Background technique [0002] Speech is one of the main means for human to exchange information, and speech separation has always occupied an important position in speech signal processing. Speech separation is also known as the cocktail conference problem, and its goal is to separate each target source speech signal from a speech signal containing multiple mixed speakers. When a piece of speech contains multiple speakers at the same time, it will seriously affect the performance of systems such as speech recognition, speaker recognition and hearing aids, so speech separation technology is particularly important. In the development process of speech separation technology, many speech separation methods based on deep learning have achieved good results, such as deep clust...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L21/0272G10L21/0308G10L25/30G06N3/04G06N3/08

CPCG10L21/0272G10L21/0308G10L25/30G06N3/08G06N3/045

Inventor 范存航温正棋

Owner 中科极限元(杭州)智能科技股份有限公司

Multi-channel Speech Separation System Based on Gated Recursive Fusion of Deep Embedded Features

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology