An Integrated Speech Enhancement System Based on Multi-objective Heterogeneous Network

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A heterogeneous network and speech enhancement technology, applied in speech analysis, instruments, etc., can solve problems such as missing information, achieve the effects of reducing input dimensions, increasing the diversity of base models, and reducing training parameters

Active Publication Date: 2022-04-22

SOUTH CHINA UNIV OF TECH

View PDF6 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] The purpose of the present invention is to solve the technical defects of the existing multi-objective learning speech enhancement system and the integrated learning speech enhancement system, and provide an integrated speech enhancement system based on a multi-objective heterogeneous network, which can effectively alleviate the parameters of multi-objective learning Optimize the collision problem and avoid the loss of information in the original input in the deep network propagation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0050] figure 1 A schematic structural diagram of an integrated speech enhancement system based on a multi-target heterogeneous network in this embodiment is shown in . Such as figure 1 As shown, an integrated speech enhancement system based on a multi-target heterogeneous network disclosed in this embodiment is composed of a feature extraction module, a feature dimensionality reduction module, m heterogeneous networks, and n gating units, wherein the original input and feature The extraction module and the feature dimension reduction module are connected, the feature extraction module is respectively connected to m heterogeneous networks, and the feature dimension reduction module and the m heterogeneous networks are respectively connected to n gating units.

[0051] This embodiment is specifically composed of a feature extraction module, a feature dimensionality reduction module, 3 heterogeneous networks, and 2 gating units. The original input is a noisy speech signal, the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an integrated speech enhancement system based on a multi-target heterogeneous network. The system includes a feature extraction module, a feature dimensionality reduction module, m heterogeneous networks, and n gating units, and uses m heterogeneous networks as an integrated The sub-model of the speech enhancement system, each heterogeneous network leads to multi-objective branches from the first network layer, and connects the first and last layers of the heterogeneous network in a symmetrical manner, which can effectively alleviate the parameter optimization conflict problem of multi-objective learning, and can Avoiding the loss of information in the original input in the deep network propagation can improve the diversity of the base model of the integrated speech enhancement system, thereby improving the quality and intelligibility of the enhanced speech. The feature dimensionality reduction module calculates the correlation information between the original input speech frames and concatenates it with the current input frame as the input of n gating units, which greatly reduces the input dimension of n gating units without losing the original Correlation information between frames in the input.

Description

technical field [0001] The invention relates to the technical field of speech enhancement, in particular to an integrated speech enhancement system based on a multi-target heterogeneous network. Background technique [0002] Speech is the most important and direct information carrier in people's daily communication. However, voice signals are often polluted by various noises in life, such as speaker noise in restaurants, machine noise in factories, construction site noise in construction sites, car noise on the road, noisy crowd noise, etc. These noises will affect our acquisition and understanding of useful speech, resulting in a decrease in the quality and intelligibility of speech. [0003] Speech enhancement technology refers to eliminating noise components from noisy speech, extracting and recovering clean speech components, so as to improve the listening quality and intelligibility of speech. The algorithms in it include traditional statistics-based augmentation tech...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L21/02G10L21/0208

CPCG10L21/02G10L21/0208

Inventor张军吴悦宁更新冯义志杨萃余华季飞

OwnerSOUTH CHINA UNIV OF TECH

An Integrated Speech Enhancement System Based on Multi-objective Heterogeneous Network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology