An Integrated Speech Enhancement System Based on Multi-objective Heterogeneous Network

A heterogeneous network and speech enhancement technology, applied in speech analysis, instruments, etc., can solve problems such as missing information, achieve the effects of reducing input dimensions, increasing the diversity of base models, and reducing training parameters

Active Publication Date: 2022-04-22
SOUTH CHINA UNIV OF TECH
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to solve the technical defects of the existing multi-objective learning speech enhancement system and the integrated learning speech enhancement system, and provide an integrated speech enhancement system based on a multi-objective heterogeneous network, which can effectively alleviate the parameters of multi-objective learning Optimize the collision problem and avoid the loss of information in the original input in the deep network propagation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An Integrated Speech Enhancement System Based on Multi-objective Heterogeneous Network
  • An Integrated Speech Enhancement System Based on Multi-objective Heterogeneous Network
  • An Integrated Speech Enhancement System Based on Multi-objective Heterogeneous Network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0050] figure 1 A schematic structural diagram of an integrated speech enhancement system based on a multi-target heterogeneous network in this embodiment is shown in . Such as figure 1 As shown, an integrated speech enhancement system based on a multi-target heterogeneous network disclosed in this embodiment is composed of a feature extraction module, a feature dimensionality reduction module, m heterogeneous networks, and n gating units, wherein the original input and feature The extraction module and the feature dimension reduction module are connected, the feature extraction module is respectively connected to m heterogeneous networks, and the feature dimension reduction module and the m heterogeneous networks are respectively connected to n gating units.

[0051] This embodiment is specifically composed of a feature extraction module, a feature dimensionality reduction module, 3 heterogeneous networks, and 2 gating units. The original input is a noisy speech signal, the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an integrated speech enhancement system based on a multi-target heterogeneous network. The system includes a feature extraction module, a feature dimensionality reduction module, m heterogeneous networks, and n gating units, and uses m heterogeneous networks as an integrated The sub-model of the speech enhancement system, each heterogeneous network leads to multi-objective branches from the first network layer, and connects the first and last layers of the heterogeneous network in a symmetrical manner, which can effectively alleviate the parameter optimization conflict problem of multi-objective learning, and can Avoiding the loss of information in the original input in the deep network propagation can improve the diversity of the base model of the integrated speech enhancement system, thereby improving the quality and intelligibility of the enhanced speech. The feature dimensionality reduction module calculates the correlation information between the original input speech frames and concatenates it with the current input frame as the input of n gating units, which greatly reduces the input dimension of n gating units without losing the original Correlation information between frames in the input.

Description

technical field [0001] The invention relates to the technical field of speech enhancement, in particular to an integrated speech enhancement system based on a multi-target heterogeneous network. Background technique [0002] Speech is the most important and direct information carrier in people's daily communication. However, voice signals are often polluted by various noises in life, such as speaker noise in restaurants, machine noise in factories, construction site noise in construction sites, car noise on the road, noisy crowd noise, etc. These noises will affect our acquisition and understanding of useful speech, resulting in a decrease in the quality and intelligibility of speech. [0003] Speech enhancement technology refers to eliminating noise components from noisy speech, extracting and recovering clean speech components, so as to improve the listening quality and intelligibility of speech. The algorithms in it include traditional statistics-based augmentation tech...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L21/02G10L21/0208
CPCG10L21/02G10L21/0208
Inventor 张军吴悦宁更新冯义志杨萃余华季飞
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products