A Speech Enhancement System Based on Temporal Modeling Generative Adversarial Networks

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A first-time, time-sensitive technology, applied in biological neural network models, speech analysis, instruments, etc., can solve the problems of time dependence of speech time-domain features and insufficient consideration of global aspects, and achieve reliable design principles, wide application prospects, simple structure

Active Publication Date: 2022-07-05

QILU UNIV OF TECH

View PDF9 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] Aiming at the above-mentioned deficiencies in the prior art, the present invention provides a speech enhancement system based on temporal modeling to generate an adversarial network to solve the problem of insufficient consideration of the time-dependence and global aspects of generating adversarial network speech time-domain features

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0036] In order to make those skilled in the art better understand the technical solutions of the present invention, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described The embodiments are only some of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0037] Key terms appearing in the present invention are explained below.

[0038] GRU: Gated Recurrent Unit, a gated recurrent unit, uses a gated mechanism to control information such as input and memory, and makes predictions at the current time.

[0039] figure 1 Shown is a speech enhancement system based on temporal m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a speech enhancement system based on a time modeling generative adversarial network, belonging to the technical field of speech signal processing, comprising: a data acquisition unit for acquiring a noisy speech signal and down-sampling the noisy speech signal; The signal enhancement unit is used for inputting the noisy speech signal into a generative adversarial network based on time modeling, compressing and extracting the global time domain feature of the speech signal, linking the time domain feature and random noise into a feature vector, and correcting the The feature vector is decoded to obtain an enhanced speech signal. The invention solves the problems of insufficient time dependence and global consideration of speech time domain features, reduces the influence of noise in the speech signal, and improves the hearing quality of the enhanced speech.

Description

technical field [0001] The invention belongs to the technical field of speech signal processing, and in particular relates to a speech enhancement system based on a temporal modeling generative confrontation network. Background technique [0002] Speech enhancement is a key technology to improve speech quality and intelligibility, that is, a technology that uses audio signal processing technology to remove noise and extract pure speech signals from an observation signal containing noise. Introducing significant speech distortion remains a formidable challenge. [0003] In recent years, with the rapid development of artificial intelligence technology and computer processing capabilities, deep learning has become a hot technology in many research fields and has achieved many remarkable research results. Because of the limited performance of traditional speech enhancement algorithms such as Wiener filtering and spectral subtraction, deep learning technology has been introduced...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L21/02G10L25/30G06N3/04

CPCG10L21/02G10L25/30G06N3/045

Inventor董安明张德辉禹继国韩玉冰李素芳张丽邱静刘洋张滕刘宗银

OwnerQILU UNIV OF TECH

A Speech Enhancement System Based on Temporal Modeling Generative Adversarial Networks

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology