Check patentability & draft patents in minutes with Patsnap Eureka AI!

Speech enhancement method and device, equipment and storage medium

A speech enhancement and audio technology, applied in speech analysis, instruments, etc., can solve the problems of gradient disappearance, sequence modeling, affecting speech enhancement effect, etc., to achieve the effect of improving generalization ability, improving speech enhancement effect, and improving generation efficiency

Pending Publication Date: 2021-09-28
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When performing speech enhancement on audio with a long time sequence, the traditional audio enhancement method will have the problem of gradient disappearance and long-term dependence when dealing with long-term information, making it impossible to effectively model long speech sequences. At the same time, in When the receptive field is smaller than the sequence length, it is impossible to perform sequence modeling at the speech level, so it has a certain impact on the modeling accuracy of the actual speech sequence, thus affecting the effect of speech enhancement

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech enhancement method and device, equipment and storage medium
  • Speech enhancement method and device, equipment and storage medium
  • Speech enhancement method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0069] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0070] Such as figure 1 Shown is a flowchart of a preferred embodiment of the speech enhancement method of the present invention. According to different requirements, the order of the steps in the flowchart can be changed, and some steps can be omitted.

[0071] The speech enhancement method is applied to one or more electronic devices, and the electronic device is a device that can automatically perform numerical calculation and / or information processing according to preset or stored computer-readable instructions, and its hardware includes But not limited to microprocessors, application specific integrated circuits (Application Specific Integrated Circuit, ASIC), programmable gate arrays (Field-Programmable Gate Array, FPGA), digital signal pro...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to artificial intelligence, and provides a speech enhancement method and device, equipment and a storage medium. According to the method, the pure audio can be expanded to obtain audio samples, the audio samples comprise audio with noise, the audio with noise is preprocessed to obtain a plurality of sequence features, each sequence feature is analyzed based on a time sequence processing network to obtain a plurality of output features, and time frequency features are generated according to the sequence features and the output features, frequency band information is extracted from the time-frequency features, the frequency band information is analyzed based on a frequency band processing network to obtain frequency band features, predicted audio is generated according to the frequency band features and the time-frequency features, network parameters are adjusted based on the predicted audio and pure audio to obtain an audio enhancement model, request audio is obtained, and enhancement processing is performed on the request audio based on the audio enhancement model, and the target audio is obtained. The enhancement effect of the target audio can be improved. In addition, the invention also relates to a block chain technology, and the target audio can be stored in a block chain.

Description

technical field [0001] The present invention relates to the technical field of artificial intelligence, in particular to a speech enhancement method, device, equipment and storage medium. Background technique [0002] Speech enhancement involves extracting target speech sources from reverberant and noisy speech environments. When performing speech enhancement on long-sequence audio, the traditional audio enhancement method will have the problem of gradient disappearance and long-term dependence when dealing with long-term information, making it impossible to effectively model long speech sequences. At the same time, in When the receptive field is smaller than the sequence length, it cannot perform sequence modeling at the utterance level, so it has a certain impact on the modeling accuracy of the actual speech sequence, thereby affecting the effect of speech enhancement. Contents of the invention [0003] In view of the above, it is necessary to provide a speech enhanceme...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/0264
CPCG10L21/0264
Inventor 张之勇王健宗
Owner PING AN TECH (SHENZHEN) CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More