Speech enhancement method and device thereof, equipment and medium
A voice enhancement and voice technology, applied in the field of signal processing, can solve problems such as long calculation time, high calculation cost, and unsatisfactory voice enhancement effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0028] Figure 1a It is a flow chart of a speech enhancement method provided by Embodiment 1 of the present invention. The embodiment of the present invention is applicable to the situation where the speech noise suppression model based on the attention mechanism is used to perform speech enhancement processing on noisy speech signals. The method can be implemented by this The voice enhancement device provided by the embodiment of the invention can be implemented by means of software and / or hardware, and can generally be integrated into computer equipment, such as vehicle-mounted terminal equipment.
[0029] Such as Figure 1a As shown, the speech enhancement method provided in this embodiment specifically includes:
[0030] S110. Acquire a target noisy speech signal, and perform a short-time Fourier transform on the target noisy speech signal to obtain a target frequency domain signal corresponding to the target noisy speech signal.
[0031] The target noisy speech signal ref...
Embodiment 2
[0075] Figure 2a It is a flowchart of a speech enhancement method provided by Embodiment 2 of the present invention. This embodiment is embodied on the basis of the above embodiments, wherein, before acquiring the target noisy speech signal, it may also include:
[0076] Short-time Fourier transform is performed on the speech noise sample signal and the speech sample signal to obtain a first frequency domain signal corresponding to the speech noise sample signal and a second frequency domain signal corresponding to the speech sample signal; wherein, the speech The noisy sample signal is generated by superimposing the noise signal on the basis of the speech sample signal;
[0077] When the speech noise suppression model is trained, the feature of the current signal frame of the first frequency domain signal is input in the encoder to obtain the encoding feature corresponding to the current signal frame of the first frequency domain signal;
[0078] Inputting the encoding fea...
Embodiment 3
[0124] image 3 It is a schematic structural diagram of a speech enhancement device provided in Embodiment 3 of the present invention. The embodiment of the present invention is applicable to the situation where the speech noise suppression model based on the attention mechanism is used to perform speech enhancement processing on noisy speech signals. The device can use It can be implemented in the form of software and / or hardware, and can generally be integrated in computer equipment.
[0125] Such as image 3 As shown, the data query device specifically includes: a target frequency domain signal generation module 310 , an encoding feature generation module 320 , a decoding feature generation module 330 and a target enhanced speech signal generation module 340 . in,
[0126] The noisy speech signal processing module 310 is configured to obtain a target noisy speech signal, perform a short-time Fourier transform on the target noisy speech signal, and obtain a target frequenc...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com