Microphone array speech enhancement system and method based on multi-task network
A microphone array and voice enhancement technology, which is applied in voice analysis, instruments, etc., can solve problems such as difficult training, achieve voice enhancement, strong noise reduction performance, and overcome performance deficiencies
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0066] This embodiment discloses a microphone array speech enhancement system based on a multi-task network, the system structure is as follows figure 1 As shown, the system consists of a speech preprocessing module, a multi-task network module, a multi-task loss statistics module, a network weight calculation module and a speech reconstruction module. The speech preprocessing module is connected with the multi-task network module and the multi-task loss statistics module. This module obtains the array speech, the reference echo speech and the target speech of each task as the input speech, and preprocesses these input speeches. The preprocessing work includes speech signals. The logarithmic amplitude spectrum of each channel speech and the reference echo speech is extracted; the multi-task network module is connected with the speech preprocessing module, the multi-task loss statistics module and the network weight calculation module to complete the removal of each channel of t...
Embodiment 2
[0070] Based on the multi-task network-based microphone array speech enhancement system disclosed in the above embodiment, this embodiment continues to disclose a multi-task network-based microphone array speech enhancement method. The method adopts the following steps to complete training and testing, training and testing Process such as image 3 shown:
[0071] S1. Construct an array speech training set, preprocess the speech, and obtain the input features of each channel and the labels of the de-reverberation task, the echo cancellation task, the noise reduction task, and the fusion task; the process is as follows:
[0072] S1.1. Construct noisy array speech and corresponding de-reverberated array speech, de-reverberated and de-echoed array speech and noise-free array speech:
[0073] The noisy array speech is x(n)=[x 1 (n),x 2 (n),...,x m (n),...,x M (n)] T ,m∈[1,M], where M=4 is the total number of array elements, the generation of noisy speech is as follows Figur...
Embodiment 3
[0112] Based on the multi-task network-based microphone array speech enhancement system disclosed in the above embodiment, this embodiment continues to disclose a multi-task network-based microphone array speech enhancement method. The method adopts the following steps to complete training and testing, training and testing Process such as image 3 shown:
[0113] S1. Construct an array speech training set, preprocess the speech, and obtain the input features of each channel and the labels of the de-reverberation task, the echo cancellation task, the noise reduction task, and the fusion task; the process is as follows:
[0114] S1.1. Construct noisy array speech and corresponding de-reverberated array speech, de-reverberated and de-echoed array speech and noise-free array speech:
[0115] The noisy array speech is x(n)=[x 1 (n),x 2 (n),...,x m (n),...,x M (n)] T ,m∈[1,M], where M=4 is the total number of array elements, the generation of noisy speech is as follows Figur...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com