Voice separation method based on parameterized multi-phase gammatone filter bank
A filter bank, speech separation technology, used in speech analysis, instrumentation, etc., to solve problems such as suboptimal performance
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
specific Embodiment
[0064] (1) Experimental settings:
[0065] The Conv-Tasnet network is trained for 200 epochs on 4-second long segments. The optimizer adopts Adam optimizer, and the initial learning rate is 0.001. If the performance does not improve for 5 consecutive epochs on the validation set, the learning rate is halved. Also, when the performance on the validation set has not improved in the past 10 epochs, the network training will be stopped. The hyperparameter setting of the network follows the network hyperparameters in Conv-Tasnet, where the number of filters N is 512. The mask functions of Temporal Convolutional Networks (TCN) are set as sigmoid function and rectified linear unit (ReLU) respectively. For ParaMPGTF, the order n is set to 2 and the magnitude α is set to 1. will c 1 and c 2 The initial value of is set to its empirical value, namely c 1 =24.7,c 2 =9.265. SI-SNR is used as the evaluation index. The reported results are the average results of 3000 sentences of t...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com