Voice processing method and device thereof, electronic equipment and computer storage medium
A technology of voice processing and voice data, applied in the direction of voice analysis, instruments, etc., can solve the problems that affect the noise separation effect and cannot deeply dig out the correlation and difference between normal signal and noise signal
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0043] An embodiment of the present invention provides a speech processing method, such as figure 1 As shown, including:
[0044] Step 101, construct a training pair of first voice data and second voice data.
[0045] Among them, the first voice data can be clean speech data, referred to as Clean Audio, no noise voice data; the second voice data can be a voice data after increasing noise on the first voice data, referred to as Noisyaudio.
[0046] Constructing the training pair of first voice data and second voice data, can include:
[0047] Data enhancement processing is performed on the first voice data to obtain a corresponding second voice data; the first voice data and its corresponding second voice data composition training pair. Data enhancements include at least one of the following methods: the same category enhancement, noise enhancement, time shift enhancement, and pitch transformation enhancement.
[0048] NOISY AUDIO data is its corresponding Clean Audio generated by ...
Embodiment 2
[0074] Embodiment of the present invention provides a voice processing device, such as figure 2 As shown, including:
[0075] Building module 10 for building a training pair of first voice data and second voice data;
[0076] Generating module 20, configured to generate a generating model for generating the first voice data and the second speech data, generating a first embedding data corresponding to the first voice data, and corresponding to the second voice data Second embedding data;
[0077] The discrimination module 30 is used to train the first embedded data and the second embedded data input discriminant model to obtain the result of the discrimination;
[0078] The learning module 40 is configured to confront model learning according to the discriminant model, a random gradient decreased manner to obtain a speech noise reduction model;
[0079] Processing module 50 for noise reduction processing on the target voice data according to the speech noise reduction model.
[00...
Embodiment 3
[0088] The embodiment of the present invention provides an electronic device, including a processor, a communication interface, a memory, and a communication bus, wherein the processor, a communication interface, a memory that performs communication between each other through a communication bus; a memory for storing a computer program; processor The method steps of the embodiments of the present invention are implemented when the program is stored on the memory is executed.
[0089] The embodiment of the present invention further provides a computer readable storage medium, the computer readable storage medium stores a computer program, the method of implementing the method of the embodiment of the present invention when executed by the processor.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com