Audio processing method, device, storage medium and computer program
An audio processing and audio technology, applied in speech analysis, instruments, etc., can solve the problem of low audio clarity, achieve the effect of suppressing reverberation and improving clarity
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0036] According to an embodiment of the present invention, an embodiment of an audio processing method is provided. It should be noted that the steps shown in the flow charts of the accompanying drawings can be executed in a computer system such as a set of computer-executable instructions, and, although in The flowcharts show a logical order, but in some cases the steps shown or described may be performed in an order different from that shown or described herein.
[0037] The method embodiment provided in Embodiment 1 of the present application may be executed in a mobile terminal, a computer terminal, or a similar computing device. figure 1 A hardware structural block diagram of a computer terminal (or mobile device) for implementing the audio processing method is shown. Such as figure 1As shown, the computer terminal 10 (or mobile device 10) may include one or more (shown by 102a, 102b, ..., 102n in the figure) processor 102 (the processor 102 may include but not limited ...
Embodiment 2
[0071] According to an embodiment of the present invention, an audio processing method is also provided, such as Figure 5 As shown, the method includes:
[0072] When acquiring sample data, combine the sampled speech in the speech database (language excluding noise and reverberation) with the reverberation feature in the reverberation feature database and the noise in the noise database to obtain multiple reverberation audio, and also That is, observe signal 1-observe signal M, and combine the sampled speech with the early reflections in the reverberation feature to obtain the target speech.
[0073] Further, the observation signal and the target speech are subjected to STFT transformation respectively to obtain the feature vectors of the observation signal and the target speech, and the feature vectors of the observation signal and the feature vectors of the target speech constitute the training set data, and the model is trained through the training set data , the obtained...
Embodiment 3
[0075] According to an embodiment of the present invention, an audio processing method is also provided, such as Figure 6 As shown, the method includes:
[0076] S61. The cloud server receives audio to be tested.
[0077] Specifically, the audio to be tested can be the audio obtained by the pickup collecting the sound from the sound source. The pickup and the sound source are in the same target space, and the target space can be a room. Due to the existence of reverberation in the room, the audio to be tested for reverb audio.
[0078] S62. The cloud server obtains the feature vector of the audio to be tested, uses the target model to process the feature vector of the audio to be tested, obtains the target time-frequency masking information, and processes the audio to be tested according to the target time-frequency masking information to obtain the target audio, wherein, the target The model is used to determine the time-frequency masking information corresponding to the r...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


