Audio processing method, device, storage medium and computer program

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An audio processing and audio technology, applied in speech analysis, instruments, etc., can solve the problem of low audio clarity, achieve the effect of suppressing reverberation and improving clarity

Active Publication Date: 2022-02-18

ALIBABA DAMO (HANGZHOU) TECH CO LTD

View PDF9 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] Embodiments of the present invention provide an audio processing method, device, storage medium, and computer program to at least solve the technical problem of low clarity of audio collected by sound pickup equipment due to the existence of reverberation in the space

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0036] According to an embodiment of the present invention, an embodiment of an audio processing method is provided. It should be noted that the steps shown in the flow charts of the accompanying drawings can be executed in a computer system such as a set of computer-executable instructions, and, although in The flowcharts show a logical order, but in some cases the steps shown or described may be performed in an order different from that shown or described herein.

[0037] The method embodiment provided in Embodiment 1 of the present application may be executed in a mobile terminal, a computer terminal, or a similar computing device. figure 1 A hardware structural block diagram of a computer terminal (or mobile device) for implementing the audio processing method is shown. Such as figure 1As shown, the computer terminal 10 (or mobile device 10) may include one or more (shown by 102a, 102b, ..., 102n in the figure) processor 102 (the processor 102 may include but not limited ...

Embodiment 2

[0071] According to an embodiment of the present invention, an audio processing method is also provided, such as Figure 5 As shown, the method includes:

[0072] When acquiring sample data, combine the sampled speech in the speech database (language excluding noise and reverberation) with the reverberation feature in the reverberation feature database and the noise in the noise database to obtain multiple reverberation audio, and also That is, observe signal 1-observe signal M, and combine the sampled speech with the early reflections in the reverberation feature to obtain the target speech.

[0073] Further, the observation signal and the target speech are subjected to STFT transformation respectively to obtain the feature vectors of the observation signal and the target speech, and the feature vectors of the observation signal and the feature vectors of the target speech constitute the training set data, and the model is trained through the training set data , the obtained...

Embodiment 3

[0075] According to an embodiment of the present invention, an audio processing method is also provided, such as Figure 6 As shown, the method includes:

[0076] S61. The cloud server receives audio to be tested.

[0077] Specifically, the audio to be tested can be the audio obtained by the pickup collecting the sound from the sound source. The pickup and the sound source are in the same target space, and the target space can be a room. Due to the existence of reverberation in the room, the audio to be tested for reverb audio.

[0078] S62. The cloud server obtains the feature vector of the audio to be tested, uses the target model to process the feature vector of the audio to be tested, obtains the target time-frequency masking information, and processes the audio to be tested according to the target time-frequency masking information to obtain the target audio, wherein, the target The model is used to determine the time-frequency masking information corresponding to the r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an audio processing method, device, storage medium and computer program. Wherein, the method includes: obtaining the feature vector of the audio to be tested; inputting the feature vector of the audio to be tested into the target model for processing to obtain target time-frequency masking information, wherein the target model is used to determine the time-frequency masking information corresponding to the reverberation audio , the time-frequency masking information is used to process the reverberation audio into the target type audio, which contains the direct sound and early reflections of the sound source corresponding to the reverberation audio; process the audio to be tested according to the target time-frequency masking information, and obtain the target audio. The invention solves the technical problem that the clarity of the audio collected by the sound pickup device is low due to the existence of the reverberation phenomenon in the space.

Description

technical field [0001] The present invention relates to the technical field of audio processing, in particular to an audio processing method, device, storage medium and computer program. Background technique [0002] Reverberation is an acoustic phenomenon in which the sound continues to exist after the sound source in the space stops. The existence of reverberation makes the speech clarity collected by the audio collection equipment low, affecting the intelligibility of the collected speech. [0003] Among them, in a larger space, in order to collect the sound from each area of the space, it is necessary to use two or more sound pickup devices to cooperate to pick up the audio generated in the space. However, due to the large space, the sound collected by the sound pickup device Sound reverberation is very noticeable, reducing the intelligibility of the captured audio content. [0004] For the above problems, no effective solution has been proposed yet. Contents of the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L21/0208

CPCG10L21/0208G10L2021/02082

Inventor 王子腾纳跃跃刘章田彪付强

Owner ALIBABA DAMO (HANGZHOU) TECH CO LTD

Audio processing method, device, storage medium and computer program

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology