Audio signal processing method and device and electronic equipment
An audio signal processing and audio signal technology, applied in the field of audio signal recognition, can solve problems such as high error rate, achieve real-time guarantee, improve efficiency and accuracy, and reduce workload
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0078] This embodiment one is aimed at figure 1 The multi-person speech scene shown provides an audio signal processing method, see figure 2 , the method may specifically include:
[0079] S210: Perform speech recognition and sound source localization on the audio signal collected in a multi-person speaking scene; wherein, when performing sound source localization on the audio signal, the following steps are respectively performed in units of signal frames in the audio signal deal with:
[0080] Obtaining the DOA spectrogram information of the current signal frame and the signal frames of the number of targets before and after it to form a matrix spectrogram, and smoothing the matrix spectrogram;
[0081] The sound source localization result of the current signal frame is determined according to the angle corresponding to the value satisfying the target condition in the smoothed DOA spectrogram corresponding to the current signal frame.
[0082] Among them, with regard to ...
Embodiment 2
[0105] In the first embodiment above, an information processing method in a specific multi-person speaking scene is introduced, which involves a specific sound source localization method, which can also be used in other application scenarios. For this reason, in Embodiment 2 of the present application, a sound source localization method is separately provided, see Figure 4 , the method may specifically include:
[0106] S410: Determine the audio signal to be processed;
[0107] There may be many kinds of audio signals to be processed, for example, they may be audio signals collected in real time in a certain scene, or they may be recording results, and so on.
[0108] S420: Obtain DOA spectrogram information of the current signal frame in the audio signal and the signal frames of the number of targets before and after it;
[0109] S430: Perform smoothing processing on a matrix spectrogram composed of direction-of-arrival spectrogram information of the signal frame and signa...
Embodiment 3
[0112] The third embodiment provides a specific application solution for a conference scene in which multiple people speak. Specifically, the third embodiment provides a method for generating meeting minutes, see Figure 5 , the method can include:
[0113] S510: Perform speech recognition and sound source localization on the audio signal collected in a conference scene where many people speak; wherein, when performing sound source localization on the audio signal, the signal frame in the audio signal is used as a unit, respectively Do the following:
[0114] Obtaining the DOA spectrogram information of the current signal frame and the signal frames of the number of targets before and after it to form a matrix spectrogram, and smoothing the matrix spectrogram;
[0115] Determine the sound source localization result of the current signal frame according to the angle corresponding to the value satisfying the target condition in the smoothed DOA spectrogram corresponding to the...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


