Speech signal detection method, device, equipment and storage medium
A voice signal and detection method technology, applied in the direction of voice analysis, instruments, etc., can solve the problems of low accuracy, undetectable end endpoint, difficult detection of voice signal start endpoint, etc., to achieve self-adaptive background noise, improve The effect of accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0035] Figure 1A It is a flow chart of a voice signal detection method provided by Embodiment 1 of the present invention. This embodiment is applicable to how to accurately detect a voice signal from an audio signal including noise. The method can be executed by the device provided in the embodiment of the present invention, and the device can be implemented in the form of software and / or hardware, and the device can be integrated in a computing device, or can be independently used as a device. see Figure 1A , the method may specifically include:
[0036] S101. Acquire an audio signal, where the audio signal includes a voice signal.
[0037] In this embodiment, the audio signal may be obtained in real time from a recording device, an audio collection device such as a microphone, a communication device, or an audio storage device. The voice signal refers to an effective signal in the audio signal, which may specifically be a voice signal that needs to occupy call resources. ...
Embodiment 2
[0062] Figure 2A It is a flow chart of a speech signal detection method provided by Embodiment 2 of the present invention. On the basis of Embodiment 1 above, this embodiment further determines the long-term eigenvalue and short-term eigenvalue based on the eigenvalue of each frame signal in the audio signal. eigenvalues are explained in detail. see Figure 2A , the method may specifically include:
[0063] S201. Acquire an audio signal, where the audio signal includes a voice signal.
[0064] S202. Extract feature values of each frame of the audio signal.
[0065] Specifically, after the audio signal is acquired, the feature value of each frame signal can be extracted through the VAD detection method. Optionally, the eigenvalue of each frame signal may be any one of time-domain energy, time-domain zero-crossing rate, logarithmic energy, spectral entropy, frequency-domain subband, and frequency-domain variance, which can be selected according to actual conditions.
...
Embodiment 3
[0079] image 3 It is a flow chart of a speech signal detection method provided by Embodiment 3 of the present invention. On the basis of the above-mentioned embodiments, this embodiment further determines the The starting point of the speech signal is explained in detail. see image 3 , the method may specifically include:
[0080] S301. Acquire an audio signal, where the audio signal includes a voice signal.
[0081] S302. Determine a long-term feature value and a short-term feature value according to the feature value of each frame of the audio signal.
[0082] S303. If the eigenvalue of the current frame signal is greater than the long-term eigenvalue or the short-term eigenvalue, and starting from the current frame signal, the eigenvalues of each frame signal within the first duration are greater than the long-term eigenvalue or the short-term eigenvalue, Then the current frame signal is taken as the starting point of the speech signal.
[0083] In order to ensure ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


