Unlock instant, AI-driven research and patent intelligence for your innovation.

Speech endpoint detection method and system

An endpoint detection and voice technology, applied in voice analysis, instruments, etc., can solve problems such as large amount of calculation, slow detection speed, poor practicability, etc., and achieve the effect of simplifying calculation, reducing memory requirements, and simple statistical comparison.

Active Publication Date: 2022-05-03
AISPEECH CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] Detection method based on wavelet transform: slow detection speed and poor practicability
[0008] In the above prior art, there are also problems of large amount of calculation and high requirements for power consumption, processor performance and memory on embedded devices.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech endpoint detection method and system
  • Speech endpoint detection method and system
  • Speech endpoint detection method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention. It should be noted that, in the case of no conflict, the embodiments of the present invention and the features in the embodiments can be combined with each other.

[0033] The invention may be described in the general context of computer-executable instructions, such as program modules, being executed by a computer. Generally, program modules ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech endpoint detection method, comprising: obtaining multiple speech existence probabilities of multiple frequency points of the current audio frame obtained during the noise reduction process of the audio signal; determining the current audio frequency according to the multiple speech existence probabilities The existence probability of the voice signal of the frame; obtain the probability that the voice signal exists in each of the preceding L1 audio frames of the current audio frame, and determine the distance D1 of the audio frame with the largest probability value from the current audio frame; determine that there is a voice signal in the current audio frame Whether the average value of the probability and the sum of the probabilities of voice signals existing in each of the previous D1 audio frames is greater than the set threshold; if yes, it is determined that there are voice signals in the D1 audio frames behind the current audio frame. The present invention uses the voice existence probability of the audio frame during the audio signal noise reduction process to detect the voice endpoint, realizes simple statistical comparison using the signal processing result, greatly simplifies the calculation, and reduces the memory requirement.

Description

technical field [0001] The invention relates to the technical field of voice signal processing, in particular to a voice endpoint detection method and system. Background technique [0002] Voice activity detection (Voice Activity detection, VAD), also known as voice detection, is used in voice processing to detect the presence or absence of voice, thereby separating voice segments and non-voice segments in a signal. [0003] The current voice endpoint detection methods include: neural network method, double-threshold detection method, detection method based on autocorrelation maximum, and detection method based on wavelet transform. in, [0004] Neural network method: the features need to be designed manually, the implementation is more complicated, and the amount of calculation is relatively large. [0005] Double-threshold detection method: utilizes the short-term energy and short-term zero-crossing rate of speech, is suitable for scenes with high signal-to-noise ratio, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L25/87G10L25/84
CPCG10L25/87G10L25/84
Inventor 彭文超姜友海沈小正
Owner AISPEECH CO LTD