Unlock instant, AI-driven research and patent intelligence for your innovation.

Voice endpoint detection method, apparatus, device, and computer-readable storage medium

An endpoint detection and voice technology, applied in the field of equipment, computer-readable storage media, devices, and voice endpoint detection methods

Active Publication Date: 2021-04-13
ONE CONNECT SMART TECH CO LTD SHENZHEN
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The main purpose of the present invention is to provide a voice endpoint detection method, device, equipment and computer-readable storage medium, aiming to solve the technical problem of how to improve the accuracy of voice endpoint detection

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice endpoint detection method, apparatus, device, and computer-readable storage medium
  • Voice endpoint detection method, apparatus, device, and computer-readable storage medium
  • Voice endpoint detection method, apparatus, device, and computer-readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0041] Such as figure 1 as shown, figure 1 It is a schematic structural diagram of a voice endpoint detection device in the hardware operating environment involved in the solution of the embodiment of the present invention.

[0042] Such as figure 1 As shown, the voice endpoint detection device may include: a processor 1001 , such as a CPU, a network interface 1004 , a user interface 1003 , a memory 1005 , and a communication bus 1002 . Wherein, the communication bus 1002 is used to realize connection and communication between these components. The user interface 1003 may include a display screen (Display), an input unit such as a keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface and a wireless interface. Optionally, the network interface 1004 may include a stand...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention relates to the technical field of voice signal processing, and discloses a voice endpoint detection method, device, equipment and computer-readable storage medium. The time-domain signal is converted into a frequency-domain spectrum signal; each of the frequency-domain spectrum signals is traversed in turn, the current frequency-domain spectrum signal corresponding to the traversed current data frame is determined, and the current data frame is calculated according to the current frequency-domain spectrum signal short-term energy entropy ratio; detect whether the short-term energy entropy ratio is greater than the initial detection threshold of the speech signal; if the short-term energy entropy ratio is greater than the initial detection threshold of the speech signal, the current data frame Moving to a preset speech frame buffer, and determining the speech segment endpoint of the speech signal according to all the data frames in the speech frame buffer. The invention improves the accuracy of detecting the end point of the image and voice.

Description

technical field [0001] The present invention relates to the technical field of voice signal processing, in particular to a voice endpoint detection method, device, equipment and computer-readable storage medium. Background technique [0002] Speech endpoint detection, as a front-end processing method, requires a small amount of calculation and can output speech paragraphs in real time. Existing methods are mainly divided into two types: methods based on signal statistical characteristics, and methods based on deep networks. The former has fewer parameters and higher interpretability; the latter can solve speech segment detection under non-stationary noise to some extent, but the algorithm performance is highly dependent on the training set, requiring a large amount of data for training, and the generalization is poor . Statistical methods are mostly used in real-time systems, mainly based on the sub-band energy, zero-crossing rate, and spectral characteristics of the signa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L25/87G10L25/78G10L25/18
CPCG10L25/87G10L25/78G10L25/18G10L2025/783
Inventor 赵沁徐国强
Owner ONE CONNECT SMART TECH CO LTD SHENZHEN