Voice endpoint determination method and device, storage medium and electronic device

A technology to determine the method and voice, applied in voice analysis, voice recognition, instruments, etc., can solve the problems of low accuracy rate, achieve the effect of improving recognition accuracy and solving the effect of low accuracy rate

Active Publication Date: 2020-01-17
ZHEJIANG DAHUA TECH CO LTD
View PDF7 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Embodiments of the present invention provide a method and device, a storage medium, and an electronic device for determining a voice endpoint, so as to at least solve the problem in the related art that the voice endpoint detection only detects a single feature, resulting in low accuracy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice endpoint determination method and device, storage medium and electronic device
  • Voice endpoint determination method and device, storage medium and electronic device
  • Voice endpoint determination method and device, storage medium and electronic device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0025] The method embodiment provided in Embodiment 1 of the present application may be executed in a mobile terminal, a computer terminal, or a similar computing device. Taking running on a mobile terminal as an example, figure 1 It is a hardware structural block diagram of a mobile terminal according to a method for determining a voice endpoint in an embodiment of the present invention. Such as figure 1 As shown, the mobile terminal 10 may include one or more ( figure 1 Only one is shown in the figure) a processor 102 (the processor 102 may include but not limited to a processing device such as a microprocessor MCU or a programmable logic device FPGA) and a memory 104 for storing data. Optionally, the above-mentioned mobile terminal also A transmission device 106 for communication functions as well as input and output devices 108 may be included. Those of ordinary skill in the art can understand that, figure 1 The shown structure is only for illustration, and does not li...

Embodiment 2

[0079] An embodiment of the present invention also provides a storage medium, in which a computer program is stored, wherein the computer program is set to execute the steps in any one of the above method embodiments when running.

[0080] Optionally, in this embodiment, the above-mentioned storage medium may be configured to store a computer program for performing the following steps:

[0081] S1, preprocessing the acquired audio signal to obtain a plurality of subbands, wherein the audio signal includes N audio signal frames, N is an integer greater than 1, and the subbands are obtained by dividing the audio signal frame based on frequency bands;

[0082] S2, according to the ratio of the signal-to-noise ratio and the spectral entropy of the subband, obtain the ratio of the signal-to-noise ratio and the spectral entropy of the audio signal frame;

[0083] S3, according to the ratio of the signal-to-noise ratio of the audio signal frame to the spectral entropy, use a double-t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a voice endpoint determination method and device, a storage medium and an electronic device, and the method comprises the following steps: carrying out the preprocessing of an obtained audio signal, obtaining a plurality of sub-bands, enabling the audio signal to comprise N audio signal frames, enabling N to be an integer greater than 1, and enabling the sub-bands to be obtained through the division of the audio signal frames based on a frequency band; obtaining the ratio of the signal-to-noise ratio to the spectral entropy of the audio signal frame according to the ratio of the signal-to-noise ratio to the spectral entropy of the sub-band; judging whether the audio signal frame is a voice frame or not by using a double-threshold detection algorithmaccording to the ratio of the signal-to-noise ratio to the spectral entropy of the audio signal frame; and if so, respectively determining the first voice frame and the last voice frame of the audiosignal as a voice starting endpoint and a voice ending endpoint of the audio signal. The problem of low accuracy due to the fact that voice endpoint detection only aims at a certain single feature inthe prior art is solved.

Description

technical field [0001] The present invention relates to the field of audio, video and communication technologies, in particular, to a method and device for determining a voice endpoint, a storage medium, and an electronic device. Background technique [0002] Speech endpoint detection is an important link in the field of speech information processing. In many practical applications such as speech response systems, speaker recognition systems and speech recognition systems, it is required to first judge the input signal of the system and accurately find out the location of the speech segment. The start point and the end point, so that the collected data can really be a valid voice signal, which can reduce the amount of transmitted data and calculation and reduce the processing time. In the current related technology, the voice endpoint detection only detects a single feature, and the accuracy rate is low. [0003] Aiming at the problem in related technologies that the voice ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/04G10L25/18G10L25/60G10L25/84
CPCG10L15/04G10L25/18G10L25/60G10L25/84
Inventor 陈烈
Owner ZHEJIANG DAHUA TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products