Voice endpoint determination method and device, storage medium and electronic device

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology to determine the method and voice, applied in voice analysis, voice recognition, instruments, etc., can solve the problems of low accuracy rate, achieve the effect of improving recognition accuracy and solving the effect of low accuracy rate

Active Publication Date: 2020-01-17

ZHEJIANG DAHUA TECH CO LTD

View PDF7 Cites 9 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] Embodiments of the present invention provide a method and device, a storage medium, and an electronic device for determining a voice endpoint, so as to at least solve the problem in the related art that the voice endpoint detection only detects a single feature, resulting in low accuracy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0025] The method embodiment provided in Embodiment 1 of the present application may be executed in a mobile terminal, a computer terminal, or a similar computing device. Taking running on a mobile terminal as an example, figure 1 It is a hardware structural block diagram of a mobile terminal according to a method for determining a voice endpoint in an embodiment of the present invention. Such as figure 1 As shown, the mobile terminal 10 may include one or more ( figure 1 Only one is shown in the figure) a processor 102 (the processor 102 may include but not limited to a processing device such as a microprocessor MCU or a programmable logic device FPGA) and a memory 104 for storing data. Optionally, the above-mentioned mobile terminal also A transmission device 106 for communication functions as well as input and output devices 108 may be included. Those of ordinary skill in the art can understand that, figure 1 The shown structure is only for illustration, and does not li...

Embodiment 2

[0079] An embodiment of the present invention also provides a storage medium, in which a computer program is stored, wherein the computer program is set to execute the steps in any one of the above method embodiments when running.

[0080] Optionally, in this embodiment, the above-mentioned storage medium may be configured to store a computer program for performing the following steps:

[0081] S1, preprocessing the acquired audio signal to obtain a plurality of subbands, wherein the audio signal includes N audio signal frames, N is an integer greater than 1, and the subbands are obtained by dividing the audio signal frame based on frequency bands;

[0082] S2, according to the ratio of the signal-to-noise ratio and the spectral entropy of the subband, obtain the ratio of the signal-to-noise ratio and the spectral entropy of the audio signal frame;

[0083] S3, according to the ratio of the signal-to-noise ratio of the audio signal frame to the spectral entropy, use a double-t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the invention provides a voice endpoint determination method and device, a storage medium and an electronic device, and the method comprises the following steps: carrying out the preprocessing of an obtained audio signal, obtaining a plurality of sub-bands, enabling the audio signal to comprise N audio signal frames, enabling N to be an integer greater than 1, and enabling the sub-bands to be obtained through the division of the audio signal frames based on a frequency band; obtaining the ratio of the signal-to-noise ratio to the spectral entropy of the audio signal frame according to the ratio of the signal-to-noise ratio to the spectral entropy of the sub-band; judging whether the audio signal frame is a voice frame or not by using a double-threshold detection algorithmaccording to the ratio of the signal-to-noise ratio to the spectral entropy of the audio signal frame; and if so, respectively determining the first voice frame and the last voice frame of the audiosignal as a voice starting endpoint and a voice ending endpoint of the audio signal. The problem of low accuracy due to the fact that voice endpoint detection only aims at a certain single feature inthe prior art is solved.

Description

technical field [0001] The present invention relates to the field of audio, video and communication technologies, in particular, to a method and device for determining a voice endpoint, a storage medium, and an electronic device. Background technique [0002] Speech endpoint detection is an important link in the field of speech information processing. In many practical applications such as speech response systems, speaker recognition systems and speech recognition systems, it is required to first judge the input signal of the system and accurately find out the location of the speech segment. The start point and the end point, so that the collected data can really be a valid voice signal, which can reduce the amount of transmitted data and calculation and reduce the processing time. In the current related technology, the voice endpoint detection only detects a single feature, and the accuracy rate is low. [0003] Aiming at the problem in related technologies that the voice ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L15/04G10L25/18G10L25/60G10L25/84

CPCG10L15/04G10L25/18G10L25/60G10L25/84

Inventor陈烈

OwnerZHEJIANG DAHUA TECH CO LTD

Voice endpoint determination method and device, storage medium and electronic device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology