Speech VAD tail point determination method and device, electronic equipment and computer readable medium

A technology for determining methods and devices, applied in speech analysis, instruments, etc., can solve problems such as slow response speed and affecting user experience

Pending Publication Date: 2020-09-04
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF2 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

If you want to improve user experience, such as fast response, you need to reconfigure the non-speech length at the end of the VAD, which will affect the user's experience during short pauses; if the non-speech length at the end of the voice VAD is configured too large, although it can solve the user's pause experience , but the overall response time is slower

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech VAD tail point determination method and device, electronic equipment and computer readable medium
  • Speech VAD tail point determination method and device, electronic equipment and computer readable medium
  • Speech VAD tail point determination method and device, electronic equipment and computer readable medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0091] In order for those skilled in the art to better understand the technical solution of the present disclosure, a method and device for determining a voice VAD end point provided by the present disclosure, electronic equipment, and a computer-readable medium are described in detail below with reference to the accompanying drawings.

[0092] Example embodiments will be described more fully hereinafter with reference to the accompanying drawings, but may be embodied in different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.

[0093] The terminology used herein is for describing particular embodiments only and is not intended to limit the present disclosure. As used herein, the singular forms "a" and "the" are intended to include the plural forms as well, unless the cont...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a speech VAD tail point determination method. The method comprises the following steps: receiving speech information of a user, dividing the speech information into data packets, and uploading the data packets to a server according to a time sequence; when it is judged that the current data packet is a mute packet, calculating the current mute duration t; and triggering theserver to detect the semantic integrity of the speech information according to the current mute duration t and a preset first threshold T1 to make the server determine a tail point of the speech information according to a semantic integrity detection result. The tail point of the speech VAD is not cut off by the intelligent equipment any more, the intelligent equipment uploads the data packet divided by the speech information to the server according to the time sequence, and the server is triggered to detect the semantic integrity of the speech information, so that the server determines the tail point of the speech information according to the semantic integrity detection result. The mute duration of the tail point of the speech information is changed into a dynamically adjustable value from the original fixed duration.

Description

technical field [0001] The present disclosure relates to the technical field of voice interaction, and in particular, to a method and device for determining a voice VAD end point, electronic equipment, and a computer-readable medium. Background technique [0002] With the popularization of smart hardware, voice interaction has become the main means of interaction. Especially in the smart speaker scenario, a large number of users use voice to order resources. A large amount of data shows that when people order a certain song by a certain singer, they often forget the name of the song and pause for a short time, causing the voice to be cut off. Resource is not expected. The main reason for this phenomenon is that the existing voice interaction technology uses VAD (Voice Activity Detect) technology to judge the end point of the voice, usually through signal and acoustic technology. It will be judged as the end of VAD and the voice will be cut off. [0003] The VAD technology...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/78G10L25/87
CPCG10L25/78G10L25/87G10L2025/786
Inventor 郭启行崔亚峰孟宪海杜春明都伟李亚男邹赛赛
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products