Unlock instant, AI-driven research and patent intelligence for your innovation.

Voice endpoint detection method and device

An endpoint detection and voice technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of low detection accuracy and inaccurate endpoint detection technology, and achieve the effect of improving the accuracy.

Active Publication Date: 2020-06-30
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the existing endpoint detection technology has the problem of inaccuracy, and the detection accuracy is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice endpoint detection method and device
  • Voice endpoint detection method and device
  • Voice endpoint detection method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0069] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0070] The terms "first", "second", "third", "fourth", etc. (if any) in the description and claims of the present invention and the above drawings are used to distinguish similar objects and not necessarily Describe a particular order or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances such that the embodiments of the invention described herein are, for example, capable of practice in sequences other th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a speech endpoint detection method and apparatus. The method includes steps: performing framing processing on a to-be-detected voice, obtaining a plurality ofto-be-detected voice frames, obtaining acoustic characteristics of the to-be-detected voice frames, and inputting the acoustic characteristics of the to-be-detected voice frames to a voice activity detection (VAD) model in sequence. The VAD model is used for outputting probabilities of the to-be-detected voice frames to be classified into initials, finals and noise, and the VAD model can accurately classify the acoustic characteristics of the to-be-detected voice frames so that the starting point and the terminal point of a voice segment are determined according to an output result of the VADmodel, and the accuracy of speech endpoint detection can be improved.

Description

technical field [0001] Embodiments of the present invention relate to the technical field of voice recognition, and in particular, to a voice endpoint detection method and device. Background technique [0002] With the development of human-computer interaction technology, speech recognition technology shows its importance. In a speech recognition system, a speech endpoint detection technology is a very important technology, and is usually also called a voice activity detection technology (voice activity detection, VAD). Speech endpoint detection refers to finding the start and end points of a speech segment in a continuous audio signal. [0003] In the prior art, the starting point and the ending point of a speech segment in an audio signal can be determined by VAD technology. In the specific implementation, the audio signal is divided into frames, and the energy and zero-crossing rate of each audio frame are extracted based on traditional signal processing methods. Then, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L25/87G10L25/30G10L15/14G10L15/05G10L15/02G10L15/18
CPCG10L15/02G10L15/05G10L15/144G10L15/1807G10L25/87G10L2015/025
Inventor 李超朱唯鑫
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD