Voice endpoint detection method, device and equipment and storage medium

An endpoint detection and voice technology, applied in the field of artificial intelligence, can solve problems such as poor detection effect and low accuracy of voice recognition

Pending Publication Date: 2019-10-15
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The main purpose of the present invention is to provide a voice endpoint detection method, device, equipment and storage m...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice endpoint detection method, device and equipment and storage medium
  • Voice endpoint detection method, device and equipment and storage medium
  • Voice endpoint detection method, device and equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0074] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0075] The invention provides a voice endpoint detection device.

[0076] refer to figure 1 , figure 1 It is a schematic structural diagram of the operating environment of the voice endpoint detection device involved in the solution of the embodiment of the present invention.

[0077] Such as figure 1 As shown, the voice endpoint detection device includes: a processor 1001 , such as a CPU, a communication bus 1002 , a user interface 1003 , a network interface 1004 , and a memory 1005 . Wherein, the communication bus 1002 is used to realize connection and communication between these components. The user interface 1003 may include a display screen (Display) and an input unit such as a keyboard (Keyboard), and the network interface 1004 may optionally include a standard wired interface or a wireless interface (such a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of artificial intelligence, and discloses a voice endpoint detection method. The method comprises the following steps of: obtaining input voice to be detected and a preset voice frame detection model; performing framing processing on the input voice to obtain a plurality of voice frames with time sequences; sequentially inputting each voice frame of theinput voice into the voice frame detection model for detection, and outputting a first detection result corresponding to each voice frame; sequentially performing harmonic energy detection on each voice frame of the input voice to obtain a second detection result corresponding to each voice frame; determining a frame type corresponding to each voice frame based on the first detection result and the second detection result; and determining a voice starting endpoint and a voice ending endpoint of the input voice based on the frame type corresponding to each voice frame. The invention further discloses a voice endpoint detection device and equipment and a computer readable storage medium. According to the invention, the accuracy of voice endpoint detection is improved.

Description

technical field [0001] The invention relates to the technical field of artificial intelligence, in particular to a voice endpoint detection method, device, equipment and storage medium. Background technique [0002] In the existing speech recognition technology, speech endpoint detection is often required, that is, to detect the start position and end position of the speech. At present, the speech endpoint detection algorithm is usually only suitable for speech and speech in relatively quiet scenes. This method is suitable for relatively stable noise. (such as white noise, siren sound, etc.), the effect is better, but the effect is poor for noisy environments (such as public places with many people talking). The reason is that the noise in such situations also has the characteristics of speech, so it is difficult to accurately Distinguishes noise from speech, resulting in poor speech recognition rates. Contents of the invention [0003] The main purpose of the present inv...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/05G10L15/06G10L25/78
CPCG10L15/05G10L15/063G10L25/78
Inventor 魏韬马骏王少军
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products