Voice activity detection and wake-up method and device

An endpoint detection and wake-up device technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problem of difficult to achieve accurate, fast, low-latency small model and low-power voice endpoint detection technology, and achieve low-latency Effect

Active Publication Date: 2018-05-08
TSINGHUA UNIV
View PDF18 Cites 58 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Embodiments of the present invention provide a voice endpoint detection and wake-up method and device to solve the problems in the prior art that are difficult to implement accurate, fast, low-delay, small-model and low-power voice endpoint detection technology and voice wake-up technology question

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice activity detection and wake-up method and device
  • Voice activity detection and wake-up method and device
  • Voice activity detection and wake-up method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0051] Such as figure 1 As shown, the embodiment of the present invention provides a voice endpoint detection and wake-up method, including:

[0052] Step 101. Acquire voice endpoint detection data and wake-up data, and perform Fbank feature extraction on the voice endpoint detection data and wake-up data to obtain voice Fbank feature data.

[0053] Step 102: Input the speech Fbank feature data into the binarized neural network model to obtain output result d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a voice activity detection and wake-up method and device, and relates to the technical field of machine learning speech recognition. The method includes the steps of acquiring voice activity detection data and wake-up data, and performing Fbank feature extraction on the voice activity detection data and wake-up data to obtain voice Fbank feature data; inputting the voice Fbank feature data to a binary neural network model to obtain binarized neural network output result data; and according to a preset backend evaluation strategy, processing the binarized neural network output result data, determining a voice start position and a voice end position of the voice activity detection data, and detecting wake-up word data in the wake-up data. The system framework of the invention can be applied to voice activity detection and voice wake-up technologies at the same time, and can implement accurate, fast, low-delay, small-model and low-power voice activity detection technologies and voice wake-up technologies.

Description

technical field [0001] The present invention relates to the technical field of machine learning speech recognition, in particular to a speech endpoint detection and wake-up method and device. Background technique [0002] At present, with the development of speech recognition technology, digital equipment and multimedia technology, the speech endpoint detection technology has been well developed. At present, Voice Activity Detection (VAD for short) is a technology for detecting voice fragments in continuous signals. Voice activity detection is often combined with Automatic Speech Recognition (ASR) systems and voiceprint recognition systems. Efficient and accurate voice endpoints become an essential part of these systems. Voice wake-up refers to the process of detecting predefined keywords in the audio stream. Once the keyword is detected, it will wake up embedded devices such as mobile phones and speakers. To achieve accurate, fast, low-latency, small-model and low-power v...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L15/04G10L15/14G10L15/16G10L15/26
CPCG10L15/02G10L15/04G10L15/142G10L15/16G10L15/26
Inventor 尹首一宋丹丹欧阳鹏刘雷波魏少军
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products