Universal voice awakening identification method and system under full-phoneme frame

A voice wake-up and recognition method technology, applied in voice recognition, voice analysis, instruments, etc., can solve the problem of high cost of replacing customized words, achieve the effect of eliminating false alarm rate increase, wide application, and good performance

Inactive Publication Date: 2018-07-13
INST OF ACOUSTICS CHINESE ACAD OF SCI +1
View PDF7 Cites 41 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The purpose of the present invention is to overcome the above-mentioned problems existing in the current voice wake-up recognition system, realize the rapid customization of the

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Universal voice awakening identification method and system under full-phoneme frame
  • Universal voice awakening identification method and system under full-phoneme frame
  • Universal voice awakening identification method and system under full-phoneme frame

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The present invention will be described in further detail below in conjunction with the accompanying drawings and specific embodiments.

[0039] Such as figure 1 As shown, a general voice wake-up recognition method under the whole phoneme framework, the method specifically includes:

[0040] Step 1) training DNN (deep neural network) acoustic model;

[0041] At present, the deep neural network has achieved outstanding results in the field of continuous speech recognition. The deep neural network (DeepNeural Network, DNN) is a neural network with strong modeling capabilities, which can learn features of any dimension well. Compared with traditional The Gaussian mixture model is more suitable for acoustic modeling. It is structured like figure 1 As shown, including the input layer, hidden layer and output layer, the weighted line connection between the nodes of adjacent layers, the number of nodes in the output layer is determined by the number of clustering categories...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a universal voice awakening identification method and a system under a full-phoneme frame. The method comprises the following steps: firstly, training a deep neural network acoustic model, modifying a dictionary according to awakening words, constructing a decoding network based on the filler, and training a support vector machine classifier according to training samples; preprocessing the input voice, inputting processed voice characteristics into a decoding network for decoding the processed voice characteristics, calculating an acoustic score according to the deep neural network acoustic model, and obtaining decoding results; inputting the statistical magnitude of successfully recognized decoding results into the support vector machine classifier for classification, and obtaining a final recognition result. According to the method disclosed by the invention, triphonon states obtained through the extension of all atonal phonemes are subjected to modeling, andthen a universal acoustic model is obtained. During the decoding process, a decoding path is limited. Therefore, the awakening performance can be improved. Meanwhile, the later-stage processing part is combined to analyze the multi-dimensional statistics of the phoneme posterior probability and the like on each path. As a result, the hidden danger that the false alarm rate is increased is eliminated.

Description

technical field [0001] The invention relates to the field of voice wake-up, in particular to a universal voice wake-up recognition method and system under the whole phoneme framework. Background technique [0002] With the rapid development of speech recognition technology, speech recognition technology has been widely used, and the application of voice wake-up technology in smart phones and smart homes is increasingly changing human life and production methods. The traditional voice wake-up recognition system is generally constructed for a specific wake-up word (that is, the system is triggered by a user-predetermined wake-up word), and there are mainly two existing methods, one is based on dynamic time warping (dynamic time warping, The DTW) method uses the acoustic features of speech for dynamic matching. First, it is necessary to record several speeches of the same wake-up word in advance, and then dynamically match the speech of the wake-up word collected on-site with t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/02G10L15/06G10L15/08G10L15/183G10L17/02G10L17/04G10L19/16G10L25/30
CPCG10L15/02G10L15/063G10L15/08G10L15/183G10L17/02G10L17/04G10L19/173G10L25/30
Inventor 徐及张震李文凤李鹏颜永红
Owner INST OF ACOUSTICS CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products