Universal voice awakening identification method and system under full-phoneme frame

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A voice wake-up and recognition method technology, applied in voice recognition, voice analysis, instruments, etc., can solve the problem of high cost of replacing customized words, achieve the effect of eliminating false alarm rate increase, wide application, and good performance

Inactive Publication Date: 2018-07-13

INST OF ACOUSTICS CHINESE ACAD OF SCI +1

View PDF7 Cites 41 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] The purpose of the present invention is to overcome the above-mentioned problems existing in the current voice wake-up recognition system, realize the rapid customization of the voice wake-up system by training a general acoustic model, and combine the limited decoding path and post-processing to overcome the problem of high cost of replacing customized words

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0038] The present invention will be described in further detail below in conjunction with the accompanying drawings and specific embodiments.

[0039] Such as figure 1 As shown, a general voice wake-up recognition method under the whole phoneme framework, the method specifically includes:

[0040] Step 1) training DNN (deep neural network) acoustic model;

[0041] At present, the deep neural network has achieved outstanding results in the field of continuous speech recognition. The deep neural network (DeepNeural Network, DNN) is a neural network with strong modeling capabilities, which can learn features of any dimension well. Compared with traditional The Gaussian mixture model is more suitable for acoustic modeling. It is structured like figure 1 As shown, including the input layer, hidden layer and output layer, the weighted line connection between the nodes of adjacent layers, the number of nodes in the output layer is determined by the number of clustering categories...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a universal voice awakening identification method and a system under a full-phoneme frame. The method comprises the following steps: firstly, training a deep neural network acoustic model, modifying a dictionary according to awakening words, constructing a decoding network based on the filler, and training a support vector machine classifier according to training samples; preprocessing the input voice, inputting processed voice characteristics into a decoding network for decoding the processed voice characteristics, calculating an acoustic score according to the deep neural network acoustic model, and obtaining decoding results; inputting the statistical magnitude of successfully recognized decoding results into the support vector machine classifier for classification, and obtaining a final recognition result. According to the method disclosed by the invention, triphonon states obtained through the extension of all atonal phonemes are subjected to modeling, andthen a universal acoustic model is obtained. During the decoding process, a decoding path is limited. Therefore, the awakening performance can be improved. Meanwhile, the later-stage processing part is combined to analyze the multi-dimensional statistics of the phoneme posterior probability and the like on each path. As a result, the hidden danger that the false alarm rate is increased is eliminated.

Description

technical field [0001] The invention relates to the field of voice wake-up, in particular to a universal voice wake-up recognition method and system under the whole phoneme framework. Background technique [0002] With the rapid development of speech recognition technology, speech recognition technology has been widely used, and the application of voice wake-up technology in smart phones and smart homes is increasingly changing human life and production methods. The traditional voice wake-up recognition system is generally constructed for a specific wake-up word (that is, the system is triggered by a user-predetermined wake-up word), and there are mainly two existing methods, one is based on dynamic time warping (dynamic time warping, The DTW) method uses the acoustic features of speech for dynamic matching. First, it is necessary to record several speeches of the same wake-up word in advance, and then dynamically match the speech of the wake-up word collected on-site with t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/02G10L15/06G10L15/08G10L15/183G10L17/02G10L17/04G10L19/16G10L25/30

CPCG10L15/02G10L15/063G10L15/08G10L15/183G10L17/02G10L17/04G10L19/173G10L25/30

Inventor徐及张震李文凤李鹏颜永红

OwnerINST OF ACOUSTICS CHINESE ACAD OF SCI

Universal voice awakening identification method and system under full-phoneme frame

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology