Optimization method and system of stacked one-dimensional convolutional network wake-up acoustic model

A convolutional network and acoustic model technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as low stability and single receptive field, achieve high wake-up rate, increase receptive field, and improve wake-up accuracy. Effect

Active Publication Date: 2021-07-16
AISPEECH CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In order to at least solve the problem that the stacked one-dimensional convolutional network wakes up the acoustic model with a relatively single receptive field and low stability

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Optimization method and system of stacked one-dimensional convolutional network wake-up acoustic model
  • Optimization method and system of stacked one-dimensional convolutional network wake-up acoustic model
  • Optimization method and system of stacked one-dimensional convolutional network wake-up acoustic model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0023] Such as figure 1 Shown is a flow chart of an optimization method for a stacked one-dimensional convolutional network wake-up acoustic model provided by an embodiment of the present invention, including the following steps:

[0024] S11: Adjust the expansion coefficient of the time-domain convolution layer in the stacked one-dimensional con...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides an optimization method of a stacked one-dimensional convolutional network wake-up acoustic model. The method comprises the following steps: adjusting an expansion coefficient of a time domain convolutional layer in a stacked one-dimensional convolutional network wake-up acoustic model, and increasing a receptive field output by the time domain convolutional layer; and setting an activation function of the time domain convolution layer as a gated linear unit, and combining the gated linear unit with the output of the time domain convolution layer to reduce the dimension of the output of the time domain convolution layer so as to optimize the stacked one-dimensional convolution network wake-up acoustic model. The embodiment of the invention further provides a system for optimizing the stacked one-dimensional convolutional network wake-up acoustic model. According to the embodiment of the invention, the interval of the convolution kernel causes the increase of the receptive field, so that the receptive field of the model is effectively increased, the wake-up precision is improved, meanwhile, after the gating linear unit is combined with the S1DCNN model, the output dimension can be reduced to half of the original dimension, the model parameter quantity is better compressed, and a higher wake-up rate can be achieved under the same parameter quantity.

Description

technical field [0001] The present invention relates to the field of intelligent speech, in particular to a stacked one-dimensional convolutional network wake-up acoustic model optimization method and system. Background technique [0002] S1DCNN (Stacked 1D convolutional networks, stacked one-dimensional convolutional network) awakens the acoustic model, which is composed of several different S1DCNN layers; each S1DCNN layer is mainly composed of two one-dimensional convolutional layers (cnn). The first convolutional layer is frequency-domain convolution, the second convolutional layer is time-domain convolution, and each channel is independent of each other, which is called depth-wise (depth) convolution. Compared with the traditional two-dimensional CNN, S1DCNN can achieve the same level of performance while reducing the amount of calculation, or the performance loss is small, and it has a higher performance in the lightweight acoustic model (for example, the acoustic mode...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06G10L15/16G10L19/008
CPCG10L15/063G10L15/16G10L19/008Y02T90/00
Inventor 王蒙薛少飞唐健
Owner AISPEECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products