Optimization method and system of stacked one-dimensional convolutional network wake-up acoustic model

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A convolutional network and acoustic model technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as low stability and single receptive field, achieve high wake-up rate, increase receptive field, and improve wake-up accuracy. Effect

Active Publication Date: 2021-07-16

AISPEECH CO LTD

View PDF4 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] In order to at least solve the problem that the stacked one-dimensional convolutional network wakes up the acoustic model with a relatively single receptive field and low stability

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0022] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0023] Such as figure 1 Shown is a flow chart of an optimization method for a stacked one-dimensional convolutional network wake-up acoustic model provided by an embodiment of the present invention, including the following steps:

[0024] S11: Adjust the expansion coefficient of the time-domain convolution layer in the stacked one-dimensional con...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the invention provides an optimization method of a stacked one-dimensional convolutional network wake-up acoustic model. The method comprises the following steps: adjusting an expansion coefficient of a time domain convolutional layer in a stacked one-dimensional convolutional network wake-up acoustic model, and increasing a receptive field output by the time domain convolutional layer; and setting an activation function of the time domain convolution layer as a gated linear unit, and combining the gated linear unit with the output of the time domain convolution layer to reduce the dimension of the output of the time domain convolution layer so as to optimize the stacked one-dimensional convolution network wake-up acoustic model. The embodiment of the invention further provides a system for optimizing the stacked one-dimensional convolutional network wake-up acoustic model. According to the embodiment of the invention, the interval of the convolution kernel causes the increase of the receptive field, so that the receptive field of the model is effectively increased, the wake-up precision is improved, meanwhile, after the gating linear unit is combined with the S1DCNN model, the output dimension can be reduced to half of the original dimension, the model parameter quantity is better compressed, and a higher wake-up rate can be achieved under the same parameter quantity.

Description

technical field [0001] The present invention relates to the field of intelligent speech, in particular to a stacked one-dimensional convolutional network wake-up acoustic model optimization method and system. Background technique [0002] S1DCNN (Stacked 1D convolutional networks, stacked one-dimensional convolutional network) awakens the acoustic model, which is composed of several different S1DCNN layers; each S1DCNN layer is mainly composed of two one-dimensional convolutional layers (cnn). The first convolutional layer is frequency-domain convolution, the second convolutional layer is time-domain convolution, and each channel is independent of each other, which is called depth-wise (depth) convolution. Compared with the traditional two-dimensional CNN, S1DCNN can achieve the same level of performance while reducing the amount of calculation, or the performance loss is small, and it has a higher performance in the lightweight acoustic model (for example, the acoustic mode...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L15/06G10L15/16G10L19/008

CPCG10L15/063G10L15/16G10L19/008Y02T90/00

Inventor 王蒙薛少飞唐健

Owner AISPEECH CO LTD

Optimization method and system of stacked one-dimensional convolutional network wake-up acoustic model

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

Agents

Company

Optimization method and system of stacked one-dimensional convolutional network wake-up acoustic model

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

Agents

Company

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology