Model training method and device based on reversible separation convolution and computer equipment

A model training and convolution technology, applied in the computer field, can solve the problems of unfavorable speech synthesis "system popularization and use, high cost of one-way mapping system construction and training, and achieve the effect of improving the model training effect and reducing the amount of calculation.

Active Publication Date: 2020-07-17
深圳市友杰智新科技有限公司
View PDF7 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The main purpose of this application is to provide a model training method based on reversible separable convolution, which aims to solve the problem that the construction and training costs of the existing one-way mapping system are very high, which is not conducive to the general promotion of "speech recognition" and "speech synthesis" systems Technical issues used

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model training method and device based on reversible separation convolution and computer equipment
  • Model training method and device based on reversible separation convolution and computer equipment
  • Model training method and device based on reversible separation convolution and computer equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053]In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0054] refer to figure 1 , a model training method based on reversible separation convolution according to an embodiment of the present application, the model includes a network for processing audio and a network for processing text, and both the network for processing audio and the network for processing text include reversible separation convolution layer, the method includes:

[0055] S1: Obtain the speech data of the specified data pair in the training set of the network processing audio, obtain the first high-dimensional vector, obtain the text data of the specified d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a model training method based on reversible separation convolution. The model comprises a network for processing audios and a network for processing texts; the network for processing the audio and the network for processing the text respectively comprise a reversible separation convolution layer; the method comprises the following steps: acquiring voice data of a specifieddata pair in a network computing training set for processing the audio, obtaining a first high-dimensional vector, obtaining text data of a network calculation specified data pair for processing a text, and obtaining a second high-dimensional vector, wherein the training set is composed of a data pair formed by voice data and text data, and the specified data pair is any data pair in the trainingset; training an audio processing network and a text processing network on the training set through a loss function, wherein the loss function is a spatial distance between the first high-dimensionalvector and the second high-dimensional vector; judging whether the loss function reaches a minimum value or not; and if so, judging that the training converges to obtain a twin network structure composed of the network for processing the audio and the network for processing the text. The model construction and training cost is saved.

Description

technical field [0001] This application relates to the field of computers, in particular to a model training method, device and computer equipment based on reversible separation convolution. Background technique [0002] "Speech recognition" and "speech synthesis" are two "sequence-to-sequence" prediction tasks in a dual relationship, which can be modeled using the encoder-decoder framework. Since the training data of "speech recognition" and "speech synthesis" are not universal, the existing speech recognition system only achieves a one-way mapping of aligning speech information to text information, and speech synthesis only achieves aligning text information to speech information one-way mapping. Due to the diversity of sequences, the scale of each one-way mapping system is very large, and the amount of data required for training the system is also very large, so the construction and training costs of each one-way mapping system are very high, which is not conducive to "s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06N3/08G10L13/02G10L15/06G10L15/16G10L25/30
CPCG06N3/08G10L15/063G10L15/16G10L13/02G10L25/30G06N3/045
Inventor 徐泓洋太荣鹏温平
Owner 深圳市友杰智新科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products