Unlock instant, AI-driven research and patent intelligence for your innovation.

Regularization of recurrent machine-learned architectures

A machine learning model and recursive technology, applied in the field of recursive machine learning models, can solve the problems of difficult parameter training, loss of context information, etc.

Pending Publication Date: 2021-09-14
THE TORONTO DOMINION BANK
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, as the complexity and size of the model increases, the parameters are often difficult to train, which can lead to overfitting the model to the dataset or losing contextual information that might be useful for generating predictions
Although regularization methods have been applied to reduce model complexity, training recurrent machine learning models to preserve important contextual information and control sensitivity to continuous input data remains a challenging problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Regularization of recurrent machine-learned architectures
  • Regularization of recurrent machine-learned architectures
  • Regularization of recurrent machine-learned architectures

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] figure 1 is a high-level block diagram of a system environment for document analysis system 110, according to an embodiment. Depend on figure 1 The illustrated system environment 100 includes one or more client devices 116 , a network 120 , and a modeling system 110 . In alternate configurations, different and / or additional components may be included in system environment 100 .

[0019] Modeling system 110 is a system for training various machine learning models. Modeling system 110 may provide the trained model to a user of client device 116 or may use the trained model to perform inference for various tasks. In one embodiment, the modeling system 110 trains a recursive machine learning model that can be used to generate sequential predictions. Sequential predictions are ordered sets of predictions in which predictions in a sequence can depend on the values ​​of previous or subsequent predictions with respect to space or time. For example, sequential prediction ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A modeling system trains a recurrent machine-learned model by determining a latent distribution and a prior distribution for a latent state. The parameters of the model are trained based on a divergence loss that penalizes significant deviations between the latent distribution the prior distribution. The latent distribution for a current observation is a distribution for the latent state given a value of the current observation and the latent state for the previous observation. The prior distribution for a current observation is a distribution for the latent state given the latent state for the previous observation independent of the value of the current observation, and represents a belief about the latent state before input evidence is taken into account.

Description

[0001] CROSS-REFERENCE TO RELATED APPLICATIONS [0002] This application claims the benefit of and priority to US Provisional Application No. 62 / 778,277, filed on December 11, 2018, the entire contents of which are incorporated herein by reference. Background technique [0003] The present invention relates generally to recursive machine learning models, and more particularly to regularization of recursive machine learning models. [0004] Modeling systems typically use recursive machine learning models such as recurrent neural networks (RNNs) or long short-term memory models (LSTMs) to generate sequential predictions. Recursive machine learning models are configured to generate subsequent predictions based on latent states for the current prediction, sometimes incorporating an initial sequence of actual inputs. The current latent state represents contextual information about predictions generated up to the current prediction, and is generated based on the latent state for t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N20/00G06F17/18G06N3/08
CPCG06N3/084G06N3/047G06N3/044G06N3/045G06N20/20
Inventor M·J·R·拉瓦特K·K·梁H·萨迪吉M·沃尔克弗斯
Owner THE TORONTO DOMINION BANK